ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	20

Descriptor

Simulation	65
Testing Problems	65
Test Items	20
Computer Assisted Testing	19
Adaptive Testing	15
Test Construction	15
Item Response Theory	14
Evaluation Methods	12
Scores	10
Scoring	9
Testing	9
Comparative Analysis	8
Psychometrics	8
Statistical Analysis	8
Estimation (Mathematics)	7
Test Validity	7
Educational Assessment	6
Latent Trait Theory	6
Measurement Techniques	6
Multiple Choice Tests	6
Ability	5
Achievement Tests	5
Cheating	5
Computation	5
Higher Education	5
More ▼

Publication Type

Reports - Research	29
Journal Articles	24
Reports - Evaluative	20
Speeches/Meeting Papers	17
Reports - Descriptive	3
Dissertations/Theses -…	2
Opinion Papers	2
Tests/Questionnaires	2
Collected Works - General	1
Collected Works - Proceedings	1
Guides - General	1
Information Analyses	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Education Level

Elementary Secondary Education	2
Secondary Education	2

Audience

Researchers

Location

Denmark	1
Germany	1
Ohio	1
Oregon	1
Poland	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Indiana Statewide Testing for…	2
Program for International…	2
SAT (College Admission Test)	2
Armed Services Vocational…	1
Minnesota Teacher Attitude…	1
National Assessment of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

Item Calibration Methods with Multiple Subscale Multistage Testing

Peer reviewed

Direct link

Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020

Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…

Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics

Detecting Fraudulent Erasures at an Aggregate Level

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2018

Wollack, Cohen, and Eckerly suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This article suggests two modifications of the EDI for…

Descriptors: Deception, Identification, Testing Problems, Cheating

Fine-Tuning Cross-Battery Assessment Procedures: After Follow-Up Testing, Use All Valid Scores, Cohesive or Not

Peer reviewed

Direct link

Schneider, W. Joel; Roman, Zachary – Journal of Psychoeducational Assessment, 2018

We used data simulations to test whether composites consisting of cohesive subtest scores are more accurate than composites consisting of divergent subtest scores. We demonstrate that when multivariate normality holds, divergent and cohesive scores are equally accurate. Furthermore, excluding divergent scores results in biased estimates of…

Descriptors: Statistical Data, Simulation, Testing, Scores

Detecting Fraudulent Erasures at an Aggregate Level

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2017

Wollack, Cohen, and Eckerly (2015) suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly (2017) extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This paper suggests two modifications of…

Descriptors: Deception, Identification, Testing Problems, Cheating

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

Evaluating Specification Tests in the Context of Value-Added Estimation

Peer reviewed

Direct link

Guarino, Cassandra M.; Reckase, Mark D.; Stacy, Brian W.; Wooldridge, Jeffrey M. – Journal of Research on Educational Effectiveness, 2015

We study the properties of two specification tests that have been applied to a variety of estimators in the context of value-added measures (VAMs) of teacher and school quality: the Hausman test for choosing between student-level random and fixed effects, and a test for feedback (sometimes called a "falsification test"). We discuss…

Descriptors: Teacher Effectiveness, Educational Quality, Evaluation Methods, Tests

Modeling Skipped and Not-Reached Items Using IRTrees

Peer reviewed

Direct link

Debeer, Dries; Janssen, Rianne; De Boeck, Paul – Journal of Educational Measurement, 2017

When dealing with missing responses, two types of omissions can be discerned: items can be skipped or not reached by the test taker. When the occurrence of these omissions is related to the proficiency process the missingness is nonignorable. The purpose of this article is to present a tree-based IRT framework for modeling responses and omissions…

Descriptors: Item Response Theory, Test Items, Responses, Testing Problems

Assessing Individual-Level Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015

With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis

Determining the Overall Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014

With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

A Simulation Study of the Situations in Which Reporting Subscores Can Add Value to Licensure Examinations

Direct link

Feinberg, Richard A. – ProQuest LLC, 2012

Subscores, also known as domain scores, diagnostic scores, or trait scores, can help determine test-takers' relative strengths and weaknesses and appropriately focus remediation. However, subscores often have poor psychometric properties, particularly reliability and distinctiveness (Folske, Gessaroli, & Swanson, 1999; Monaghan, 2006;…

Descriptors: Simulation, Tests, Testing, Scores

Evaluating Specification Tests in the Context of Value-Added Estimation. Working Paper #38

Download full text

Guarino, Cassandra M.; Reckase, Mark D.; Stacy, Brian W.; Wooldridge, Jeffrey M. – Education Policy Center at Michigan State University, 2014

We study the properties of two specification tests that have been applied to a variety of estimators in the context of value-added measures (VAMs) of teacher and school quality: the Hausman test for choosing between random and fixed effects and a test for feedback (sometimes called a "falsification test"). We discuss theoretical…

Descriptors: Achievement Gains, Evaluation Methods, Teacher Effectiveness, Educational Quality

The Effects of Answer Copying on the Ability Level Estimates of Cheater Examinees in Answer Copying Pairs

Download full text

Zopluoglu, Cengiz; Davenport, Ernest C., Jr. – Online Submission, 2011

The purpose of this study was to examine the effects of answer copying on the ability level estimates of cheater examinees in answer copying pairs. The study generated answer copying pairs for each of 1440 conditions, source ability (12) x cheater ability (12) x amount of copying (10). The average difference between the ability level estimates…

Descriptors: Cheating, Multiple Choice Tests, Ability, High Stakes Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Journal of Educational…	7
Educational and Psychological…	2
Journal of Educational and…	2
ProQuest LLC	2
Academic Medicine	1
American Educational Research…	1
Applied Measurement in…	1
Education Policy Center at…	1
Educational Researcher	1
Evaluation Quarterly	1
Grantee Submission	1
International Journal of…	1
Journal of Consulting and…	1
Journal of Dental Education	1
Journal of Educational…	1
Journal of Optometric…	1
Journal of Psychoeducational…	1
Journal of Research on…	1
Large-scale Assessments in…	1
Middle School Journal	1
Online Submission	1
TECHNOS	1
More ▼

Sinharay, Sandip	5
Davey, Tim	3
Stocking, Martha L.	3
Choi, Seung W.	2
Guarino, Cassandra M.	2
Kim, Dong-In	2
Parshall, Cynthia G.	2
Pommerich, Mary	2
Reckase, Mark D.	2
Robitzsch, Alexander	2
Stacy, Brian W.	2
Wan, Ping	2
Wooldridge, Jeffrey M.	2
Bay, Luz	1
Baydar, Nazli	1
Boekkooi-Timminga, Ellen	1
Bruno, James E.	1
Burchard, Kenneth W.	1
Burden, Timothy	1
Carifio, James	1
Chun Wang	1
Cliff, Norman	1
Cui, Ying	1
More ▼