ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Educational Testing	6
Error of Measurement	6
Simulation	6
Test Items	4
Item Analysis	2
Item Response Theory	2
Statistical Analysis	2
Statistical Bias	2
Test Reliability	2
Ability	1
Academic Aptitude	1
Adaptive Testing	1
Bayesian Statistics	1
Cheating	1
Comparative Analysis	1
Computer Assisted Testing	1
Computer Programs	1
Correlation	1
Educational Assessment	1
Educational Improvement	1
Effect Size	1
Elementary Secondary Education	1
Equations (Mathematics)	1
Evaluation Methods	1
Expectation	1
More ▼

Source

American Institutes for…	1
Applied Psychological…	1
ETS Research Report Series	1
Journal of Educational…	1
ProQuest LLC	1

Author

Falk, Carl F.	1
Gallagher, Larry	1
Hong, Seong Eun	1
Jiang, Tao	1
Linn, Bob	1
McLaughlin, Don	1
Meijer, Rob R.	1
Monroe, Scott	1
Patience, Wayne M.	1
Reckase, Mark D.	1
Sotaridona, Leonardo S.	1
Topczewski, Anna Marie	1
Zwick, Rebecca	1
van der Linden, Wim J.	1
More ▼

Publication Type

Reports - Research	4
Journal Articles	3
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Performance of Person-Fit Statistics under Model Misspecification

Peer reviewed

Direct link

Hong, Seong Eun; Monroe, Scott; Falk, Carl F. – Journal of Educational Measurement, 2020

In educational and psychological measurement, a person-fit statistic (PFS) is designed to identify aberrant response patterns. For parametric PFSs, valid inference depends on several assumptions, one of which is that the item response theory (IRT) model is correctly specified. Previous studies have used empirical data sets to explore the effects…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Error of Measurement

Effect of Violating Unidimensional Item Response Theory Vertical Scaling Assumptions on Developmental Score Scales

Direct link

Topczewski, Anna Marie – ProQuest LLC, 2013

Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…

Descriptors: Item Response Theory, Scaling, Scores, Student Development

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

Detecting Answer Copying Using the Kappa Statistic

Peer reviewed

Direct link

Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006

A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…

Descriptors: Cheating, Test Items, Simulation, Statistical Analysis

Assigning Adaptive NAEP Booklets Based on State Assessment Scores: A Simulation Study of the Impact on Standard Errors

Download full text

Linn, Bob; McLaughlin, Don; Jiang, Tao; Gallagher, Larry – American Institutes for Research, 2004

The purpose of this simulation was to assess the improvements in estimates of standard errors that could be expected if students participating in NAEP were pre-assigned to test booklets that were adapted to their level of performance based on their state assessment scores. Students in extreme quartiles would receive one regular NAEP block and…

Descriptors: Educational Improvement, Educational Assessment, Error of Measurement, Educational Testing

Operational Characteristics of a Rasch Model Tailored Testing Procedure when Program Parameters and Item Pool Attributes are Varied.

Download full text

Patience, Wayne M.; Reckase, Mark D. – 1979

Simulated tailored tests were used to investigate the relationships between characteristics of the item pool and the computer program, and the reliability and bias of the resulting ability estimates. The computer program was varied to provide for various step sizes (differences in difficulty between successive steps) and different acceptance…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Educational Testing