ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Comparative Analysis	5
Item Analysis	5
Nonparametric Statistics	5
Item Response Theory	4
Error of Measurement	2
Guessing (Tests)	2
Regression (Statistics)	2
Responses	2
Scoring	2
Test Items	2
Achievement Tests	1
Cheating	1
Classification	1
Difficulty Level	1
Foreign Countries	1
Goodness of Fit	1
International Assessment	1
Measurement	1
Models	1
Multiple Choice Tests	1
Psychometrics	1
Robustness (Statistics)	1
Sample Size	1
Scores	1
Secondary School Students	1
More ▼

Source

Applied Measurement in…	1
ETS Research Report Series	1
Educational and Psychological…	1
Journal of Educational and…	1
Measurement:…	1

Author

Guo, Hongwen	2
Abulela, Mohammed A. A.	1
Atanasov, Dimitar V.	1
Dimitrov, Dimiter M.	1
Dunbar, Stephen B.	1
Kolen, Michael J.	1
Kyllonen, Patrick	1
Lei, Pui-Wa	1
Luo, Yong	1
Rios, Joseph A.	1
Schmitt, Neal	1
Sinharay, Sandip	1
Zu, Jiyun	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Evaluative	1

Education Level

Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Person-Fit Assessment under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V.; Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2020

This study examines and compares four person-fit statistics (PFSs) in the framework of the "D"- scoring method (DSM): (a) van der Flier's "U3" statistic; (b) "Ud" statistic, as a modification of "U3" under the DSM; (c) "Zd" statistic, as a modification of the "Z3 (l[subscript z])"…

Descriptors: Goodness of Fit, Item Analysis, Item Response Theory, Scoring

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Evaluation of Different Scoring Rules for a Noncognitive Test in Development. Research Report. ETS RR-16-03

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016

In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…

Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics

Nonparametric Item Response Curve Estimation with Correction for Measurement Error

Peer reviewed

Direct link

Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011

Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…

Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement

A Comparison of Parametric and Nonparametric Approaches to Item Analysis for Multiple-Choice Tests

Peer reviewed

Direct link

Lei, Pui-Wa; Dunbar, Stephen B.; Kolen, Michael J. – Educational and Psychological Measurement, 2004

This study compares the parametric multiple-choice model and the nonparametric kernel smoothing approach to estimating option characteristic functions (OCCs) using an empirical criterion, the stability of curve estimates over occasions that represents random error. The potential utility of graphical OCCs in item analysis was illustrated with…

Descriptors: Nonparametric Statistics, Multiple Choice Tests, Item Analysis, Item Response Theory