Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Comparative Analysis | 5 |
Item Analysis | 5 |
Nonparametric Statistics | 5 |
Item Response Theory | 4 |
Error of Measurement | 2 |
Guessing (Tests) | 2 |
Regression (Statistics) | 2 |
Responses | 2 |
Scoring | 2 |
Test Items | 2 |
Achievement Tests | 1 |
More ▼ |
Source
Applied Measurement in… | 1 |
ETS Research Report Series | 1 |
Educational and Psychological… | 1 |
Journal of Educational and… | 1 |
Measurement:… | 1 |
Author
Guo, Hongwen | 2 |
Abulela, Mohammed A. A. | 1 |
Atanasov, Dimitar V. | 1 |
Dimitrov, Dimiter M. | 1 |
Dunbar, Stephen B. | 1 |
Kolen, Michael J. | 1 |
Kyllonen, Patrick | 1 |
Lei, Pui-Wa | 1 |
Luo, Yong | 1 |
Rios, Joseph A. | 1 |
Schmitt, Neal | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 4 |
Reports - Evaluative | 1 |
Education Level
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
What Works Clearinghouse Rating
Dimitrov, Dimiter M.; Atanasov, Dimitar V.; Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2020
This study examines and compares four person-fit statistics (PFSs) in the framework of the "D"- scoring method (DSM): (a) van der Flier's "U3" statistic; (b) "Ud" statistic, as a modification of "U3" under the DSM; (c) "Zd" statistic, as a modification of the "Z3 (l[subscript z])"…
Descriptors: Goodness of Fit, Item Analysis, Item Response Theory, Scoring
Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022
When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…
Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016
In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…
Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics
Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011
Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…
Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement
Lei, Pui-Wa; Dunbar, Stephen B.; Kolen, Michael J. – Educational and Psychological Measurement, 2004
This study compares the parametric multiple-choice model and the nonparametric kernel smoothing approach to estimating option characteristic functions (OCCs) using an empirical criterion, the stability of curve estimates over occasions that represents random error. The potential utility of graphical OCCs in item analysis was illustrated with…
Descriptors: Nonparametric Statistics, Multiple Choice Tests, Item Analysis, Item Response Theory