ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Error Patterns	5
Evaluation Methods	5
Models	3
Evaluators	2
Test Items	2
Bayesian Statistics	1
Cognitive Measurement	1
Comparative Analysis	1
Computation	1
Criteria	1
Data Collection	1
Educational Assessment	1
Elementary Education	1
Elementary School Students	1
Goodness of Fit	1
Grade 8	1
Higher Education	1
Hypothesis Testing	1
Interrater Reliability	1
Measurement Techniques	1
Medical Education	1
Medical Students	1
Monte Carlo Methods	1
Performance Based Assessment	1
Prediction	1
More ▼

Source

Journal of Educational…

Author

Clauser, Brian E.	1
Clyman, Stephen G.	1
Engelhard, George, Jr.	1
Hou, Likun	1
Jones, Eli	1
Joo, Seang-Hwane	1
Lee, Philseok	1
Nandakumar, Ratna	1
Swanson, David B.	1
Wind, Stefanie A.	1
de la Torre, Jimmy	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	3
Reports - Evaluative	2

Education Level

Audience

Location

Georgia

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

The Effects of Incomplete Rating Designs in Combination with Rater Effects

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019

Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…

Descriptors: Rating Scales, Models, Evaluators, Data Collection

Differential Item Functioning Assessment in Cognitive Diagnostic Modeling: Application of the Wald Test to Investigate DIF in the DINA Model

Peer reviewed

Direct link

Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014

Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…

Descriptors: Test Bias, Models, Simulation, Error Patterns

Components of Rater Error in a Complex Performance Assessment.

Peer reviewed

Clauser, Brian E.; Clyman, Stephen G.; Swanson, David B. – Journal of Educational Measurement, 1999

Two studies focused on aspects of the rating process in performance assessment. The first, which involved 15 raters and about 400 medical students, made the "committee" facet of raters working in groups explicit, and the second, which involved about 200 medical students and four raters, made the "rating-occasion" facet…

Descriptors: Error Patterns, Evaluation Methods, Evaluators, Higher Education

Examining Rater Errors in the Assessment of Written Composition with a Many-Faceted Rasch Model.

Peer reviewed

Engelhard, George, Jr. – Journal of Educational Measurement, 1994

Rater errors (rater severity, halo effect, central tendency, and restriction of range) are described, and criteria are presented for evaluating rating quality based on a many-faceted Rasch (FACETS) model. Ratings of 264 compositions from the Eighth Grade Writing Test in Georgia by 15 raters illustrate the discussion. (SLD)

Descriptors: Criteria, Educational Assessment, Elementary Education, Elementary School Students