ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	7

Descriptor

Data Collection	7
Equated Scores	3
Statistical Analysis	3
Comparative Analysis	2
Computation	2
Models	2
Test Items	2
Accuracy	1
Bias	1
Criteria	1
Documentation	1
Educational Assessment	1
Error Patterns	1
Evaluation	1
Evaluation Methods	1
Evaluators	1
Eye Movements	1
Feedback (Response)	1
Goodness of Fit	1
Innovation	1
Measurement	1
Methods	1
National Competency Tests	1
Network Analysis	1
Problem Solving	1
More ▼

Source

Journal of Educational…

Author

Baldwin, Peter	1
Bradlow, Eric T.	1
Branberg, Kenny	1
Clauser, Brian E.	1
Daria Gerasimova	1
Jones, Eli	1
Shu, Zhan	1
Wiberg, Marie	1
Wind, Stefanie A.	1
Zhu, Mengxiao	1
van der Linden, Wim J.	1
von Davier, Alina A.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	3
Reports - Descriptive	2
Opinion Papers	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Argument-Based Approach to Validity: Developing a Living Document and Incorporating Preregistration

Peer reviewed

Direct link

Daria Gerasimova – Journal of Educational Measurement, 2024

I propose two practical advances to the argument-based approach to validity: developing a living document and incorporating preregistration. First, I present a potential structure for the living document that includes an up-to-date summary of the validity argument. As the validation process may span across multiple studies, the living document…

Descriptors: Validity, Documentation, Methods, Research Reports

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

The Effects of Incomplete Rating Designs in Combination with Rater Effects

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019

Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…

Descriptors: Rating Scales, Models, Evaluators, Data Collection

Comments on "Some Conceptual Issues in Observed-Score Equating" by Wim J. van der Linden

Peer reviewed

Direct link

Bradlow, Eric T. – Journal of Educational Measurement, 2013

The van der Linden article (this issue) provides a roadmap for future research in equating. My belief is that the roadmap begins and ends with collecting auxiliary data that can be utilized to provide improved equating, especially when data are sparse or equating beyond simple moments is desired.

Descriptors: Equated Scores, Data Collection, Statistical Analysis, Research

Using Networks to Visualize and Analyze Process Data for Educational Assessment

Peer reviewed

Direct link

Zhu, Mengxiao; Shu, Zhan; von Davier, Alina A. – Journal of Educational Measurement, 2016

New technology enables interactive and adaptive scenario-based tasks (SBTs) to be adopted in educational measurement. At the same time, it is a challenging problem to build appropriate psychometric models to analyze data collected from these tasks, due to the complexity of the data. This study focuses on process data collected from SBTs. We…

Descriptors: Measurement, Data Collection, National Competency Tests, Scoring Rubrics

Some Conceptual Issues in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2013

In spite of all of the technical progress in observed-score equating, several of the more conceptual aspects of the process still are not well understood. As a result, the equating literature struggles with rather complex criteria of equating, lack of a test-theoretic foundation, confusing terminology, and ad hoc analyses. A return to Lord's…

Descriptors: Equated Scores, Statistical Analysis, Computation, Data Collection

Observed Score Linear Equating with Covariates

Peer reviewed

Direct link

Branberg, Kenny; Wiberg, Marie – Journal of Educational Measurement, 2011

This paper examined observed score linear equating in two different data collection designs, the equivalent groups design and the nonequivalent groups design, when information from covariates (i.e., background variables correlated with the test scores) was included. The main purpose of the study was to examine the effect (i.e., bias, variance, and…

Descriptors: Equated Scores, Data Collection, Models, Accuracy