ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Data Collection	5
Test Construction	3
Educational Assessment	2
Scores	2
Test Items	2
Test Validity	2
Tests	2
Accuracy	1
Automation	1
COVID-19	1
Correlation	1
Data Analysis	1
Documentation	1
Equated Scores	1
Error of Measurement	1
Evaluation Methods	1
Federal Legislation	1
Generalizability Theory	1
Inferences	1
Item Response Theory	1
Longitudinal Studies	1
Measurement	1
Measures (Individuals)	1
Pandemics	1
Psychometrics	1
More ▼

Source

Educational Measurement:…

Author

Kolen, Michael J.	2
An, Lily Shiao	1
Davis, Laurie Laughlin	1
Ho, Andrew Dean	1
Nichols, Paul D.	1
Stella Y. Kim	1
Sungyeun Kim	1
Tong, Ye	1
Williams, Natasha	1

Publication Type

Journal Articles	5
Reports - Descriptive	3
Reports - Research	2
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms

Peer reviewed

Direct link

Stella Y. Kim; Sungyeun Kim – Educational Measurement: Issues and Practice, 2025

This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A…

Descriptors: Generalizability Theory, Automation, Test Items, Students

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Consequences of Test Score Use as Validity Evidence: Roles and Responsibilities

Peer reviewed

Direct link

Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009

This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…

Descriptors: Tests, Test Validity, Scores, Data Collection

Scaling: An Items Module

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael J. – Educational Measurement: Issues and Practice, 2010

"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…

Descriptors: Test Results, Scaling, Measures (Individuals), Raw Scores

Linking Assessments Effectively: Purpose and Design.

Peer reviewed

Kolen, Michael J. – Educational Measurement: Issues and Practice, 2001

Discusses some practical issues in linking educational assessments, focusing on the importance of clarity of purpose when assessments are linked. Also stresses the importance of the design used to collect data for linking. Uses linking studies from a variety of situations to illustrate these points. (SLD)

Descriptors: Data Collection, Educational Assessment, Equated Scores, Research Design