ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	6

Descriptor

Computation	6
Educational Testing	6
Error of Measurement	6
Scores	4
Item Response Theory	3
Correlation	2
Measurement	2
Test Items	2
Testing Programs	2
Accuracy	1
Achievement Gains	1
Achievement Tests	1
Adaptive Testing	1
College Entrance Examinations	1
Comparative Analysis	1
Computer Assisted Testing	1
Content Analysis	1
Data	1
Differences	1
Difficulty Level	1
Educational Research	1
Effect Size	1
Ethics	1
Foreign Countries	1
Goodness of Fit	1
More ▼

Source

ACT, Inc.	1
International Education…	1
Journal of Educational and…	1
National Center for Education…	1
Practical Assessment,…	1
Psychometrika	1

Author

Chang, Yuan-chin Ivan	1
Cui, Zhongmin	1
Fang, Yu	1
Gorad, Stephen	1
Haberman, Shelby J.	1
Han, Kyung T.	1
Hordosy, Rita	1
Jaciw, Andrew P.	1
Lu, Hung-Yi	1
Olsen, Robert B.	1
Price, Cristofer	1
Siddiqui, Nadia	1
Traynor, Anne	1
Unlu, Fatih	1
Woodruff, David	1
More ▼

Publication Type

Journal Articles	4
Reports - Evaluative	3
Reports - Research	3
Numerical/Quantitative Data	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Arizona	1
California	1
Missouri	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment

What Works Clearinghouse Rating

Showing all 6 results Save | Export

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

How Unstable Are "School Effects" Assessed by a Value-Added Technique?

Peer reviewed
PDF on ERIC

Download full text

Gorad, Stephen; Hordosy, Rita; Siddiqui, Nadia – International Education Studies, 2013

This paper re-considers the widespread use of value-added approaches to estimate school "effects", and shows the results to be very unstable over time. The paper uses as an example the contextualised value-added scores of all secondary schools in England. The study asks how many schools with at least 99% of their pupils included in the…

Descriptors: Foreign Countries, Outcomes of Education, Secondary Education, Educational Testing

Fixing the c Parameter in the Three-Parameter Logistic Model

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012

For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…

Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)

Online Calibration via Variable Length Computerized Adaptive Testing

Peer reviewed

Direct link

Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010

Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…

Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement

Estimating the Impacts of Educational Interventions Using State Tests or Study-Administered Tests. NCEE 2012-4016

Peer reviewed
PDF on ERIC

Download full text

Olsen, Robert B.; Unlu, Fatih; Price, Cristofer; Jaciw, Andrew P. – National Center for Education Evaluation and Regional Assistance, 2011

This report examines the differences in impact estimates and standard errors that arise when these are derived using state achievement tests only (as pre-tests and post-tests), study-administered tests only, or some combination of state- and study-administered tests. State tests may yield different evaluation results relative to a test that is…

Descriptors: Achievement Tests, Standardized Tests, State Standards, Reading Achievement

When Can Subscores Have Value?

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation