ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	7

Descriptor

Error of Measurement	8
Scores	8
Testing Programs	8
Academic Achievement	3
Measurement	3
Accuracy	2
Computation	2
Educational Improvement	2
Educational Testing	2
Effect Size	2
Expertise	2
Federal Programs	2
Grade 3	2
Inferences	2
Item Response Theory	2
Regression (Statistics)	2
Sampling	2
State Standards	2
Student Evaluation	2
Test Results	2
Testing	2
Academic Degrees	1
Academic Persistence	1
Academic Records	1
Achievement Tests	1
More ▼

Source

Council of Chief State School…	2
Journal of Educational and…	2
Applied Measurement in…	1
ETS Research Report Series	1
National Center for Education…	1
National Center for Education…	1

Author

Haberman, Shelby J.	2
Brockmann, Frank	1
Chen, Wen-Hung	1
Doorey, Nancy A.	1
Ferrara, Steve	1
Guo, Hongwen	1
Horn, Laura	1
Jaciw, Andrew P.	1
Johnson, Eugene	1
Olsen, Robert B.	1
Price, Cristofer	1
Radford, Alexandria Walton	1
Sinharay, Sandip	1
Unlu, Fatih	1
More ▼

Publication Type

Journal Articles	4
Reports - Evaluative	3
Reports - Research	3
Numerical/Quantitative Data	2
Reports - Descriptive	2

Education Level

Elementary Secondary Education	2
Grade 3	2
Higher Education	2
Postsecondary Education	2
Elementary Education	1
Grade 2	1
High Schools	1
Secondary Education	1

Audience

Policymakers	1
Teachers	1

Location

Arizona	1
California	1
Missouri	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Praxis Series	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Application of Best Linear Prediction and Penalized Best Linear Prediction to ETS Tests. Research Report. ETS RR-20-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2020

Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].

Descriptors: Prediction, Scores, Tests, Testing Programs

Nonparametric Item Response Curve Estimation with Correction for Measurement Error

Peer reviewed

Direct link

Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011

Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…

Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement

Commonly Unrecognized Error Variance in Statewide Assessment Programs: Sources of Error Variance and What Can Be Done to Reduce Them

Download full text

Brockmann, Frank – Council of Chief State School Officers, 2011

State testing programs today are more extensive than ever, and their results are required to serve more purposes and high-stakes decisions than one might have imagined. Assessment results are used to hold schools, districts, and states accountable for student performance and to help guide a multitude of important decisions. This report describes…

Descriptors: Accuracy, Measurement, Testing, Expertise

Addressing Two Commonly Unrecognized Sources of Score Instability in Annual State Assessments

Download full text

Doorey, Nancy A. – Council of Chief State School Officers, 2011

The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…

Descriptors: Testing, Sampling, Expertise, Testing Programs

Estimating the Impacts of Educational Interventions Using State Tests or Study-Administered Tests. NCEE 2012-4016

Peer reviewed
PDF on ERIC

Download full text

Olsen, Robert B.; Unlu, Fatih; Price, Cristofer; Jaciw, Andrew P. – National Center for Education Evaluation and Regional Assistance, 2011

This report examines the differences in impact estimates and standard errors that arise when these are derived using state achievement tests only (as pre-tests and post-tests), study-administered tests only, or some combination of state- and study-administered tests. State tests may yield different evaluation results relative to a test that is…

Descriptors: Achievement Tests, Standardized Tests, State Standards, Reading Achievement

When Can Subscores Have Value?

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation

An Overview of Classes Taken and Credits Earned by Beginning Postsecondary Students. WEB Tables. NCES 2013-151rev

Peer reviewed
PDF on ERIC

Download full text

Radford, Alexandria Walton; Horn, Laura – National Center for Education Statistics, 2012

These Web Tables provide an overview of classes taken and credits earned by a nationwide sample of first-time beginning postsecondary students based on data from the Postsecondary Education Transcript Study (PETS) of the 2004/09 Beginning Postsecondary Students Longitudinal Study. PETS collected transcripts from all the postsecondary institutions…

Descriptors: Postsecondary Education, College Freshmen, Academic Records, Longitudinal Studies

Vertically Articulated Performance Standards: Logic, Procedures, and Likely Classification Accuracy

Peer reviewed

Direct link

Ferrara, Steve; Johnson, Eugene; Chen, Wen-Hung – Applied Measurement in Education, 2005

Psychometricians continue to develop and evaluate methods for linking test scores, both horizontally and vertically. This article describes a social moderation process for articulating (i.e., linking) performance standards across grade levels for an operational state assessment program. The researchers used generated data to evaluate the likely…

Descriptors: Grade 2, Grade 3, Scores, Error of Measurement