Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 7 |
Descriptor
Error of Measurement | 8 |
Scores | 8 |
Testing Programs | 8 |
Academic Achievement | 3 |
Measurement | 3 |
Accuracy | 2 |
Computation | 2 |
Educational Improvement | 2 |
Educational Testing | 2 |
Effect Size | 2 |
Expertise | 2 |
More ▼ |
Source
Council of Chief State School… | 2 |
Journal of Educational and… | 2 |
Applied Measurement in… | 1 |
ETS Research Report Series | 1 |
National Center for Education… | 1 |
National Center for Education… | 1 |
Author
Haberman, Shelby J. | 2 |
Brockmann, Frank | 1 |
Chen, Wen-Hung | 1 |
Doorey, Nancy A. | 1 |
Ferrara, Steve | 1 |
Guo, Hongwen | 1 |
Horn, Laura | 1 |
Jaciw, Andrew P. | 1 |
Johnson, Eugene | 1 |
Olsen, Robert B. | 1 |
Price, Cristofer | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Evaluative | 3 |
Reports - Research | 3 |
Numerical/Quantitative Data | 2 |
Reports - Descriptive | 2 |
Education Level
Elementary Secondary Education | 2 |
Grade 3 | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Grade 2 | 1 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Policymakers | 1 |
Teachers | 1 |
Location
Arizona | 1 |
California | 1 |
Missouri | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 1 |
Praxis Series | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Haberman, Shelby J. – ETS Research Report Series, 2020
Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].
Descriptors: Prediction, Scores, Tests, Testing Programs
Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011
Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…
Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement
Brockmann, Frank – Council of Chief State School Officers, 2011
State testing programs today are more extensive than ever, and their results are required to serve more purposes and high-stakes decisions than one might have imagined. Assessment results are used to hold schools, districts, and states accountable for student performance and to help guide a multitude of important decisions. This report describes…
Descriptors: Accuracy, Measurement, Testing, Expertise
Doorey, Nancy A. – Council of Chief State School Officers, 2011
The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…
Descriptors: Testing, Sampling, Expertise, Testing Programs
Olsen, Robert B.; Unlu, Fatih; Price, Cristofer; Jaciw, Andrew P. – National Center for Education Evaluation and Regional Assistance, 2011
This report examines the differences in impact estimates and standard errors that arise when these are derived using state achievement tests only (as pre-tests and post-tests), study-administered tests only, or some combination of state- and study-administered tests. State tests may yield different evaluation results relative to a test that is…
Descriptors: Achievement Tests, Standardized Tests, State Standards, Reading Achievement
Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008
In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…
Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation
Radford, Alexandria Walton; Horn, Laura – National Center for Education Statistics, 2012
These Web Tables provide an overview of classes taken and credits earned by a nationwide sample of first-time beginning postsecondary students based on data from the Postsecondary Education Transcript Study (PETS) of the 2004/09 Beginning Postsecondary Students Longitudinal Study. PETS collected transcripts from all the postsecondary institutions…
Descriptors: Postsecondary Education, College Freshmen, Academic Records, Longitudinal Studies
Ferrara, Steve; Johnson, Eugene; Chen, Wen-Hung – Applied Measurement in Education, 2005
Psychometricians continue to develop and evaluate methods for linking test scores, both horizontally and vertically. This article describes a social moderation process for articulating (i.e., linking) performance standards across grade levels for an operational state assessment program. The researchers used generated data to evaluate the likely…
Descriptors: Grade 2, Grade 3, Scores, Error of Measurement