Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 4 |
Descriptor
Educational Assessment | 10 |
Item Response Theory | 10 |
Testing Programs | 10 |
Scaling | 5 |
Test Construction | 5 |
Achievement Tests | 4 |
Scoring | 4 |
Educational Testing | 3 |
Equated Scores | 3 |
Psychometrics | 3 |
Comparative Analysis | 2 |
More ▼ |
Source
ETS Research Report Series | 2 |
Educational Measurement:… | 1 |
Journal of Applied Testing… | 1 |
Journal of Educational… | 1 |
Journal of Research in… | 1 |
Author
Bock, R. Darrell | 1 |
Buckley, Barbara C. | 1 |
Congdon, Peter J. | 1 |
Ferrara, Steve | 1 |
Forsyth, Robert A. | 1 |
Johnson, Eugene | 1 |
Kahl, Stuart R. | 1 |
Linn, Robert L., Ed. | 1 |
McQueen, Joy | 1 |
Perie, Marianne | 1 |
Quellmalz, Edys S. | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 4 |
Reports - Evaluative | 3 |
Opinion Papers | 2 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Books | 1 |
Collected Works - General | 1 |
Education Level
Adult Education | 1 |
Elementary Secondary Education | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Early Childhood Longitudinal… | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
von Davier, Alina A. – ETS Research Report Series, 2012
Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…
Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling
Quellmalz, Edys S.; Timms, Michael J.; Silberglitt, Matt D.; Buckley, Barbara C. – Journal of Research in Science Teaching, 2012
This article reports on the collaboration of six states to study how simulation-based science assessments can become transformative components of multi-level, balanced state science assessment systems. The project studied the psychometric quality, feasibility, and utility of simulation-based science assessments designed to serve formative purposes…
Descriptors: State Programs, Educational Assessment, Simulated Environment, Grade 6
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Ferrara, Steve; Perie, Marianne; Johnson, Eugene – Journal of Applied Testing Technology, 2008
Psychometricians continue to introduce new approaches to setting cut scores for educational assessments in an attempt to improve on current methods. In this paper we describe the Item-Descriptor (ID) Matching method, a method based on IRT item mapping. In ID Matching, test content area experts match items (i.e., their judgments about the knowledge…
Descriptors: Test Results, Test Content, Testing Programs, Educational Testing

Congdon, Peter J.; McQueen, Joy – Journal of Educational Measurement, 2000
Studied the stability of rater severity over an extended rating period by applying multifaceted Rasch analysis to ratings of 16 raters of writing performances of 8,285 elementary school students. Findings cast doubt on the practice of using a single calibration of rate severity as the basis for adjustment of person measures. (SLD)
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Interrater Reliability
Sireci, Stephen G. – 1996
Test developers continue to struggle with the technical and logistical problems inherent in assessing achievement across different languages. Many testing programs offer separate language versions of a test to evaluate the achievement of examinees in different language groups. However, comparison of individuals who took different language versions…
Descriptors: Achievement Tests, Bilingual Education, Comparative Analysis, Educational Assessment

Forsyth, Robert A. – Educational Measurement: Issues and Practice, 1991
The scales of the National Assessment of Educational Progress (NAEP), as constructed, do not yield meaningful criterion-referenced interpretations. Poorly defined NAEP goals and the present knowledge base do not allow the measurement of what examinees can and cannot do. Inappropriate interpretations of NAEP data are discussed, with specific…
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Assessment, Item Response Theory
Bock, R. Darrell – 1991
The scoring method that will be applied in the current 12th-grade science assessment project of the National Science Foundation and the Office of Educational Research and Assessment is described. The method, "graded mark-point" scoring, is modeled after procedures developed by P. Tamir for use in the performance exercises of the Israeli…
Descriptors: Educational Assessment, Evaluators, Grade 12, Grading
Kahl, Stuart R.; And Others – 1995
The assessment instruments of the Maine Educational Assessment emphasize extended constructed-response questions. The results from these assessments are reported in terms of percentages of students at four performance levels. The Student-Based Constructed-Response Method was used to establish performance standards for these levels on the…
Descriptors: Academic Standards, Achievement Tests, Constructed Response, Cutting Scores
Linn, Robert L., Ed. – 1993
This collection explores the theory and applications of educational testing. It is divided into sections on theory and general principles of educational measurement, administration of tests and scoring, and applications of testing. The following chapters present information on test theory and use: (1) "Current Perspectives and Future…
Descriptors: Ability, Achievement Tests, Admission Criteria, Cognitive Psychology