ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	9

Descriptor

Educational Testing	13
Error of Measurement	13
Scores	7
Correlation	5
Measurement	5
Teacher Effectiveness	5
Teacher Evaluation	5
Educational Assessment	4
Educational Policy	4
Academic Achievement	3
Computation	3
Evaluation Problems	3
Longitudinal Studies	3
Measurement Techniques	3
Student Evaluation	3
Test Theory	3
Achievement Gains	2
Credentials	2
Cutting Scores	2
Educational Research	2
Effect Size	2
Evaluation Methods	2
Item Response Theory	2
Models	2
Personnel Policy	2
More ▼

Source

National Center for Analysis…	2
ACT, Inc.	1
American Educational Research…	1
American Institutes for…	1
Applied Psychological…	1
Carnegie Foundation for the…	1
Educational Measurement:…	1
Journal of Educational and…	1
Policy Analysis for…	1
Psychological Assessment	1
Psychometrika	1
More ▼

Publication Type

Reports - Evaluative	13
Journal Articles	6
Speeches/Meeting Papers	3
Numerical/Quantitative Data	1

Education Level

Elementary Secondary Education	6
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Higher Education	1
Postsecondary Education	1

Audience

Location

California	2
New York	2
Germany	1
Illinois	1
New Jersey	1
North Carolina	1
Tennessee	1
Texas	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

ACT Assessment	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

How Stable Are Value-Added Estimates across Years, Subjects and Student Groups? What We Know Series: Value-Added Methods and Applications. Knowledge Brief 3

Download full text

Loeb, Susanna; Candelaria, Christopher A. – Carnegie Foundation for the Advancement of Teaching, 2012

Value-added models measure teacher performance by the test score gains of their students, adjusted for a variety of factors such as the performance of students when they enter the class. The measures are based on desired student outcomes such as math and reading scores, but they have a number of potential drawbacks. One of them is the…

Descriptors: Academic Achievement, Teacher Effectiveness, Scores, Peer Influence

Online Calibration via Variable Length Computerized Adaptive Testing

Peer reviewed

Direct link

Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010

Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…

Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement

When Can Subscores Have Value?

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Value-Added Measures of Education Performance: Clearing Away the Smoke and Mirrors. Policy Brief 10-4

Direct link

Harris, Douglas N. – Policy Analysis for California Education, PACE (NJ3), 2010

In this policy brief, the author explores the problems with attainment measures when it comes to evaluating performance at the school level, and explores the best uses of value-added measures. These value-added measures, the author writes, are useful for sorting out-of-school influences from school influences or from teacher performance, giving…

Descriptors: Principals, Observation, Teacher Evaluation, Measurement Techniques

Using Value-Added Measures of Teacher Quality. Brief 9

Download full text

Hanushek, Eric A.; Rivkin, Steven G. – National Center for Analysis of Longitudinal Data in Education Research, 2010

Extensive education research on the contribution of teachers to student achievement produces two generally accepted results. First, teacher quality varies substantially as measured by the value added to student achievement or future academic attainment or earnings. Second, variables often used to determine entry into the profession and…

Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

Performance Assessments with Microworlds and Their Difficulty

Peer reviewed

Direct link

Kluge, Annette – Applied Psychological Measurement, 2008

The use of microworlds (MWs), or complex dynamic systems, in educational testing and personnel selection is hampered by systematic measurement errors because these new and innovative item formats are not adequately controlled for their difficulty. This empirical study introduces a way to operationalize an MW's difficulty and demonstrates the…

Descriptors: Personnel Selection, Self Efficacy, Educational Testing, Computer Uses in Education

Classical Test Theory in Historical Perspective.

Peer reviewed

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997

Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)

Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

Cut Scores and Testing: Statistics, Judgment, Truth, and Error.

Peer reviewed

Dwyer, Carol Anne – Psychological Assessment, 1996

The uses and abuses of cut scores are examined. The article demonstrates (1) that cut scores always entail judgment; (2) that cut scores inherently result in misclassification; (3) that cut scores impose an artificial dichotomy on an essentially continuous distribution of knowledge, skill, or ability; and (4) that no true cut scores exist. (SLD)

Descriptors: Classification, Cutting Scores, Educational Testing, Error of Measurement

Assigning Adaptive NAEP Booklets Based on State Assessment Scores: A Simulation Study of the Impact on Standard Errors

Download full text

Linn, Bob; McLaughlin, Don; Jiang, Tao; Gallagher, Larry – American Institutes for Research, 2004

The purpose of this simulation was to assess the improvements in estimates of standard errors that could be expected if students participating in NAEP were pre-assigned to test booklets that were adapted to their level of performance based on their state assessment scores. Students in extreme quartiles would receive one regular NAEP block and…

Descriptors: Educational Improvement, Educational Assessment, Error of Measurement, Educational Testing

Measuring Effect Sizes: The Effect of Measurement Error. Working Paper 19

Download full text

Boyd, Donald; Grossman, Pamela; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – National Center for Analysis of Longitudinal Data in Education Research, 2008

Value-added models in education research allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. Researchers typically quantify the impacts of such interventions in terms of "effect sizes", i.e., the estimated effect of a one standard deviation change in the…

Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

A Compromise Grading Model for Classroom Tests.

Download full text

Johanson, George A. – 1992

Most educational measurement texts distinguish between norm-referenced (NR), or relative, methods of assigning letter grades to objective test scores, and criterion-referenced (CR), or absolute, methods. Both NR and CR approaches have serious limitations in typical classroom situations, and neither approach, in its pure form, may be entirely…

Descriptors: Criterion Referenced Tests, Cutting Scores, Educational Testing, Error of Measurement

Loeb, Susanna	2
Boyd, Donald	1
Candelaria, Christopher A.	1
Chang, Yuan-chin Ivan	1
Cui, Zhongmin	1
Dwyer, Carol Anne	1
Fang, Yu	1
Gallagher, Larry	1
Grossman, Pamela	1
Haberman, Shelby J.	1
Hanushek, Eric A.	1
Harris, Douglas N.	1
Jiang, Tao	1
Johanson, George A.	1
Kluge, Annette	1
Lankford, Hamilton	1
Linn, Bob	1
Lu, Hung-Yi	1
McLaughlin, Don	1
Papay, John P.	1
Rivkin, Steven G.	1
Traub, Ross E.	1
Traynor, Anne	1
Woodruff, David	1
Wyckoff, James	1
More ▼