Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 9 |
Descriptor
Educational Testing | 13 |
Error of Measurement | 13 |
Scores | 7 |
Correlation | 5 |
Measurement | 5 |
Teacher Effectiveness | 5 |
Teacher Evaluation | 5 |
Educational Assessment | 4 |
Educational Policy | 4 |
Academic Achievement | 3 |
Computation | 3 |
More ▼ |
Source
Author
Loeb, Susanna | 2 |
Boyd, Donald | 1 |
Candelaria, Christopher A. | 1 |
Chang, Yuan-chin Ivan | 1 |
Cui, Zhongmin | 1 |
Dwyer, Carol Anne | 1 |
Fang, Yu | 1 |
Gallagher, Larry | 1 |
Grossman, Pamela | 1 |
Haberman, Shelby J. | 1 |
Hanushek, Eric A. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 13 |
Journal Articles | 6 |
Speeches/Meeting Papers | 3 |
Numerical/Quantitative Data | 1 |
Education Level
Elementary Secondary Education | 6 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
ACT Assessment | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013
Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…
Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling
Loeb, Susanna; Candelaria, Christopher A. – Carnegie Foundation for the Advancement of Teaching, 2012
Value-added models measure teacher performance by the test score gains of their students, adjusted for a variety of factors such as the performance of students when they enter the class. The measures are based on desired student outcomes such as math and reading scores, but they have a number of potential drawbacks. One of them is the…
Descriptors: Academic Achievement, Teacher Effectiveness, Scores, Peer Influence
Chang, Yuan-chin Ivan; Lu, Hung-Yi – Psychometrika, 2010
Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration…
Descriptors: Test Items, Educational Testing, Adaptive Testing, Measurement
Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008
In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…
Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Harris, Douglas N. – Policy Analysis for California Education, PACE (NJ3), 2010
In this policy brief, the author explores the problems with attainment measures when it comes to evaluating performance at the school level, and explores the best uses of value-added measures. These value-added measures, the author writes, are useful for sorting out-of-school influences from school influences or from teacher performance, giving…
Descriptors: Principals, Observation, Teacher Evaluation, Measurement Techniques
Hanushek, Eric A.; Rivkin, Steven G. – National Center for Analysis of Longitudinal Data in Education Research, 2010
Extensive education research on the contribution of teachers to student achievement produces two generally accepted results. First, teacher quality varies substantially as measured by the value added to student achievement or future academic attainment or earnings. Second, variables often used to determine entry into the profession and…
Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications
Kluge, Annette – Applied Psychological Measurement, 2008
The use of microworlds (MWs), or complex dynamic systems, in educational testing and personnel selection is hampered by systematic measurement errors because these new and innovative item formats are not adequately controlled for their difficulty. This empirical study introduces a way to operationalize an MW's difficulty and demonstrates the…
Descriptors: Personnel Selection, Self Efficacy, Educational Testing, Computer Uses in Education

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997
Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)
Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

Dwyer, Carol Anne – Psychological Assessment, 1996
The uses and abuses of cut scores are examined. The article demonstrates (1) that cut scores always entail judgment; (2) that cut scores inherently result in misclassification; (3) that cut scores impose an artificial dichotomy on an essentially continuous distribution of knowledge, skill, or ability; and (4) that no true cut scores exist. (SLD)
Descriptors: Classification, Cutting Scores, Educational Testing, Error of Measurement
Linn, Bob; McLaughlin, Don; Jiang, Tao; Gallagher, Larry – American Institutes for Research, 2004
The purpose of this simulation was to assess the improvements in estimates of standard errors that could be expected if students participating in NAEP were pre-assigned to test booklets that were adapted to their level of performance based on their state assessment scores. Students in extreme quartiles would receive one regular NAEP block and…
Descriptors: Educational Improvement, Educational Assessment, Error of Measurement, Educational Testing
Boyd, Donald; Grossman, Pamela; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – National Center for Analysis of Longitudinal Data in Education Research, 2008
Value-added models in education research allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. Researchers typically quantify the impacts of such interventions in terms of "effect sizes", i.e., the estimated effect of a one standard deviation change in the…
Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications
Johanson, George A. – 1992
Most educational measurement texts distinguish between norm-referenced (NR), or relative, methods of assigning letter grades to objective test scores, and criterion-referenced (CR), or absolute, methods. Both NR and CR approaches have serious limitations in typical classroom situations, and neither approach, in its pure form, may be entirely…
Descriptors: Criterion Referenced Tests, Cutting Scores, Educational Testing, Error of Measurement