Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Error of Measurement | 10 |
Measurement Techniques | 10 |
Testing Problems | 10 |
Scores | 6 |
Test Reliability | 4 |
Interrater Reliability | 3 |
Latent Trait Theory | 3 |
Test Items | 3 |
Academic Achievement | 2 |
Correlation | 2 |
Cutting Scores | 2 |
More ▼ |
Source
International Journal of… | 1 |
Journal of Educational and… | 1 |
Journal of Research and… | 1 |
Journal of Studies in… | 1 |
Author
Busch, John Christian | 1 |
De Santi, Roger J. | 1 |
Doss, David | 1 |
Foster, Jeff L. | 1 |
Jaeger, Richard M. | 1 |
Ligon, Glynn | 1 |
Linn, Robert L. | 1 |
Lord, Frederic M. | 1 |
Luftig, Jeffrey T. | 1 |
Meyer, Kevin D. | 1 |
Norton, Willis P. | 1 |
More ▼ |
Publication Type
Reports - Research | 7 |
Journal Articles | 4 |
Speeches/Meeting Papers | 4 |
Reports - Descriptive | 2 |
Reports - Evaluative | 1 |
Education Level
Audience
Researchers | 4 |
Location
Laws, Policies, & Programs
Assessments and Surveys
New Jersey College Basic… | 1 |
Sequential Tests of… | 1 |
What Works Clearinghouse Rating
Thissen, David – Journal of Educational and Behavioral Statistics, 2016
David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…
Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation

Luftig, Jeffrey T.; Norton, Willis P. – Journal of Studies in Technical Careers, 1981
The purpose of this article is to review applications of reliability formulas and to recommend more appropriate methods of determining the reliability of affective instruments. (SK)
Descriptors: Affective Measures, Error of Measurement, Measurement Techniques, Test Reliability

De Santi, Roger J.; Sullivan, Vicki Gallo – Journal of Research and Development in Education, 1985
Cloze-based evaluations of reading comprehension present room for a greater amount of subjectivity in rating reader response. A study was designed to ascertain the nature of potential subjectivity within a single-rater's ratings of cloze-based assessments of reading comprehension. (DF)
Descriptors: Cloze Procedure, Elementary Secondary Education, Error of Measurement, Interrater Reliability
Doss, David; Ligon, Glynn – 1985
Upon learning that a form of the Sequential Tests of Educational Progress was incorrectly distributed to an unidentified number of high school students along with an answer sheet pregridded with an alternate test form, the Austin Independent School District performed the following research analyses: (1) scored the tests using the key for each…
Descriptors: Educational Testing, Error of Measurement, Latent Trait Theory, Measurement Techniques
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Lord, Frederic M. – 1983
If a loss function is available specifying the social cost of an error of measurement in the score on a unidimensional test, an asymptotic method, based on item response theory, is developed for optimal test design for a specified target population of examinees. Since in the real world such loss functions are not available, it is more useful to…
Descriptors: Cutting Scores, Decision Making, Error of Measurement, Estimation (Mathematics)
Smith, Richard M. – 1983
Previous studies of test item bias have investigated how different groups of examinees perform differently on a given set of items. These studies imply that examinees should be treated in a certain way because they are of a particular sex or race rather than as individuals in their own right, but it is unrealistic and unfair to assume such an…
Descriptors: Academic Ability, Error of Measurement, Error Patterns, Higher Education
Jaeger, Richard M.; Busch, John Christian – 1986
This study explores the use of the modified caution index (MCI) for identifying judges whose patterns of recommendations suggest that their judgments might be based on incomplete information, flawed reasoning, or inattention to their standard-setting tasks. It also examines the effect on test standards and passing rates when the test standards of…
Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, High Schools
Werts, Charles E.; Linn, Robert L. – 1975
Forming a sequence covering the various aspects of the simplex model, four articles are presented here under the following titles: "A Simplex Model for Analyzing Academic Growth", "Analyzing Ratings With Correlated Intrajudge Measurement Errors", "The Correlation of States With Gain", and "The Reliability of…
Descriptors: Academic Achievement, Achievement Gains, Analysis of Covariance, College Students