ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Error of Measurement	10
Measurement Techniques	10
Testing Problems	10
Scores	6
Test Reliability	4
Interrater Reliability	3
Latent Trait Theory	3
Test Items	3
Academic Achievement	2
Correlation	2
Cutting Scores	2
Evaluation Methods	2
Goodness of Fit	2
Psychometrics	2
Research Methodology	2
Scoring	2
Test Construction	2
Test Interpretation	2
Academic Ability	1
Achievement Gains	1
Adaptive Testing	1
Affective Measures	1
Analysis of Covariance	1
Analysis of Variance	1
Business Administration	1
More ▼

Source

International Journal of…	1
Journal of Educational and…	1
Journal of Research and…	1
Journal of Studies in…	1

Author

Busch, John Christian	1
De Santi, Roger J.	1
Doss, David	1
Foster, Jeff L.	1
Jaeger, Richard M.	1
Ligon, Glynn	1
Linn, Robert L.	1
Lord, Frederic M.	1
Luftig, Jeffrey T.	1
Meyer, Kevin D.	1
Norton, Willis P.	1
Shale, Doug	1
Smith, Richard M.	1
Sullivan, Vicki Gallo	1
Thissen, David	1
Werts, Charles E.	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	4
Speeches/Meeting Papers	4
Reports - Descriptive	2
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

New Jersey College Basic…	1
Sequential Tests of…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Bad Questions: An Essay Involving Item Response Theory

Peer reviewed

Direct link

Thissen, David – Journal of Educational and Behavioral Statistics, 2016

David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…

Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation

Affective Instrumentation: Determining Its Reliability.

Peer reviewed

Luftig, Jeffrey T.; Norton, Willis P. – Journal of Studies in Technical Careers, 1981

The purpose of this article is to review applications of reliability formulas and to recommend more appropriate methods of determining the reliability of affective instruments. (SK)

Descriptors: Affective Measures, Error of Measurement, Measurement Techniques, Test Reliability

Reliability of Single-Rater Judgments of Semantic and Syntactic Classifications of Cloze Test Responses.

Peer reviewed

De Santi, Roger J.; Sullivan, Vicki Gallo – Journal of Research and Development in Education, 1985

Cloze-based evaluations of reading comprehension present room for a greater amount of subjectivity in rating reader response. A study was designed to ascertain the nature of potential subjectivity within a single-rater's ratings of cloze-based assessments of reading comprehension. (DF)

Descriptors: Cloze Procedure, Elementary Secondary Education, Error of Measurement, Interrater Reliability

Empty Bubbles: What Test Form Did They Take?

Download full text

Doss, David; Ligon, Glynn – 1985

Upon learning that a form of the Sequential Tests of Educational Progress was incorrectly distributed to an unidentified number of high school students along with an answer sheet pregridded with an alternate test form, the Austin Independent School District performed the following research analyses: (1) scored the tests using the key for each…

Descriptors: Educational Testing, Error of Measurement, Latent Trait Theory, Measurement Techniques

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Essay Reliability: Form and Meaning.

Download full text

Shale, Doug – 1986

This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…

Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests

Estimating the Imputed Social Cost of Errors of Measurement.

Download full text

Lord, Frederic M. – 1983

If a loss function is available specifying the social cost of an error of measurement in the score on a unidimensional test, an asymptotic method, based on item response theory, is developed for optimal test design for a specified target population of examinees. Since in the real world such loss functions are not available, it is more useful to…

Descriptors: Cutting Scores, Decision Making, Error of Measurement, Estimation (Mathematics)

Test Fairness Is a Personal Issue!

Smith, Richard M. – 1983

Previous studies of test item bias have investigated how different groups of examinees perform differently on a given set of items. These studies imply that examinees should be treated in a certain way because they are of a particular sex or race rather than as individuals in their own right, but it is unrealistic and unfair to assume such an…

Descriptors: Academic Ability, Error of Measurement, Error Patterns, Higher Education

The Use and Effect of Caution Indices in Detecting Aberrant Patterns of Standard-Setting Recommendations.

Jaeger, Richard M.; Busch, John Christian – 1986

This study explores the use of the modified caution index (MCI) for identifying judges whose patterns of recommendations suggest that their judgments might be based on incomplete information, flawed reasoning, or inattention to their standard-setting tasks. It also examines the effect on test standards and passing rates when the test standards of…

Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, High Schools

Study of Academic Growth Using Simplex Models. Final Report.

Download full text

Werts, Charles E.; Linn, Robert L. – 1975

Forming a sequence covering the various aspects of the simplex model, four articles are presented here under the following titles: "A Simplex Model for Analyzing Academic Growth", "Analyzing Ratings With Correlated Intrajudge Measurement Errors", "The Correlation of States With Gain", and "The Reliability of…

Descriptors: Academic Achievement, Achievement Gains, Analysis of Covariance, College Students