ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Probability	6
Reliability	6
True Scores	6
Classification	5
Error of Measurement	3
Criterion Referenced Tests	2
Item Response Theory	2
Measurement	2
Raw Scores	2
Test Reliability	2
Ability	1
Academic Achievement	1
Academic Standards	1
Achievement	1
Educational Research	1
Elementary School Students	1
Elementary Secondary Education	1
Evaluation	1
Foreign Countries	1
Goodness of Fit	1
Guessing (Tests)	1
High School Students	1
Item Analysis	1
Mathematical Models	1
Multiple Choice Tests	1
More ▼

Source

Educational Research

Author

Bramley, Tom	1
Hoffman, R. Gene	1
Kane, Michael T.	1
Livingston, Samuel A.	1
Moloney, James M.	1
Wang, Tianyou	1
Wise, Lauress L.	1
Yoon, Bokhee	1
Young, Michael James	1

Publication Type

Reports - Research	4
Speeches/Meeting Papers	3
Reports - Evaluative	2
Journal Articles	1
Numerical/Quantitative Data	1

Education Level

Audience

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

Work Keys (ACT)

What Works Clearinghouse Rating

Showing all 6 results Save | Export

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

Establishing the Reliability of Student Proficiency Classifications: The Accuracy of Observed Classifications.

Download full text

Hoffman, R. Gene; Wise, Lauress L. – 2000

Classical test theory is based on the concept of a true score for each examinee, defined as the expected or average score across an infinite number of repeated parallel tests. In most cases, there is only a score from a single administration of the test in question. The difference between this single observed score and the underlying true score is…

Descriptors: Achievement, Classification, Observation, Probability

The Criterion-Referenced Reliability of a Single Score. Report 76-01.

Livingston, Samuel A. – 1976

A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)

Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Conditional Standard Errors, Reliability and Decision Consistency of Performance Levels Using Polytomous IRT.

Wang, Tianyou; And Others – 1996

M. J. Kolen, B. A. Hanson, and R. L. Brennan (1992) presented a procedure for assessing the conditional standard error of measurement (CSEM) of scale scores using a strong true-score model. They also investigated the ways of using nonlinear transformation from number-correct raw score to scale score to equalize the conditional standard error along…

Descriptors: Ability, Classification, Error of Measurement, Goodness of Fit

Estimating the Consistency and Accuracy of Classifications in a Standards-Referenced Assessment. CSE Technical Report 475.

Download full text

Young, Michael James; Yoon, Bokhee – 1998

An important feature of recent large-scale performance assessments has been the reporting of pupil and school performance in terms of performance or proficiency categories. When an assessment uses such ordered categories as the primary means of reporting results, the natural way of reporting on the quality of the assessment is through the…

Descriptors: Academic Achievement, Academic Standards, Classification, Criterion Referenced Tests

Item Reliabilities for a Family of Answer-Until-Correct (AUC) Scoring Rules.

PDF pending restoration

Kane, Michael T.; Moloney, James M. – 1976

The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…

Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests