ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Error of Measurement	7
Probability	7
True Scores	7
Classification	3
Reliability	3
Statistical Analysis	3
Goodness of Fit	2
Item Response Theory	2
Mathematical Models	2
Measurement	2
Scores	2
Statistical Distributions	2
Ability	1
Achievement Tests	1
Comparative Analysis	1
Computation	1
Computer Simulation	1
Correlation	1
Criterion Referenced Tests	1
Educational Research	1
Equated Scores	1
Error Correction	1
Evaluation	1
Foreign Countries	1
Guidelines	1
More ▼

Source

Educational Research	1
Journal of Educational and…	1
Practical Assessment,…	1

Author

Bramley, Tom	1
Brennan, Robert L.	1
Dayton, C. Mitchell	1
Jiang, Tao	1
Kolen, Michael J.	1
Lee, Won-Chan	1
Livingston, Samuel A.	1
Macready, George B.	1
Phillips, Gary W.	1
Wang, Tianyou	1
Wilcox, Rand R.	1
More ▼

Publication Type

Reports - Evaluative	4
Reports - Research	4
Journal Articles	3
Speeches/Meeting Papers	1

Education Level

Audience

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

Work Keys (ACT)

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Measurement Error and Equating Error in Power Analysis

Peer reviewed
PDF on ERIC

Download full text

Phillips, Gary W.; Jiang, Tao – Practical Assessment, Research & Evaluation, 2016

Power analysis is a fundamental prerequisite for conducting scientific research. Without power analysis the researcher has no way of knowing whether the sample size is large enough to detect the effect he or she is looking for. This paper demonstrates how psychometric factors such as measurement error and equating error affect the power of…

Descriptors: Error of Measurement, Statistical Analysis, Equated Scores, Sample Size

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

The Criterion-Referenced Reliability of a Single Score. Report 76-01.

Livingston, Samuel A. – 1976

A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)

Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Statistical Comparisons Among Hierarchies Based on Latent Structure Models. Research Monograph 77-1.

Download full text

Macready, George B.; Dayton, C. Mitchell – 1977

A probabilistic hypothesis testing procedure to assess the fit of hypothesized hierarchical structures for test item data is discussed. Statistical procedures are presented which are useful for evaluating the fit of data of a certain class of probabilistic models. These models apply to sets of dichotomous (O,1) responses for which there are…

Descriptors: Error of Measurement, Goodness of Fit, Hypothesis Testing, Mathematical Models

Conditional Standard Errors, Reliability and Decision Consistency of Performance Levels Using Polytomous IRT.

Wang, Tianyou; And Others – 1996

M. J. Kolen, B. A. Hanson, and R. L. Brennan (1992) presented a procedure for assessing the conditional standard error of measurement (CSEM) of scale scores using a strong true-score model. They also investigated the ways of using nonlinear transformation from number-correct raw score to scale score to equalize the conditional standard error along…

Descriptors: Ability, Classification, Error of Measurement, Goodness of Fit

An Alternative Interpretation of Three Stability Models. Measurement and Methodology, Work Unit 2: Technical Adequacy of Tests.

Wilcox, Rand R. – 1978

Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…

Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models