ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	2

Descriptor

Simulation	6
Test Reliability	6
True Scores	6
Statistical Analysis	3
Computer Programs	2
Criterion Referenced Tests	2
Error of Measurement	2
Evaluation Methods	2
Mathematical Models	2
Test Bias	2
Test Theory	2
Accuracy	1
Achievement Gains	1
Achievement Tests	1
Analysis of Covariance	1
Career Development	1
Comparative Analysis	1
Decision Making	1
Elementary Secondary Education	1
Equated Scores	1
Evaluation Research	1
Factor Analysis	1
Grade 8	1
Individual Differences	1
International Assessment	1
More ▼

Source

Applied Measurement in…	2
Educational Sciences: Theory…	1

Author

Algina, James	1
Boughton, Keith A.	1
Gierl, Mark J.	1
Gotzmann, Andrea	1
Hau, Kit-Tai	1
Kelecioglu, Hülya	1
Marshall, J. Laird	1
Marston, Paul T., Borich,…	1
Noe, Michael J.	1
Xiao, Leifeng	1
Öztürk-Gübes, Nese	1
More ▼

Publication Type

Reports - Research	5
Journal Articles	3
Reports - Evaluative	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

An Investigation of a Single Administration Estimate of a Criterion- Referenced Reliability Index.

Download full text

Noe, Michael J.; Algina, James – 1977

Single-administration procedures for estimating the coefficient of agreement, a reliability index for criterion referenced tests, were recently developed by Subkoviak. The procedures require a distributional assumption for errors of measurement and an estimate of each examinee's true score. A computer simulation of tests composed of items that…

Descriptors: Computer Programs, Criterion Referenced Tests, Simulation, Test Reliability

Performance of SIBTEST When the Percentage of DIF Items Is Large

Peer reviewed

Direct link

Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004

Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…

Descriptors: True Scores, Simulation, Test Bias, Student Evaluation

The Mean Split-Half Coefficient of Agreement and its Relation to Other Single-Administration Test Indices: A Study Based on Simulated Data. Technical Report No. 350.

Download full text

Marshall, J. Laird – 1976

A summary is provided of the rationale for questioning the applicability of classical reliability measures to criterion referenced tests; an extension of the classical theory of true and error scores to incorporate a theory of dichotomous decisions; a presentation of the mean split-half coefficient of agreement, a single-administration test index…

Descriptors: Career Development, Computer Programs, Criterion Referenced Tests, Decision Making

Analysis of Covariance: Is It the Appropriate Model to Study Change?

Download full text

Marston, Paul T., Borich, Gary D. – 1977

The four main approaches to measuring treatment effects in schools; raw gain, residual gain, covariance, and true scores; were compared. A simulation study showed true score analysis produced a large number of Type-I errors. When corrected for this error, this method showed the least power of the four. This outcome was clearly the result of the…

Descriptors: Achievement Gains, Analysis of Covariance, Comparative Analysis, Error of Measurement