Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 2 |
Descriptor
Simulation | 6 |
Test Reliability | 6 |
True Scores | 6 |
Statistical Analysis | 3 |
Computer Programs | 2 |
Criterion Referenced Tests | 2 |
Error of Measurement | 2 |
Evaluation Methods | 2 |
Mathematical Models | 2 |
Test Bias | 2 |
Test Theory | 2 |
More ▼ |
Author
Algina, James | 1 |
Boughton, Keith A. | 1 |
Gierl, Mark J. | 1 |
Gotzmann, Andrea | 1 |
Hau, Kit-Tai | 1 |
Kelecioglu, Hülya | 1 |
Marshall, J. Laird | 1 |
Marston, Paul T., Borich,… | 1 |
Noe, Michael J. | 1 |
Xiao, Leifeng | 1 |
Öztürk-Gübes, Nese | 1 |
More ▼ |
Publication Type
Reports - Research | 5 |
Journal Articles | 3 |
Reports - Evaluative | 1 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 1 |
What Works Clearinghouse Rating
Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023
We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…
Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Noe, Michael J.; Algina, James – 1977
Single-administration procedures for estimating the coefficient of agreement, a reliability index for criterion referenced tests, were recently developed by Subkoviak. The procedures require a distributional assumption for errors of measurement and an estimate of each examinee's true score. A computer simulation of tests composed of items that…
Descriptors: Computer Programs, Criterion Referenced Tests, Simulation, Test Reliability
Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004
Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…
Descriptors: True Scores, Simulation, Test Bias, Student Evaluation
Marshall, J. Laird – 1976
A summary is provided of the rationale for questioning the applicability of classical reliability measures to criterion referenced tests; an extension of the classical theory of true and error scores to incorporate a theory of dichotomous decisions; a presentation of the mean split-half coefficient of agreement, a single-administration test index…
Descriptors: Career Development, Computer Programs, Criterion Referenced Tests, Decision Making
Marston, Paul T., Borich, Gary D. – 1977
The four main approaches to measuring treatment effects in schools; raw gain, residual gain, covariance, and true scores; were compared. A simulation study showed true score analysis produced a large number of Type-I errors. When corrected for this error, this method showed the least power of the four. This outcome was clearly the result of the…
Descriptors: Achievement Gains, Analysis of Covariance, Comparative Analysis, Error of Measurement