Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Source
Educational and Psychological… | 2 |
Assessment & Evaluation in… | 1 |
Evaluation in Education: An… | 1 |
Psychometrika | 1 |
Author
Burton, Richard F. | 1 |
Choppin, Bruce | 1 |
Kingma, Johannes | 1 |
Kroc, Edward | 1 |
Lewis, Charles | 1 |
Van Den Bos, Kees P. | 1 |
Zumbo, Bruno D. | 1 |
Publication Type
Journal Articles | 5 |
Reports - Descriptive | 5 |
Opinion Papers | 2 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019
Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…
Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

Lewis, Charles – Psychometrika, 1986
On the occasion of Psychometrika's fiftieth anniversary, the past twenty-five years' developments in mental test theory are reviewed. Psychometrika articles treating topics in test theory are listed in a bibliography. (Author/LMO)
Descriptors: Cognitive Measurement, Mathematical Models, Psychological Testing, Psychometrics

Kingma, Johannes; Van Den Bos, Kees P. – Educational and Psychological Measurement, 1987
Fifteen FORTRAN 77 programs are contained in the described package. Three programs are available for each of five forgetting models involving 5, 7, 8, 9, and 10-parameter models. The programs compute parameter estimates and test parameter estimates both between and within experimental conditions. (Author/GDC)
Descriptors: Computer Software, Educational Experiments, Hypothesis Testing, Learning

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2001
Describes four measures of test unreliability that quantify effects of question selection and guessing, both separately and together--three chosen for immediacy and one for greater mathematical elegance. Quantifies their dependence on test length and number of answer options per question. Concludes that many multiple choice tests are unreliable…
Descriptors: Guessing (Tests), Mathematical Models, Multiple Choice Tests, Objective Tests
Choppin, Bruce – Evaluation in Education: An International Review Series, 1985
Using the analogy of temperature measurement, the Rasch model is presented with arguments for its adoption as the basic scaling technique for achievement measures. Three extensions of the Rasch model for more complex testing are developed. Test development for the British national assessment program and the promise of item banking are also…
Descriptors: Academic Achievement, Achievement Tests, Educational Assessment, Item Banks