ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	7

Descriptor

Computation	7
Test Theory	7
Item Response Theory	4
Error of Measurement	3
Comparative Analysis	2
Foreign Countries	2
Methods	2
Reliability	2
Scores	2
Statistical Analysis	2
Test Items	2
Ability	1
Achievement Tests	1
Cloze Procedure	1
College Entrance Examinations	1
Criteria	1
Data Collection	1
Difficulty Level	1
Educational Assessment	1
Educational Testing	1
Equated Scores	1
Equations (Mathematics)	1
Exit Examinations	1
Factor Analysis	1
Grade 3	1
More ▼

Source

ACT, Inc.	1
Applied Measurement in…	1
Applied Psychological…	1
Behavioral Research and…	1
Journal of Educational…	1
Practical Assessment,…	1
Research Papers in Education	1

Author

Beauducel, Andre	1
Bramley, Tom	1
Cui, Zhongmin	1
Dhawan, Vikas	1
Fang, Yu	1
Haberman, Shelby	1
Ketterlin-Geller, Leanne R.	1
Larkin, Kevin	1
Liu, Kimy	1
Naples, Adam	1
Puhan, Gautam	1
Sinharay, Sandip	1
Stemler, Steven E.	1
Sundstrom-Hebert, Krystal	1
Tindal, Gerald	1
Traynor, Anne	1
Woodruff, David	1
van der Linden, Wim J.	1
More ▼

Publication Type

Reports - Evaluative	7
Journal Articles	5
Numerical/Quantitative Data	1

Education Level

Secondary Education	2
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Location

Germany	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Eysenck Personality Inventory	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Rasch Measurement v. Item Response Theory: Knowing When to Cross the Line

Peer reviewed
PDF on ERIC

Download full text

Stemler, Steven E.; Naples, Adam – Practical Assessment, Research & Evaluation, 2021

When students receive the same score on a test, does that mean they know the same amount about the topic? The answer to this question is more complex than it may first appear. This paper compares classical and modern test theories in terms of how they estimate student ability. Crucial distinctions between the aims of Rasch Measurement and IRT are…

Descriptors: Item Response Theory, Test Theory, Ability, Computation

Some Conceptual Issues in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2013

In spite of all of the technical progress in observed-score equating, several of the more conceptual aspects of the process still are not well understood. As a result, the equating literature struggles with rather complex criteria of equating, lack of a test-theoretic foundation, confusing terminology, and ad hoc analyses. A return to Lord's…

Descriptors: Equated Scores, Statistical Analysis, Computation, Data Collection

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Problems in Estimating Composite Reliability of "Unitised" Assessments

Peer reviewed

Direct link

Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013

This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…

Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability

Taking the Error Term of the Factor Model into Account: The Factor Score Predictor Interval

Peer reviewed

Direct link

Beauducel, Andre – Applied Psychological Measurement, 2013

The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…

Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement

The Utility of Augmented Subscores in a Licensure Exam: An Evaluation of Methods Using Empirical Data

Peer reviewed

Direct link

Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010

Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…

Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods

Instrument Development Procedures for Maze Measures. Technical Report # 08-06

Download full text

Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to document the instrument development of maze measures for grades 3-8. Each maze passage contained twelve omitted words that students filled in by choosing the best-fit word from among the provided options. In this technical report, we describe the process of creating, reviewing, and pilot testing the maze measures.…

Descriptors: Test Construction, Cloze Procedure, Multiple Choice Tests, Reading Tests