ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	5

Descriptor

Comparative Analysis	6
Scaling	6
Test Theory	6
Computation	3
Educational Testing	3
Equated Scores	3
Item Response Theory	3
Measurement Techniques	3
Classification	2
Definitions	2
Educational Assessment	2
Error of Measurement	2
Evaluation Methods	2
Foreign Countries	2
High Stakes Tests	2
Predictive Measurement	2
Psychometrics	2
Test Interpretation	2
Test Use	2
Testing Problems	2
Anxiety	1
Bias	1
Cognitive Tests	1
College Entrance Examinations	1
Computer Simulation	1
More ▼

Source

Measurement:…	2
ACT, Inc.	1
Applied Measurement in…	1
Applied Psychological…	1

Author

Almehrizi, Rashid S.	1
Beretvas, S. Natasha	1
Cresswell, Mike	1
Cui, Zhongmin	1
Fang, Yu	1
Fitzpatrick, Steven J.	1
Morrison, Carol A.	1
Murphy, Daniel L.	1
Newton, Paul E.	1
Traynor, Anne	1
Woodruff, David	1
More ▼

Publication Type

Journal Articles	4
Reports - Research	3
Opinion Papers	2
Reports - Evaluative	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

Australia	1
Colorado	1
Florida	1
New York	1
North Carolina	1
Tennessee	1
Texas	1
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Wales)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Advanced Placement…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Direct and Indirect Equating: A Comparison of Four Methods Using the Rasch Model.

Download full text

Morrison, Carol A.; Fitzpatrick, Steven J. – 1992

An attempt was made to determine which item response theory (IRT) equating method results in the least amount of equating error or "scale drift" when equating scores across one or more test forms. An internal anchor test design was employed with five different test forms, each consisting of 30 items, 10 in common with the base test and 5…

Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Error of Measurement