ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	27
Since 2016 (last 10 years)	72
Since 2006 (last 20 years)	134

Descriptor

Scores	137
Test Items	40
Cutting Scores	38
Testing Problems	38
Test Interpretation	37
Achievement Tests	31
Educational Assessment	28
Elementary Secondary Education	26
Psychometrics	26
Equated Scores	25
Test Use	24
Comparative Analysis	23
Item Response Theory	23
Test Construction	23
Evaluation Methods	22
Standard Setting (Scoring)	22
Academic Achievement	21
Scoring	21
Test Validity	21
Test Results	19
Validity	19
College Entrance Examinations	18
Models	18
Standardized Tests	16
Standards	16
More ▼

Source

Educational Measurement:…

210

Publication Type

Journal Articles	210
Reports - Research	83
Reports - Evaluative	57
Reports - Descriptive	44
Opinion Papers	23
Tests/Questionnaires	9
Speeches/Meeting Papers	7
Information Analyses	6
Guides - Non-Classroom	5
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Historical Materials	1
More ▼

Education Level

Higher Education	19
Secondary Education	17
Postsecondary Education	15
Elementary Secondary Education	14
High Schools	11
Elementary Education	9
Middle Schools	6
Junior High Schools	5
Grade 3	4
Grade 4	4
Early Childhood Education	3
Grade 5	3
Adult Education	2
Grade 6	2
Intermediate Grades	2
Primary Education	2
Grade 1	1
Grade 10	1
Grade 7	1
Grade 8	1
Grade 9	1
High School Equivalency…	1
Two Year Colleges	1
More ▼

Audience

Teachers	2
Counselors	1
Practitioners	1

Location

Canada	3
Idaho	2
United States	2
Arizona	1
California	1
Florida	1
Germany	1
Israel	1
Kansas	1
Maryland	1
Nebraska	1
Netherlands	1
New Hampshire	1
South Carolina	1
United Kingdom	1
Wisconsin	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	7
Education Consolidation…	1
Every Student Succeeds Act…	1

Assessments and Surveys

SAT (College Admission Test)	11
ACT Assessment	7
National Assessment of…	4
Comprehensive Tests of Basic…	3
Iowa Tests of Basic Skills	3
Graduate Record Examinations	2
Program for International…	2
California Achievement Tests	1
College Board Achievement…	1
Iowa Tests of Educational…	1
Preliminary Scholastic…	1
Program for the International…	1
Sequential Tests of…	1
Test of English as a Foreign…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 210 results Save | Export

Evaluating Population Invariance of Test Equating during the COVID-19 Pandemic

Peer reviewed

Direct link

Li, Dongmei; Kapoor, Shalini – Educational Measurement: Issues and Practice, 2022

Population invariance is a desirable property of test equating which might not hold when significant changes occur in the test population, such as those brought about by the COVID-19 pandemic. This research aims to investigate whether equating functions are reasonably invariant when the test population is impacted by the pandemic. Based on…

Descriptors: Test Items, Equated Scores, COVID-19, Pandemics

2023 NCME Presidential Address: Some Musings on Comparable Scores

Peer reviewed

Direct link

Deborah J. Harris – Educational Measurement: Issues and Practice, 2024

This article is based on my 2023 NCME Presidential Address, where I talked a bit about my journey into the profession, and more substantively about comparable scores. Specifically, I discussed some of the different ways 'comparable scores' are defined, highlighted some areas I think we as a profession need to pay more attention to when considering…

Descriptors: Scores, Comparative Analysis, Speeches, Career Development

Digital Module 32: Understanding and Mitigating the Impact of Low Effort on Common Uses of Test and Survey Scores

Peer reviewed

Direct link

Soland, James – Educational Measurement: Issues and Practice, 2023

Most individuals who take, interpret, design, or score tests are aware that examinees do not always provide full effort when responding to items. However, many such individuals are not aware of how pervasive the issue is, what its consequences are, and how to address it. In this digital ITEMS module, Dr. James Soland will help fill these gaps in…

Descriptors: Student Behavior, Tests, Scores, Incidence

Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms

Peer reviewed

Direct link

Stella Y. Kim; Sungyeun Kim – Educational Measurement: Issues and Practice, 2025

This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A…

Descriptors: Generalizability Theory, Automation, Test Items, Students

A Critical Look into the Beuk Standard-Setting Method

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2020

One commonly used compromise standard-setting method is the Beuk (1984) method. A key assumption of the Beuk method is that the emphasis given to the pass rate and the percent correct ratings should be proportional to the extent that the panelists agree on their ratings. However, whether the slope of Beuk line reflects the emphasis that panelists…

Descriptors: Standard Setting (Scoring), Cutting Scores, Weighted Scores, Evaluation Methods

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Defining Test-Score Interpretation, Use, and Claims: Delphi Study for the Validity Argument

Peer reviewed

Direct link

Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023

Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…

Descriptors: Test Interpretation, Scores, Test Use, Test Validity

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Digital Module 14: Planning and Conducting Standard Setting

Peer reviewed

Direct link

Bunch, Michael B. – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Michael Bunch provides an in-depth, step-by-step look at how standard setting is done. It does not focus on any specific procedure or methodology (e.g., modified Angoff, bookmark, and body of work) but on the practical tasks that must be completed for any standard setting activity. Dr. Bunch carries the…

Descriptors: Standard Setting, Cutting Scores, Scores, Reports

Using Classification Tree Models to Determine Course Placement

Peer reviewed

Direct link

Lee, Chansoon – Educational Measurement: Issues and Practice, 2022

Appropriate placement into courses at postsecondary institutions is critical for the success of students in terms of retention and graduation rates. To reduce the number of students who are misplaced, using multiple measures in placing students is encouraged. However, in practice most postsecondary schools utilize only a few measures to determine…

Descriptors: Classification, Models, Student Placement, College Students

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Combining Process Information and Item Response Modeling to Estimate Problem-Solving Ability

Peer reviewed

Direct link

Xiao, Yue; Veldkamp, Bernard; Liu, Hongyun – Educational Measurement: Issues and Practice, 2022

The action sequences of respondents in problem-solving tasks reflect rich and detailed information about their performance, including differences in problem-solving ability, even if item scores are equal. It is therefore not sufficient to infer individual problem-solving skills based solely on item scores. This study is a preliminary attempt to…

Descriptors: Problem Solving, Item Response Theory, Scores, Item Analysis

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

What Are the Conditions Associated with Subscore Added Value Noninvariance? Implications for Improving Subscore Interpretation Fairness

Peer reviewed

Direct link

Rios, Joseph A.; Miranda, Alejandra A. – Educational Measurement: Issues and Practice, 2021

Subscore added value analyses assume invariance across test taking populations; however, this assumption may be untenable in practice as differential subdomain relationships may be present among subgroups. The purpose of this simulation study was to understand the conditions associated with subscore added value noninvariance when manipulating: (1)…

Descriptors: Scores, Test Length, Ability, Correlation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Sinharay, Sandip	9
Hills, John R.	6
Kolen, Michael J.	5
Feinberg, Richard A.	4
Ho, Andrew D.	4
Mehrens, William A.	4
Sireci, Stephen G.	4
Wainer, Howard	4
Wyse, Adam E.	4
Clauser, Brian E.	3
Dorans, Neil J.	3
Frisbie, David A.	3
Hoover, H. D.	3
Kuncel, Nathan R.	3
Linn, Robert L.	3
Margolis, Melissa J.	3
Mattern, Krista	3
Puhan, Gautam	3
Baldwin, Peter	2
Brennan, Robert L.	2
Cannell, John Jacob	2
Childs, Ruth A.	2
Cizek, Gregory J.	2
Clarizio, Harvey F.	2
More ▼