ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	17
Since 2007 (last 20 years)	33

Descriptor

Scoring	65
Test Items	65
Testing	65
Test Construction	30
Test Validity	24
Test Reliability	20
Item Analysis	15
Comparative Analysis	13
Psychometrics	13
Scores	13
Item Response Theory	11
Test Format	11
Computer Assisted Testing	10
Language Tests	10
English (Second Language)	9
Foreign Countries	9
Test Bias	9
Test Interpretation	9
Test Use	9
Difficulty Level	8
Evaluation Methods	8
Achievement Tests	7
Elementary Secondary Education	7
Multiple Choice Tests	7
Testing Accommodations	7
More ▼

Publication Type

Journal Articles	29
Reports - Research	15
Reports - Evaluative	13
Reports - Descriptive	12
Guides - General	9
Tests/Questionnaires	7
Guides - Non-Classroom	6
Guides - Classroom - Teacher	5
Numerical/Quantitative Data	3
Speeches/Meeting Papers	3
Books	2
Guides - Classroom - Learner	2
Multilingual/Bilingual…	2
Opinion Papers	2
Collected Works - General	1
Information Analyses	1
More ▼

Education Level

Elementary Education	3
Elementary Secondary Education	3
Grade 4	3
Grade 5	3
Grade 6	3
Grade 7	3
High Schools	3
Junior High Schools	3
Middle Schools	3
Secondary Education	3
Early Childhood Education	2
Grade 3	2
Grade 8	2
Grade 9	2
Intermediate Grades	2
Primary Education	2
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Practitioners	5
Administrators	3
Teachers	3

Location

Canada	2
Arizona	1
California	1
Netherlands	1
North Carolina	1
Puerto Rico	1
United Kingdom (England)	1
United Kingdom (London)	1
United Kingdom (Scotland)	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	5
ACT Assessment	2
Autism Diagnostic Observation…	1
Clinical Evaluation of…	1
Graduate Management Admission…	1
Graduate Record Examinations	1
Preliminary Scholastic…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1
Strengths and Difficulties…	1
Test of English as a Foreign…	1
United States Medical…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Standard Processes. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Scoring Stability in a Large-Scale Assessment Program: A Longitudinal Analysis of Leniency/Severity Effects

Peer reviewed

Direct link

Palermo, Corey; Bunch, Michael B.; Ridge, Kirk – Journal of Educational Measurement, 2019

Although much attention has been given to rater effects in rater-mediated assessment contexts, little research has examined the overall stability of leniency and severity effects over time. This study examined longitudinal scoring data collected during three consecutive administrations of a large-scale, multi-state summative assessment program.…

Descriptors: Scoring, Interrater Reliability, Measurement, Summative Evaluation

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

New Meridian Comparability Review Guidelines. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

Quality Testing Standards and Criteria for Comparability Claims. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

ITC Guidelines for Translating and Adapting Tests (Second Edition)

Peer reviewed

Direct link

International Journal of Testing, 2018

The second edition of the International Test Commission Guidelines for Translating and Adapting Tests was prepared between 2005 and 2015 to improve upon the first edition, and to respond to advances in testing technology and practices. The 18 guidelines are organized into six categories to facilitate their use: pre-condition (3), test development…

Descriptors: Translation, Test Construction, Testing, Scoring

A Pragmatic Future for NAEP: Containing Costs and Updating Technologies. Consensus Study Report

Peer reviewed
PDF on ERIC

Download full text

Direct link

National Academies Press, 2022

The National Assessment of Educational Progress (NAEP) -- often called "The Nation's Report Card" -- is the largest nationally representative and continuing assessment of what students in public and private schools in the United States know and can do in various subjects and has provided policy makers and the public with invaluable…

Descriptors: Costs, Futures (of Society), National Competency Tests, Educational Trends

Fairness Concerns of Discrete Option Multiple Choice Items

Peer reviewed
PDF on ERIC

Download full text

Eckerly, Carol; Smith, Russell; Sowles, John – Practical Assessment, Research & Evaluation, 2018

The Discrete Option Multiple Choice (DOMC) item format was introduced by Foster and Miller (2009) with the intent of improving the security of test content. However, by changing the amount and order of the content presented, the test taking experience varies by test taker, thereby introducing potential fairness issues. In this paper we…

Descriptors: Culture Fair Tests, Multiple Choice Tests, Testing, Test Items

"Quality Testing Standards" -- A Starter Kit for States. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…

Descriptors: Testing, Standards, Comparative Analysis, Test Content

Test Review: Schrank, F. A., McGrew, K. S., & Mather, N. (2014). Woodcock-Johnson IV Tests of Cognitive Abilities

Peer reviewed

Direct link

Reynolds, Matthew R.; Niileksela, Christopher R. – Journal of Psychoeducational Assessment, 2015

"The Woodcock-Johnson IV Tests of Cognitive Abilities" (WJ IV COG) is an individually administered measure of psychometric intellectual abilities designed for ages 2 to 90+. The measure was published by Houghton Mifflin Harcourt-Riverside in 2014. Frederick Shrank, Kevin McGrew, and Nancy Mather are the authors. Richard Woodcock, the…

Descriptors: Cognitive Tests, Testing, Scoring, Test Interpretation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

New Meridian Corporation	6
Journal of Educational…	4
Journal of Psychoeducational…	4
International Journal of…	3
Practical Assessment,…	3
Psychometrika	3
Educational Measurement:…	2
National Center for Education…	2
ACT Education Corp.	1
Applied Measurement in…	1
Arizona Department of…	1
Biochemical Education	1
British Journal of…	1
Communique	1
ETS Research Report Series	1
Educational Testing Service	1
English Teaching Forum	1
International Journal of…	1
Journal of Experimental…	1
National Academies Press	1
National Council on…	1
Programmed Learning and…	1
Teaching and Learning in…	1
More ▼

De Avila, Edward A.	2
Duncan, Sharon E.	2
Hambleton, Ronald K.	2
Puhan, Gautam	2
Ahmed, S.	1
Alderson, J. Charles	1
Anderson, Dan	1
Ann Arthur	1
Baldwin, Peter	1
Baxter, G. P.	1
Bennett, Randy Elliot	1
Bliss, Stacy	1
Bolt, Daniel M.	1
Botting, Nicola	1
Brennan, Robert L.	1
Brull, Harry	1
Bunch, Michael B.	1
Case, Susan M.	1
Chapelle, Carol, Ed.	1
Chen Qiu	1
Chi-Yu Huang	1
Clauser, Brian E.	1
Cook, Linda L.	1
Dodd, Barbara	1
More ▼