ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	20
Since 2007 (last 20 years)	56

Descriptor

Test Items	106
Test Validity	106
Test Construction	71
Test Reliability	44
Item Analysis	20
Scoring	19
Item Response Theory	17
Psychometrics	16
Scores	16
Student Evaluation	16
Testing	16
Evaluation Methods	15
Achievement Tests	14
Higher Education	14
Language Tests	14
Foreign Countries	13
Standardized Tests	13
Difficulty Level	12
Test Format	12
Test Use	11
English (Second Language)	10
Language Proficiency	10
Multiple Choice Tests	9
Reading Tests	9
Test Bias	9
More ▼

Publication Type

Reports - Descriptive	106
Journal Articles	58
Numerical/Quantitative Data	10
Speeches/Meeting Papers	7
Tests/Questionnaires	7
Opinion Papers	4
Guides - Non-Classroom	3
Reports - Research	2
Collected Works - Serials	1
Guides - Classroom - Teacher	1
Information Analyses	1
Reports - Evaluative	1
More ▼

Education Level

Elementary Education	8
Grade 5	7
Higher Education	7
Secondary Education	7
Elementary Secondary Education	6
Middle Schools	6
Grade 4	5
Grade 6	5
Grade 7	5
High Schools	5
Junior High Schools	5
Postsecondary Education	5
Grade 3	4
Early Childhood Education	3
Grade 8	3
Grade 9	3
Intermediate Grades	3
Primary Education	3
Grade 1	1
Grade 10	1
Grade 2	1
Kindergarten	1
More ▼

Audience

Practitioners	7
Teachers	6
Administrators	5
Researchers	4
Community	1
Parents	1

Location

Australia	2
Florida	2
Japan	2
Massachusetts	2
Missouri	2
New Mexico	2
Tennessee	2
Georgia	1
Idaho	1
India	1
Kuwait	1
Mexico	1
Nebraska	1
New York	1
Oregon	1
Philippines	1
Puerto Rico	1
United Kingdom	1
Washington	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…	2
No Child Left Behind Act 2001	2

Assessments and Surveys

Program for International…	3
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Measures of Academic Progress	1
National Assessment of…	1
National Teacher Examinations	1
New York State Regents…	1
North Carolina End of Course…	1
Stanford Achievement Tests	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 106 results Save | Export

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Supporting the Interpretive Validity of Student-Level Claims in Science Assessment with Tiered Claim Structures

Peer reviewed

Direct link

Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022

We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…

Descriptors: Science Tests, Test Validity, Test Items, Test Construction

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Argument-Based Validation in Practice: Examples from Mathematics Education

Peer reviewed

Direct link

Krupa, Erin Elizabeth; Carney, Michele; Bostic, Jonathan – Applied Measurement in Education, 2019

This article provides a brief introduction to the set of four articles in the special issue. To provide a foundation for the issue, key terms are defined, a brief historical overview of validity is provided, and a description of several different validation approaches used in the issue are explained. Finally, the contribution of the articles to…

Descriptors: Test Items, Program Validation, Test Validity, Mathematics Education

Establishing Survey Validity: A Practical Guide

Peer reviewed
PDF on ERIC

Download full text

Cobern, William W.; Adams, Betty A. J. – International Journal of Assessment Tools in Education, 2020

What follows is a practical guide for establishing the validity of a survey for research purposes. The motivation for providing this guide is our observation that researchers, not necessarily being survey researchers per se, but wanting to use a survey method, lack a concise resource on validity. There is far more to know about surveys and survey…

Descriptors: Surveys, Test Validity, Test Construction, Test Items

The Uses of Process Data in Large-Scale Educational Assessments. OECD Education Working Papers. No. 286

Direct link

Maddox, Bryan – OECD Publishing, 2023

The digital transition in educational testing has introduced many new opportunities for technology to enhance large-scale assessments. These include the potential to collect and use log data on test-taker response processes routinely, and on a large scale. Process data has long been recognised as a valuable source of validation evidence in…

Descriptors: Measurement, Inferences, Test Reliability, Computer Assisted Testing

Diagnostic Test Construction: Insights from Cognitive Diagnostic Modeling

Peer reviewed
PDF on ERIC

Download full text

Ketabi, Somaye; Alavi, Seyyed Mohammed; Ravand, Hamdollah – International Journal of Language Testing, 2021

Although Diagnostic Classification Models (DCMs) were introduced to education system decades ago, it seems that these models were not employed for the original aims upon which they had been designed. Using DCMs has been mostly common in analyzing large-scale non-diagnostic tests and these models have been rarely used in developing Cognitive…

Descriptors: Diagnostic Tests, Test Construction, Goodness of Fit, Classification

Revising the BioMedical Admissions Test (BMAT) to Improve Impact and Washback for Candidates and Support Fair Access to Test Preparation

Peer reviewed

Direct link

McElwee, Sarah; Y. F. Cheung, Kevin; R. T. Cromie, Stephen; Shannon, Mark; Gallacher, Tom – Assessment in Education: Principles, Policy & Practice, 2021

The BioMedical Admissions Test (BMAT) has been used to select students for healthcare courses for 15 years. Recently, the candidature has included an increasing number of test takers who did not complete their schooling in the UK. In line with responsibilities to promote widening participation, a revision of the Section 2 Scientific Knowledge and…

Descriptors: Foreign Countries, Medical Education, College Admission, Medical Schools

English MAP Reading Fluency Technical Report: Based on Assessments Administered during the 2020-2021 School Year

Download full text

NWEA, 2022

This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…

Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency

Response Process Validity Studies of the Scale Literacy Skills Test

Peer reviewed

Direct link

Trate, Jaclyn M.; Fisher, Victoria; Blecking, Anja; Geissinger, Peter; Murphy, Kristen L. – Journal of Chemical Education, 2019

Assessment and evaluation tools and instruments are developed to measure many things from content knowledge to misconceptions to student affect. The standard validation processes for these are regularly conducted and provide strong evidence for the validity of the measurements that are made. As part of the suite of validation tools available to…

Descriptors: Test Validity, Multiple Choice Tests, Chemistry, Science Tests

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Distributed Item Review: Administrator User Guide. Technical Report #1603

Download full text

Irvin, P. Shawn – Behavioral Research and Teaching, 2016

The Distributed Item Review (DIR) is a secure and flexible, web-based system designed to present test items to expert reviewers across a broad geographic area for evaluation of important dimensions of quality (e.g., alignment with standards, bias, sensitivity, and student accessibility). The DIR is comprised of essential features that allow system…

Descriptors: Test Items, Test Reviews, Test Validity, Guides

Cognitive Interviewing for Item Development: Validity Evidence Based on Content and Response Processes

Peer reviewed

Direct link

Peterson, Christina Hamme; Peterson, N. Andrew; Powell, Kristen Gilmore – Measurement and Evaluation in Counseling and Development, 2017

Cognitive interviewing (CI) is a method to identify sources of confusion in assessment items and to assess validity evidence on the basis of content and response processes. We introduce readers to CI and describe a process for conducting such interviews and analyzing the results. Recommendations for best practice are provided.

Descriptors: Test Items, Test Construction, Interviews, Test Validity

Digital SAT® Research Summary

Download full text

College Board, 2023

Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…

Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Educational Measurement:…	4
Behavioral Research and…	3
Online Submission	3
Practical Assessment,…	3
Applied Measurement in…	2
Journal of Applied Testing…	2
Journal of Educational and…	2
Measurement and Evaluation in…	2
New Meridian Corporation	2
New Mexico Public Education…	2
OECD Publishing	2
AMATYC Review	1
Alberta Journal of…	1
American Journal of Sexuality…	1
Assessment in Education:…	1
Assessment in Higher Education	1
Astronomy Education Review	1
British Journal of…	1
Career and Technical…	1
Center for Assessment and…	1
College Board	1
College Board Review	1
Communication Disorders…	1
Communique	1
ETS Research Report Series	1
More ▼

Stansfield, Charles W.	5
Liu, Kimy	3
Ketterlin-Geller, Leanne R.	2
Lee, Yi-Hsuan	2
Petscher, Yaacov	2
Tindal, Gerald	2
Truckenmiller, Adrea	2
Abedi, Jamal	1
Adams, Betty A. J.	1
Ahmed, S.	1
Alavi, Seyyed Mohammed	1
Alonzo, Julie	1
Andersson, Luanne	1
Anne Traynor	1
Arth, Thomas O.	1
Baghaei, Purya	1
Baker, E. L.	1
Bardar, Erin M.	1
Barghaus, Katherine M.	1
Barry, G. Michael	1
Baxter, G. P.	1
Beddow, Peter A.	1
Benderson, Albert, Ed.	1
Bergstrom, Betty	1
More ▼