ERIC - Search Results

Publication Date

In 2026	0
Since 2025	7
Since 2022 (last 5 years)	33
Since 2017 (last 10 years)	81
Since 2007 (last 20 years)	164

Descriptor

Test Reliability	175
Testing	175
Test Validity	128
Test Construction	61
Scoring	60
Foreign Countries	42
Language Tests	37
Scores	33
Psychometrics	31
Item Response Theory	26
Test Items	26
Evaluation Methods	23
Test Bias	23
Student Evaluation	22
English (Second Language)	21
Second Language Learning	20
Academic Achievement	19
Testing Accommodations	19
Computer Assisted Testing	18
Test Interpretation	18
Language Arts	16
Mathematics Tests	16
Standardized Tests	16
Achievement Tests	15
Error of Measurement	15
More ▼

Publication Type

Journal Articles	128
Reports - Research	68
Reports - Evaluative	57
Reports - Descriptive	31
Numerical/Quantitative Data	18
Tests/Questionnaires	8
Information Analyses	6
Guides - Non-Classroom	5
Guides - Classroom - Teacher	4
Opinion Papers	4
Books	3
Collected Works - General	2
Dissertations/Theses -…	2
Guides - General	2
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
More ▼

Education Level

Higher Education	30
Elementary Education	29
Secondary Education	26
Postsecondary Education	25
Early Childhood Education	23
Junior High Schools	19
Middle Schools	18
Primary Education	18
Elementary Secondary Education	17
Grade 7	16
Grade 6	15
Intermediate Grades	15
Grade 5	14
Grade 3	13
Grade 4	13
Grade 8	12
High Schools	9
Kindergarten	9
Grade 9	5
Preschool Education	5
Adult Education	3
Grade 1	3
Grade 10	2
Grade 11	2
Adult Basic Education	1
More ▼

Audience

Teachers	5
Administrators	2
Policymakers	1
Practitioners	1
Students	1

Location

New York	9
China	5
Illinois	5
Turkey	4
Australia	3
Canada	3
Indonesia	3
Maryland	3
Nebraska	3
Russia	3
United Kingdom	3
Bangladesh	2
Europe	2
Iran	2
Malaysia	2
Pennsylvania	2
South Africa	2
Texas	2
United Kingdom (England)	2
Washington	2
California	1
Colombia	1
Cyprus	1
Delaware	1
Florida	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 175 results Save | Export

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Test Review: Raven's 2 Progressive Matrices, Clinical Edition (Raven's 2)

Peer reviewed

Direct link

McLeod, Justin W.H.; McCrimmon, Adam W. – Journal of Psychoeducational Assessment, 2021

The "Raven's 2 Progressive Matrices Clinical Edition" (Raven's 2; Raven, Rust, Chan, & Zhou, 2018), published by NCS Pearson, is an individually administered nonverbal assessment of general cognitive ability developed to measure "educative abilities," defined as the ability to think clearly and solve complex problems in…

Descriptors: Test Reviews, Intelligence Tests, Testing, Test Reliability

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Assessments Play an Important Role in Serving Students. What's Next: Policy Recommendations from the George W. Bush Institute

Download full text

Anne Wicks; Robin Berkley – George W. Bush Institute, 2025

Assessments are one of the most important--and often misunderstood--elements of education. In most cases, tests are administered by the state as well as by districts and schools. Assessments at each of these levels have distinct purposes, yield different information, and are part of a powerful, coordinated approach to improving student outcomes.…

Descriptors: Student Evaluation, Testing, Tests, Standardized Tests

Parents Can Accurately and Reliably Administer an Online Dyslexia Evaluation Tool

Peer reviewed

Direct link

Hurford, David P.; Wines, Autumn – Australian Journal of Learning Difficulties, 2022

The purpose of the present study was to examine the potential that parents could effectively administer an online dyslexia evaluation tool (ODET) to their children. To this end, four groups consisting of parents and trained staff were compared. Sixty-three children (36 females and 27 males) participated. The children in each group were assessed…

Descriptors: Test Reliability, Computer Assisted Testing, Dyslexia, Screening Tests

The Use of ChatGPT in Assessment

Peer reviewed
PDF on ERIC

Download full text

Mehmet Kanik – International Journal of Assessment Tools in Education, 2024

ChatGPT has surged interest to cause people to look for its use in different tasks. However, before allowing it to replace humans, its capabilities should be investigated. As ChatGPT has potential for use in testing and assessment, this study aims to investigate the questions generated by ChatGPT by comparing them to those written by a course…

Descriptors: Artificial Intelligence, Testing, Multiple Choice Tests, Test Construction

Examining the Relationship between Randomization Strategies and Control Group Crossover in Higher Education Interventions. EdWorkingPaper No. 24-1083

Download full text

Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024

This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…

Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction

Can the Oral Proficiency Interview -- Computer (ACTFL OPIc) Be Used Instead of the Oral Proficiency Interview (ACTFL OPI)? An Aligned Rank Transform (ART) Analysis

Peer reviewed

Direct link

Troy L. Cox; Gregory L. Thompson; Steven S. Stokes – Foreign Language Annals, 2025

This study investigated the differences between the ACTFL Oral Proficiency Interview (OPI) and the ACTFL Oral Proficiency Interview - Computer (OPIc) among Spanish learners at a U.S. university. Participants (N = 154) were randomly assigned to take both tests in a counterbalanced order to mitigate test order effects. Data were analyzed using an…

Descriptors: Oral Language, Language Proficiency, Interviews, Computer Uses in Education

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

The Effects of Attentional Focus on Test-Retest Reliability of Jumping Tasks

Peer reviewed

Direct link

Makaruk, Hubert; Porter, Jared M.; Cieslinski, Igor – Measurement in Physical Education and Exercise Science, 2021

This study examined the test-retest reliability of the standing long jump (SLJ) and the countermovement jump (CMJ) following consistent and non-consistent attentional focus cuing instructions in physically active young adults (n = 30). The systematic error (as standardize change in mean), random error (as typical error), the Bland and Altman…

Descriptors: Attention Control, Test Reliability, Performance Tests, Physical Activities

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Journal of Psychoeducational…	21
Language Testing	8
New York State Education…	8
Canadian Journal of School…	7
ETS Research Report Series	4
International Journal of…	4
Journal of Educational…	4
Online Submission	4
Partnership for Assessment of…	4
Regional Educational…	4
Educational Measurement:…	3
Language Assessment Quarterly	3
Measurement in Physical…	3
Nebraska Department of…	3
Practical Assessment,…	3
Advances in Language and…	2
Annenberg Institute for…	2
Child Abuse & Neglect: The…	2
Early Child Development and…	2
New Meridian Corporation	2
ProQuest LLC	2
Research Quarterly for…	2
ACT Education Corp.	1
ACT, Inc.	1
Administration and Policy in…	1
More ▼

McCrimmon, Adam W.	6
Ackerman, Debra J.	2
Dickens, Rachel H.	2
Dorans, Neil J.	2
Dunne, Michael P.	2
Hamid, M. Obaidul	2
Hurford, David P.	2
Meisinger, Elizabeth B.	2
Mislevy, Robert J.	2
Nordstokke, David W.	2
Pinder, Patrice Juliet	2
Runyan, Desmond K.	2
Salmani-Nodoushan, Mohammad…	2
Tarar, Jessica M.	2
Zolotor, Adam J.	2
Abdullah, Saifuddin Kumar	1
Ahmed, Md. Kawser	1
Ajjawi, Rola	1
Al Hajri, Fatma	1
Al-Tamimi, Mohammad	1
Albers, Craig A.	1
Alderson, J. Charles	1
Alfonso, Vincent C.	1
Ali, Md. Maksud	1
Alper Gülay	1
More ▼

ACT Assessment	3
Measures of Academic Progress	3
Raven Progressive Matrices	3
Wechsler Intelligence Scale…	3
Battelle Developmental…	2
Bayley Scales of Infant…	2
Clinical Evaluation of…	2
National Assessment of…	2
Wechsler Adult Intelligence…	2
Woodcock Johnson Tests of…	2
ACTFL Oral Proficiency…	1
Autism Diagnostic Observation…	1
Beck Anxiety Inventory	1
Beery Developmental Test of…	1
Block Design Test	1
Center for Epidemiologic…	1
Classroom Assessment Scoring…	1
Denver Developmental…	1
Developmental Indicators for…	1
Florida Comprehensive…	1
Gates MacGinitie Reading Tests	1
General Educational…	1
Graduate Record Examinations	1
Gray Oral Reading Test	1
Infant Toddler Environment…	1
More ▼