ERIC - Search Results

Publication Date

In 2026	0
Since 2025	7
Since 2022 (last 5 years)	33
Since 2017 (last 10 years)	81
Since 2007 (last 20 years)	164

Descriptor

Test Reliability	778
Testing	778
Test Validity	492
Test Construction	236
Test Interpretation	146
Scoring	138
Language Tests	103
Standardized Tests	96
Evaluation Methods	90
Higher Education	87
Foreign Countries	83
Elementary Secondary Education	82
Achievement Tests	78
Student Evaluation	77
Testing Problems	76
Measurement Techniques	73
Scores	71
English (Second Language)	69
Tests	66
Test Format	65
Comparative Analysis	61
Item Analysis	61
Test Items	61
Statistical Analysis	60
Second Language Learning	59
More ▼

Education Level

Higher Education	31
Elementary Education	29
Postsecondary Education	26
Secondary Education	26
Early Childhood Education	23
Junior High Schools	19
Middle Schools	18
Primary Education	18
Elementary Secondary Education	17
Grade 7	16
Grade 6	15
Intermediate Grades	15
Grade 5	14
Grade 3	13
Grade 4	13
Grade 8	12
High Schools	10
Kindergarten	9
Adult Education	5
Grade 9	5
Preschool Education	5
Grade 1	3
Grade 10	2
Grade 11	2
Adult Basic Education	1
More ▼

Audience

Practitioners	34
Teachers	22
Administrators	8
Researchers	6
Counselors	2
Policymakers	2
Students	2

Location

Canada	10
New York	10
Australia	6
United Kingdom	6
United Kingdom (England)	6
China	5
Illinois	5
Japan	5
United Kingdom (Great Britain)	5
Iran	4
Ohio	4
Pennsylvania	4
Turkey	4
Indonesia	3
Malaysia	3
Maryland	3
Nebraska	3
Russia	3
Bangladesh	2
California	2
Europe	2
India	2
New York (New York)	2
New Zealand	2
South Africa	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Bilingual Education Act 1968	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 778 results Save | Export

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Test Review: Raven's 2 Progressive Matrices, Clinical Edition (Raven's 2)

Peer reviewed

Direct link

McLeod, Justin W.H.; McCrimmon, Adam W. – Journal of Psychoeducational Assessment, 2021

The "Raven's 2 Progressive Matrices Clinical Edition" (Raven's 2; Raven, Rust, Chan, & Zhou, 2018), published by NCS Pearson, is an individually administered nonverbal assessment of general cognitive ability developed to measure "educative abilities," defined as the ability to think clearly and solve complex problems in…

Descriptors: Test Reviews, Intelligence Tests, Testing, Test Reliability

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Assessments Play an Important Role in Serving Students. What's Next: Policy Recommendations from the George W. Bush Institute

Download full text

Anne Wicks; Robin Berkley – George W. Bush Institute, 2025

Assessments are one of the most important--and often misunderstood--elements of education. In most cases, tests are administered by the state as well as by districts and schools. Assessments at each of these levels have distinct purposes, yield different information, and are part of a powerful, coordinated approach to improving student outcomes.…

Descriptors: Student Evaluation, Testing, Tests, Standardized Tests

Parents Can Accurately and Reliably Administer an Online Dyslexia Evaluation Tool

Peer reviewed

Direct link

Hurford, David P.; Wines, Autumn – Australian Journal of Learning Difficulties, 2022

The purpose of the present study was to examine the potential that parents could effectively administer an online dyslexia evaluation tool (ODET) to their children. To this end, four groups consisting of parents and trained staff were compared. Sixty-three children (36 females and 27 males) participated. The children in each group were assessed…

Descriptors: Test Reliability, Computer Assisted Testing, Dyslexia, Screening Tests

The Use of ChatGPT in Assessment

Peer reviewed
PDF on ERIC

Download full text

Mehmet Kanik – International Journal of Assessment Tools in Education, 2024

ChatGPT has surged interest to cause people to look for its use in different tasks. However, before allowing it to replace humans, its capabilities should be investigated. As ChatGPT has potential for use in testing and assessment, this study aims to investigate the questions generated by ChatGPT by comparing them to those written by a course…

Descriptors: Artificial Intelligence, Testing, Multiple Choice Tests, Test Construction

Examining the Relationship between Randomization Strategies and Control Group Crossover in Higher Education Interventions. EdWorkingPaper No. 24-1083

Download full text

Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024

This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…

Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction

Can the Oral Proficiency Interview -- Computer (ACTFL OPIc) Be Used Instead of the Oral Proficiency Interview (ACTFL OPI)? An Aligned Rank Transform (ART) Analysis

Peer reviewed

Direct link

Troy L. Cox; Gregory L. Thompson; Steven S. Stokes – Foreign Language Annals, 2025

This study investigated the differences between the ACTFL Oral Proficiency Interview (OPI) and the ACTFL Oral Proficiency Interview - Computer (OPIc) among Spanish learners at a U.S. university. Participants (N = 154) were randomly assigned to take both tests in a counterbalanced order to mitigate test order effects. Data were analyzed using an…

Descriptors: Oral Language, Language Proficiency, Interviews, Computer Uses in Education

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

The Effects of Attentional Focus on Test-Retest Reliability of Jumping Tasks

Peer reviewed

Direct link

Makaruk, Hubert; Porter, Jared M.; Cieslinski, Igor – Measurement in Physical Education and Exercise Science, 2021

This study examined the test-retest reliability of the standing long jump (SLJ) and the countermovement jump (CMJ) following consistent and non-consistent attentional focus cuing instructions in physically active young adults (n = 30). The systematic error (as standardize change in mean), random error (as typical error), the Bland and Altman…

Descriptors: Attention Control, Test Reliability, Performance Tests, Physical Activities

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 52

Diagnostique	27
Journal of Psychoeducational…	21
Journal of Educational…	15
Language Testing	13
Educational and Psychological…	12
New York State Education…	8
Canadian Journal of School…	7
System	6
American Journal of Mental…	5
Language Learning	5
ETS Research Report Series	4
Exceptional Children	4
International Journal of…	4
Journal of Experimental…	4
Journal of School Psychology	4
Language Assessment Quarterly	4
Online Submission	4
Partnership for Assessment of…	4
Psychology in the Schools	4
Psychometrika	4
Regional Educational…	4
Review of Educational Research	4
Academic Medicine	3
Canadian Modern Language…	3
Early Child Development and…	3
More ▼

McCrimmon, Adam W.	6
Weiss, David J.	6
Kapes, Jerome T.	4
Alderson, J. Charles	3
Bennett, Randy Elliot	3
Gallas, Edwin J.	3
Guthrie, P. D.	3
Hambleton, Ronald K.	3
Renzulli, Joseph S.	3
Rippey, Robert M.	3
Stansfield, Charles W.	3
Ackerman, Debra J.	2
Allen, Thomas E.	2
Baker, Eva L.	2
Betz, Nancy E.	2
Brown, James Dean	2
Dickens, Rachel H.	2
Dorans, Neil J.	2
Dunne, Michael P.	2
Ebel, Robert L.	2
Feldt, Leonard S.	2
Fernandes, Kathleen	2
Hakstian, A. Ralph	2
Hamid, M. Obaidul	2
More ▼

Journal Articles	291
Reports - Research	248
Reports - Evaluative	93
Reports - Descriptive	88
Speeches/Meeting Papers	55
Guides - Non-Classroom	45
Tests/Questionnaires	43
Opinion Papers	30
Information Analyses	28
Numerical/Quantitative Data	20
Guides - Classroom - Teacher	11
Books	8
Guides - General	8
Collected Works - General	6
Reference Materials -…	6
Collected Works - Proceedings	3
Collected Works - Serials	3
Reference Materials - General	3
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
ERIC Publications	2
Dissertations/Theses -…	1
Legal/Legislative/Regulatory…	1
More ▼

Peabody Picture Vocabulary…	6
Wechsler Intelligence Scale…	6
Illinois Test of…	5
Bayley Scales of Infant…	4
Raven Progressive Matrices	4
Self Directed Search	4
Stanford Achievement Tests	4
Test of English as a Foreign…	4
Wechsler Adult Intelligence…	4
ACT Assessment	3
Developmental Indicators for…	3
General Aptitude Test Battery	3
General Educational…	3
Measures of Academic Progress	3
Minnesota Multiphasic…	3
National Assessment of…	3
SAT (College Admission Test)	3
Stanford Binet Intelligence…	3
Vineland Adaptive Behavior…	3
Wechsler Preschool and…	3
Woodcock Johnson Tests of…	3
ACTFL Oral Proficiency…	2
Battelle Developmental…	2
Clinical Evaluation of…	2
Coopersmith Self Esteem…	2
More ▼