ERIC - Search Results

Publication Date

In 2026	1
Since 2025	584
Since 2022 (last 5 years)	2521
Since 2017 (last 10 years)	5556
Since 2007 (last 20 years)	9152

Descriptor

Test Validity	21728
Test Reliability	9984
Test Construction	5869
Foreign Countries	4932
Psychometrics	2950
Factor Analysis	2937
Measures (Individuals)	2369
Higher Education	2247
Evaluation Methods	2083
College Students	1809
Correlation	1719
Questionnaires	1631
Elementary Secondary Education	1613
Scores	1568
Factor Structure	1548
Student Evaluation	1489
Test Items	1415
Student Attitudes	1336
Language Tests	1232
Screening Tests	1198
Measurement Techniques	1185
Comparative Analysis	1181
Elementary School Students	1171
Academic Achievement	1120
Standardized Tests	1104
More ▼

Publication Type

Journal Articles	14073
Reports - Research	13346
Reports - Evaluative	2392
Speeches/Meeting Papers	1717
Tests/Questionnaires	1347
Reports - Descriptive	1014
Information Analyses	819
Opinion Papers	801
Dissertations/Theses -…	377
Guides - Non-Classroom	261
Numerical/Quantitative Data	193
Books	103
Reports - General	90
Guides - Classroom - Teacher	71
Reference Materials -…	66
Collected Works - Proceedings	57
Guides - General	52
Collected Works - General	49
Collected Works - Serials	39
Book/Product Reviews	33
Dissertations/Theses	32
ERIC Publications	30
Legal/Legislative/Regulatory…	29
Non-Print Media	21
ERIC Digests in Full Text	19
More ▼

Education Level

Higher Education	2802
Postsecondary Education	2286
Secondary Education	1637
Elementary Education	1466
High Schools	746
Middle Schools	704
Early Childhood Education	598
Elementary Secondary Education	566
Junior High Schools	521
Primary Education	296
Intermediate Grades	292
Grade 5	254
Preschool Education	254
Grade 4	250
Grade 8	248
Grade 6	239
Grade 7	208
Grade 3	191
Kindergarten	185
Grade 1	143
Grade 2	117
Grade 9	116
Adult Education	93
Grade 10	91
Grade 11	73
More ▼

Audience

Researchers	728
Practitioners	429
Teachers	142
Administrators	96
Policymakers	57
Counselors	36
Students	20
Parents	13
Community	7
Support Staff	6
Media Staff	2
More ▼

Location

Turkey	799
Australia	347
Canada	324
China	300
United States	188
Indonesia	170
Spain	168
United Kingdom	160
Netherlands	158
California	155
Germany	153
Taiwan	143
Iran	123
United Kingdom (England)	113
Florida	107
Hong Kong	104
Texas	98
Japan	95
Malaysia	93
South Korea	88
India	85
Israel	85
New York	82
Italy	79
Pennsylvania	74
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	3
Does not meet standards	1

Showing 1 to 15 of 21,728 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Development and Validation of Kindergarten Dynamic Assessments of Early Reading and Language

Peer reviewed

Direct link

Eunsoo Cho; Mina Son; Sarah Reiley; Eun Ha Kim – Language, Speech, and Hearing Services in Schools, 2025

Purpose: The purpose of this study was to develop and evaluate the initial reliability and validity evidence of the dynamic assessment (DA) of early reading and language as a second-stage screener in kindergarten, the first year of formal schooling. The DA comprises three subtests that capture students' ability to learn letter sounds and blending…

Descriptors: Alternative Assessment, Test Construction, Test Validity, Kindergarten

Development of an Achievement Test on Organic Substances within the Scope of the 9th-Grade Biology Course

Peer reviewed
PDF on ERIC

Download full text

Meryem Konu Kadirhanogullari; Esra Özay Köse – Science Insights Education Frontiers, 2025

This study aims to develop a valid and reliable achievement test in accordance with the content framework of the 9th-grade Biology Course Curriculum published within the scope of the Turkish Century Maarif Model on the subject of "Organic Matter". The screening method was used for this purpose. The sample of the study consists of 258…

Descriptors: Science Tests, Test Construction, Grade 9, Biology

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Ratings of Students' Stress: Initial Reliability and Validity Evidence for a Brief Stress and Resilience Assessment

Peer reviewed

Direct link

Christopher J. Anthony; Stephen N. Elliott – School Mental Health, 2025

Stress is a complex construct that is related to resilience and general health starting in childhood. Despite its importance for student health and well-being, there are few measures of stress designed for school-based applications. In this study, we developed and initially validated a Stress Indicators Scale using five samples of teachers,…

Descriptors: Test Construction, Stress Variables, Test Validity, Test Items

Development of a Four-Tier Diagnostic Test for Misconceptions in Natural Science of Primary School Pupils

Peer reviewed
PDF on ERIC

Download full text

Anatri Desstya; Ika Candra Sayekti; Muhammad Abduh; Sukartono – Journal of Turkish Science Education, 2025

This study aimed to develop a standardised instrument for diagnosing science misconceptions in primary school children. Following a developmental research approach using the 4-D model (Define, Design, Develop, Disseminate), 100 four-tier multiple choice items were constructed. Content validity was established through expert evaluation by six…

Descriptors: Test Construction, Science Tests, Science Instruction, Diagnostic Tests

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

A High-Stakes Reading Test as the White Listening Subject: Applying an Antiracist Validation Lens

Peer reviewed

Direct link

Jeanne Sinclair – Critical Inquiry in Language Studies, 2025

In this paper, the White listening subject takes the form of a standardized high-stakes reading test, the State of Texas Assessment of Academic Readiness (STAAR). Although the test does not actually listen, it 'hears' and evaluates children's responses to its questions. I present the results of the 2017 Grade 8 reading exams, from the March, May,…

Descriptors: High Stakes Tests, Standardized Tests, Reading Tests, Achievement Tests

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

Inventory of Galilean Transformation of Uniform Linear Motion in Position-Time Graphs

Peer reviewed

Direct link

E.?B. Merki; S.?I. Hofer; A. Vaterlaus; A. Lichtenberger – Physical Review Physics Education Research, 2025

When describing motion in physics, the selection of a frame of reference is crucial. The graph of a moving object can look quite different based on the frame of reference. In recent years, various tests have been developed to assess the interpretation of kinematic graphs, but none of these tests have specifically addressed differences in reference…

Descriptors: Graphs, Motion, Physics, Secondary School Students

Developing and Validating a Biological System Thinking Test for Middle School Students

Peer reviewed

Direct link

Ruying Li; Gaofeng Li – International Journal of Science and Mathematics Education, 2025

Systems thinking (ST) is an essential competence for future life and biology learning. Appropriate assessment is critical for collecting sufficient information to develop ST in biology education. This research offers an ST framework based on a comprehensive understanding of biological systems, encompassing four skills across three complexity…

Descriptors: Test Construction, Test Validity, Science Tests, Cognitive Tests

Development and Initial Validation of the Computer-Based Orthographic Processing Assessment Short Form: An Application of Cognitive Diagnostic Modeling

Peer reviewed

Direct link

Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025

A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…

Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols

NAEP Achievement Levels Validity Argument Report

Download full text

Anne H. Davidson – National Assessment Governing Board, 2025

The purpose of this National Assessment of Educational Progress (NAEP) Achievement Levels Validity Argument Report is to synthesize evidence currently available to address the validity of the interpretations and uses of the NAEP Achievement Levels. Validity is the extent to which theory and evidence supports or refutes proposed and enacted test…

Descriptors: National Competency Tests, Academic Achievement, Test Validity, College Entrance Examinations

Real-Life Applications of Competence-Based Test Development to the Construction, Improvement, and Shortening of Tests

Peer reviewed

Direct link

Pasquale Anselmi; Jürgen Heller; Luca Stefanutti; Egidio Robusto; Giulia Barillari – Education and Information Technologies, 2025

Competence-based test development (CbTD) is a novel method for constructing tests that are as informative as possible about the competence state (the set of skills an individual masters) underlying the item responses. If desired, the tests can also be minimal, meaning that no item can be eliminated without reducing their informativeness. To…

Descriptors: Competency Based Education, Test Construction, Test Length, Usability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 1449

Educational and Psychological…	795
Journal of Psychoeducational…	381
ProQuest LLC	363
Psychology in the Schools	303
Journal of Autism and…	273
Measurement and Evaluation in…	211
Journal of Clinical Psychology	210
Journal of Consulting and…	195
Online Submission	179
Journal of Educational…	166
Psychological Assessment	162
Language Testing	156
Grantee Submission	140
Diagnostique	135
Journal of Vocational Behavior	125
Assessment for Effective…	122
Assessment	121
Journal of Counseling…	120
Journal of Learning…	117
Measurement in Physical…	111
Educational Measurement:…	107
International Journal of…	99
Journal of School Psychology	90
Autism: The International…	87
Measurement and Evaluation in…	86
More ▼

No Child Left Behind Act 2001	67
Individuals with Disabilities…	26
Elementary and Secondary…	25
Every Student Succeeds Act…	13
Debra P v Turlington	11
Civil Rights Act 1964 Title…	10
Elementary and Secondary…	8
Rehabilitation Act 1973…	7
Individuals with Disabilities…	6
Larry P v Riles	6
Race to the Top	6
Americans with Disabilities…	5
Education Consolidation…	4
Education for All Handicapped…	4
Elementary and Secondary…	4
Fourteenth Amendment	4
Bakke v Regents of University…	3
Comprehensive Education…	3
Individuals with Disabilities…	3
Individuals with Disabilities…	3
Lau v Nichols	3
Goals 2000	2
Head Start	2
United States Constitution	2
Aid to Families with…	1
More ▼

General Aptitude Test Battery	492
Wechsler Intelligence Scale…	344
SAT (College Admission Test)	197
Minnesota Multiphasic…	145
Test of English as a Foreign…	118
Peabody Picture Vocabulary…	116
Wechsler Adult Intelligence…	108
Stanford Binet Intelligence…	98
National Assessment of…	87
Graduate Record Examinations	77
Kaufman Assessment Battery…	76
ACT Assessment	74
Autism Diagnostic Observation…	65
Stanford Achievement Tests	61
Vineland Adaptive Behavior…	61
Child Behavior Checklist	58
Strengths and Difficulties…	57
National Teacher Examinations	55
Program for International…	54
Raven Progressive Matrices	52
Beck Depression Inventory	48
Self Directed Search	48
Iowa Tests of Basic Skills	47
Wide Range Achievement Test	44
Armed Services Vocational…	43
More ▼

Fraser, Barry J.	40
Michael, William B.	36
Thompson, Bruce	33
Marsh, Herbert W.	32
Tindal, Gerald	28
Hambleton, Ronald K.	27
Matson, Johnny L.	27
Reynolds, Cecil R.	27
Erford, Bradley T.	26
Stansfield, Charles W.	25
Epstein, Michael H.	23
Prediger, Dale J.	23
Popham, W. James	22
Ebel, Robert L.	21
Furlong, Michael J.	21
Linn, Robert L.	21
Baker, Eva L.	20
Elliott, Stephen N.	20
Hanna, Gerald S.	20
Watkins, Marley W.	20
Byrne, Barbara M.	18
Carver, Ronald P.	18
Kilgus, Stephen P.	18
More ▼