ERIC - Search Results

Publication Date

In 2026	0
Since 2025	200
Since 2022 (last 5 years)	1070
Since 2017 (last 10 years)	2580
Since 2007 (last 20 years)	4941

Descriptor

Test Items	9533
Test Construction	2717
Foreign Countries	2181
Item Response Theory	1868
Difficulty Level	1620
Item Analysis	1501
Test Validity	1415
Test Reliability	1186
Multiple Choice Tests	1156
Scores	1136
Computer Assisted Testing	1057
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	854
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	770
Models	753
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1310
Postsecondary Education	1060
Secondary Education	925
Elementary Education	715
Middle Schools	419
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	209
Grade 4	183
Early Childhood Education	177
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 331 to 345 of 9,533 results Save | Export

Assessment of Large Language Models' Performances and Hallucinations for Chinese Postgraduate Medical Entrance Examination

Peer reviewed

Direct link

Hongfei Ye; Jian Xu; Danqing Huang; Meng Xie; Jinming Guo; Junrui Yang; Haiwei Bao; Mingzhi Zhang; Ce Zheng – Discover Education, 2025

This study evaluates Large language models (LLMs)' performance on Chinese Postgraduate Medical Entrance Examination (CPGMEE) as well as the hallucinations produced by LLMs and investigate their implications for medical education. We curated 10 trials of mock CPGMEE to evaluate the performances of 4 LLMs (GPT-4.0, ChatGPT, QWen 2.1 and Ernie 4.0).…

Descriptors: College Entrance Examinations, Foreign Countries, Computational Linguistics, Graduate Medical Education

Before (E)Valuating: Student Testing in History and Engineering

Peer reviewed

Direct link

Herbert Kalthoff; Fabian Koelsch – British Journal of Sociology of Education, 2025

University examinations categorise students according to their individual achievements determined by teaching staff. This procedure serves the elicitation and certification of student knowledge and thus reproduces academic hierarchies. Drawing on empirical evidence from ethnographic fieldwork in Engineering and History departments, this article…

Descriptors: College Students, Student Evaluation, Testing, History Instruction

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Argument-Based Validation of Chulalongkorn University Language Institute (CULI) Test: A Rasch-Based Evidence Investigation

Peer reviewed

Direct link

Apichat Khamboonruang – Language Testing in Asia, 2025

Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…

Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests

Empirical Evidence of Students' Systems Thinking Skills in ESD-Oriented: A Rasch Analysis Approach

Peer reviewed
PDF on ERIC

Download full text

Ikmanisa Khairati; L. Lufri; Muhyiatul Fadilah – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025

Education for Sustainable Development (ESD) serves as a key accelerator for achieving the Sustainable Development Goals (SDGs), emphasizing systems thinking as an essential competency that must be cultivated in the learning process. This study investigates students' systems thinking skills within the ESD framework through assessments on…

Descriptors: Systems Approach, Thinking Skills, Sustainable Development, Biology

The Effects of Open-Ended Probes on Closed Survey Questions in Web Surveys

Peer reviewed

Direct link

Patricia Hadler – Sociological Methods & Research, 2025

Probes are follow-ups to survey questions used to gain insights on respondents' understanding of and responses to these questions. They are usually administered as open-ended questions, primarily in the context of questionnaire pretesting. Due to the decreased cost of data collection for open-ended questions in web surveys, researchers have argued…

Descriptors: Online Surveys, Discovery Processes, Test Items, Data Collection

Design Framework for the ACT® Enhancements. ACT Research. Research Report. R2519

Download full text

Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025

This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…

Descriptors: College Entrance Examinations, Testing, Change, Test Construction

Structural Accommodations in Middle School Mathematics: Implications for Lessening the Achievement Gap for English Learners

Peer reviewed
PDF on ERIC

Download full text

Albert M. Jimenez; Nicholas Clegorne; Sheryl Croft; David G. Buckman – Educational Planning, 2025

This quantitative study was designed to determine whether the use of graphical aids in standardized mathematics testing is effective in lessening the achievement gap between English Language Learner (ELL) students and their non-ELL counterparts for middle-grade aged students. The data used for this study include data from 2,659 students and come…

Descriptors: Middle School Students, Mathematics Instruction, Mathematics Achievement, English Learners

Factor Analysis and Item Reduction of the Tromsø Social Intelligence Scale (TSIS) in a Sample Peruvian

Peer reviewed

Direct link

José Ventura-León; Cristopher Lino-Cruz; Shirley Tocto-Muñoz; Andy Rick Sánchez-Villena – Journal of Psychoeducational Assessment, 2025

Academic and occupational success requires social intelligence, the ability to comprehend, and manage interpersonal connections. This research aims to assess and improve the Tromsø Social Intelligence Scale (TSIS) for Peruvian university students, focusing on cultural adaptability, reliability, and validity. Participants included 973 university…

Descriptors: Factor Analysis, Intelligence Tests, Test Items, Test Length

Item Block Position and Format Effects in e-TIMSS among the Low- and High-Achieving Countries

Peer reviewed
PDF on ERIC

Download full text

Nese Öztürk Gübes – International Journal of Assessment Tools in Education, 2025

The Trends in International Mathematics and Science Study (TIMSS) was administered via computer, eTIMSS, for the first time in 2019. The purpose of this study was to investigate item block position and item format effect on eighth grade mathematics item easiness in low- and high-achieving countries of eTIMSS 2019. Item responses from Chile, Qatar,…

Descriptors: Foreign Countries, International Assessment, Achievement Tests, Mathematics Achievement

A Highly Adaptive Testing Design for PISA

Peer reviewed

Direct link

Andreas Frey; Christoph König; Aron Fink – Journal of Educational Measurement, 2025

The highly adaptive testing (HAT) design is introduced as an alternative test design for the Programme for International Student Assessment (PISA). The principle of HAT is to be as adaptive as possible when selecting items while accounting for PISA's nonstatistical constraints and addressing issues concerning PISA such as item position effects.…

Descriptors: Adaptive Testing, Test Construction, Alternative Assessment, Achievement Tests

Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs

Peer reviewed

Direct link

Hyo Jeong Shin; Christoph König; Frederic Robin; Andreas Frey; Kentaro Yamamoto – Journal of Educational Measurement, 2025

Many international large-scale assessments (ILSAs) have switched to multistage adaptive testing (MST) designs to improve measurement efficiency in measuring the skills of the heterogeneous populations around the world. In this context, previous literature has reported the acceptable level of model parameter recovery under the MST designs when the…

Descriptors: Robustness (Statistics), Item Response Theory, Adaptive Testing, Test Construction

Utilizing Response Time for Item Selection in On-the-Fly Multistage Adaptive Testing for PISA Assessment

Peer reviewed

Direct link

Xiuxiu Tang; Yi Zheng; Tong Wu; Kit-Tai Hau; Hua-Hua Chang – Journal of Educational Measurement, 2025

Multistage adaptive testing (MST) has been recently adopted for international large-scale assessments such as Programme for International Student Assessment (PISA). MST offers improved measurement efficiency over traditional nonadaptive tests and improved practical convenience over single-item-adaptive computerized adaptive testing (CAT). As a…

Descriptors: Reaction Time, Test Items, Achievement Tests, Foreign Countries

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Comparison of Threshold Identification Methods for Response Time Effort on Computerized Test Items

Peer reviewed

Direct link

Militsa G. Ivanova; Hanna Eklöf; Michalis P. Michaelides – Journal of Applied Testing Technology, 2025

Digital administration of assessments allows for the collection of process data indices, such as response time, which can serve as indicators of rapid-guessing and examinee test-taking effort. Setting a time threshold is essential to distinguish effortful from effortless behavior using item response times. Threshold identification methods may…

Descriptors: Test Items, Computer Assisted Testing, Reaction Time, Achievement Tests

« Previous Page | Next Page »

Pages: 1 | ... | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | ... | 636

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	70
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5869
Reports - Research	5578
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	178
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼