ERIC - Search Results

Publication Date

In 2025	10
Since 2024	33
Since 2021 (last 5 years)	142
Since 2016 (last 10 years)	349
Since 2006 (last 20 years)	683

Descriptor

Comparative Analysis	1021
Test Items	1021
Foreign Countries	281
Item Response Theory	264
Item Analysis	229
Difficulty Level	215
Scores	196
Test Construction	178
Statistical Analysis	167
Test Format	143
Multiple Choice Tests	138
Computer Assisted Testing	133
Test Bias	124
Correlation	118
Language Tests	117
Mathematics Tests	115
Simulation	112
English (Second Language)	109
Achievement Tests	107
Second Language Learning	100
Models	96
Test Reliability	89
College Students	81
Scoring	81
Test Validity	80
More ▼

Education Level

Higher Education	177
Postsecondary Education	148
Secondary Education	109
Elementary Education	80
Elementary Secondary Education	54
High Schools	47
Middle Schools	39
Grade 8	38
Junior High Schools	28
Grade 4	25
Intermediate Grades	22
Early Childhood Education	17
Grade 5	17
Grade 7	15
Grade 3	13
Grade 6	13
Primary Education	12
Grade 9	11
Grade 10	7
Grade 11	7
Grade 12	7
Adult Education	4
Kindergarten	3
Grade 2	2
High School Equivalency…	1
More ▼

Audience

Researchers	17
Practitioners	5
Teachers	4
Administrators	3
Parents	1
Policymakers	1
Students	1

Location

United States	21
Turkey	20
Germany	19
Japan	19
Canada	17
Australia	16
China	15
Taiwan	13
Indonesia	11
South Korea	9
United Kingdom (England)	9
Iran	8
Israel	7
Hong Kong	6
Norway	6
California	5
Czech Republic	5
Massachusetts	5
Netherlands	5
New Jersey	5
New York	5
North Carolina	5
Ohio	5
Russia	5
South Africa	5
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 1,021 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Design Framework for the ACT® Enhancements. ACT Research. Research Report. R2519

Download full text

Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025

This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…

Descriptors: College Entrance Examinations, Testing, Change, Test Construction

Developing an MLA-Test for Young Learners -- Insights from Measurement Theory and Language Testing

Peer reviewed

Direct link

Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025

This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…

Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Cognitive Diagnosis Testlet Model for Multiple-Choice Items

Peer reviewed

Direct link

Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024

The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…

Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software

Validity of Multiple-Choice Digital Formative Assessment for Assessing Students' (Mis)Conceptions: Evidence from a Mixed-Methods Study in Algebra

Peer reviewed

Direct link

Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024

Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…

Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions

Generating Social and Emotional Skill Items: Humans vs. ChatGPT. ACT Research. Issue Brief

Download full text

Kate E. Walton; Cristina Anguiano-Carrasco – ACT, Inc., 2024

Large language models (LLMs), such as ChatGPT, are becoming increasingly prominent. Their use is becoming more and more popular to assist with simple tasks, such as summarizing documents, translating languages, rephrasing sentences, or answering questions. Reports like McKinsey's (Chui, & Yee, 2023) estimate that by implementing LLMs,…

Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Developing the Diagnostic Test of Misconceptions of Fractions

Peer reviewed
PDF on ERIC

Download full text

Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023

This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…

Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

Reliability and Validity of Methods to Assess Undergraduate Healthcare Student Performance in Pharmacology: Comparison of Open Book versus Time-Limited Closed Book Examinations

Peer reviewed
PDF on ERIC

Download full text

David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023

We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…

Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format

ChatGPT-4o, ChatGPT-4 and Google Gemini are Compared with Students: A Study in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Harun Bayer; Fazilet Gül Ince Araci; Gülsah Gürkan – International Journal of Technology in Education and Science, 2024

The rapid advancement of artificial intelligence technologies, their pervasive use in every field, and the growing understanding of the benefits they bring have led actors in the education sector to pursue research in this field. In particular, the use of artificial intelligence tools has become more prevalent in the education sector due to the…

Descriptors: Artificial Intelligence, Computer Software, Computational Linguistics, Technology Uses in Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 69

Educational and Psychological…	60
Journal of Educational…	54
ProQuest LLC	49
ETS Research Report Series	43
Applied Psychological…	35
Applied Measurement in…	31
Language Testing	19
International Journal of…	18
Journal of Educational and…	11
Online Submission	10
Grantee Submission	9
Assessment & Evaluation in…	8
Educational Measurement:…	8
International Journal of…	8
Language Assessment Quarterly	8
Educational Research and…	7
College Entrance Examination…	6
Educational Assessment	6
Eurasian Journal of…	6
International Journal of…	6
Language Testing in Asia	6
Practical Assessment,…	6
College Board	5
International Association for…	5
Journal of Experimental…	5
More ▼

Chang, Hua-Hua	10
Kim, Sooyeon	10
Dodd, Barbara G.	8
Hambleton, Ronald K.	8
von Davier, Alina A.	8
Liu, Jinghua	7
Cohen, Allan S.	6
Holland, Paul W.	6
Kim, Seock-Ho	6
Sinharay, Sandip	6
Finch, Holmes	5
Lee, Won-Chan	5
McKinley, Robert L.	5
Reckase, Mark D.	5
Zhang, Jinming	5
von Davier, Matthias	5
Benson, Jeri	4
Bridgeman, Brent	4
DeMars, Christine E.	4
Ercikan, Kadriye	4
He, Wei	4
Hsu, Tse-Chi	4
Jin, Ying	4
McClellan, Catherine	4
More ▼

Reports - Research	714
Journal Articles	689
Reports - Evaluative	183
Speeches/Meeting Papers	147
Dissertations/Theses -…	50
Tests/Questionnaires	46
Reports - Descriptive	43
Numerical/Quantitative Data	24
Information Analyses	12
Guides - Non-Classroom	7
Books	6
Collected Works - General	6
Guides - General	4
Opinion Papers	3
Collected Works - Serials	2
Guides - Classroom - Learner	2
Non-Print Media	2
Reference Materials - General	2
Reports - General	2
Collected Works - Proceedings	1
Guides - Classroom - Teacher	1
Historical Materials	1
More ▼

Program for International…	40
Trends in International…	33
SAT (College Admission Test)	26
National Assessment of…	18
Test of English as a Foreign…	18
Progress in International…	9
ACT Assessment	7
Graduate Record Examinations	7
Advanced Placement…	5
Iowa Tests of Basic Skills	5
California Achievement Tests	4
Comprehensive Tests of Basic…	4
Stanford Achievement Tests	4
Test of English for…	4
International English…	3
Law School Admission Test	3
Measures of Academic Progress	3
Peabody Picture Vocabulary…	3
Raven Progressive Matrices	3
Wechsler Intelligence Scale…	3
Armed Services Vocational…	2
Beck Depression Inventory	2
Digit Span Test	2
Graduate Management Admission…	2
International Association for…	2
More ▼