ERIC - Search Results

Publication Date

In 2025	4
Since 2024	11
Since 2021 (last 5 years)	26
Since 2016 (last 10 years)	55
Since 2006 (last 20 years)	92

Descriptor

Comparative Analysis	178
Test Construction	178
Test Items	178
Item Analysis	49
Foreign Countries	42
Difficulty Level	37
Test Format	35
Item Response Theory	34
Multiple Choice Tests	30
Computer Assisted Testing	28
Scores	28
Mathematics Tests	26
Test Reliability	26
Test Validity	25
Language Tests	24
Higher Education	23
Achievement Tests	22
English (Second Language)	22
Scoring	20
Statistical Analysis	20
Psychometrics	18
Science Tests	18
Second Language Learning	15
Evaluation Methods	14
Mathematical Models	14
More ▼

Publication Type

Reports - Research	107
Journal Articles	95
Speeches/Meeting Papers	41
Reports - Evaluative	38
Reports - Descriptive	13
Tests/Questionnaires	10
Numerical/Quantitative Data	7
Dissertations/Theses -…	5
Books	4
Guides - General	4
Collected Works - General	3
Guides - Non-Classroom	3
Information Analyses	3
Non-Print Media	2
Reference Materials - General	2
Reports - General	2
Guides - Classroom - Teacher	1
Opinion Papers	1
More ▼

Education Level

Higher Education	25
Postsecondary Education	24
Secondary Education	16
Elementary Education	14
Elementary Secondary Education	8
Intermediate Grades	6
Grade 4	4
High Schools	4
Grade 6	2
Grade 8	2
Middle Schools	2
Adult Education	1
Early Childhood Education	1
Grade 10	1
Grade 5	1
More ▼

Audience

Researchers	4
Practitioners	3
Administrators	1
Parents	1
Policymakers	1
Teachers	1

Location

Germany	3
Japan	3
Australia	2
Indonesia	2
Iran	2
United Kingdom (England)	2
Asia	1
Belgium	1
Canada	1
China	1
China (Beijing)	1
Colombia	1
Colorado	1
Colorado (Boulder)	1
Czech Republic	1
District of Columbia	1
Europe	1
Florida	1
Georgia	1
Greece	1
Hong Kong	1
Idaho	1
Illinois	1
Ireland	1
Ireland (Dublin)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Program for International…	9
National Assessment of…	6
Trends in International…	6
SAT (College Admission Test)	4
Test of English as a Foreign…	3
Progress in International…	2
ACT Assessment	1
Eysenck Personality Inventory	1
Graduate Record Examinations	1
International Association for…	1
International English…	1
Iowa Tests of Basic Skills	1
Michigan Test of English…	1
Minnesota Multiphasic…	1
School and College Ability…	1
Wechsler Adult Intelligence…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 178 results Save | Export

Developing an MLA-Test for Young Learners -- Insights from Measurement Theory and Language Testing

Peer reviewed

Direct link

Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025

This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…

Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Design Framework for the ACT® Enhancements. ACT Research. Research Report. R2519

Download full text

Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025

This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…

Descriptors: College Entrance Examinations, Testing, Change, Test Construction

Generating Social and Emotional Skill Items: Humans vs. ChatGPT. ACT Research. Issue Brief

Download full text

Kate E. Walton; Cristina Anguiano-Carrasco – ACT, Inc., 2024

Large language models (LLMs), such as ChatGPT, are becoming increasingly prominent. Their use is becoming more and more popular to assist with simple tasks, such as summarizing documents, translating languages, rephrasing sentences, or answering questions. Reports like McKinsey's (Chui, & Yee, 2023) estimate that by implementing LLMs,…

Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

The Effect of Ratio of Items Indicating Differential Item Functioning on Computer Adaptive and Multi-Stage Tests

Peer reviewed
PDF on ERIC

Download full text

Erdem-Kara, Basak; Dogan, Nuri – International Journal of Assessment Tools in Education, 2022

Recently, adaptive test approaches have become a viable alternative to traditional fixed-item tests. The main advantage of adaptive tests is that they reach desired measurement precision with fewer items. However, fewer items mean that each item has a more significant effect on ability estimation and therefore those tests are open to more…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Test Construction

A Comparative Study of AI-Human-Made and Human-Made Test Forms for a University TESOL Theory Course

Peer reviewed

Direct link

Kyung-Mi O. – Language Testing in Asia, 2024

This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…

Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Standard Processes. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Validity of Multiple-Choice Digital Formative Assessment for Assessing Students' (Mis)Conceptions: Evidence from a Mixed-Methods Study in Algebra

Peer reviewed

Direct link

Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024

Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…

Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions

ChatGPT-4o, ChatGPT-4 and Google Gemini are Compared with Students: A Study in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Harun Bayer; Fazilet Gül Ince Araci; Gülsah Gürkan – International Journal of Technology in Education and Science, 2024

The rapid advancement of artificial intelligence technologies, their pervasive use in every field, and the growing understanding of the benefits they bring have led actors in the education sector to pursue research in this field. In particular, the use of artificial intelligence tools has become more prevalent in the education sector due to the…

Descriptors: Artificial Intelligence, Computer Software, Computational Linguistics, Technology Uses in Education

The Importance of Increased Processing Demands in the Design of Elicited Imitation Tests

Peer reviewed

Direct link

Rosemary Erlam; Lan Wei – Language Teaching Research, 2024

This study is a conceptual replication of Ellis' 'Measuring implicit and explicit knowledge of a second language: A psychometric study', published in "Studies in Second Language Acquisition" (2005), aiming to establish the importance of including belief statements (hypothesized to increase processing demands) in the design of Elicited…

Descriptors: Language Processing, Language Tests, Second Language Learning, Psychometrics

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Student Performance and Exam Quality in Student- versus Instructor-Created Exams in Human Physiology

Peer reviewed
PDF on ERIC

Download full text

Laura S. Kabiri; Catherine R. Barber; Thomas M. McCabe; Augusto X. Rodriguez – HAPS Educator, 2024

Multiple-choice questions (MCQs) are commonly used in undergraduate introductory science, technology, engineering, and mathematics (STEM) courses, and substantial evidence supports the use of student-created questions to promote learning. However, research on student-created MCQ exams as an assessment method is more limited, and no studies have…

Descriptors: Physiology, Science Tests, Student Developed Materials, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Journal of Educational…	11
Applied Psychological…	6
ETS Research Report Series	6
Educational and Psychological…	5
ProQuest LLC	5
Educational Measurement:…	4
New Meridian Corporation	4
Assessment & Evaluation in…	3
Language Assessment Quarterly	3
Language Testing	3
Applied Measurement in…	2
CBE - Life Sciences Education	2
College Board	2
International Journal of…	2
Journal of Technology,…	2
Online Submission	2
Teaching of Psychology	2
ACT Education Corp.	1
ACT, Inc.	1
American Journal of…	1
Assessment and Evaluation in…	1
Bilingual Research Journal	1
Cambridge Assessment	1
College Student Journal	1
Contemporary Educational…	1
More ▼

Benson, Jeri	4
Hambleton, Ronald K.	3
Subkoviak, Michael J.	3
Wu, Margaret	3
Douglass, James B.	2
Haladyna, Tom	2
Kenney, Patricia Ann	2
Kromrey, Jeffrey D.	2
McKinley, Robert L.	2
O'Neal, Marcia R.	2
Ponsoda, Vicente	2
Roid, Gale	2
Silver, Edward A.	2
Sireci, Stephen G.	2
von Davier, Matthias	2
Aaron Smith	1
Abad, Francisco Jose	1
Alacaci, Cengiz	1
Alexander Gerd Maier	1
Alexander Kah	1
Aleyna Altan	1
Ali, Usama S.	1
Allen, Nancy L.	1
Alqarni, Abdulelah Mohammed	1
More ▼