ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	33

Descriptor

Comparative Analysis	86
Item Analysis	86
Test Construction	86
Test Items	49
Difficulty Level	19
Test Reliability	19
Statistical Analysis	18
Achievement Tests	16
Test Validity	16
Scores	15
Foreign Countries	14
Multiple Choice Tests	13
Computer Assisted Testing	12
Criterion Referenced Tests	11
Latent Trait Theory	11
English (Second Language)	10
Evaluation Methods	10
Higher Education	10
Item Response Theory	9
Mathematical Models	9
Scoring	9
Language Tests	8
Measurement Techniques	8
Correlation	7
Item Banks	7
More ▼

Publication Type

Reports - Research	46
Journal Articles	34
Speeches/Meeting Papers	20
Reports - Evaluative	15
Books	3
Numerical/Quantitative Data	3
Collected Works - General	1
Dissertations/Theses -…	1
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
Reports - Descriptive	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	8
Postsecondary Education	8
Elementary Education	6
Secondary Education	6
Elementary Secondary Education	3
Grade 6	3
Middle Schools	3
Intermediate Grades	2
Adult Education	1
Grade 5	1
High Schools	1
More ▼

Audience

Researchers	4
Practitioners	1
Teachers	1

Location

Australia	2
Iran	2
Belgium	1
Brazil	1
China	1
France	1
Germany	1
Luxembourg	1
Pennsylvania	1
Poland	1
South Korea	1
Switzerland	1
Turkey	1
Turkey (Istanbul)	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Eysenck Personality Inventory	2
International English…	1
National Assessment of…	1
Program for International…	1
SAT (College Admission Test)	1
University Residence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

The Effect of Ratio of Items Indicating Differential Item Functioning on Computer Adaptive and Multi-Stage Tests

Peer reviewed
PDF on ERIC

Download full text

Erdem-Kara, Basak; Dogan, Nuri – International Journal of Assessment Tools in Education, 2022

Recently, adaptive test approaches have become a viable alternative to traditional fixed-item tests. The main advantage of adaptive tests is that they reach desired measurement precision with fewer items. However, fewer items mean that each item has a more significant effect on ability estimation and therefore those tests are open to more…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Test Construction

Comparison of Two Exam Evaluation Methods for Objectivity

Peer reviewed
PDF on ERIC

Download full text

Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022

The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…

Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Validity of Multiple-Choice Digital Formative Assessment for Assessing Students' (Mis)Conceptions: Evidence from a Mixed-Methods Study in Algebra

Peer reviewed

Direct link

Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024

Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…

Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions

Developing the Diagnostic Test of Misconceptions of Fractions

Peer reviewed
PDF on ERIC

Download full text

Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023

This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…

Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions

An Intersectional Approach to DIF: Comparing Outcomes across Methods

Peer reviewed

Direct link

Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022

Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…

Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction

Evaluation of Auto-Generated Distractors in Multiple Choice Questions from a Semantic Network

Peer reviewed

Direct link

Zhang, Lishan; VanLehn, Kurt – Interactive Learning Environments, 2021

Despite their drawback, multiple-choice questions are an enduring feature in instruction because they can be answered more rapidly than open response questions and they are easily scored. However, it can be difficult to generate good incorrect choices (called "distractors"). We designed an algorithm to generate distractors from a…

Descriptors: Semantics, Networks, Multiple Choice Tests, Teaching Methods

A Pragmatic Future for NAEP: Containing Costs and Updating Technologies. Consensus Study Report

Peer reviewed
PDF on ERIC

Download full text

Direct link

National Academies Press, 2022

The National Assessment of Educational Progress (NAEP) -- often called "The Nation's Report Card" -- is the largest nationally representative and continuing assessment of what students in public and private schools in the United States know and can do in various subjects and has provided policy makers and the public with invaluable…

Descriptors: Costs, Futures (of Society), National Competency Tests, Educational Trends

Predicting the Difficulty of EFL Tests Based on Corpus Linguistic Features and Expert Judgment

Peer reviewed

Direct link

Choi, Inn-Chull; Moon, Youngsun – Language Assessment Quarterly, 2020

This study examines the relationships among various major factors that may affect the difficulty level of language tests in an attempt to enhance the robustness of item difficulty estimation, which constitutes a crucial factor ensuring the equivalency of high-stakes tests. The observed difficulties of the reading and listening sections of two EFL…

Descriptors: English (Second Language), Second Language Learning, Language Tests, Difficulty Level

Technology-Enhanced Items in Grades 1-12 English Language Proficiency Assessments

Peer reviewed

Direct link

Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022

Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…

Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)

A Comparability Study of Text Difficulty and Task Characteristics of Parallel Academic IELTS Reading Tests

Peer reviewed
PDF on ERIC

Download full text

Liao, Linyu – English Language Teaching, 2020

As a high-stakes standardized test, IELTS is expected to have comparable forms of test papers so that test takers from different test administration on different dates receive comparable test scores. Therefore, this study examined the text difficulty and task characteristics of four parallel academic IELTS reading tests to reveal to what extent…

Descriptors: Second Language Learning, English (Second Language), Language Tests, High Stakes Tests

Examining Mode Effects for an Adapted Chinese Critical Thinking Assessment

Peer reviewed

Direct link

Gu, Lin; Ling, Guangming; Liu, Ou Lydia; Yang, Zhitong; Li, Guirong; Kardanova, Elena; Loyalka, Prashant – Assessment & Evaluation in Higher Education, 2021

We examine the effects of computer-based versus paper-based assessment of critical thinking skills, adapted from English (in the U.S.) to Chinese. Using data collected based on a random assignment between the two modes in multiple Chinese colleges, we investigate mode effects from multiple perspectives: mean scores, measurement precision, item…

Descriptors: Critical Thinking, Tests, Test Format, Computer Assisted Testing

Exploratory Item Classification via Spectral Graph Clustering

Peer reviewed
PDF on ERIC

Download full text

Direct link

Yunxiao Chen; Xiaoou Li; Jingchen Liu; Gongjun Xu; Zhiliang Ying – Grantee Submission, 2017

Large-scale assessments are supported by a large item pool. An important task in test development is to assign items into scales that measure different characteristics of individuals, and a popular approach is cluster analysis of items. Classical methods in cluster analysis, such as the hierarchical clustering, K-means method, and latent-class…

Descriptors: Item Analysis, Classification, Graphs, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Applied Psychological…	4
Educational and Psychological…	3
Journal of Educational…	3
Applied Measurement in…	2
Assessment & Evaluation in…	2
International Journal of…	2
Language Assessment Quarterly	2
Teaching of Psychology	2
Bilingual Research Journal	1
CBE - Life Sciences Education	1
Cambridge Assessment	1
College Board	1
Edinburgh Working Papers in…	1
Educational Assessment	1
Educational Forum	1
English Language Teaching	1
European Journal of Science…	1
Foreign Language Annals	1
Grantee Submission	1
Interactive Learning…	1
International Society for…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Nutrition…	1
Journal of Technology,…	1
More ▼

Benson, Jeri	3
Haladyna, Tom	3
Dawis, Rene V.	2
Hambleton, Ronald K.	2
Roid, Gale	2
Subkoviak, Michael J.	2
Tinsley, Howard E. A.	2
Abad, Francisco Jose	1
Alexander Kah	1
Aleyna Altan	1
Bacon, Tina P.	1
Barrada, Juan Ramon	1
Barry, Carol	1
Bashaw, W. L.	1
Bernknopf, Stanley	1
Bishop, Pamela R.	1
Bowers, John J.	1
Bramley, Tom	1
Bratfisch, Oswald	1
Brennan, Robert L,	1
Broonen, Jean-Paul	1
Bärbel Barzel	1
Chapelle, Carol, Ed.	1
Chapman, Mark	1
More ▼