ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	37
Since 2006 (last 20 years)	90

Descriptor

Comparative Analysis	149
Test Items	149
Models	95
Item Response Theory	69
Mathematical Models	46
Item Analysis	43
Simulation	34
Difficulty Level	32
Statistical Analysis	32
Goodness of Fit	24
Scores	24
Factor Analysis	23
Foreign Countries	23
Test Bias	21
Test Construction	21
Computer Assisted Testing	20
Error of Measurement	18
Correlation	16
Achievement Tests	15
Adaptive Testing	15
Sample Size	15
Mathematics Tests	14
Maximum Likelihood Statistics	14
Monte Carlo Methods	14
Psychometrics	14
More ▼

Publication Type

Journal Articles	95
Reports - Research	95
Reports - Evaluative	36
Speeches/Meeting Papers	25
Dissertations/Theses -…	8
Reports - Descriptive	6
Books	2
Guides - Non-Classroom	2
Opinion Papers	2
Reports - General	2
Tests/Questionnaires	2
More ▼

Education Level

Higher Education	18
Postsecondary Education	11
Elementary Education	7
Elementary Secondary Education	5
Secondary Education	5
Grade 4	3
High Schools	3
Grade 3	2
Grade 7	2
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Grade 12	1
More ▼

Audience

Researchers	3
Practitioners	1
Students	1

Location

United States	6
South Korea	3
Germany	2
Massachusetts	2
Netherlands	2
Africa	1
Argentina	1
Canada	1
China	1
Colorado (Boulder)	1
France	1
Indonesia	1
Iran	1
Israel (Jerusalem)	1
Japan	1
Minnesota	1
Ohio	1
Russia	1
Saudi Arabia	1
Senegal	1
South Carolina	1
Sweden	1
Taiwan	1
Thailand (Bangkok)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Trends in International…	6
National Assessment of…	3
SAT (College Admission Test)	3
Iowa Tests of Educational…	2
Law School Admission Test	2
Program for International…	2
Test of English as a Foreign…	2
ACT Assessment	1
Advanced Placement…	1
Comprehensive Tests of Basic…	1
Graduate Record Examinations	1
Home Observation for…	1
National Longitudinal Survey…	1
Peabody Picture Vocabulary…	1
Raven Advanced Progressive…	1
School and College Ability…	1
Test of English for…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 149 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

A Comparison of the Added Value of Subscores across Two Subscore Augmentation Methods

Peer reviewed
PDF on ERIC

Download full text

Afsharrad, Mohammad; Pishghadam, Reza; Baghaei, Purya – International Journal of Language Testing, 2023

Testing organizations are faced with increasing demand to provide subscores in addition to the total test score. However, psychometricians argue that most subscores do not have added value to be worth reporting. To have added value, subscores need to meet a number of criteria: they should be reliable, distinctive, and distinct from each other and…

Descriptors: Comparative Analysis, Scores, Value Added Models, Psychometrics

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Application of Item Response Tree (IRTree) Models on Testing Data: Comparing Its Performance with Binary and Polytomous Item Response Models

Direct link

Yixi Wang – ProQuest LLC, 2020

Binary item response theory (IRT) models are widely used in educational testing data. These models are not perfect because they simplify the individual item responding process, ignore the differences among different response patterns, cannot handle multidimensionality that lay behind options within a single item, and cannot manage missing response…

Descriptors: Item Response Theory, Educational Testing, Data, Models

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Comparison of R Packages for Automated Test Assembly with Mixed-Integer Linear Programming

Peer reviewed

Direct link

Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023

Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…

Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Classical Test Theory and Item Response Theory Comparison of the Brief Electricity and Magnetism Assessment and the Conceptual Survey of Electricity and Magnetism

Peer reviewed

Direct link

Eaton, Philip; Johnson, Keith; Barrett, Frank; Willoughby, Shannon – Physical Review Physics Education Research, 2019

For proper assessment selection understanding the statistical similarities amongst assessments that measure the same, or very similar, topics is imperative. This study seeks to extend the comparative analysis between the brief electricity and magnetism assessment (BEMA) and the conceptual survey of electricity and magnetism (CSEM) presented by…

Descriptors: Test Theory, Item Response Theory, Comparative Analysis, Energy

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	13
Journal of Educational…	11
Applied Psychological…	10
ETS Research Report Series	9
ProQuest LLC	8
International Journal of…	6
Journal of Educational and…	6
Applied Measurement in…	3
Language Testing	3
Measurement:…	3
Journal of Memory and Language	2
ACT, Inc.	1
Acta Educationis Generalis	1
Advances in Health Sciences…	1
Assessment	1
Assessment in Education:…	1
British Journal of…	1
Developmental Psychology	1
EURASIA Journal of…	1
Educational Evaluation and…	1
Educational Measurement:…	1
Educational Research and…	1
Hacettepe University Journal…	1
Intelligence	1
International Educational…	1
More ▼

Reckase, Mark D.	5
DeMars, Christine E.	3
McKinley, Robert L.	3
von Davier, Matthias	3
Benson, Jeri	2
Berger, Martijn P. F.	2
Cohen, Allan S.	2
Douglass, James B.	2
Haladyna, Thomas M.	2
He, Wei	2
Jin, Ying	2
Kim, Seock-Ho	2
Kromrey, Jeffrey D.	2
Lee, Young-Sun	2
Nandakumar, Ratna	2
Paek, Insu	2
Park, Yoon Soo	2
Sinharay, Sandip	2
Stocking, Martha L.	2
Strobl, Carolin	2
Suh, Youngsuk	2
Wainer, Howard	2
Xu, Xueli	2
Zeileis, Achim	2
More ▼