ERIC - Search Results

Publication Date

In 2025	6
Since 2024	29

Source

Educational and Psychological…	5
Journal of Educational…	5
Journal of Educational and…	4
Education and Information…	2
Grantee Submission	2
International Journal of…	2
Educational Measurement:…	1
Educational Process:…	1
Inclusion	1
International Journal of…	1
Journal of Computer Assisted…	1
Large-scale Assessments in…	1
Measurement:…	1
ProQuest LLC	1
School Leadership Review	1
More ▼

Publication Type

Journal Articles	26
Reports - Research	22
Reports - Evaluative	3
Reports - Descriptive	2
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Secondary Education	5
Elementary Secondary Education	4
Junior High Schools	4
Middle Schools	4
Elementary Education	2
Grade 8	2
Grade 4	1
Intermediate Grades	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
Big Five Inventory	1
Program for International…	1
Stages of Concern…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

A Method for Generating Course Test Questions Based on Natural Language Processing and Deep Learning

Peer reviewed

Direct link

Hei-Chia Wang; Yu-Hung Chiang; I-Fan Chen – Education and Information Technologies, 2024

Assessment is viewed as an important means to understand learners' performance in the learning process. A good assessment method is based on high-quality examination questions. However, generating high-quality examination questions manually by teachers is a time-consuming task, and it is not easy for students to obtain question banks. To solve…

Descriptors: Natural Language Processing, Test Construction, Test Items, Models

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Text-Based Question Difficulty Prediction: A Systematic Review of Automatic Approaches

Peer reviewed

Direct link

Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024

Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…

Descriptors: Test Items, Test Construction, Difficulty Level, Prediction

Expanding the Lognormal Response Time Model Using Profile Similarity Metrics to Improve the Detection of Anomalous Testing Behavior

Peer reviewed

Direct link

Gregory M. Hurtz; Regi Mucino – Journal of Educational Measurement, 2024

The Lognormal Response Time (LNRT) model measures the speed of test-takers relative to the normative time demands of items on a test. The resulting speed parameters and model residuals are often analyzed for evidence of anomalous test-taking behavior associated with fast and poorly fitting response time patterns. Extending this model, we…

Descriptors: Student Reaction, Reaction Time, Response Style (Tests), Test Items

A Multidimensional Partially Compensatory Response Time Model on Basis of the Log-Normal Distribution

Peer reviewed

Direct link

Jochen Ranger; Christoph König; Benjamin W. Domingue; Jörg-Tobias Kuhn; Andreas Frey – Journal of Educational and Behavioral Statistics, 2024

In the existing multidimensional extensions of the log-normal response time (LNRT) model, the log response times are decomposed into a linear combination of several latent traits. These models are fully compensatory as low levels on traits can be counterbalanced by high levels on other traits. We propose an alternative multidimensional extension…

Descriptors: Models, Statistical Distributions, Item Response Theory, Response Rates (Questionnaires)

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

ChatGPT's Performance Evaluation in Spreadsheets Modelling to Inform Assessments Redesign

Peer reviewed

Direct link

Michelle Cheong – Journal of Computer Assisted Learning, 2025

Background: Increasingly, students are using ChatGPT to assist them in learning and even completing their assessments, raising concerns of academic integrity and loss of critical thinking skills. Many articles suggested educators redesign assessments that are more 'Generative-AI-resistant' and to focus on assessing students on higher order…

Descriptors: Artificial Intelligence, Performance Based Assessment, Spreadsheets, Models

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

Previous Page | Next Page »

Pages: 1 | 2

Amanda Goodwin	2
Jesper Tijmstra	2
Maria Bolsinova	2
Matthew Naveiras	2
Paul De Boeck	2
Sun-Joo Cho	2
Aditya Shah	1
Aiman Mohammad Freihat	1
Ajay Devmane	1
Allan S. Cohen	1
Andreas Frey	1
Benjamin W. Domingue	1
Brad Linnenkamp	1
Chengyu Cui	1
Christoph König	1
Chun Wang	1
Daniel M Bolt	1
Daniel M. Bolt	1
David B. Flora	1
David Rutkowski	1
Emily A. Brown	1
Evan E. Dean	1
Floriana Grasso	1
George Engelhard	1
Gongjun Xu	1
More ▼

Models	29
Test Items	29
Item Response Theory	20
Item Analysis	8
Simulation	8
Accuracy	6
Computation	6
Achievement Tests	5
Error of Measurement	5
Measurement Techniques	5
Artificial Intelligence	4
Data Analysis	4
Difficulty Level	4
Elementary Secondary Education	4
Foreign Countries	4
Goodness of Fit	4
International Assessment	4
Maximum Likelihood Statistics	4
Measurement	4
Scores	4
Test Construction	4
Test Format	4
Comparative Analysis	3
Error Patterns	3
Generalization	3
More ▼