ERIC - Search Results

Publication Date

In 2025	9
Since 2024	36

Source

Educational and Psychological…	6
Journal of Educational and…	6
Journal of Educational…	5
Education and Information…	2
Grantee Submission	2
International Journal of…	2
Measurement:…	2
Annenberg Institute for…	1
Asia Pacific Education Review	1
Educational Measurement:…	1
Educational Process:…	1
Inclusion	1
International Journal of…	1
Journal of Computer Assisted…	1
Large-scale Assessments in…	1
ProQuest LLC	1
School Leadership Review	1
npj Science of Learning	1
More ▼

Publication Type

Journal Articles	32
Reports - Research	29
Reports - Evaluative	3
Reports - Descriptive	2
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Secondary Education	5
Elementary Education	4
Elementary Secondary Education	4
Junior High Schools	4
Middle Schools	4
Early Childhood Education	2
Grade 2	2
Grade 8	2
Primary Education	2
Grade 4	1
Intermediate Grades	1
More ▼

Audience

Location

Denmark	1
Finland	1
Norway	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
Big Five Inventory	1
Program for International…	1
Stages of Concern…	1
Teaching and Learning…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 36 results Save | Export

Influences of Carry-Over Effects across Scales on Mediation Analyses

Peer reviewed

Direct link

Kuan-Yu Jin; Yi-Jhen Wu; Ming Ming Chiu – Measurement: Interdisciplinary Research and Perspectives, 2025

Many education tests and psychological surveys elicit respondent views of similar constructs across scenarios (e.g., story followed by multiple choice questions) by repeating common statements across scales (one-statement-multiple-scale, OSMS). However, a respondent's earlier responses to the common statement can affect later responses to it…

Descriptors: Administrator Surveys, Teacher Surveys, Responses, Test Items

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

The Impact of Measurement Noninvariance across Time and Group in Longitudinal Item Response Modeling

Peer reviewed

Direct link

In-Hee Choi – Asia Pacific Education Review, 2024

Longitudinal item response data often exhibit two types of measurement noninvariance: the noninvariance of item parameters between subject groups and that of item parameters across multiple time points. This study proposes a comprehensive approach to the simultaneous modeling of both types of measurement noninvariance in terms of longitudinal item…

Descriptors: Longitudinal Studies, Item Response Theory, Growth Models, Error of Measurement

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

A Method for Generating Course Test Questions Based on Natural Language Processing and Deep Learning

Peer reviewed

Direct link

Hei-Chia Wang; Yu-Hung Chiang; I-Fan Chen – Education and Information Technologies, 2024

Assessment is viewed as an important means to understand learners' performance in the learning process. A good assessment method is based on high-quality examination questions. However, generating high-quality examination questions manually by teachers is a time-consuming task, and it is not easy for students to obtain question banks. To solve…

Descriptors: Natural Language Processing, Test Construction, Test Items, Models

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Text-Based Question Difficulty Prediction: A Systematic Review of Automatic Approaches

Peer reviewed

Direct link

Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024

Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…

Descriptors: Test Items, Test Construction, Difficulty Level, Prediction

Expanding the Lognormal Response Time Model Using Profile Similarity Metrics to Improve the Detection of Anomalous Testing Behavior

Peer reviewed

Direct link

Gregory M. Hurtz; Regi Mucino – Journal of Educational Measurement, 2024

The Lognormal Response Time (LNRT) model measures the speed of test-takers relative to the normative time demands of items on a test. The resulting speed parameters and model residuals are often analyzed for evidence of anomalous test-taking behavior associated with fast and poorly fitting response time patterns. Extending this model, we…

Descriptors: Student Reaction, Reaction Time, Response Style (Tests), Test Items

Feature versus Object in Interpreting Working Memory Capacity

Peer reviewed

Direct link

Wuji Lin; Chenxi Lv; Jiejie Liao; Yuan Hu; Yutong Liu; Jingyuan Lin – npj Science of Learning, 2024

The debate about whether the capacity of working memory (WM) varies with the complexity of memory items continues. This study employed novel experimental materials to investigate the role of complexity in WM capacity. Across seven experiments, we explored the relationship between complexity and WM capacity. The results indicated that the…

Descriptors: Short Term Memory, Difficulty Level, Retention (Psychology), Test Items

A Multidimensional Partially Compensatory Response Time Model on Basis of the Log-Normal Distribution

Peer reviewed

Direct link

Jochen Ranger; Christoph König; Benjamin W. Domingue; Jörg-Tobias Kuhn; Andreas Frey – Journal of Educational and Behavioral Statistics, 2024

In the existing multidimensional extensions of the log-normal response time (LNRT) model, the log response times are decomposed into a linear combination of several latent traits. These models are fully compensatory as low levels on traits can be counterbalanced by high levels on other traits. We propose an alternative multidimensional extension…

Descriptors: Models, Statistical Distributions, Item Response Theory, Response Rates (Questionnaires)

Previous Page | Next Page »

Pages: 1 | 2 | 3

Benjamin W. Domingue	3
Allan S. Cohen	2
Amanda Goodwin	2
Jesper Tijmstra	2
Joshua B. Gilbert	2
Kuan-Yu Jin	2
Luke W. Miratrix	2
Maria Bolsinova	2
Matthew Naveiras	2
Mridul Joshi	2
Paul De Boeck	2
Sun-Joo Cho	2
Aditya Shah	1
Aiman Mohammad Freihat	1
Ajay Devmane	1
Andreas Frey	1
Brad Linnenkamp	1
Chengyu Cui	1
Chenxi Lv	1
Christoph König	1
Chun Wang	1
Daniel M Bolt	1
Daniel M. Bolt	1
David B. Flora	1
David Rutkowski	1
More ▼

Test Items	36
Models	29
Item Response Theory	23
Item Analysis	11
Simulation	9
Computation	7
Difficulty Level	7
Accuracy	6
Error of Measurement	6
Measurement Techniques	6
Test Construction	6
Achievement Tests	5
Correlation	5
Foreign Countries	5
Psychometrics	5
Scores	5
Artificial Intelligence	4
Comparative Analysis	4
Data Analysis	4
Educational Assessment	4
Elementary Secondary Education	4
Goodness of Fit	4
International Assessment	4
Intervention	4
Maximum Likelihood Statistics	4
More ▼