ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	32

Descriptor

Computer Software	35
Models	35
Test Items	35
Item Response Theory	18
Computer Assisted Testing	11
Test Construction	9
Accuracy	8
Computation	8
Difficulty Level	8
Scoring	8
Foreign Countries	7
Item Analysis	7
Simulation	7
Statistical Analysis	7
Educational Assessment	6
Classification	5
Comparative Analysis	5
Correlation	5
Goodness of Fit	5
Markov Processes	5
Monte Carlo Methods	5
Automation	4
Bayesian Statistics	4
Mathematics Tests	4
Programming	4
More ▼

Source

Educational and Psychological…	5
Journal of Educational…	5
Applied Psychological…	3
ETS Research Report Series	3
International Educational…	3
International Journal of…	3
Journal of Educational and…	2
Computers & Education	1
Council of Chief State School…	1
Eurasian Journal of…	1
IEEE Transactions on Learning…	1
International Journal of…	1
International Working Group…	1
Journal of Applied Testing…	1
Journal of Technology,…	1
Routledge, Taylor & Francis…	1
More ▼

Publication Type

Journal Articles	27
Reports - Research	21
Reports - Descriptive	6
Reports - Evaluative	5
Collected Works - Proceedings	2
Speeches/Meeting Papers	2
Tests/Questionnaires	2
Books	1

Education Level

Higher Education	7
Postsecondary Education	5
Elementary Secondary Education	3
Secondary Education	3
Adult Education	1

Audience

Researchers	2
Practitioners	1
Students	1

Location

Canada	1
China	1
Netherlands	1
Saudi Arabia	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Investigating Heterogeneity in Response Strategies: A Mixture Multidimensional IRTree Approach

Peer reviewed

Direct link

Ö. Emre C. Alagöz; Thorsten Meiser – Educational and Psychological Measurement, 2024

To improve the validity of self-report measures, researchers should control for response style (RS) effects, which can be achieved with IRTree models. A traditional IRTree model considers a response as a combination of distinct decision-making processes, where the substantive trait affects the decision on response direction, while decisions about…

Descriptors: Item Response Theory, Validity, Self Evaluation (Individuals), Decision Making

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Automatic Multiple Choice Question Generation From Text: A Survey

Peer reviewed

Direct link

Rao, Dhawaleswar; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2020

Automatic multiple choice question (MCQ) generation from a text is a popular research area. MCQs are widely accepted for large-scale assessment in various domains and applications. However, manual generation of MCQs is expensive and time-consuming. Therefore, researchers have been attracted toward automatic MCQ generation since the late 90's.…

Descriptors: Multiple Choice Tests, Test Construction, Automation, Computer Software

A Short Note on Estimating the Testlet Model with Different Estimators in Mplus

Peer reviewed

Direct link

Luo, Yong – Educational and Psychological Measurement, 2018

Mplus is a powerful latent variable modeling software program that has become an increasingly popular choice for fitting complex item response theory models. In this short note, we demonstrate that the two-parameter logistic testlet model can be estimated as a constrained bifactor model in Mplus with three estimators encompassing limited- and…

Descriptors: Computer Software, Models, Statistical Analysis, Computation

Using JAGS for Bayesian Cognitive Diagnosis Modeling: A Tutorial

Peer reviewed

Direct link

Zhan, Peida; Jiao, Hong; Man, Kaiwen; Wang, Lijun – Journal of Educational and Behavioral Statistics, 2019

In this article, we systematically introduce the just another Gibbs sampler (JAGS) software program to fit common Bayesian cognitive diagnosis models (CDMs) including the deterministic inputs, noisy "and" gate model; the deterministic inputs, noisy "or" gate model; the linear logistic model; the reduced reparameterized unified…

Descriptors: Bayesian Statistics, Computer Software, Models, Test Items

Implementation of Cognitive Diagnosis Modeling Using the GDINA R Package

Peer reviewed
PDF on ERIC

Download full text

Torre, Jimmy de la; Akbay, Lokman – Eurasian Journal of Educational Research, 2019

Purpose: Well-designed assessment methodologies and various cognitive diagnosis models (CDMs) to extract diagnostic information about examinees' individual strengths and weaknesses have been developed. Due to this novelty, as well as educational specialists' lack of familiarity with CDMs, their applications are not widespread. This article aims at…

Descriptors: Cognitive Measurement, Models, Computer Software, Testing

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Short Note on Obtaining Point Estimates of the IRT Ability Parameter with MCMC Estimation in Mplus: How Many Plausible Values Are Needed?

Peer reviewed

Direct link

Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019

Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…

Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures

SARM: A Computer Program for Estimating Speed-Accuracy Response Models for Dichotomous Items. Research Report. ETS RR-18-15

Peer reviewed
PDF on ERIC

Download full text

van Rijn, Peter W.; Ali, Usama S. – ETS Research Report Series, 2018

A computer program was developed to estimate speed-accuracy response models for dichotomous items. This report describes how the models are estimated and how to specify data and input files. An example using data from a listening section of an international language test is described to illustrate the modeling approach and features of the computer…

Descriptors: Computer Software, Computation, Reaction Time, Timed Tests

Towards a Model-Free Estimate of the Limits to Student Modeling Accuracy

Peer reviewed
PDF on ERIC

Download full text

Chen, Binglin; West, Matthew; Ziles, Craig – International Educational Data Mining Society, 2018

This paper attempts to quantify the accuracy limit of "nextitem-correct" prediction by using numerical optimization to estimate the student's probability of getting each question correct given a complete sequence of item responses. This optimization is performed without an explicit parameterized model of student behavior, but with the…

Descriptors: Accuracy, Probability, Student Behavior, Test Items

Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

Peer reviewed
PDF on ERIC

Download full text

Aybek, Eren Can; Demirtasli, R. Nukhet – International Journal of Research in Education and Science, 2017

This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Modeling Skipped and Not-Reached Items Using IRTrees

Peer reviewed

Direct link

Debeer, Dries; Janssen, Rianne; De Boeck, Paul – Journal of Educational Measurement, 2017

When dealing with missing responses, two types of omissions can be discerned: items can be skipped or not reached by the test taker. When the occurrence of these omissions is related to the proficiency process the missingness is nonignorable. The purpose of this article is to present a tree-based IRT framework for modeling responses and omissions…

Descriptors: Item Response Theory, Test Items, Responses, Testing Problems

Previous Page | Next Page »

Pages: 1 | 2 | 3

Jin, Kuan-Yu	3
Wang, Wen-Chung	3
Gierl, Mark J.	2
Jiao, Hong	2
Luo, Yong	2
Akbay, Lokman	1
Ali, Usama S.	1
Alves, Cecila	1
Ariew, Robert A.	1
Aybek, Eren Can	1
Baghaei, Purya	1
Bock, H. Darrell	1
Breyer, F. Jay	1
Calders, Toon	1
Chang, Chi	1
Chen, Binglin	1
Conati, Cristina	1
De Boeck, Paul	1
DeCarlo, Lawrence T.	1
Deane, Paul	1
Debeer, Dries	1
Demirtasli, R. Nukhet	1
Dimitrov, Dimiter M.	1
Dunkel, Patricia A.	1
Fan, Ya-Ching	1
More ▼