ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	14
Since 2017 (last 10 years)	32
Since 2007 (last 20 years)	52

Descriptor

Accuracy	53
Models	53
Test Items	53
Item Response Theory	34
Computation	16
Classification	15
Simulation	12
Comparative Analysis	11
Statistical Analysis	11
Bayesian Statistics	9
Computer Assisted Testing	9
Difficulty Level	9
Goodness of Fit	9
Maximum Likelihood Statistics	9
Monte Carlo Methods	9
Computer Software	8
Diagnostic Tests	8
Item Analysis	8
Psychometrics	8
Reaction Time	8
Correlation	7
Markov Processes	7
Mathematics Tests	7
Measurement	7
Sample Size	7
More ▼

Publication Type

Reports - Research	41
Journal Articles	39
Dissertations/Theses -…	6
Reports - Evaluative	4
Speeches/Meeting Papers	4
Reports - Descriptive	2

Education Level

Elementary Secondary Education	5
Higher Education	5
Postsecondary Education	5
Secondary Education	3
Junior High Schools	2
Middle Schools	2

Audience

Location

Iran

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	4
Big Five Inventory	1
Graduate Record Examinations	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 53 results Save | Export

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

ChatGPT's Performance Evaluation in Spreadsheets Modelling to Inform Assessments Redesign

Peer reviewed

Direct link

Michelle Cheong – Journal of Computer Assisted Learning, 2025

Background: Increasingly, students are using ChatGPT to assist them in learning and even completing their assessments, raising concerns of academic integrity and loss of critical thinking skills. Many articles suggested educators redesign assessments that are more 'Generative-AI-resistant' and to focus on assessing students on higher order…

Descriptors: Artificial Intelligence, Performance Based Assessment, Spreadsheets, Models

Investigating Heterogeneity in Response Strategies: A Mixture Multidimensional IRTree Approach

Peer reviewed

Direct link

Ö. Emre C. Alagöz; Thorsten Meiser – Educational and Psychological Measurement, 2024

To improve the validity of self-report measures, researchers should control for response style (RS) effects, which can be achieved with IRTree models. A traditional IRTree model considers a response as a combination of distinct decision-making processes, where the substantive trait affects the decision on response direction, while decisions about…

Descriptors: Item Response Theory, Validity, Self Evaluation (Individuals), Decision Making

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

A Factored Regression Model for Composite Scores with Item-Level Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Egamaria Alacam; Craig K. Enders; Han Du; Brian T. Keller – Grantee Submission, 2023

Composite scores are an exceptionally important psychometric tool for behavioral science research applications. A prototypical example occurs with self-report data, where researchers routinely use questionnaires with multiple items that tap into different features of a target construct. Item-level missing data are endemic to composite score…

Descriptors: Regression (Statistics), Scores, Psychometrics, Test Items

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Grantee Submission, 2024

Explanatory item response models (EIRMs) have been applied to investigate the effects of person covariates, item covariates, and their interactions in the fields of reading education and psycholinguistics. In practice, it is often assumed that the relationships between the covariates and the logit transformation of item response probability are…

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Journal of Educational Measurement, 2024

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Using Machine Learning to Predict Bloom's Taxonomy Level for Certification Exam Items

Peer reviewed

Direct link

Mead, Alan D.; Zhou, Chenxuan – Journal of Applied Testing Technology, 2022

This study fit a Naïve Bayesian classifier to the words of exam items to predict the Bloom's taxonomy level of the items. We addressed five research questions, showing that reasonably good prediction of Bloom's level was possible, but accuracy varies across levels. In our study, performance for Level 2 was poor (Level 2 items were misclassified…

Descriptors: Artificial Intelligence, Prediction, Taxonomy, Natural Language Processing

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Reconsidering Cutoff Points in the General Method of Empirical Q-Matrix Validation

Peer reviewed

Direct link

Nájera, Pablo; Sorrel, Miguel A.; Abad, Francisco José – Educational and Psychological Measurement, 2019

Cognitive diagnosis models (CDMs) are latent class multidimensional statistical models that help classify people accurately by using a set of discrete latent variables, commonly referred to as attributes. These models require a Q-matrix that indicates the attributes involved in each item. A potential problem is that the Q-matrix construction…

Descriptors: Matrices, Statistical Analysis, Models, Classification

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

A Comparison of the Relative Performance of Four IRT Models on Equating Passage-Based Tests

Peer reviewed

Direct link

Kim, Kyung Yong; Lim, Euijin; Lee, Won-Chan – International Journal of Testing, 2019

For passage-based tests, items that belong to a common passage often violate the local independence assumption of unidimensional item response theory (UIRT). In this case, ignoring local item dependence (LID) and estimating item parameters using a UIRT model could be problematic because doing so might result in inaccurate parameter estimates,…

Descriptors: Item Response Theory, Equated Scores, Test Items, Models

An Item-Level Expected Classification Accuracy and Its Applications in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Wang, Wenyi; Song, Lihong; Chen, Ping; Ding, Shuliang – Journal of Educational Measurement, 2019

Most of the existing classification accuracy indices of attribute patterns lose effectiveness when the response data is absent in diagnostic testing. To handle this issue, this article proposes new indices to predict the correct classification rate of a diagnostic test before administering the test under the deterministic noise input…

Descriptors: Cognitive Tests, Classification, Accuracy, Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	10
Journal of Educational…	7
ProQuest LLC	6
Grantee Submission	4
ETS Research Report Series	3
Applied Measurement in…	2
Applied Psychological…	2
International Educational…	2
International Journal of…	2
Journal of Educational and…	2
Practical Assessment,…	2
Educational Assessment	1
Educational Process:…	1
Educational Sciences: Theory…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Computer Assisted…	1
Measurement:…	1
Online Submission	1
Pearson	1
Psychometrika	1
SAGE Open	1
More ▼

Amanda Goodwin	2
Chang, Hua-Hua	2
He, Wei	2
Matthew Naveiras	2
Paul De Boeck	2
Sun-Joo Cho	2
Abad, Francisco José	1
Aiman Mohammad Freihat	1
Ali, Usama S.	1
Anil, Duygu	1
Arenson, Ethan A.	1
Baghaei, Purya	1
Brian T. Keller	1
Castellano, Katherine	1
Chen, Binglin	1
Chen, Jinsong	1
Chen, Ping	1
Chen, Shu-Ying	1
Chengyu Cui	1
Chien, Yuehmei	1
Chun Wang	1
Craig K. Enders	1
Culpepper, Steven	1
Custer, Michael	1
Dai, Shenghai	1
More ▼