ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	51

Descriptor

Computation	54
Difficulty Level	54
Item Response Theory	54
Test Items	49
Models	18
Comparative Analysis	14
Error of Measurement	12
Accuracy	11
Bayesian Statistics	11
Sample Size	11
Simulation	11
Statistical Analysis	11
Correlation	8
Equated Scores	8
Maximum Likelihood Statistics	8
Monte Carlo Methods	8
Scores	8
Statistical Bias	8
Guessing (Tests)	6
Markov Processes	6
Mathematics Tests	6
Test Bias	6
Test Construction	6
Computer Software	5
Foreign Countries	5
More ▼

Publication Type

Journal Articles	40
Reports - Research	39
Dissertations/Theses -…	7
Reports - Evaluative	5
Numerical/Quantitative Data	2
Reports - Descriptive	2
Collected Works - Proceedings	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	6
Middle Schools	6
Secondary Education	6
Grade 8	5
Grade 5	4
Higher Education	4
Junior High Schools	4
Postsecondary Education	4
Elementary Secondary Education	3
Grade 3	3
Grade 4	3
Grade 6	3
Grade 7	3
Grade 1	2
Grade 2	2
High Schools	2
Early Childhood Education	1
Grade 11	1
Grade 12	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Location

Belgium	1
Brazil	1
Florida	1
Germany	1
Oregon	1
Saudi Arabia	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

General Aptitude Test Battery	1
Measures of Academic Progress	1

What Works Clearinghouse Rating

Showing 1 to 15 of 54 results Save | Export

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Detecting Local Dependence: A Threshold-Autoregressive Item Response Theory (TAR-IRT) Approach for Polytomous Items

Peer reviewed

Direct link

Tang, Xiaodan; Karabatsos, George; Chen, Haiqin – Applied Measurement in Education, 2020

In applications of item response theory (IRT) models, it is known that empirical violations of the local independence (LI) assumption can significantly bias parameter estimates. To address this issue, we propose a threshold-autoregressive item response theory (TAR-IRT) model that additionally accounts for order dependence among the item responses…

Descriptors: Item Response Theory, Test Items, Models, Computation

Can Auxiliary Information Improve Rasch Estimation at Small Sample Sizes?

Direct link

Derek Sauder – ProQuest LLC, 2020

The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…

Descriptors: Item Response Theory, Sample Size, Computation, Test Length

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Efficient Estimation of Mean Ability Growth Using Vertical Scaling

Peer reviewed

Direct link

Bjermo, Jonas; Miller, Frank – Applied Measurement in Education, 2021

In recent years, the interest in measuring growth in student ability in various subjects between different grades in school has increased. Therefore, good precision in the estimated growth is of importance. This paper aims to compare estimation methods and test designs when it comes to precision and bias of the estimated growth of mean ability…

Descriptors: Scaling, Ability, Computation, Test Items

An Investigation of Item Parameter Invariance Using Focused Calibration Samples for MAP Growth

Download full text

He, Wei – NWEA, 2021

New MAP® Growth™ assessments are being developed that administer items more closely matched to the grade level of the student. However, MAP Growth items are calibrated with samples that typically consist of students from a variety of grades, including the target grade to which an item is aligned. While this choice of calibration sample is…

Descriptors: Achievement Tests, Test Items, Instructional Program Divisions, Difficulty Level

Careful with Those Priors: A Note on Bayesian Estimation in Two-Parameter Logistic Item Response Theory Models

Peer reviewed

Direct link

Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2018

This study examined the use of Bayesian analysis methods for the estimation of item parameters in a two-parameter logistic item response theory model. Using simulated data under various design conditions with both informative and non-informative priors, the parameter recovery of Bayesian analysis methods were examined. Overall results showed that…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Difficulty Level

An Empirical Study for the Statistical Adjustment of Rater Bias

Peer reviewed
PDF on ERIC

Download full text

Ilhan, Mustafa – International Journal of Assessment Tools in Education, 2019

This study investigated the effectiveness of statistical adjustments applied to rater bias in many-facet Rasch analysis. Some changes were first made in the dataset that did not include "rater × examinee" bias to cause to have "rater × examinee" bias. Later, bias adjustment was applied to rater bias included in the data file,…

Descriptors: Statistical Analysis, Item Response Theory, Evaluators, Bias

Unidimensional IRT Item Parameter Estimates across Equivalent Test Forms with Confounding Specifications within Dimensions

Peer reviewed

Direct link

Matlock, Ki Lynn; Turner, Ronna – Educational and Psychological Measurement, 2016

When constructing multiple test forms, the number of items and the total test difficulty are often equivalent. Not all test developers match the number of items and/or average item difficulty within subcontent areas. In this simulation study, six test forms were constructed having an equal number of items and average item difficulty overall.…

Descriptors: Item Response Theory, Computation, Test Items, Difficulty Level

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

An Approach to Scoring and Equating Tests with Binary Items: Piloting With Large-Scale Assessments

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2016

This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…

Descriptors: Scoring, Equated Scores, Test Items, Measurement

Bayesian Estimation of Multidimensional Item Response Models. A Comparison of Analytic and Simulation Algorithms

Peer reviewed
PDF on ERIC

Download full text

Martin-Fernandez, Manuel; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…

Descriptors: Bayesian Statistics, Item Response Theory, Models, Comparative Analysis

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

A New Procedure for Detection of Students' Rapid Guessing Responses Using Response Time

Peer reviewed

Direct link

Guo, Hongwen; Rios, Joseph A.; Haberman, Shelby; Liu, Ou Lydia; Wang, Jing; Paek, Insu – Applied Measurement in Education, 2016

Unmotivated test takers using rapid guessing in item responses can affect validity studies and teacher and institution performance evaluation negatively, making it critical to identify these test takers. The authors propose a new nonparametric method for finding response-time thresholds for flagging item responses that result from rapid-guessing…

Descriptors: Guessing (Tests), Reaction Time, Nonparametric Statistics, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	10
ProQuest LLC	7
Applied Measurement in…	6
Journal of Educational…	5
Applied Psychological…	4
ETS Research Report Series	3
Behavioral Research and…	2
International Journal of…	2
Online Submission	2
Computers & Education	1
Educational Process:…	1
Eurasian Journal of…	1
International Educational…	1
Journal of Educational and…	1
Journal of Speech, Language,…	1
Measurement:…	1
Multivariate Behavioral…	1
NWEA	1
National Center for Research…	1
Practical Assessment,…	1
Psicologica: International…	1
Structural Equation Modeling:…	1
More ▼

Finch, Holmes	3
He, Wei	3
Jiao, Hong	2
Ketterlin-Geller, Leanne R.	2
Kim, Sooyeon	2
Liu, Kimy	2
Matlock, Ki Lynn	2
Michaelides, Michalis P.	2
Moses, Tim	2
Paek, Insu	2
Revuelta, Javier	2
Tindal, Gerald	2
Wang, Shudong	2
Aiman Mohammad Freihat	1
Ali, Usama S.	1
Anderson, Daniel	1
Bauduin, Charity	1
Bjermo, Jonas	1
Bottge, Brian	1
Cai, Li	1
Champagne, Zachary	1
Chen, Guanhua	1
Chen, Haiqin	1
Cho, Sun-Joo	1
Cohen, Allan S.	1
More ▼