ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	11

Descriptor

Comparative Analysis	17
Difficulty Level	17
Monte Carlo Methods	17
Item Response Theory	13
Test Items	13
Models	6
Accuracy	5
Bayesian Statistics	5
Mathematical Models	5
Sample Size	5
Simulation	5
Computation	4
Markov Processes	4
Maximum Likelihood Statistics	4
Test Bias	4
Estimation (Mathematics)	3
Scores	3
Test Format	3
Achievement Tests	2
Adaptive Testing	2
Classification	2
Computer Assisted Testing	2
Correlation	2
Equations (Mathematics)	2
Error of Measurement	2
More ▼

Source

Educational and Psychological…	4
Applied Measurement in…	2
Journal of Educational…	2
Asia Pacific Education Review	1
Hacettepe University Journal…	1
Journal of Educational…	1
Practical Assessment,…	1
ProQuest LLC	1
Psicologica: International…	1

Publication Type

Journal Articles	13
Reports - Research	13
Reports - Evaluative	3
Speeches/Meeting Papers	3
Dissertations/Theses -…	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

Assessing the Performance of Classical Test Theory Item Discrimination Estimators in Monte Carlo Simulations

Peer reviewed

Direct link

Bazaldua, Diego A. Luna; Lee, Young-Sun; Keller, Bryan; Fellers, Lauren – Asia Pacific Education Review, 2017

The performance of various classical test theory (CTT) item discrimination estimators has been compared in the literature using both empirical and simulated data, resulting in mixed results regarding the preference of some discrimination estimators over others. This study analyzes the performance of various item discrimination estimators in CTT:…

Descriptors: Test Items, Monte Carlo Methods, Item Response Theory, Correlation

Subjective Priors for Item Response Models: Application of Elicitation by Design

Peer reviewed

Direct link

Ames, Allison; Smith, Elizabeth – Journal of Educational Measurement, 2018

Bayesian methods incorporate model parameter information prior to data collection. Eliciting information from content experts is an option, but has seen little implementation in Bayesian item response theory (IRT) modeling. This study aims to use ethical reasoning content experts to elicit prior information and incorporate this information into…

Descriptors: Item Response Theory, Bayesian Statistics, Ethics, Specialists

Bayesian Estimation of Multidimensional Item Response Models. A Comparison of Analytic and Simulation Algorithms

Peer reviewed
PDF on ERIC

Download full text

Martin-Fernandez, Manuel; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…

Descriptors: Bayesian Statistics, Item Response Theory, Models, Comparative Analysis

Rasch Mixture Models for DIF Detection: A Comparison of Old and New Score Specifications

Peer reviewed

Direct link

Frick, Hannah; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2015

Rasch mixture models can be a useful tool when checking the assumption of measurement invariance for a single Rasch model. They provide advantages compared to manifest differential item functioning (DIF) tests when the DIF groups are only weakly correlated with the manifest covariates available. Unlike in single Rasch models, estimation of Rasch…

Descriptors: Item Response Theory, Test Bias, Comparative Analysis, Scores

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

A Comparison of Uniform DIF Effect Size Estimators under the MIMIC and Rasch Models

Peer reviewed

Direct link

Jin, Ying; Myers, Nicholas D.; Ahn, Soyeon; Penfield, Randall D. – Educational and Psychological Measurement, 2013

The Rasch model, a member of a larger group of models within item response theory, is widely used in empirical studies. Detection of uniform differential item functioning (DIF) within the Rasch model typically employs null hypothesis testing with a concomitant consideration of effect size (e.g., signed area [SA]). Parametric equivalence between…

Descriptors: Test Bias, Effect Size, Item Response Theory, Comparative Analysis

Estimation Methods for One-Parameter Testlet Models

Peer reviewed

Direct link

Jiao, Hong; Wang, Shudong; He, Wei – Journal of Educational Measurement, 2013

This study demonstrated the equivalence between the Rasch testlet model and the three-level one-parameter testlet model and explored the Markov Chain Monte Carlo (MCMC) method for model parameter estimation in WINBUGS. The estimation accuracy from the MCMC method was compared with those from the marginalized maximum likelihood estimation (MMLE)…

Descriptors: Computation, Item Response Theory, Models, Monte Carlo Methods

Termination Criteria for Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011

Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability

Comparison of IRT Likelihood Ratio Test and Logistic Regression DIF Detection Procedures

Peer reviewed

Direct link

Atar, Burcu; Kamata, Akihito – Hacettepe University Journal of Education, 2011

The Type I error rates and the power of IRT likelihood ratio test and cumulative logit ordinal logistic regression procedures in detecting differential item functioning (DIF) for polytomously scored items were investigated in this Monte Carlo simulation study. For this purpose, 54 simulation conditions (combinations of 3 sample sizes, 2 sample…

Descriptors: Test Bias, Sample Size, Monte Carlo Methods, Item Response Theory

A Monte Carlo Comparison of Item and Person Statistics Based on Item Response Theory versus Classical Test Theory.

Peer reviewed

MacDonald, Paul; Paunonen, Sampo V. – Educational and Psychological Measurement, 2002

Examined the behavior of item and person statistics from item response theory and classical test theory frameworks through Monte Carlo methods with simulated test data. Findings suggest that item difficulty and person ability estimates are highly comparable for both approaches. (SLD)

Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Response Theory

"Thin" versus "Thick" Matching in the Mantel-Haenszel Procedure for Detecting DIF.

Donoghue, John R.; Allen, Nancy L. – 1991

This Monte Carlo study examined strategies for forming the matching variable for the Mantel-Haenszel (MH) differential item functioning (DIF) procedure. Data were generated using a three-parameter logistic item response theory model, with common guessing parameters. The number of subjects and test length were manipulated, as were the difficulty,…

Descriptors: Comparative Analysis, Difficulty Level, Equations (Mathematics), Item Bias

A Bayesian Version of Rasch's Multiplicative Poisson Model for the Number of Errors of an Achievement Test.

Peer reviewed

Jansen, Margo G. H. – Journal of Educational Statistics, 1986

In this paper a Bayesian procedure is developed for the simultaneous estimation of the reading ability and difficulty parameters which are assumed to be factors in reading errors by the multiplicative Poisson Model. According to several criteria, the Bayesian estimates are better than comparable maximum likelihood estimates. (Author/JAZ)

Descriptors: Achievement Tests, Bayesian Statistics, Comparative Analysis, Difficulty Level

An Empirical Investigation of Lu's Method of Reliability Estimation.

Peer reviewed

Huck, Schuyler W.; And Others – Educational and Psychological Measurement, 1981

Believing that examinee-by-item interaction should be conceptualized as true score variability rather than as a result of errors of measurement, Lu proposed a modification of Hoyt's analysis of variance reliability procedure. Via a computer simulation study, it is shown that Lu's approach does not separate interaction from error. (Author/RL)

Descriptors: Analysis of Variance, Comparative Analysis, Computer Programs, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2

Ahn, Soyeon	1
Allen, Nancy L.	1
Ames, Allison	1
Atar, Burcu	1
Bacon, Tina P.	1
Bazaldua, Diego A. Luna	1
Donoghue, John R.	1
Fellers, Lauren	1
Finch, Holmes	1
French, Brian F.	1
Frick, Hannah	1
He, Wei	1
Hsu, Tse-Chi	1
Huck, Schuyler W.	1
Jansen, Margo G. H.	1
Jiao, Hong	1
Jin, Ying	1
Kamata, Akihito	1
Keller, Bryan	1
Kirisci, Levent	1
Koziol, Natalie A.	1
Kromrey, Jeffrey D.	1
Lee, Young-Sun	1
MacDonald, Paul	1
More ▼