ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	26

Descriptor

Comparative Analysis	28
Markov Processes	28
Monte Carlo Methods	28
Item Response Theory	19
Computation	13
Bayesian Statistics	12
Models	10
Maximum Likelihood Statistics	9
Accuracy	6
Computer Software	6
Hierarchical Linear Modeling	6
Statistical Analysis	6
Correlation	5
Regression (Statistics)	5
Sample Size	5
Test Items	5
Difficulty Level	4
Foreign Countries	4
Measurement	4
Simulation	4
Test Length	4
Effect Size	3
Error of Measurement	3
Hypothesis Testing	3
Intervention	3
More ▼

Source

ProQuest LLC	5
Journal of Educational and…	4
ETS Research Report Series	3
Educational and Psychological…	3
Journal of Educational…	3
Society for Research on…	2
Applied Measurement in…	1
Applied Psychological…	1
Cognitive Science	1
Educational Psychology	1
International Journal of…	1
Psicologica: International…	1
Remedial and Special Education	1
School Effectiveness and…	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	19
Dissertations/Theses -…	5
Reports - Evaluative	4

Education Level

Elementary Education	3
Middle Schools	3
Secondary Education	3
Grade 4	2
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Grade 5	1
Grade 6	1
Grade 7	1
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Location

Hong Kong	2
Australia	1
Austria	1
Belgium	1
California	1
Colorado	1
El Salvador	1
Florida	1
Illinois	1
Kansas	1
New York	1
North Carolina	1
Qatar	1
Singapore	1
Slovakia	1
South Korea	1
Tennessee	1
Texas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Iowa Tests of Basic Skills	1
Program for International…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Cognitive Diagnosis Testlet Model for Multiple-Choice Items

Peer reviewed

Direct link

Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024

The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…

Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software

Subjective Priors for Item Response Models: Application of Elicitation by Design

Peer reviewed

Direct link

Ames, Allison; Smith, Elizabeth – Journal of Educational Measurement, 2018

Bayesian methods incorporate model parameter information prior to data collection. Eliciting information from content experts is an option, but has seen little implementation in Bayesian item response theory (IRT) modeling. This study aims to use ethical reasoning content experts to elicit prior information and incorporate this information into…

Descriptors: Item Response Theory, Bayesian Statistics, Ethics, Specialists

Comparative Analyses of MIRT Models and Software (BMIRT and flexMIRT)

Peer reviewed

Direct link

Yavuz, Guler; Hambleton, Ronald K. – Educational and Psychological Measurement, 2017

Application of MIRT modeling procedures is dependent on the quality of parameter estimates provided by the estimation software and techniques used. This study investigated model parameter recovery of two popular MIRT packages, BMIRT and flexMIRT, under some common measurement conditions. These packages were specifically selected to investigate the…

Descriptors: Item Response Theory, Models, Comparative Analysis, Computer Software

Using the Stan Program for Bayesian Item Response Theory

Peer reviewed

Direct link

Luo, Yong; Jiao, Hong – Educational and Psychological Measurement, 2018

Stan is a new Bayesian statistical software program that implements the powerful and efficient Hamiltonian Monte Carlo (HMC) algorithm. To date there is not a source that systematically provides Stan code for various item response theory (IRT) models. This article provides Stan code for three representative IRT models, including the…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Computer Software

Bayesian Estimation of Multidimensional Item Response Models. A Comparison of Analytic and Simulation Algorithms

Peer reviewed
PDF on ERIC

Download full text

Martin-Fernandez, Manuel; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…

Descriptors: Bayesian Statistics, Item Response Theory, Models, Comparative Analysis

Lord's Wald Test for Detecting Dif in Multidimensional Irt Models: A Comparison of Two Estimation Approaches

Peer reviewed

Direct link

Lee, Soo; Suh, Youngsuk – Journal of Educational Measurement, 2018

Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…

Descriptors: Item Response Theory, Sample Size, Models, Error of Measurement

Using Data-Dependent Priors to Mitigate Small Sample Bias in Latent Growth Models: A Discussion and Illustration Using M"plus"

Peer reviewed

Direct link

McNeish, Daniel M. – Journal of Educational and Behavioral Statistics, 2016

Mixed-effects models (MEMs) and latent growth models (LGMs) are often considered interchangeable save the discipline-specific nomenclature. Software implementations of these models, however, are not interchangeable, particularly with small sample sizes. Restricted maximum likelihood estimation that mitigates small sample bias in MEMs has not been…

Descriptors: Models, Statistical Analysis, Hierarchical Linear Modeling, Sample Size

Bayesian Estimation of Multi-Unidimensional Graded Response IRT Models

Direct link

Kuo, Tzu-Chun – ProQuest LLC, 2015

Item response theory (IRT) has gained an increasing popularity in large-scale educational and psychological testing situations because of its theoretical advantages over classical test theory. Unidimensional graded response models (GRMs) are useful when polytomous response items are designed to measure a unified latent trait. They are limited in…

Descriptors: Item Response Theory, Bayesian Statistics, Computation, Models

Comparing Three Estimation Methods for the Three-Parameter Logistic IRT Model

Direct link

Lamsal, Sunil – ProQuest LLC, 2015

Different estimation procedures have been developed for the unidimensional three-parameter item response theory (IRT) model. These techniques include the marginal maximum likelihood estimation, the fully Bayesian estimation using Markov chain Monte Carlo simulation techniques, and the Metropolis-Hastings Robbin-Monro estimation. With each…

Descriptors: Item Response Theory, Monte Carlo Methods, Maximum Likelihood Statistics, Markov Processes

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Differential Item Functioning Analysis Using a Mixture 3-Parameter Logistic Model with a Covariate on the TIMSS 2007 Mathematics Test

Peer reviewed

Direct link

Choi, Youn-Jeng; Alexeev, Natalia; Cohen, Allan S. – International Journal of Testing, 2015

The purpose of this study was to explore what may be contributing to differences in performance in mathematics on the Trends in International Mathematics and Science Study 2007. This was done by using a mixture item response theory modeling approach to first detect latent classes in the data and then to examine differences in performance on items…

Descriptors: Test Bias, Mathematics Achievement, Mathematics Tests, Item Response Theory

Regional Inequality in Reading Performance: An Exploration in Belgium

Peer reviewed

Direct link

Ning, Bo; Van Damme, Jan; Van Den Noortgate, Wim; Gielen, Sarah; Bellens, Kim; Dupriez, Vincent; Dumay, Xavier – School Effectiveness and School Improvement, 2016

In the 2009 Programme for International Student Assessment, the Flemish community of Belgium outscored its French community in reading, with low achievers accounting for a large proportion of the score gaps. In this study, between-community comparisons based on the Blinder-Oaxaca decomposition method showed that the Flemish community benefits…

Descriptors: Reading Instruction, Reading Strategies, Reading Skills, Foreign Countries

Testing the Efficiency of Markov Chain Monte Carlo with People Using Facial Affect Categories

Peer reviewed

Direct link

Martin, Jay B.; Griffiths, Thomas L.; Sanborn, Adam N. – Cognitive Science, 2012

Exploring how people represent natural categories is a key step toward developing a better understanding of how people learn, form memories, and make decisions. Much research on categorization has focused on artificial categories that are created in the laboratory, since studying natural categories defined on high-dimensional stimuli such as…

Descriptors: Markov Processes, Monte Carlo Methods, Correlation, Efficiency

Diagnosing University Students' Academic Writing in English: Is Cognitive Diagnostic Modelling the Way Forward?

Peer reviewed

Direct link

Xie, Qin – Educational Psychology, 2017

The study utilised a fine-grained diagnostic checklist to assess first-year undergraduates in Hong Kong and evaluated its validity and usefulness for diagnosing academic writing in English. Ten English language instructors marked 472 academic essays with the checklist. They also agreed on a Q-matrix, which specified the relationships among the…

Descriptors: Academic Discourse, College Students, College English, Foreign Countries

Effect Size Measure and Analysis of Single Subject Designs

Peer reviewed
PDF on ERIC

Download full text

Society for Research on Educational Effectiveness, 2013

One of the vexing problems in the analysis of SSD is in the assessment of the effect of intervention. Serial dependence notwithstanding, the linear model approach that has been advanced involves, in general, the fitting of regression lines (or curves) to the set of observations within each phase of the design and comparing the parameters of these…

Descriptors: Research Design, Effect Size, Intervention, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2

Jiao, Hong	2
Alexeev, Natalia	1
Ames, Allison	1
Bellens, Kim	1
Beretvas, S. Natasha	1
Choi, Youn-Jeng	1
Cohen, Allan S.	1
Deping, Li	1
Draper, David	1
Dumay, Xavier	1
Dupriez, Vincent	1
Espelage, Dorothy L.	1
Gielen, Sarah	1
Granberg-Rademacker, J. Scott	1
Griffiths, Thomas L.	1
Hambleton, Ronald K.	1
He, Wei	1
Horner, Robert H.	1
Jenkins, Frank	1
Jeon, Minjeong	1
Johnson, Matthew S.	1
Kieftenbeld, Vincent	1
Kuo, Tzu-Chun	1
Lamsal, Sunil	1
Lee, Soo	1
More ▼