ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Difficulty Level	10
Monte Carlo Methods	10
Simulation	10
Test Items	7
Item Response Theory	6
Comparative Analysis	5
Bayesian Statistics	3
Models	3
Probability	3
Accuracy	2
Classification	2
Computer Assisted Testing	2
Error of Measurement	2
Factor Analysis	2
Markov Processes	2
Ability	1
Adaptive Testing	1
Algorithms	1
Change	1
Cluster Analysis	1
Computation	1
Correlation	1
Data Analysis	1
Educational Assessment	1
Estimation (Mathematics)	1
More ▼

Source

Educational and Psychological…	2
Applied Measurement in…	1
Hacettepe University Journal…	1
Journal of Educational…	1
Practical Assessment,…	1
Psicologica: International…	1

Author

Atar, Burcu	1
Chiu, Christopher W. T.	1
Cohen, Jon	1
Dardick, William R.	1
Jones, Patricia B.	1
Kamata, Akihito	1
Koziol, Natalie A.	1
Lau, C. Allen	1
MacDonald, Paul	1
Martin-Fernandez, Manuel	1
Mislevy, Robert J.	1
Paunonen, Sampo V.	1
Revuelta, Javier	1
Snow, Stephanie	1
Thompson, Nathan A.	1
Wang, Tianyou	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	7
Speeches/Meeting Papers	3
Reports - Evaluative	2

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Bayesian Estimation of Multidimensional Item Response Models. A Comparison of Analytic and Simulation Algorithms

Peer reviewed
PDF on ERIC

Download full text

Martin-Fernandez, Manuel; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…

Descriptors: Bayesian Statistics, Item Response Theory, Models, Comparative Analysis

Reweighting Data in the Spirit of Tukey: Using Bayesian Posterior Probabilities as Rasch Residuals for Studying Misfit

Peer reviewed

Direct link

Dardick, William R.; Mislevy, Robert J. – Educational and Psychological Measurement, 2016

A new variant of the iterative "data = fit + residual" data-analytical approach described by Mosteller and Tukey is proposed and implemented in the context of item response theory psychometric models. Posterior probabilities from a Bayesian mixture model of a Rasch item response theory model and an unscalable latent class are expressed…

Descriptors: Bayesian Statistics, Probability, Data Analysis, Item Response Theory

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

Termination Criteria for Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011

Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability

Comparison of IRT Likelihood Ratio Test and Logistic Regression DIF Detection Procedures

Peer reviewed

Direct link

Atar, Burcu; Kamata, Akihito – Hacettepe University Journal of Education, 2011

The Type I error rates and the power of IRT likelihood ratio test and cumulative logit ordinal logistic regression procedures in detecting differential item functioning (DIF) for polytomously scored items were investigated in this Monte Carlo simulation study. For this purpose, 54 simulation conditions (combinations of 3 sample sizes, 2 sample…

Descriptors: Test Bias, Sample Size, Monte Carlo Methods, Item Response Theory

Computerized Classification Testing under Practical Constraints with a Polytomous Model.

Download full text

Lau, C. Allen; Wang, Tianyou – 1999

A study was conducted to extend the sequential probability ratio testing (SPRT) procedure with the polytomous model under some practical constraints in computerized classification testing (CCT), such as methods to control item exposure rate, and to study the effects of other variables, including item information algorithms, test difficulties, item…

Descriptors: Algorithms, Computer Assisted Testing, Difficulty Level, Item Banks

A Subdividing Method for Generalizability Theory: Precision of Measurement Errors and Patterns of Missing Data.

Chiu, Christopher W. T. – 2000

A procedure was developed to analyze data with missing observations by extracting data from a sparsely filled data matrix into analyzable smaller subsets of data. This subdividing method, based on the conceptual framework of meta-analysis, was accomplished by creating data sets that exhibit structural designs and then pooling variance components…

Descriptors: Difficulty Level, Error of Measurement, Generalizability Theory, Interrater Reliability

A Monte Carlo Comparison of Item and Person Statistics Based on Item Response Theory versus Classical Test Theory.

Peer reviewed

MacDonald, Paul; Paunonen, Sampo V. – Educational and Psychological Measurement, 2002

Examined the behavior of item and person statistics from item response theory and classical test theory frameworks through Monte Carlo methods with simulated test data. Findings suggest that item difficulty and person ability estimates are highly comparable for both approaches. (SLD)

Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Response Theory

Impact of Changing Difficulty on Inferences from the National Assessment of Educational Progress.

Peer reviewed

Cohen, Jon; Snow, Stephanie – Journal of Educational Measurement, 2002

Studied the impact of changes in item difficulty on National Assessment of Educational Progress (NAEP) estimates over time through a Monte Carlo study that simulated the responses of 1990 NAEP mathematics respondents to 1990 and 1996 NAEP mathematics items. Results support the idea that these changes have not affected the NAEP trend line.…

Descriptors: Change, Difficulty Level, Estimation (Mathematics), Mathematics Tests

Dimensionality Assessment for Dichotomously Scored Items Using Multidimensional Scaling.

Download full text

Jones, Patricia B.; And Others – 1987

In order to determine the effectiveness of multidimensional scaling (MDS) in recovering the dimensionality of a set of dichotomously-scored items, data were simulated in one, two, and three dimensions for a variety of correlations with the underlying latent trait. Similarity matrices were constructed from these data using three margin-sensitive…

Descriptors: Cluster Analysis, Correlation, Difficulty Level, Error of Measurement