ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Bayesian Statistics	13
Test Length	13
Estimation (Mathematics)	7
Item Response Theory	7
Simulation	7
Ability	5
Adaptive Testing	5
Computation	4
Computer Assisted Testing	4
Mathematical Models	4
Maximum Likelihood Statistics	4
Probability	4
Monte Carlo Methods	3
Sample Size	3
Test Items	3
Classification	2
Comparative Analysis	2
Computer Simulation	2
Error of Measurement	2
Markov Processes	2
Mastery Tests	2
Statistical Distributions	2
True Scores	2
Academic Ability	1
Computer Software	1
More ▼

Source

Applied Psychological…	5
Educational and Psychological…	1
Psychometrika	1

Publication Type

Reports - Evaluative	13
Journal Articles	7
Speeches/Meeting Papers	3
Reports - Research	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

COMPASS (Computer Assisted…

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Recovery of Graded Response Model Parameters: A Comparison of Marginal Maximum Likelihood and Markov Chain Monte Carlo Estimation

Peer reviewed

Direct link

Kieftenbeld, Vincent; Natesan, Prathiba – Applied Psychological Measurement, 2012

Markov chain Monte Carlo (MCMC) methods enable a fully Bayesian approach to parameter estimation of item response models. In this simulation study, the authors compared the recovery of graded response model parameters using marginal maximum likelihood (MML) and Gibbs sampling (MCMC) under various latent trait distributions, test lengths, and…

Descriptors: Test Length, Markov Processes, Item Response Theory, Monte Carlo Methods

Variations on Stochastic Curtailment in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew David – Applied Psychological Measurement, 2010

In sequential mastery testing (SMT), assessment via computer is used to classify examinees into one of two mutually exclusive categories. Unlike paper-and-pencil tests, SMT has the capability to use variable-length stopping rules. One approach to shortening variable-length tests is stochastic curtailment, which halts examination if the probability…

Descriptors: Mastery Tests, Computer Assisted Testing, Adaptive Testing, Test Length

Simultaneous Estimation of Overall and Domain Abilities: A Higher-Order IRT Model Approach

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao – Applied Psychological Measurement, 2009

Assessments consisting of different domains (e.g., content areas, objectives) are typically multidimensional in nature but are commonly assumed to be unidimensional for estimation purposes. The different domains of these assessments are further treated as multi-unidimensional tests for the purpose of obtaining diagnostic information. However, when…

Descriptors: Ability, Tests, Item Response Theory, Data Analysis

A Consideration for Variable Length Adaptive Tests.

Download full text

Wingersky, Marilyn S. – 1989

In a variable-length adaptive test with a stopping rule that relied on the asymptotic standard error of measurement of the examinee's estimated true score, M. S. Stocking (1987) discovered that it was sufficient to know the examinee's true score and the number of items administered to predict with some accuracy whether an examinee's true score was…

Descriptors: Adaptive Testing, Bayesian Statistics, Error of Measurement, Estimation (Mathematics)

Determining Test Length to Control for False-Positive and False-Negative Error Rates on Criterion-Referenced Test.

Wilcox, Rand R. – 1980

Concern about passing those examinees who should pass, and retaining those who need remedial work, is one problem related to criterion-referenced testing. This paper deals with one aspect of that problem. When determining how many items to include on a criterion-referenced test, practitioners must resolve various non-statistical issues before a…

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Latent Trait Theory, Mathematical Models

The Effect of Person Misfit on Classification Decisions

Peer reviewed

Direct link

Hendrawan, Irene; Glas, Cees A. W.; Meijer, Rob R. – Applied Psychological Measurement, 2005

The effect of person misfit to an item response theory model on a mastery/nonmastery decision was investigated. Furthermore, it was investigated whether the classification precision can be improved by identifying misfitting respondents using person-fit statistics. A simulation study was conducted to investigate the probability of a correct…

Descriptors: Probability, Statistics, Test Length, Simulation

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Peer reviewed

Kim, Seock-Ho; And Others – Psychometrika, 1994

Hierarchical Bayes procedures for the two-parameter logistic item response model were compared for estimating item and ability parameters through two joint and two marginal Bayesian procedures. Marginal procedures yielded smaller root mean square differences for item and ability, but results for larger sample size and test length were similar.…

Descriptors: Ability, Bayesian Statistics, Computer Simulation, Estimation (Mathematics)

Reducing Bias in CAT Trait Estimation: A Comparison of Approaches.

Peer reviewed

Wang, Tianyou; Hanson, Bradley A.; Lau, Che-Ming A. – Applied Psychological Measurement, 1999

Extended the use of a beta prior in trait estimation to the maximum expected a posteriori (MAP) method of Bayesian estimation. This new method, essentially unbiased MAP, was compared with MAP, essentially unbiased expected a posteriori, weighted likelihood, and maximum-likelihood estimation methods. The new method significantly reduced bias in…

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Estimation (Mathematics)

Comparing BILOG and LOGIST Estimates for Normal, Truncated Normal, and Beta Ability Distributions.

Download full text

Abdel-fattah, Abdel-fattah A. – 1994

The accuracy of estimation procedures in item response theory was studied using Monte Carlo methods and varying sample size, number of subjects, and distribution of ability parameters for: (1) joint maximum likelihood as implemented in the computer program LOGIST; (2) marginal maximum likelihood; and (3) marginal Bayesian procedures as implemented…

Descriptors: Ability, Bayesian Statistics, Estimation (Mathematics), Maximum Likelihood Statistics

A Bayesian Method for Evaluating Trainee Proficiency. Technical Paper 323.

Download full text

Epstein, Kenneth I.; Steinheiser, Frederick H., Jr. – 1978

A multiparameter, programmable model was developed to examine the interactive influence of certain parameters on the probability of deciding that an examinee had attained a specified degree of mastery. It was applied within the simulated context of performance testing of military trainees. These parameters included: (1) the number of assumed…

Descriptors: Academic Ability, Bayesian Statistics, Cutting Scores, Hypothesis Testing

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)

The Influence of Dimensionality on CAT Ability Estimation.

Peer reviewed

De Ayala, R. J. – Educational and Psychological Measurement, 1992

Effects of dimensionality on ability estimation of an adaptive test were examined using generated data in Bayesian computerized adaptive testing (CAT) simulations. Generally, increasing interdimensional difficulty association produced a slight decrease in test length and an increase in accuracy of ability estimation as assessed by root mean square…

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Computer Simulation

The Selection of Test Items for Decision Making with a Computer Adaptive Test.

Download full text

Spray, Judith A.; Reckase, Mark D. – 1994

The issue of test-item selection in support of decision making in adaptive testing is considered. The number of items needed to make a decision is compared for two approaches: selecting items from an item pool that are most informative at the decision point or selecting items that are most informative at the examinee's ability level. The first…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Kim, Seock-Ho	2
Abdel-fattah, Abdel-fattah A.	1
De Ayala, R. J.	1
Epstein, Kenneth I.	1
Finkelman, Matthew David	1
Glas, Cees A. W.	1
Hanson, Bradley A.	1
Hendrawan, Irene	1
Kieftenbeld, Vincent	1
Lau, Che-Ming A.	1
Meijer, Rob R.	1
Natesan, Prathiba	1
Reckase, Mark D.	1
Song, Hao	1
Spray, Judith A.	1
Steinheiser, Frederick H., Jr.	1
Wang, Tianyou	1
Wilcox, Rand R.	1
Wingersky, Marilyn S.	1
de la Torre, Jimmy	1
More ▼