ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Difficulty Level	8
Maximum Likelihood Statistics	8
Monte Carlo Methods	8
Test Items	6
Item Response Theory	5
Bayesian Statistics	4
Comparative Analysis	4
Accuracy	3
Computation	3
Mathematical Models	3
Models	3
Achievement Tests	2
Classification	2
Error of Measurement	2
Markov Processes	2
Statistical Bias	2
Statistical Distributions	2
Adaptive Testing	1
Associative Learning	1
College Students	1
Computer Assisted Testing	1
Computer Software	1
Correlation	1
Educational Assessment	1
Estimation (Mathematics)	1
More ▼

Source

Applied Measurement in…	2
Educational and Psychological…	2
Journal of Educational…	1
Journal of Educational…	1

Author

Finch, Holmes	2
Edwards, Julianne M.	1
French, Brian F.	1
He, Wei	1
Jansen, Margo G. H.	1
Jiao, Hong	1
Koziol, Natalie A.	1
Owston, Ronald D.	1
Patience, Wayne M.	1
Reckase, Mark D.	1
Seo, Dong Gi	1
Wang, Shudong	1
Weiss, David J.	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	6
Speeches/Meeting Papers	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

l[subscript z] Person-Fit Index to Identify Misfit Students with Achievement Test Data

Peer reviewed

Direct link

Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2013

The usefulness of the l[subscript z] person-fit index was investigated with achievement test data from 20 exams given to more than 3,200 college students. Results for three methods of estimating ? showed that the distributions of l[subscript z] were not consistent with its theoretical distribution, resulting in general overfit to the item response…

Descriptors: Achievement Tests, College Students, Goodness of Fit, Item Response Theory

Estimation Methods for One-Parameter Testlet Models

Peer reviewed

Direct link

Jiao, Hong; Wang, Shudong; He, Wei – Journal of Educational Measurement, 2013

This study demonstrated the equivalence between the Rasch testlet model and the three-level one-parameter testlet model and explored the Markov Chain Monte Carlo (MCMC) method for model parameter estimation in WINBUGS. The estimation accuracy from the MCMC method was compared with those from the marginalized maximum likelihood estimation (MMLE)…

Descriptors: Computation, Item Response Theory, Models, Monte Carlo Methods

A Bayesian Version of Rasch's Multiplicative Poisson Model for the Number of Errors of an Achievement Test.

Peer reviewed

Jansen, Margo G. H. – Journal of Educational Statistics, 1986

In this paper a Bayesian procedure is developed for the simultaneous estimation of the reading ability and difficulty parameters which are assumed to be factors in reading errors by the multiplicative Poisson Model. According to several criteria, the Bayesian estimates are better than comparable maximum likelihood estimates. (Author/JAZ)

Descriptors: Achievement Tests, Bayesian Statistics, Comparative Analysis, Difficulty Level

A Monte Carlo Comparison of Learning Hierarchy Validation Techniques.

Owston, Ronald D. – 1979

The development of a probabilistic model for validating Gange's learning hierarchies is described. Learning hierarchies are defined as paired networks of intellectual tasks arranged so that a substantial amount of positive transfer occurs from tasks in a lower position to connected ones in a higher position. This probabilistic validation technique…

Descriptors: Associative Learning, Classification, Difficulty Level, Mathematical Models

Operational Characteristics of a One-Parameter Tailored Testing Procedure. Research Report 79-2.

Download full text

Patience, Wayne M.; Reckase, Mark D. – 1979

An experiment was performed with computer-generated data to investigate some of the operational characteristics of tailored testing as they are related to various provisions of the computer program and item pool. With respect to the computer program, two characteristics were varied: the size of the step of increase or decrease in item difficulty…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Error of Measurement