ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	8

Descriptor

Test Length	9
Sample Size	6
Item Response Theory	5
Test Items	4
Accuracy	3
Methods	3
Monte Carlo Methods	3
Ability	2
Adaptive Testing	2
Bayesian Statistics	2
Computer Assisted Testing	2
Computer Software	2
Item Analysis	2
Reliability	2
Selection	2
Simulation	2
Statistical Analysis	2
Test Format	2
Achievement Tests	1
Algorithms	1
Artificial Intelligence	1
Automation	1
Bias	1
Classification	1
Cognitive Psychology	1
More ▼

Source

Measurement:…

Author

Ames, Allison J.	1
Bao, Yu	1
Bradshaw, Laine	1
Cohen, Allan S.	1
Embretson, Susan E.	1
Ezike, Nnamdi C.	1
Kalkan, Ömür Kaya	1
Karadavut, Tugba	1
Kim, Seock-Ho	1
Kim, Stella Yun	1
Leventhal, Brian C.	1
Luo, Yong	1
McBride, James R.	1
Novak, Josip	1
Rebernjak, Blaž	1
Sun, Ting	1
Wyse, Adam E.	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	8
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Colombia	1
Indonesia	1
Jordan	1
Peru	1
Qatar	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Evaluating Six Approaches to Handling Zero-Frequency Scores under Equipercentile Equating

Peer reviewed

Direct link

Sun, Ting; Kim, Stella Yun – Measurement: Interdisciplinary Research and Perspectives, 2021

In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in…

Descriptors: Equated Scores, Test Length, Sample Size, Methods

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

A Comparison of Common IRT Model-Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2021

To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight…

Descriptors: Item Response Theory, Test Format, Selection, Methods

The Comparison of Estimation Methods for the Four-Parameter Logistic Item Response Theory Model

Peer reviewed

Direct link

Kalkan, Ömür Kaya – Measurement: Interdisciplinary Research and Perspectives, 2022

The four-parameter logistic (4PL) Item Response Theory (IRT) model has recently been reconsidered in the literature due to the advances in the statistical modeling software and the recent developments in the estimation of the 4PL IRT model parameters. The current simulation study evaluated the performance of expectation-maximization (EM),…

Descriptors: Comparative Analysis, Sample Size, Test Length, Algorithms

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

Monte Carlo Simulation in Item Response Theory Applications Using SAS

Peer reviewed

Direct link

Ames, Allison J.; Leventhal, Brian C.; Ezike, Nnamdi C. – Measurement: Interdisciplinary Research and Perspectives, 2020

Data simulation and Monte Carlo simulation studies are important skills for researchers and practitioners of educational and psychological measurement, but there are few resources on the topic specific to item response theory. Even fewer resources exist on the statistical software techniques to implement simulation studies. This article presents…

Descriptors: Monte Carlo Methods, Item Response Theory, Simulation, Computer Software

Estimation of Mixture Rasch Models from Skewed Latent Ability Distributions

Peer reviewed

Direct link

Karadavut, Tugba; Cohen, Allan S.; Kim, Seock-Ho – Measurement: Interdisciplinary Research and Perspectives, 2020

Mixture Rasch (MixRasch) models conventionally assume normal distributions for latent ability. Previous research has shown that the assumption of normality is often unmet in educational and psychological measurement. When normality is assumed, asymmetry in the actual latent ability distribution has been shown to result in extraction of spurious…

Descriptors: Item Response Theory, Ability, Statistical Distributions, Sample Size

Attribute-Level Item Selection Method for DCM-CAT

Peer reviewed

Direct link

Bao, Yu; Bradshaw, Laine – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs) can provide multidimensional diagnostic feedback about students' mastery levels of knowledge components or attributes. One advantage of using DCMs is the ability to accurately and reliably classify students into mastery levels with a relatively small number of items per attribute. Combining DCMs with…

Descriptors: Test Items, Selection, Adaptive Testing, Computer Assisted Testing

The Second Century of Ability Testing: Some Predictions and Speculations

Peer reviewed

Direct link

Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004

The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…

Descriptors: Ability, Testing, Futures (of Society), Psychometrics