ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	11

Descriptor

Item Response Theory	15
Sample Size	15
Statistical Distributions	15
Bayesian Statistics	6
Statistical Bias	6
Test Items	6
Test Length	6
Ability	5
Models	5
Simulation	5
Error of Measurement	4
Accuracy	3
Computation	3
Estimation (Mathematics)	3
Mathematical Models	3
Mathematics Tests	3
Sampling	3
Statistical Analysis	3
Achievement Tests	2
Comparative Analysis	2
Computer Simulation	2
Difficulty Level	2
Foreign Countries	2
Goodness of Fit	2
Grade 8	2
More ▼

Source

Educational and Psychological…	3
Measurement:…	2
ProQuest LLC	2
ACT, Inc.	1
Applied Psychological…	1
Educational Sciences: Theory…	1
International Journal of…	1
International Journal of…	1

Publication Type

Journal Articles	9
Reports - Research	8
Reports - Evaluative	5
Speeches/Meeting Papers	3
Dissertations/Theses -…	2

Education Level

Grade 8	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Colombia	1
Indonesia	1
Jordan	1
Peru	1
Qatar	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Item Parameter Recovery: Sensitivity to Prior Distribution

Peer reviewed

Direct link

Christine E. DeMars; Paulius Satkus – Educational and Psychological Measurement, 2024

Marginal maximum likelihood, a common estimation method for item response theory models, is not inherently a Bayesian procedure. However, due to estimation difficulties, Bayesian priors are often applied to the likelihood when estimating 3PL models, especially with small samples. Little focus has been placed on choosing the priors for marginal…

Descriptors: Item Response Theory, Statistical Distributions, Error of Measurement, Bayesian Statistics

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Parameter Estimation Bias of Dichotomous Logistic Item Response Theory Models Using Different Variables

Peer reviewed
PDF on ERIC

Download full text

Köse, Alper; Dogan, C. Deha – International Journal of Evaluation and Research in Education, 2019

The aim of this study was to examine the precision of item parameter estimation in different sample sizes and test lengths under three parameter logistic model (3PL) item response theory (IRT) model, where the trait measured by a test was not normally distributed or had a skewed distribution. In the study, number of categories (1-0), and item…

Descriptors: Statistical Bias, Item Response Theory, Simulation, Accuracy

Estimation of Mixture Rasch Models from Skewed Latent Ability Distributions

Peer reviewed

Direct link

Karadavut, Tugba; Cohen, Allan S.; Kim, Seock-Ho – Measurement: Interdisciplinary Research and Perspectives, 2020

Mixture Rasch (MixRasch) models conventionally assume normal distributions for latent ability. Previous research has shown that the assumption of normality is often unmet in educational and psychological measurement. When normality is assumed, asymmetry in the actual latent ability distribution has been shown to result in extraction of spurious…

Descriptors: Item Response Theory, Ability, Statistical Distributions, Sample Size

An Improved Estimation Using Polya-Gamma Augmentation for Bayesian Structural Equation Models with Dichotomous Variables

Peer reviewed

Direct link

Kim, Seohyun; Lu, Zhenqiu; Cohen, Allan S. – Measurement: Interdisciplinary Research and Perspectives, 2018

Bayesian algorithms have been used successfully in the social and behavioral sciences to analyze dichotomous data particularly with complex structural equation models. In this study, we investigate the use of the Polya-Gamma data augmentation method with Gibbs sampling to improve estimation of structural equation models with dichotomous variables.…

Descriptors: Bayesian Statistics, Structural Equation Models, Computation, Social Science Research

Examination of Polytomous Items' Psychometric Properties According to Nonparametric Item Response Theory Models in Different Test Conditions

Peer reviewed
PDF on ERIC

Download full text

Sengul Avsar, Asiye; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2017

This study analysed polytomous items' psychometric properties according to nonparametric item response theory (NIRT) models. Thus, simulated datasets--three different test lengths (10, 20 and 30 items), three sample distributions (normal, right and left skewed) and three samples sizes (100, 250 and 500)--were generated by conducting 20…

Descriptors: Test Items, Psychometrics, Nonparametric Statistics, Item Response Theory

Estimating the Nominal Response Model under Nonnormal Conditions

Peer reviewed

Direct link

Preston, Kathleen Suzanne Johnson; Reise, Steven Paul – Educational and Psychological Measurement, 2014

The nominal response model (NRM), a much understudied polytomous item response theory (IRT) model, provides researchers the unique opportunity to evaluate within-item category distinctions. Polytomous IRT models, such as the NRM, are frequently applied to psychological assessments representing constructs that are unlikely to be normally…

Descriptors: Item Response Theory, Computation, Models, Accuracy

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Direct link

Quesen, Sarah – ProQuest LLC, 2016

When studying differential item functioning (DIF) with students with disabilities (SWD) focal groups typically suffer from small sample size, whereas the reference group population is usually large. This makes it possible for a researcher to select a sample from the reference population to be similar to the focal group on the ability scale. Doing…

Descriptors: Test Items, Academic Accommodations (Disabilities), Testing Accommodations, Disabilities

Grain Size and Parameter Recovery with TIMSS and the General Diagnostic Model

Peer reviewed

Direct link

Skaggs, Gary; Wilkins, Jesse L. M.; Hein, Serge F. – International Journal of Testing, 2016

The purpose of this study was to explore the degree of grain size of the attributes and the sample sizes that can support accurate parameter recovery with the General Diagnostic Model (GDM) for a large-scale international assessment. In this resampling study, bootstrap samples were obtained from the 2003 Grade 8 TIMSS in Mathematics at varying…

Descriptors: Achievement Tests, Foreign Countries, Elementary Secondary Education, Science Achievement

The Performance of the Linear Logistic Test Model When the Q-Matrix Is Misspecified: A Simulation Study

Direct link

MacDonald, George T. – ProQuest LLC, 2014

A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…

Descriptors: Simulation, Item Response Theory, Models, Test Items

Linking Item Parameters to a Base Scale. ACT Research Report Series, 2009-2

Download full text

Kang, Taehoon; Petersen, Nancy S. – ACT, Inc., 2009

This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord (1983) characteristic curve method…

Descriptors: Standards, Testing Programs, Test Items, Statistical Distributions

An Empirical Study of the Effects of Small Datasets and Varying Prior Variances on Item Parameter Estimation in BILOG.

Peer reviewed

Harwell, Michael R.; Janosky, Janine E. – Applied Psychological Measurement, 1991

Investigates the BILOG computer program's ability to recover known item parameters for different numbers of items, examinees, and variances of the prior distributions of discrimination parameters for the two-parameter logistic item-response theory model. For samples of at least 250 examinees and 15 items, simulation results support using BILOG.…

Descriptors: Bayesian Statistics, Computer Simulation, Estimation (Mathematics), Item Response Theory

Quick Norms with Rasch Measurement.

PDF pending restoration

Bush, M. Joan; Schumacker, Randall E. – 1993

The feasibility of quick norms derived by the procedure described by B. D. Wright and M. H. Stone (1979) was investigated. Norming differences between traditionally calculated means and Rasch "quick" means were examined for simulated data sets of varying sample size, test length, and type of distribution. A 5 by 5 by 2 design with a…

Descriptors: Computer Simulation, Item Response Theory, Norm Referenced Tests, Sample Size

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)

The Effect of Restricting Ability Distributions in the Estimation of Item Difficulties: Implications for a CAT Implementation.

Download full text

Ito, Kyoko; Sykes, Robert C. – 1994

Responses to previously calibrated items administered in a computerized adaptive testing (CAT) mode may be used to recalibrate the items. This live-data simulation study investigated the possibility, and limitations, of on-line adaptive recalibration of precalibrated items. Responses to items of a Rasch-based paper-and-pencil licensure examination…

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Difficulty Level

Cohen, Allan S.	2
Kim, Seock-Ho	2
Bush, M. Joan	1
Chalmers, Robert Philip	1
Christine E. DeMars	1
Dogan, C. Deha	1
Harwell, Michael R.	1
Hein, Serge F.	1
Ito, Kyoko	1
Janosky, Janine E.	1
Kang, Taehoon	1
Karadavut, Tugba	1
Kim, Seohyun	1
Köse, Alper	1
Lin, Zhongtian	1
Lu, Zhenqiu	1
MacDonald, George T.	1
Paek, Insu	1
Paulius Satkus	1
Petersen, Nancy S.	1
Preston, Kathleen Suzanne…	1
Quesen, Sarah	1
Reise, Steven Paul	1
Schumacker, Randall E.	1
More ▼