ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	8

Descriptor

Statistical Distributions	12
Test Items	12
Item Response Theory	9
Computation	5
Goodness of Fit	5
Monte Carlo Methods	4
Statistical Analysis	4
Difficulty Level	3
Mathematical Models	3
Models	3
Power (Statistics)	3
Ability	2
Achievement Tests	2
Bayesian Statistics	2
Computer Simulation	2
Equations (Mathematics)	2
Error of Measurement	2
Item Analysis	2
Item Bias	2
Maximum Likelihood Statistics	2
Sample Size	2
Simulation	2
Statistical Bias	2
Adaptive Testing	1
Change	1
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	12
Reports - Research	8
Reports - Evaluative	4
Speeches/Meeting Papers	2

Education Level

Elementary Education	1
Grade 7	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

South Korea

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Use of the Lagrange Multiplier Test for Assessing Measurement Invariance under Model Misspecification

Peer reviewed

Direct link

Guastadisegni, Lucia; Cagnone, Silvia; Moustaki, Irini; Vasdekis, Vassilis – Educational and Psychological Measurement, 2022

This article studies the Type I error, false positive rates, and power of four versions of the Lagrange multiplier test to detect measurement noninvariance in item response theory (IRT) models for binary data under model misspecification. The tests considered are the Lagrange multiplier test computed with the Hessian and cross-product approach,…

Descriptors: Measurement, Statistical Analysis, Item Response Theory, Test Items

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

The Role of Item Distributions on Reliability Estimation: The Case of Cronbach's Coefficient Alpha

Peer reviewed

Direct link

Olvera Astivia, Oscar Lorenzo; Kroc, Edward; Zumbo, Bruno D. – Educational and Psychological Measurement, 2020

Simulations concerning the distributional assumptions of coefficient alpha are contradictory. To provide a more principled theoretical framework, this article relies on the Fréchet-Hoeffding bounds, in order to showcase that the distribution of the items play a role on the estimation of correlations and covariances. More specifically, these bounds…

Descriptors: Test Items, Test Reliability, Computation, Correlation

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Coefficient Omega Bootstrap Confidence Intervals: Nonnormal Distributions

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin – Educational and Psychological Measurement, 2013

The performance of the normal theory bootstrap (NTB), the percentile bootstrap (PB), and the bias-corrected and accelerated (BCa) bootstrap confidence intervals (CIs) for coefficient omega was assessed through a Monte Carlo simulation under conditions not previously investigated. Of particular interests were nonnormal Likert-type and binary items.…

Descriptors: Sampling, Statistical Inference, Computation, Statistical Analysis

A Comparison of Three IRT Approaches to Examinee Ability Change Modeling in a Single-Group Anchor Test Design

Peer reviewed

Direct link

Paek, Insu; Park, Hyun-Jeong; Cai, Li; Chi, Eunlim – Educational and Psychological Measurement, 2014

Typically a longitudinal growth modeling based on item response theory (IRT) requires repeated measures data from a single group with the same test design. If operational or item exposure problems are present, the same test may not be employed to collect data for longitudinal analyses and tests at multiple time points are constructed with unique…

Descriptors: Item Response Theory, Comparative Analysis, Test Items, Equated Scores

l[subscript z] Person-Fit Index to Identify Misfit Students with Achievement Test Data

Peer reviewed

Direct link

Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2013

The usefulness of the l[subscript z] person-fit index was investigated with achievement test data from 20 exams given to more than 3,200 college students. Results for three methods of estimating ? showed that the distributions of l[subscript z] were not consistent with its theoretical distribution, resulting in general overfit to the item response…

Descriptors: Achievement Tests, College Students, Goodness of Fit, Item Response Theory

The Distributional Properties of Rasch Item Fit Statistics.

Peer reviewed

Smith, Richard M. – Educational and Psychological Measurement, 1991

This study reports results of an investigation based on simulated data of the distributional properties of the item fit statistics that are commonly used in the Rasch model calibration programs as indices of the fit of responses to individual items to the measurement model. (SLD)

Descriptors: Computer Simulation, Equations (Mathematics), Goodness of Fit, Item Response Theory

Detecting Item Bias in the Rasch Rating Scale Model.

Peer reviewed

Smith, Richard M. – Educational and Psychological Measurement, 1994

Simulated data are used to assess the appropriateness of using separate calibration and between-fit approaches to detecting item bias in the Rasch rating scale model. Results indicate that Type I error rates for the null distribution hold even when there are different ability levels for reference and focal groups. (SLD)

Descriptors: Ability, Goodness of Fit, Identification, Item Bias

Computerized Adaptive Testing Using the Partial Credit Model: Effects of Item Pool Characteristics and Different Stopping Rules.

Peer reviewed

Dodd, Barbara G.; And Others – Educational and Psychological Measurement, 1993

Effects of the following variables on performance of computerized adaptive testing (CAT) procedures for the partial credit model (PCM) were studied: (1) stopping rule for terminating CAT; (2) item pool size; and (3) distribution of item difficulties. Implications of findings for CAT systems based on the PCM are discussed. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Difficulty Level

Identifying Negatively Discriminating Items When Test Scores Are Not Normally Distributed.

Peer reviewed

Fowler, Robert L.; Clingman, Joy M. – Educational and Psychological Measurement, 1992

Monte Carlo techniques are used to examine the power of the "B" statistic of R. L. Brennan (1972) to detect negatively discriminating items drawn from a variety of nonnormal population distributions. A simplified procedure is offered for conducting an item-discrimination analysis on typical classroom objective tests. (SLD)

Descriptors: Classroom Techniques, Elementary Secondary Education, Equations (Mathematics), Item Analysis

Paek, Insu	2
Smith, Richard M.	2
Bezirhan, Ummugul	1
Cagnone, Silvia	1
Cai, Li	1
Chalmers, Robert Philip	1
Chi, Eunlim	1
Clingman, Joy M.	1
Divers, Jasmin	1
Dodd, Barbara G.	1
Edwards, Julianne M.	1
Finch, Holmes	1
Fowler, Robert L.	1
Guastadisegni, Lucia	1
Kroc, Edward	1
Lin, Zhongtian	1
Moustaki, Irini	1
Olvera Astivia, Oscar Lorenzo	1
Padilla, Miguel A.	1
Park, Hyun-Jeong	1
Seo, Dong Gi	1
Vasdekis, Vassilis	1
Weiss, David J.	1
Zumbo, Bruno D.	1
More ▼