ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	10

Descriptor

Error of Measurement	13
Item Response Theory	13
Statistical Distributions	13
Models	6
Computation	5
Goodness of Fit	5
Probability	4
Sample Size	4
Test Items	4
Ability	3
Bayesian Statistics	3
Simulation	3
Accuracy	2
Adaptive Testing	2
Classification	2
Comparative Analysis	2
Equated Scores	2
Mathematics Tests	2
Nonparametric Statistics	2
Psychometrics	2
Robustness (Statistics)	2
Scores	2
Statistical Bias	2
Test Length	2
Academic Accommodations…	1
More ▼

Source

Educational and Psychological…	3
Journal of Educational and…	2
Applied Psychological…	1
ETS Research Report Series	1
Educational Sciences: Theory…	1
Measurement:…	1
National Center for Research…	1
Practical Assessment,…	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	4
Dissertations/Theses -…	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Early Childhood Education	1
Grade 3	1
Grade 7	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
Secondary Education	1

Audience

Location

South Korea

Laws, Policies, & Programs

Assessments and Surveys

Work Keys (ACT)

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Item Parameter Recovery: Sensitivity to Prior Distribution

Peer reviewed

Direct link

Christine E. DeMars; Paulius Satkus – Educational and Psychological Measurement, 2024

Marginal maximum likelihood, a common estimation method for item response theory models, is not inherently a Bayesian procedure. However, due to estimation difficulties, Bayesian priors are often applied to the likelihood when estimating 3PL models, especially with small samples. Little focus has been placed on choosing the priors for marginal…

Descriptors: Item Response Theory, Statistical Distributions, Error of Measurement, Bayesian Statistics

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Functional Data Analysis and Person Response Functions

Peer reviewed

Direct link

Turner, Kyle T.; Engelhard, George, Jr. – Measurement: Interdisciplinary Research and Perspectives, 2023

The purpose of this study is to illustrate the use of functional data analysis (FDA) as a general methodology for analyzing person response functions (PRFs). Applications of FDA to psychometrics have included the estimation of item response functions and latent distributions, as well as differential item functioning. Although FDA has been…

Descriptors: Data Analysis, Item Response Theory, Psychometrics, Statistical Distributions

Examination of Polytomous Items' Psychometric Properties According to Nonparametric Item Response Theory Models in Different Test Conditions

Peer reviewed
PDF on ERIC

Download full text

Sengul Avsar, Asiye; Tavsancil, Ezel – Educational Sciences: Theory and Practice, 2017

This study analysed polytomous items' psychometric properties according to nonparametric item response theory (NIRT) models. Thus, simulated datasets--three different test lengths (10, 20 and 30 items), three sample distributions (normal, right and left skewed) and three samples sizes (100, 250 and 500)--were generated by conducting 20…

Descriptors: Test Items, Psychometrics, Nonparametric Statistics, Item Response Theory

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Direct link

Quesen, Sarah – ProQuest LLC, 2016

When studying differential item functioning (DIF) with students with disabilities (SWD) focal groups typically suffer from small sample size, whereas the reference group population is usually large. This makes it possible for a researcher to select a sample from the reference population to be similar to the focal group on the ability scale. Doing…

Descriptors: Test Items, Academic Accommodations (Disabilities), Testing Accommodations, Disabilities

A Comparison of Three IRT Approaches to Examinee Ability Change Modeling in a Single-Group Anchor Test Design

Peer reviewed

Direct link

Paek, Insu; Park, Hyun-Jeong; Cai, Li; Chi, Eunlim – Educational and Psychological Measurement, 2014

Typically a longitudinal growth modeling based on item response theory (IRT) requires repeated measures data from a single group with the same test design. If operational or item exposure problems are present, the same test may not be employed to collect data for longitudinal analyses and tests at multiple time points are constructed with unique…

Descriptors: Item Response Theory, Comparative Analysis, Test Items, Equated Scores

A New Statistic for Evaluating Item Response Theory Models for Ordinal Data. CRESST Report 839

Download full text

Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…

Descriptors: Item Response Theory, Models, Goodness of Fit, Probability

Computing the Expected Proportions of Misclassified Examinees.

Peer reviewed

Rudner, Lawrence M. – Practical Assessment, Research & Evaluation, 2001

Provides and illustrates a method to compute the expected number of misclassifications of examinees using three-parameter item response theory and two state classifications (mastery or nonmastery). The method uses the standard error and the expected examinee ability distribution. (SLD)

Descriptors: Ability, Classification, Computation, Error of Measurement

The Distribution of Indexes of Person Fit within the Computerized Adaptive Testing Environment.

Peer reviewed

Nering, Michael L. – Applied Psychological Measurement, 1997

Evaluated the distribution of person fit within the computerized-adaptive testing (CAT) environment through simulation. Found that, within the CAT environment, these indexes tend not to follow a standard normal distribution. Person fit indexes had means and standard deviations that were quite different from the expected. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Error of Measurement, Item Response Theory

Conditional Standard Errors, Reliability and Decision Consistency of Performance Levels Using Polytomous IRT.

Wang, Tianyou; And Others – 1996

M. J. Kolen, B. A. Hanson, and R. L. Brennan (1992) presented a procedure for assessing the conditional standard error of measurement (CSEM) of scale scores using a strong true-score model. They also investigated the ways of using nonlinear transformation from number-correct raw score to scale score to equalize the conditional standard error along…

Descriptors: Ability, Classification, Error of Measurement, Goodness of Fit

Cai, Li	2
Paek, Insu	2
Chalmers, Robert Philip	1
Chi, Eunlim	1
Christine E. DeMars	1
Engelhard, George, Jr.	1
Kim, Sooyeon	1
Lin, Zhongtian	1
Monroe, Scott	1
Moses, Tim	1
Nering, Michael L.	1
Park, Hyun-Jeong	1
Paulius Satkus	1
Quesen, Sarah	1
Rudner, Lawrence M.	1
Sengul Avsar, Asiye	1
Sinharay, Sandip	1
Tavsancil, Ezel	1
Turner, Kyle T.	1
Wallin, Gabriel	1
Wang, Tianyou	1
Wiberg, Marie	1
Yoo, Hanwook Henry	1
More ▼