ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	12

Source

Journal of Educational and…

Publication Type

Journal Articles	13
Reports - Research	12
Reports - Evaluative	1

Education Level

Secondary Education	2
Elementary Secondary Education	1
Grade 12	1
High Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

An Information Matrix Test for the Collapsing of Categories under the Partial Credit Model

Peer reviewed

Direct link

Harel, Daphna; Steele, Russell J. – Journal of Educational and Behavioral Statistics, 2018

Collapsing categories is a commonly used data reduction technique; however, to date there do not exist principled methods to determine whether collapsing categories is appropriate in practice. With ordinal responses under the partial credit model, when collapsing categories, the true model for the collapsed data is no longer a partial credit…

Descriptors: Matrices, Models, Item Response Theory, Research Methodology

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

Person Fit Analysis in Computerized Adaptive Testing Using Tests for a Change Point

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2016

Meijer and van Krimpen-Stoop noted that the number of person-fit statistics (PFSs) that have been designed for computerized adaptive tests (CATs) is relatively modest. This article partially addresses that concern by suggesting three new PFSs for CATs. The statistics are based on tests for a change point and can be used to detect an abrupt change…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Goodness of Fit

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

A Quasi-Parametric Method for Fitting Flexible Item Response Functions

Peer reviewed

Direct link

Liang, Longjuan; Browne, Michael W. – Journal of Educational and Behavioral Statistics, 2015

If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating…

Descriptors: Item Response Theory, Statistical Analysis, Goodness of Fit, Bayesian Statistics

Modeling Information Accumulation in Psychological Tests Using Item Response Times

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jörg-Tobias – Journal of Educational and Behavioral Statistics, 2015

In this article, a latent trait model is proposed for the response times in psychological tests. The latent trait model is based on the linear transformation model and subsumes popular models from survival analysis, like the proportional hazards model and the proportional odds model. Core of the model is the assumption that an unspecified monotone…

Descriptors: Psychological Testing, Reaction Time, Statistical Analysis, Models

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Design-Comparable Effect Sizes in Multiple Baseline Designs: A General Modeling Framework

Peer reviewed

Direct link

Pustejovsky, James E.; Hedges, Larry V.; Shadish, William R. – Journal of Educational and Behavioral Statistics, 2014

In single-case research, the multiple baseline design is a widely used approach for evaluating the effects of interventions on individuals. Multiple baseline designs involve repeated measurement of outcomes over time and the controlled introduction of a treatment at different times for different individuals. This article outlines a general…

Descriptors: Hierarchical Linear Modeling, Effect Size, Maximum Likelihood Statistics, Computation

Analyzing Response Times in Tests with Rank Correlation Approaches

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jorg-Tobias – Journal of Educational and Behavioral Statistics, 2013

It is common practice to log-transform response times before analyzing them with standard factor analytical methods. However, sometimes the log-transformation is not capable of linearizing the relation between the response times and the latent traits. Therefore, a more general approach to response time analysis is proposed in the current…

Descriptors: Item Response Theory, Simulation, Reaction Time, Least Squares Statistics

Mixed and Mixture Regression Models for Continuous Bounded Responses Using the Beta Distribution

Peer reviewed

Direct link

Verkuilen, Jay; Smithson, Michael – Journal of Educational and Behavioral Statistics, 2012

Doubly bounded continuous data are common in the social and behavioral sciences. Examples include judged probabilities, confidence ratings, derived proportions such as percent time on task, and bounded scale scores. Dependent variables of this kind are often difficult to analyze using normal theory models because their distributions may be quite…

Descriptors: Responses, Regression (Statistics), Statistical Analysis, Models

Multidimensional Item Response Theory Modeling of Binary Data: Large Sample Properties of NOHARM Estimates.

Peer reviewed

Maydeu-Olivares, Albert – Journal of Educational and Behavioral Statistics, 2001

Provides asymptotic formulas for the standard errors of parameter estimates from the NOHARM computer program for restricted and unrestricted rotated models, using large-sample theory, and a goodness-of-fit test of the model. Used simulation to show that results from NOHARM are comparable to the three-stage estimator of B. Muthen (1993). (SLD)

Descriptors: Estimation (Mathematics), Goodness of Fit, Item Response Theory, Mathematical Models

Goodness of Fit	13
Simulation	13
Item Response Theory	9
Computation	7
Models	7
Statistical Analysis	6
Maximum Likelihood Statistics	5
Test Items	5
Comparative Analysis	4
Bayesian Statistics	3
Achievement Tests	2
Data Analysis	2
Equations (Mathematics)	2
Error of Measurement	2
Evaluation Methods	2
Foreign Countries	2
International Assessment	2
Item Analysis	2
Probability	2
Reaction Time	2
Scores	2
Scoring	2
Statistical Distributions	2
Academic Achievement	1
Accuracy	1
More ▼

Sinharay, Sandip	3
Ranger, Jochen	2
Browne, Michael W.	1
Eckerly, Carol	1
Gorney, Kylie	1
Harel, Daphna	1
Hedges, Larry V.	1
James O. Ramsay	1
Joakim Wallmark	1
Juan Li	1
Kuhn, Jorg-Tobias	1
Kuhn, Jörg-Tobias	1
Liang, Longjuan	1
Marie Wiberg	1
Maydeu-Olivares, Albert	1
Monroe, Scott	1
Pustejovsky, James E.	1
Shadish, William R.	1
Smithson, Michael	1
Steele, Russell J.	1
Verkuilen, Jay	1
Wollack, James A.	1
Yang, Ji Seung	1
Zheng, Xiaying	1
More ▼