ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	8

Descriptor

Nonparametric Statistics	12
Item Response Theory	8
Comparative Analysis	6
Simulation	6
Test Items	6
Goodness of Fit	3
Monte Carlo Methods	3
Scores	3
Achievement Tests	2
Classification	2
Computation	2
Difficulty Level	2
Evaluation Methods	2
Guessing (Tests)	2
Identification	2
International Assessment	2
Responses	2
Sample Size	2
Test Length	2
Testing Problems	2
Accuracy	1
College Freshmen	1
Cross Cultural Studies	1
Cutting Scores	1
Data Analysis	1
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	12
Reports - Research	9
Reports - Evaluative	4

Education Level

Secondary Education	2
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

California

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Combining Nonparametric and Parametric Item Response Theory to Explore Data Quality: Illustrations and a Simulation Study

Peer reviewed

Direct link

Stefanie A. Wind; Benjamin Lugu – Applied Measurement in Education, 2024

Researchers who use measurement models for evaluation purposes often select models with stringent requirements, such as Rasch models, which are parametric. Mokken Scale Analysis (MSA) offers a theory-driven nonparametric modeling approach that may be more appropriate for some measurement applications. Researchers have discussed using MSA as a…

Descriptors: Item Response Theory, Data Analysis, Simulation, Nonparametric Statistics

Development and Use of Anchoring Vignettes: Psychometric Investigations and Recommendations for a Nonparametric Approach

Peer reviewed

Direct link

Lee, HyeSun; Smith, Weldon; Martinez, Angel; Ferris, Heather; Bova, Joe – Applied Measurement in Education, 2021

The aim of the current research was to provide recommendations to facilitate the development and use of anchoring vignettes (AVs) for cross-cultural comparisons in education. Study 1 identified six factors leading to order violations and ties in AV responses based on cognitive interviews with 15-year-old students. The factors were categorized into…

Descriptors: Vignettes, Test Items, Equated Scores, Nonparametric Statistics

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

A Nonparametric Approach for Assessing Goodness-of-Fit of IRT Models in a Mixed Format Test

Peer reviewed

Direct link

Liang, Tie; Wells, Craig S. – Applied Measurement in Education, 2015

Investigating the fit of a parametric model plays a vital role in validating an item response theory (IRT) model. An area that has received little attention is the assessment of multiple IRT models used in a mixed-format test. The present study extends the nonparametric approach, proposed by Douglas and Cohen (2001), to assess model fit of three…

Descriptors: Nonparametric Statistics, Goodness of Fit, Item Response Theory, Test Format

A New Procedure for Detection of Students' Rapid Guessing Responses Using Response Time

Peer reviewed

Direct link

Guo, Hongwen; Rios, Joseph A.; Haberman, Shelby; Liu, Ou Lydia; Wang, Jing; Paek, Insu – Applied Measurement in Education, 2016

Unmotivated test takers using rapid guessing in item responses can affect validity studies and teacher and institution performance evaluation negatively, making it critical to identify these test takers. The authors propose a new nonparametric method for finding response-time thresholds for flagging item responses that result from rapid-guessing…

Descriptors: Guessing (Tests), Reaction Time, Nonparametric Statistics, Models

Investigation of a Nonparametric Procedure for Assessing Goodness-of-Fit in Item Response Theory

Peer reviewed

Direct link

Wells, Craig S.; Bolt, Daniel M. – Applied Measurement in Education, 2008

Tests of model misfit are often performed to validate the use of a particular model in item response theory. Douglas and Cohen (2001) introduced a general nonparametric approach for detecting misfit under the two-parameter logistic model. However, the statistical properties of their approach, and empirical comparisons to other methods, have not…

Descriptors: Test Length, Test Items, Monte Carlo Methods, Nonparametric Statistics

A Monte Carlo Comparison of Parametric and Nonparametric Polytomous DIF Detection Methods.

Peer reviewed

Bolt, Daniel M. – Applied Measurement in Education, 2002

Compared two parametric procedures for detecting differential item functioning (DIF) using the graded response model (GRM), the GRM-likelihood ratio test and the GRM-differential functioning of items and tests, with a nonparametric DIF detection procedure, Poly-SIBTEST. Monte Carlo simulation results show that Poly-SIBTEST showed the least amount…

Descriptors: Comparative Analysis, Item Bias, Monte Carlo Methods, Nonparametric Statistics

Detection of Aberrant Item Score Patterns: A Review of Recent Developments.

Peer reviewed

Meijer, Rob R.; Sijtsma, Klaas – Applied Measurement in Education, 1995

Methods for detecting item score patterns that are unlikely, given that a parametric item response theory model gives an adequate description of the data or given the responses of other persons in the group, are discussed. The use of person-fit statistics in empirical data analysis is briefly discussed. (SLD)

Descriptors: Identification, Item Response Theory, Nonparametric Statistics, Patterns in Mathematics

A Simulation Comparison of Parametric and Nonparametric Dimensionality Detection Procedures

Peer reviewed

Direct link

Mroch, Andrew A.; Bolt, Daniel M. – Applied Measurement in Education, 2006

Recently, nonparametric methods have been proposed that provide a dimensionally based description of test structure for tests with dichotomous items. Because such methods are based on different notions of dimensionality than are assumed when using a psychometric model, it remains unclear whether these procedures might lead to a different…

Descriptors: Simulation, Comparative Analysis, Psychometrics, Methods Research

Mokken Scale Analysis: Theoretical Considerations and an Application to Transivity Tasks.

Peer reviewed

Sijtsma, Klaas, Verweij, Anton C. – Applied Measurement in Education, 1992

Empirical data analysis using the Mokken models of monotone homogeneity and double monotonicity is discussed. Results from the Mokken approach with 3 data sets (for a total of 425 elementary school students) pertaining to transitive interference items are compared to Rasch analysis. (SLD)

Descriptors: Comparative Analysis, Elementary Education, Elementary School Students, Item Response Theory

Nonparametric Person-Fit Research: Some Theoretical Issues and an Empirical Example.

Peer reviewed

Meijer, Rob R.; And Others – Applied Measurement in Education, 1996

Several existing group-based statistics to detect improbable item score patterns are discussed, along with the cut scores proposed in the literature to classify an item score pattern as aberrant. A simulation study and an empirical study are used to compare the statistics and their use and to investigate the practical use of cut scores. (SLD)

Descriptors: Achievement Tests, Classification, Cutting Scores, Identification

Bolt, Daniel M.	3
Meijer, Rob R.	2
Rios, Joseph A.	2
Wells, Craig S.	2
Abulela, Mohammed A. A.	1
Benjamin Lugu	1
Bova, Joe	1
Ferris, Heather	1
Guo, Hongwen	1
Haberman, Shelby	1
Lee, HyeSun	1
Liang, Tie	1
Liu, Ou Lydia	1
Martinez, Angel	1
Mroch, Andrew A.	1
Paek, Insu	1
Sijtsma, Klaas	1
Sijtsma, Klaas, Verweij,…	1
Sinharay, Sandip	1
Smith, Weldon	1
Stefanie A. Wind	1
Wang, Jing	1
More ▼