ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	10

Descriptor

Nonparametric Statistics	16
Simulation	16
Test Items	16
Item Response Theory	10
Sample Size	7
Test Length	6
Correlation	4
Scores	4
Achievement Tests	3
Classification	3
Comparative Analysis	3
Computation	3
Error of Measurement	3
Goodness of Fit	3
Item Analysis	3
Regression (Statistics)	3
Cutting Scores	2
Data Analysis	2
Difficulty Level	2
Error Patterns	2
Evaluation Methods	2
Foreign Countries	2
Group Membership	2
Identification	2
Models	2
More ▼

Source

Applied Psychological…	5
Applied Measurement in…	3
Educational and Psychological…	3
Journal of Educational…	2
Society for Research on…	1

Publication Type

Journal Articles	13
Reports - Evaluative	9
Reports - Research	7
Speeches/Meeting Papers	1

Education Level

Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Identifying Problematic Item Characteristics with Small Samples Using Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2022

Researchers frequently use Mokken scale analysis (MSA), which is a nonparametric approach to item response theory, when they have relatively small samples of examinees. Researchers have provided some guidance regarding the minimum sample size for applications of MSA under various conditions. However, these studies have not focused on item-level…

Descriptors: Nonparametric Statistics, Item Response Theory, Sample Size, Test Items

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

On the Bias-Amplifying Effect of Near Instruments in Observational Studies

Peer reviewed
PDF on ERIC

Download full text

Steiner, Peter M.; Kim, Yongnam – Society for Research on Educational Effectiveness, 2014

In contrast to randomized experiments, the estimation of unbiased treatment effects from observational data requires an analysis that conditions on all confounding covariates. Conditioning on covariates can be done via standard parametric regression techniques or nonparametric matching like propensity score (PS) matching. The regression or…

Descriptors: Observation, Research Methodology, Test Bias, Regression (Statistics)

Minimum Sample Size Requirements for Mokken Scale Analysis

Peer reviewed

Direct link

Straat, J. Hendrik; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2014

An automated item selection procedure in Mokken scale analysis partitions a set of items into one or more Mokken scales, if the data allow. Two algorithms are available that pursue the same goal of selecting Mokken scales of maximum length: Mokken's original automated item selection procedure (AISP) and a genetic algorithm (GA). Minimum…

Descriptors: Sampling, Test Items, Effect Size, Scaling

On the Use of Nonparametric Item Characteristic Curve Estimation Techniques for Checking Parametric Model Fit

Peer reviewed

Direct link

Lee, Young-Sun; Wollack, James A.; Douglas, Jeffrey – Educational and Psychological Measurement, 2009

The purpose of this study was to assess the model fit of a 2PL through comparison with the nonparametric item characteristic curve (ICC) estimation procedures. Results indicate that three nonparametric procedures implemented produced ICCs that are similar to that of the 2PL for items simulated to fit the 2PL. However for misfitting items,…

Descriptors: Nonparametric Statistics, Item Response Theory, Test Items, Simulation

Three Classes of Nonparametric Differential Step Functioning Effect Estimators

Peer reviewed

Direct link

Penfield, Randall D. – Applied Psychological Measurement, 2008

The examination of measurement invariance in polytomous items is complicated by the possibility that the magnitude and sign of lack of invariance may vary across the steps underlying the set of polytomous response options, a concept referred to as differential step functioning (DSF). This article describes three classes of nonparametric DSF effect…

Descriptors: Simulation, Nonparametric Statistics, Item Response Theory, Computation

Comparison of Parametric and Nonparametric Bootstrap Methods for Estimating Random Error in Equipercentile Equating

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008

This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…

Descriptors: Test Length, Test Content, Simulation, Computation

Nonparametric Person-Fit Analysis of Polytomous Item Scores

Peer reviewed

Direct link

Emons, Wilco H. M. – Applied Psychological Measurement, 2008

Person-fit methods are used to uncover atypical test performance as reflected in the pattern of scores on individual items in a test. Unlike parametric person-fit statistics, nonparametric person-fit statistics do not require fitting a parametric test theory model. This study investigates the effectiveness of generalizations of nonparametric…

Descriptors: Simulation, Nonparametric Statistics, Item Response Theory, Goodness of Fit

Consistent Estimation of Rasch Item Parameters and Their Standard Errors under Complex Sample Designs

Peer reviewed

Direct link

Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008

U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…

Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation

Investigation of a Nonparametric Procedure for Assessing Goodness-of-Fit in Item Response Theory

Peer reviewed

Direct link

Wells, Craig S.; Bolt, Daniel M. – Applied Measurement in Education, 2008

Tests of model misfit are often performed to validate the use of a particular model in item response theory. Douglas and Cohen (2001) introduced a general nonparametric approach for detecting misfit under the two-parameter logistic model. However, the statistical properties of their approach, and empirical comparisons to other methods, have not…

Descriptors: Test Length, Test Items, Monte Carlo Methods, Nonparametric Statistics

Testing Features of Graphical DIF: Application of a Regression Correction to Three Nonparametric Statistical Tests

Peer reviewed

Direct link

Bolt, Daniel M.; Gierl, Mark J. – Journal of Educational Measurement, 2006

Inspection of differential item functioning (DIF) in translated test items can be informed by graphical comparisons of item response functions (IRFs) across translated forms. Due to the many forms of DIF that can emerge in such analyses, it is important to develop statistical tests that can confirm various characteristics of DIF when present.…

Descriptors: Regression (Statistics), Tests, Test Bias, Test Items

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

Nonparametric Person-Fit Research: Some Theoretical Issues and an Empirical Example.

Peer reviewed

Meijer, Rob R.; And Others – Applied Measurement in Education, 1996

Several existing group-based statistics to detect improbable item score patterns are discussed, along with the cut scores proposed in the literature to classify an item score pattern as aberrant. A simulation study and an empirical study are used to compare the statistics and their use and to investigate the practical use of cut scores. (SLD)

Descriptors: Achievement Tests, Classification, Cutting Scores, Identification

Evaluating DETECT Classification Accuracy and Consistency when Data Display Complex Structure

Peer reviewed

Direct link

Gierl, Mark J.; Leighton, Jacqueline P.; Tan, Xuan – Journal of Educational Measurement, 2006

DETECT, the acronym for Dimensionality Evaluation To Enumerate Contributing Traits, is an innovative and relatively new nonparametric dimensionality assessment procedure used to identify mutually exclusive, dimensionally homogeneous clusters of items using a genetic algorithm ( Zhang & Stout, 1999). Because the clusters of items are mutually…

Descriptors: Program Evaluation, Cluster Grouping, Evaluation Methods, Multivariate Analysis

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

Previous Page | Next Page »

Pages: 1 | 2

Meijer, Rob R.	3
Bolt, Daniel M.	2
Gierl, Mark J.	2
Abulela, Mohammed A. A.	1
Chan, Tsze	1
Cohen, Jon	1
Cui, Zhongmin	1
Douglas, Jeffrey	1
Emons, Wilco H. M.	1
Jiang, Tao	1
Kim, Yongnam	1
Kolen, Michael J.	1
Lee, Young-Sun	1
Leighton, Jacqueline P.	1
Nandakumar, Ratna	1
Penfield, Randall D.	1
Rios, Joseph A.	1
Seburn, Mary	1
Sijtsma, Klaas	1
Steiner, Peter M.	1
Straat, J. Hendrik	1
Tan, Xuan	1
Wells, Craig S.	1
Wind, Stefanie A.	1
More ▼