ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	22

Descriptor

Item Response Theory	27
Nonparametric Statistics	27
Statistical Analysis	27
Test Items	10
Error of Measurement	6
Evaluation Methods	6
Goodness of Fit	6
Sample Size	6
Simulation	6
Comparative Analysis	5
Test Bias	5
Scores	4
Accuracy	3
Computation	3
Difficulty Level	3
Item Analysis	3
Models	3
Test Length	3
Bayesian Statistics	2
Classification	2
Computer Software	2
Correlation	2
Data	2
Data Analysis	2
Differences	2
More ▼

Source

Applied Psychological…	7
Educational and Psychological…	6
ETS Research Report Series	2
Measurement:…	2
ProQuest LLC	2
Educational Sciences: Theory…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Experimental…	1
Language Testing	1
Psychological Methods	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	25
Reports - Research	18
Reports - Evaluative	4
Reports - Descriptive	3
Dissertations/Theses -…	2

Education Level

Elementary Education	2
Elementary Secondary Education	1
Grade 6	1
Intermediate Grades	1
Middle Schools	1
Secondary Education	1

Audience

Location

Georgia	2
Brazil	1
Indiana	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Detecting Aberrant Response Behavior with Nonparametric Method: Mokken and PerFit Packages in RStudio

Peer reviewed

Direct link

Sengül Avsar, Asiye – Measurement: Interdisciplinary Research and Perspectives, 2020

In order to reach valid and reliable test scores, various test theories have been developed, and one of them is nonparametric item response theory (NIRT). Mokken Models are the most widely known NIRT models which are useful for small samples and short tests. Mokken Package is useful for Mokken Scale Analysis. An important issue about validity is…

Descriptors: Response Style (Tests), Nonparametric Statistics, Item Response Theory, Test Validity

A Nonparametric Procedure for Exploring Differences in Rating Quality across Test-Taker Subgroups in Rater-Mediated Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Language Testing, 2019

Differences in rater judgments that are systematically related to construct-irrelevant characteristics threaten the fairness of rater-mediated writing assessments. Accordingly, it is essential that researchers and practitioners examine the degree to which the psychometric quality of rater judgments is comparable across test-taker subgroups.…

Descriptors: Nonparametric Statistics, Interrater Reliability, Differences, Writing Tests

Applying Mokken Scaling Techniques to Incomplete Rating Designs for Educational Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2020

A major challenge in the widespread application of Mokken scale analysis (MSA) to educational performance assessments is the requirement of complete data, where every rater rates every student. In this study, simulated and real data are used to demonstrate a method by which researchers and practitioners can apply MSA to incomplete rating designs.…

Descriptors: Item Response Theory, Scaling, Nonparametric Statistics, Performance Based Assessment

Evaluating the Accuracy of the Empirical Item Characteristic Curve Preequating Method in the Presence of Test Speededness

Peer reviewed

Direct link

Qiu, Yuxi; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2019

This study aimed to assess the accuracy of the empirical item characteristic curve (EICC) preequating method given the presence of test speededness. The simulation design of this study considered the proportion of speededness, speededness point, speededness rate, proportion of missing on speeded items, sample size, and test length. After crossing…

Descriptors: Accuracy, Equated Scores, Test Items, Nonparametric Statistics

Assess Robustness of the Rasch Mixture Model to Detect Differential Item Functioning -- A Monte Carlo Study

Direct link

Jinjin Huang – ProQuest LLC, 2020

Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated.…

Descriptors: Robustness (Statistics), Item Response Theory, Test Items, Item Analysis

Exploring Rating Quality in Rater-Mediated Assessments Using Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A.; Engelhard, George, Jr. – Educational and Psychological Measurement, 2016

Mokken scale analysis is a probabilistic nonparametric approach that offers statistical and graphical tools for evaluating the quality of social science measurement without placing potentially inappropriate restrictions on the structure of a data set. In particular, Mokken scaling provides a useful method for evaluating important measurement…

Descriptors: Nonparametric Statistics, Statistical Analysis, Measurement, Psychometrics

Dimensionality in Compensatory MIRT When Complex Structure Exists: Evaluation of DETECT and NOHARM

Peer reviewed

Direct link

Svetina, Dubravka; Levy, Roy – Journal of Experimental Education, 2016

This study investigated the effect of complex structure on dimensionality assessment in compensatory multidimensional item response models using DETECT- and NOHARM-based methods. The performance was evaluated via the accuracy of identifying the correct number of dimensions and the ability to accurately recover item groupings using a simple…

Descriptors: Item Response Theory, Accuracy, Correlation, Sample Size

Evaluation of Different Scoring Rules for a Noncognitive Test in Development. Research Report. ETS RR-16-03

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016

In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…

Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics

A Quasi-Parametric Method for Fitting Flexible Item Response Functions

Peer reviewed

Direct link

Liang, Longjuan; Browne, Michael W. – Journal of Educational and Behavioral Statistics, 2015

If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating…

Descriptors: Item Response Theory, Statistical Analysis, Goodness of Fit, Bayesian Statistics

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Random Forest as an Imputation Method for Education and Psychology Research: Its Impact on Item Fit and Difficulty of the Rasch Model

Peer reviewed

Direct link

Golino, Hudson F.; Gomes, Cristiano M. A. – International Journal of Research & Method in Education, 2016

This paper presents a non-parametric imputation technique, named random forest, from the machine learning field. The random forest procedure has two main tuning parameters: the number of trees grown in the prediction and the number of predictors used. Fifty experimental conditions were created in the imputation procedure, with different…

Descriptors: Item Response Theory, Regression (Statistics), Difficulty Level, Goodness of Fit

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Statistical Refinement of the Q-Matrix in Cognitive Diagnosis

Peer reviewed

Direct link

Chiu, Chia-Yi – Applied Psychological Measurement, 2013

Most methods for fitting cognitive diagnosis models to educational test data and assigning examinees to proficiency classes require the Q-matrix that associates each item in a test with the cognitive skills (attributes) needed to answer it correctly. In most cases, the Q-matrix is not known but is constructed from the (fallible) judgments of…

Descriptors: Cognitive Tests, Diagnostic Tests, Models, Statistical Analysis

The Effect of Sample Size on Parametric and Nonparametric Factor Analytical Methods

Peer reviewed
PDF on ERIC

Download full text

Kalkan, Ömür Kaya; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

Linear factor analysis models used to examine constructs underlying the responses are not very suitable for dichotomous or polytomous response formats. The associated problems cannot be eliminated by polychoric or tetrachoric correlations in place of the Pearson correlation. Therefore, we considered parameters obtained from the NOHARM and FACTOR…

Descriptors: Sample Size, Nonparametric Statistics, Factor Analysis, Correlation

A Review of DIMPACK Version 1.0: Conditional Covariance-Based Test Dimensionality Analysis Package

Peer reviewed

Direct link

Deng, Nina; Han, Kyung T.; Hambleton, Ronald K. – Applied Psychological Measurement, 2013

DIMPACK Version 1.0 for assessing test dimensionality based on a nonparametric conditional covariance approach is reviewed. This software was originally distributed by Assessment Systems Corporation and now can be freely accessed online. The software consists of Windows-based interfaces of three components: DIMTEST, DETECT, and CCPROX/HAC, which…

Descriptors: Item Response Theory, Nonparametric Statistics, Statistical Analysis, Computer Software

Previous Page | Next Page »

Pages: 1 | 2

Wind, Stefanie A.	3
Emons, Wilco H. M.	2
Meijer, Rob R.	2
Abad, Francisco J.	1
Azen, Razia	1
Browne, Michael W.	1
Chiu, Chia-Yi	1
Cui, Ying	1
DeMars, Christine E.	1
Deng, Nina	1
Dorans, Neil J.	1
Dyehouse, Melissa A.	1
Edwards, Julianne M.	1
Engelhard, George, Jr.	1
Finch, Holmes	1
Gierl, Mark J.	1
Golino, Hudson F.	1
Gomes, Cristiano M. A.	1
Guo, Hongwen	1
Hambleton, Ronald K.	1
Han, Kyung T.	1
Hessen, David J.	1
Huggins-Manley, Anne Corinne	1
Jinjin Huang	1
Kalkan, Ömür Kaya	1
More ▼