ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	13

Descriptor

Nonparametric Statistics	17
Statistical Analysis	17
Test Items	17
Item Response Theory	10
Comparative Analysis	5
Item Analysis	5
Test Bias	5
Error of Measurement	4
Difficulty Level	3
Evaluation Methods	3
Foreign Countries	3
Goodness of Fit	3
Multiple Choice Tests	3
Regression (Statistics)	3
Sample Size	3
Scores	3
Accuracy	2
Achievement Tests	2
Computation	2
Educational Testing	2
Generalization	2
Monte Carlo Methods	2
Psychological Studies	2
Psychometrics	2
Scoring	2
More ▼

Source

Educational and Psychological…	4
Applied Psychological…	3
Advances in Physiology…	1
ETS Research Report Series	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Educational and…	1
ProQuest LLC	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	12
Reports - Evaluative	2
Dissertations/Theses -…	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Bosnia and Herzegovina…	1
Brazil	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Generalized Discrimination Index

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Kelley's Discrimination Index (DI) is a simple and robust, classical non-parametric short-cut to estimate the item discrimination power (IDP) in the practical educational settings. Unlike item-total correlation, DI can reach the ultimate values of +1 and -1, and it is stable against the outliers. Because of the computational easiness, DI is…

Descriptors: Test Items, Computation, Item Analysis, Nonparametric Statistics

Evaluating the Accuracy of the Empirical Item Characteristic Curve Preequating Method in the Presence of Test Speededness

Peer reviewed

Direct link

Qiu, Yuxi; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2019

This study aimed to assess the accuracy of the empirical item characteristic curve (EICC) preequating method given the presence of test speededness. The simulation design of this study considered the proportion of speededness, speededness point, speededness rate, proportion of missing on speeded items, sample size, and test length. After crossing…

Descriptors: Accuracy, Equated Scores, Test Items, Nonparametric Statistics

Assess Robustness of the Rasch Mixture Model to Detect Differential Item Functioning -- A Monte Carlo Study

Direct link

Jinjin Huang – ProQuest LLC, 2020

Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated.…

Descriptors: Robustness (Statistics), Item Response Theory, Test Items, Item Analysis

Detection of Uniform and Nonuniform Differential Item Functioning by Item-Focused Trees

Peer reviewed

Direct link

Berger, Moritz; Tutz, Gerhard – Journal of Educational and Behavioral Statistics, 2016

Detection of differential item functioning (DIF) by use of the logistic modeling approach has a long tradition. One big advantage of the approach is that it can be used to investigate nonuniform (NUDIF) as well as uniform DIF (UDIF). The classical approach allows one to detect DIF by distinguishing between multiple groups. We propose an…

Descriptors: Test Bias, Regression (Statistics), Nonparametric Statistics, Statistical Analysis

Evaluation of Different Scoring Rules for a Noncognitive Test in Development. Research Report. ETS RR-16-03

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016

In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…

Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Random Forest as an Imputation Method for Education and Psychology Research: Its Impact on Item Fit and Difficulty of the Rasch Model

Peer reviewed

Direct link

Golino, Hudson F.; Gomes, Cristiano M. A. – International Journal of Research & Method in Education, 2016

This paper presents a non-parametric imputation technique, named random forest, from the machine learning field. The random forest procedure has two main tuning parameters: the number of trees grown in the prediction and the number of predictors used. Fifty experimental conditions were created in the imputation procedure, with different…

Descriptors: Item Response Theory, Regression (Statistics), Difficulty Level, Goodness of Fit

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Medical Students' vs. Family Physicians' Assessment of Practical and Logical Values of Pathophysiology Multiple-Choice Questions

Peer reviewed

Direct link

Secic, Damir; Husremovic, Dzenana; Kapur, Eldan; Jatic, Zaim; Hadziahmetovic, Nina; Vojnikovic, Benjamin; Fajkic, Almir; Meholjic, Amir; Bradic, Lejla; Hadzic, Amila – Advances in Physiology Education, 2017

Testing strategies can either have a very positive or negative effect on the learning process. The aim of this study was to examine the degree of consistency in evaluating the practicality and logic of questions from a medical school pathophysiology test, between students and family medicine doctors. The study engaged 77 family medicine doctors…

Descriptors: Medical Students, Physicians, Medicine, Qualitative Research

Developing an Achievement Test for the Subject of Sound in Science Education

Peer reviewed
PDF on ERIC

Download full text

Sözen, Merve; Bolat, Mualla – Journal of Education and Learning, 2016

The purpose of this study is to develop an achievement test which includes the basic concepts about the subject of sound and its properties in middle school science lessons and which at the same time aims to reveal the alternative concepts that the students already have. During the process of the development of the test, studies in the field and…

Descriptors: Achievement Tests, Science Education, Acoustics, Test Construction

DIF Testing for Ordinal Items with Poly-SIBTEST, the Mantel and GMH Tests, and IRT-LR-DIF when the Latent Distribution Is Nonnormal for Both Groups

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2011

Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another. One way to test items with ordinal response scales for DIF is likelihood ratio (LR) testing using item response theory (IRT), or IRT-LR-DIF. Despite the various advantages of…

Descriptors: Test Bias, Test Items, Item Response Theory, Nonparametric Statistics

DIF Trees: Using Classification Trees to Detect Differential Item Functioning

Peer reviewed

Direct link

Vaughn, Brandon K.; Wang, Qiu – Educational and Psychological Measurement, 2010

A nonparametric tree classification procedure is used to detect differential item functioning for items that are dichotomously scored. Classification trees are shown to be an alternative procedure to detect differential item functioning other than the use of traditional Mantel-Haenszel and logistic regression analysis. A nonparametric…

Descriptors: Test Bias, Classification, Nonparametric Statistics, Regression (Statistics)

Nonparametric Person-Fit Analysis of Polytomous Item Scores

Peer reviewed

Direct link

Emons, Wilco H. M. – Applied Psychological Measurement, 2008

Person-fit methods are used to uncover atypical test performance as reflected in the pattern of scores on individual items in a test. Unlike parametric person-fit statistics, nonparametric person-fit statistics do not require fitting a parametric test theory model. This study investigates the effectiveness of generalizations of nonparametric…

Descriptors: Simulation, Nonparametric Statistics, Item Response Theory, Goodness of Fit

Nonparametric Item Evaluation Index

Peer reviewed

Ivens, Stephen H. – Educational and Psychological Measurement, 1971

Descriptors: Difficulty Level, Item Analysis, Nonparametric Statistics, Statistical Analysis

Bootstrapping Selected Item Statistics from a Student-Made Test.

Download full text

Burroughs, Monte – 2002

This study applied nonparametric bootstrapping to test null hypotheses for selected statistics (KR-20, difficulty, and discrimination) derived from a student-made test. The test, administered to 21 students enrolled in a graduate-level educational assessment class, contained 42 items, 33 of which were analyzed. Random permutations of the data…

Descriptors: Graduate Students, Graduate Study, Higher Education, Hypothesis Testing

Previous Page | Next Page »

Pages: 1 | 2

Berger, Moritz	1
Bolat, Mualla	1
Bradic, Lejla	1
Burroughs, Monte	1
DeMars, Christine E.	1
Edwards, Julianne M.	1
Emons, Wilco H. M.	1
Fajkic, Almir	1
Finch, Holmes	1
Golino, Hudson F.	1
Gomes, Cristiano M. A.	1
Guo, Hongwen	1
Hadziahmetovic, Nina	1
Hadzic, Amila	1
Hessen, David J.	1
Huggins-Manley, Anne Corinne	1
Husremovic, Dzenana	1
Ivens, Stephen H.	1
Jatic, Zaim	1
Jinjin Huang	1
Kapur, Eldan	1
Kyllonen, Patrick	1
Meholjic, Amir	1
Metsämuuronen, Jari	1
Phan, Ha	1
More ▼