Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Differences | 4 |
Error of Measurement | 4 |
Test Length | 4 |
Ability | 3 |
Comparative Analysis | 3 |
Sample Size | 3 |
Test Bias | 3 |
Item Response Theory | 2 |
Simulation | 2 |
Test Items | 2 |
True Scores | 2 |
More ▼ |
Source
Educational Sciences: Theory… | 1 |
International Journal of… | 1 |
Journal of Educational and… | 1 |
ProQuest LLC | 1 |
Author
Arsan, Nihan | 1 |
Atalay Kabasakal, Kübra | 1 |
DeMars, Christine E. | 1 |
Gök, Bilge | 1 |
Kelecioglu, Hülya | 1 |
Lee, Yi-Hsuan | 1 |
Wang, Wei | 1 |
Zhang, Jinming | 1 |
Publication Type
Journal Articles | 3 |
Reports - Research | 3 |
Dissertations/Theses -… | 1 |
Education Level
Audience
Location
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
What Works Clearinghouse Rating
Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017
Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…
Descriptors: Test Bias, Test Reliability, Performance, Scores
Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014
This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…
Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias
Wang, Wei – ProQuest LLC, 2013
Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…
Descriptors: Equated Scores, Test Format, Test Items, Test Length
DeMars, Christine E. – Journal of Educational and Behavioral Statistics, 2009
The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…
Descriptors: Regression (Statistics), Test Bias, Error of Measurement, True Scores