ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Reference Groups	8
Test Items	8
Sample Size	5
Simulation	5
Test Bias	4
Ability	3
Item Response Theory	3
Comparative Analysis	2
Goodness of Fit	2
Item Bias	2
Measurement	2
Models	2
Scores	2
Statistical Distributions	2
Test Length	2
Adaptive Testing	1
African Americans	1
Barriers	1
Cognitive Ability	1
Computer Assisted Testing	1
Computer Software	1
Culture Fair Tests	1
Difficulty Level	1
Educational Practices	1
Educational Testing	1
More ▼

Source

Journal of Educational…	3
Applied Psychological…	1
Educational and Psychological…	1
International Journal of…	1
ProQuest LLC	1

Author

Clauser, Brian	1
Dorans, Neil J.	1
Furlow, Carolyn F.	1
Gagne, Phill	1
Ito, Kyoko	1
Klockars, Alan J.	1
Koon, Sharon	1
Lee, Yoonsun	1
Moses, Timothy P.	1
Puhan, Gautam	1
Roschmann, Sarina	1
Ross, Terris Raiford	1
Smith, Richard M.	1
Sykes, Robert C.	1
Volker, Martin A.	1
Witmer, Sara E.	1
Yu, Lei	1
More ▼

Publication Type

Journal Articles	6
Reports - Evaluative	4
Reports - Research	3
Speeches/Meeting Papers	2
Dissertations/Theses -…	1

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Examining Provision and Sufficiency of Testing Accommodations for English Learners

Peer reviewed

Direct link

Roschmann, Sarina; Witmer, Sara E.; Volker, Martin A. – International Journal of Testing, 2021

Accommodations are commonly provided to address language-related barriers students may experience during testing. Research on the validity of scores from accommodated test administrations remains somewhat inconclusive. The current study investigated item response patterns to understand whether accommodations, as used in practice among English…

Descriptors: Testing Accommodations, English Language Learners, Scores, Item Response Theory

The Impact of Multidimensionality on the Detection of Differential Bundle Functioning Using Simultaneous Item Bias Test

Peer reviewed

Direct link

Furlow, Carolyn F.; Ross, Terris Raiford; Gagne, Phill – Applied Psychological Measurement, 2009

Douglas, Roussos, and Stout introduced the concept of differential bundle functioning (DBF) for identifying the underlying causes of differential item functioning (DIF). In this study, reference group was simulated to have higher mean ability than the focal group on a nuisance dimension, resulting in DIF for each of the multidimensional items…

Descriptors: Test Bias, Test Items, Reference Groups, Simulation

A Comparison of Methods for Detecting Differential Distractor Functioning

Direct link

Koon, Sharon – ProQuest LLC, 2010

This study examined the effectiveness of the odds-ratio method (Penfield, 2008) and the multinomial logistic regression method (Kato, Moen, & Thurlow, 2009) for measuring differential distractor functioning (DDF) effects in comparison to the standardized distractor analysis approach (Schmitt & Bleistein, 1987). Students classified as participating…

Descriptors: Test Bias, Test Items, Reference Groups, Lunch Programs

Using Log-Linear Smoothing to Improve Small-Sample DIF Estimation

Peer reviewed

Direct link

Puhan, Gautam; Moses, Timothy P.; Yu, Lei; Dorans, Neil J. – Journal of Educational Measurement, 2009

This study examined the extent to which log-linear smoothing could improve the accuracy of differential item functioning (DIF) estimates in small samples of examinees. Examinee responses from a certification test were analyzed using White examinees in the reference group and African American examinees in the focal group. Using a simulation…

Descriptors: Test Items, Reference Groups, Testing Programs, Raw Scores

Simulated Tests of Differential Item Functioning Using SIBTEST with and without Impact

Peer reviewed

Direct link

Klockars, Alan J.; Lee, Yoonsun – Journal of Educational Measurement, 2008

Monte Carlo simulations with 20,000 replications are reported to estimate the probability of rejecting the null hypothesis regarding DIF using SIBTEST when there is DIF present and/or when impact is present due to differences on the primary dimension to be measured. Sample sizes are varied from 250 to 2000 and test lengths from 10 to 40 items.…

Descriptors: Test Bias, Test Length, Reference Groups, Probability

The Effects of Score Group Width on the Mantel-Haenszel Procedure.

Peer reviewed

Clauser, Brian; And Others – Journal of Educational Measurement, 1994

The effect of reducing the number of score groups in the matching criterion of the Mantel-Haenszel procedure when screening for differential item functioning was investigated with a simulated data set. Results suggest that more than modest reductions cannot be recommended when ability distributions of reference and focal groups differ. (SLD)

Descriptors: Ability, Experimental Groups, Item Bias, Reference Groups

Detecting Item Bias in the Rasch Rating Scale Model.

Peer reviewed

Smith, Richard M. – Educational and Psychological Measurement, 1994

Simulated data are used to assess the appropriateness of using separate calibration and between-fit approaches to detecting item bias in the Rasch rating scale model. Results indicate that Type I error rates for the null distribution hold even when there are different ability levels for reference and focal groups. (SLD)

Descriptors: Ability, Goodness of Fit, Identification, Item Bias

The Effect of Restricting Ability Distributions in the Estimation of Item Difficulties: Implications for a CAT Implementation.

Download full text

Ito, Kyoko; Sykes, Robert C. – 1994

Responses to previously calibrated items administered in a computerized adaptive testing (CAT) mode may be used to recalibrate the items. This live-data simulation study investigated the possibility, and limitations, of on-line adaptive recalibration of precalibrated items. Responses to items of a Rasch-based paper-and-pencil licensure examination…

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Difficulty Level