ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Ability	7
Comparative Analysis	7
Test Length	7
Sample Size	5
Simulation	4
Error of Measurement	3
Item Response Theory	3
Statistical Bias	3
Statistical Distributions	3
Test Items	3
Differences	2
Difficulty Level	2
Estimation (Mathematics)	2
Foreign Countries	2
Item Bias	2
Maximum Likelihood Statistics	2
Methods	2
Achievement Tests	1
Analysis of Variance	1
Bayesian Statistics	1
Computation	1
Equated Scores	1
Equations (Mathematics)	1
Mathematical Models	1
Models	1
More ▼

Source

ETS Research Report Series	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
ProQuest LLC	1

Author

Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Gök, Bilge	1
Kelecioglu, Hülya	1
Kim, Seock-Ho	1
Lee, Yi-Hsuan	1
Pommerich, Mary	1
Schumacker, Randall E.	1
Seong, Tae-Je	1
Wang, Wei	1
Whitmore, Marjorie L.	1
Zhang, Jinming	1
More ▼

Publication Type

Reports - Evaluative	4
Journal Articles	3
Speeches/Meeting Papers	3
Reports - Research	2
Dissertations/Theses -…	1

Education Level

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Comparing Performances (Type I Error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…

Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Comparing Different Approaches of Bias Correction for Ability Estimation in IRT Models. Research Report. ETS RR-08-13

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2008

The method of maximum-likelihood is typically applied to item response theory (IRT) models when the ability parameter is estimated while conditioning on the true item parameters. In practice, the item parameters are unknown and need to be estimated first from a calibration sample. Lewis (1985) and Zhang and Lu (2007) proposed the expected response…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Ability

A Comparison of Logistic Regression and Analysis of Variance Differential Item Functioning Decision Methods.

Peer reviewed

Whitmore, Marjorie L.; Schumacker, Randall E. – Educational and Psychological Measurement, 1999

Compared differential item functioning detection rates for logistic regression and analysis of variance for dichotomously scored items using simulated data and varying test length, sample size, discrimination rate, and underlying ability. Explains why the logistic regression method is recommended for most applications. (SLD)

Descriptors: Ability, Analysis of Variance, Comparative Analysis, Item Bias

An Analytical Evaluation of Two Common-Odds Ratios as Population Indicators of DIF.

Download full text

Pommerich, Mary; And Others – 1995

The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Bias

A Comparison of Procedures for Ability Estimation under the Graded Response Model.

Download full text

Seong, Tae-Je; And Others – 1997

This study was designed to compare the accuracy of three commonly used ability estimation procedures under the graded response model. The three methods, maximum likelihood (ML), expected a posteriori (EAP), and maximum a posteriori (MAP), were compared using a recovery study design for two sample sizes, two underlying ability distributions, and…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Estimation (Mathematics)

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)