Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 27 |
Descriptor
Comparative Analysis | 47 |
Test Bias | 47 |
Test Items | 14 |
Scores | 12 |
Item Response Theory | 11 |
Simulation | 8 |
Evaluation Methods | 7 |
Foreign Countries | 7 |
Test Validity | 7 |
Standardized Tests | 6 |
Statistical Analysis | 6 |
More ▼ |
Source
Author
Miao, Jing | 2 |
Moses, Tim | 2 |
Allen, Nancy L. | 1 |
And Others. | 1 |
Babiar, Tasha Calvert | 1 |
Barton, Karen | 1 |
Bennett, Randy Elliot | 1 |
Berberoglu, Giray | 1 |
Blinn, Kari | 1 |
Braumoeller, Bear F. | 1 |
Brown, R. L. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 47 |
Journal Articles | 34 |
Speeches/Meeting Papers | 6 |
Reports - Research | 2 |
Tests/Questionnaires | 2 |
Opinion Papers | 1 |
Education Level
Elementary Education | 5 |
Grade 4 | 3 |
Grade 8 | 3 |
High Schools | 3 |
Elementary Secondary Education | 2 |
Grade 7 | 2 |
Higher Education | 2 |
Middle Schools | 2 |
Secondary Education | 2 |
Grade 10 | 1 |
Grade 3 | 1 |
More ▼ |
Audience
Administrators | 1 |
Practitioners | 1 |
Researchers | 1 |
Location
United States | 3 |
Alabama | 1 |
Australia | 1 |
Germany | 1 |
Ireland | 1 |
Maryland | 1 |
North Carolina | 1 |
Spain | 1 |
Taiwan | 1 |
Turkey | 1 |
Washington | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016
The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…
Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models
Braumoeller, Bear F. – Sociological Methods & Research, 2017
Fuzzy-set qualitative comparative analysis (fsQCA) has become one of the most prominent methods in the social sciences for capturing causal complexity, especially for scholars with small- and medium-"N" data sets. This research note explores two key assumptions in fsQCA's methodology for testing for necessary and sufficient…
Descriptors: Qualitative Research, Comparative Analysis, Social Science Research, Research Methodology
Kim, Eun Sook; Yoon, Myeongsun – Structural Equation Modeling: A Multidisciplinary Journal, 2011
This study investigated two major approaches in testing measurement invariance for ordinal measures: multiple-group categorical confirmatory factor analysis (MCCFA) and item response theory (IRT). Unlike the ordinary linear factor analysis, MCCFA can appropriately model the ordered-categorical measures with a threshold structure. A simulation…
Descriptors: Measurement, Factor Analysis, Item Response Theory, Comparative Analysis
Demars, Christine E. – Applied Measurement in Education, 2011
Three types of effects sizes for DIF are described in this exposition: log of the odds-ratio (differences in log-odds), differences in probability-correct, and proportion of variance accounted for. Using these indices involves conceptualizing the degree of DIF in different ways. This integrative review discusses how these measures are impacted in…
Descriptors: Effect Size, Test Bias, Probability, Difficulty Level
Gandy, Sandra E. – Reading & Writing Quarterly, 2013
With the increasing amount of testing taking place in classrooms, teachers may question how appropriate those assessments are for the growing numbers of English language learners (ELLs) in the United States. One of the assessment options for classroom teachers is the informal reading inventory (IRI), which is the most frequently used assessment…
Descriptors: Informal Reading Inventories, English Language Learners, Student Evaluation, Standardized Tests
Moses, Tim; Miao, Jing; Dorans, Neil J. – Journal of Educational and Behavioral Statistics, 2010
In this study, the accuracies of four strategies were compared for estimating conditional differential item functioning (DIF), including raw data, logistic regression, log-linear models, and kernel smoothing. Real data simulations were used to evaluate the estimation strategies across six items, DIF and No DIF situations, and four sample size…
Descriptors: Test Bias, Statistical Analysis, Computation, Comparative Analysis
Babiar, Tasha Calvert – Journal of Applied Measurement, 2011
Traditionally, women and minorities have not been fully represented in science and engineering. Numerous studies have attributed these differences to gaps in science achievement as measured by various standardized tests. Rather than describe mean group differences in science achievement across multiple cultures, this study focused on an in-depth…
Descriptors: Test Bias, Science Achievement, Standardized Tests, Grade 8
Wiberg, Marie – International Journal of Testing, 2009
The aim of this study was to examine log linear modelling (LLM) compared with logistic regression (LR) and Mantel-Haenszel (MH) test for detecting Differential Item Functioning (DIF) in a mastery test. The three methods were chosen because they have similar components. The results showed fairly high matching percentages together with high…
Descriptors: Test Bias, Mastery Tests, Comparative Analysis, Regression (Statistics)
Rasch Analysis of the Assessment of Children's Hand Skills in Children with and without Disabilities
Chien, Chi-Wen; Brown, Ted; McDonald, Rachael – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
The Assessment of Children's Hand Skills (ACHS) is a new assessment tool that utilizes a naturalistic observational method to capture children's real-life hand skill performance when engaging in various types of activities. The ACHS also intends to be used with both typically developing children and those presenting with disabilities. The purpose…
Descriptors: Test Items, Construct Validity, Test Bias, Disabilities
Wetzel, Eunike; Hell, Benedikt; Passler, Katja – Journal of Career Assessment, 2012
Three test construction strategies are described and illustrated in the development of the Verb Interest Test (VIT), an inventory that assesses vocational interests using verbs. Verbs might be a promising alternative to the descriptions of occupational activities used in most vocational interest inventories because they are context-independent,…
Descriptors: Test Construction, Culture Fair Tests, Vocational Interests, Interest Inventories
Vaughn, Brandon K.; Wang, Qiu – Educational and Psychological Measurement, 2010
A nonparametric tree classification procedure is used to detect differential item functioning for items that are dichotomously scored. Classification trees are shown to be an alternative procedure to detect differential item functioning other than the use of traditional Mantel-Haenszel and logistic regression analysis. A nonparametric…
Descriptors: Test Bias, Classification, Nonparametric Statistics, Regression (Statistics)
Moses, Tim; Miao, Jing; Dorans, Neil – Educational Testing Service, 2010
This study compared the accuracies of four differential item functioning (DIF) estimation methods, where each method makes use of only one of the following: raw data, logistic regression, loglinear models, or kernel smoothing. The major focus was on the estimation strategies' potential for estimating score-level, conditional DIF. A secondary focus…
Descriptors: Test Bias, Statistical Analysis, Computation, Scores
Jiang, Bo; Xu, Xiaoying; Garcia, Alicia; Lewis, Jennifer E. – Journal of Chemical Education, 2010
The Test of Logical Thinking (TOLT) and the Group Assessment of Logical Thinking (GALT) are two of the instruments most widely used by science educators and researchers to measure students' formal reasoning abilities. Based on Piaget's cognitive development theory, formal thinking ability has been shown to be essential for student achievement in…
Descriptors: Test Bias, Test Reliability, Chemistry, Logical Thinking
Finch, Holmes; Barton, Karen; Meyer, Patrick – Educational Assessment, 2009
The No Child Left Behind act resulted in an increased reliance on large-scale standardized tests to assess the progress of individual students as well as schools. In addition, emphasis was placed on including all students in the testing programs as well as those with disabilities. As a result, the role of testing accommodations has become more…
Descriptors: Test Bias, Testing Accommodations, Standardized Tests, Mathematics Tests
Wuang, Yee-Pay; Wang, Li-Chen; Su, Chwen-Yng – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The aim of this study was to examine the validation of the Hooper Visual Organization Test (HVOT) for use in children by testing for item fit, unidimensionality, item hierarchy, reliability, and screening capacity. A modified scoring system was devised for the HVOT so that children received some credit for being able to describe the function of…
Descriptors: Test Bias, Down Syndrome, Scoring, Item Response Theory