Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 38 |
Descriptor
Reading Tests | 57 |
Test Bias | 57 |
Test Items | 57 |
Foreign Countries | 15 |
Item Analysis | 15 |
Reading Comprehension | 14 |
Item Response Theory | 13 |
Mathematics Tests | 12 |
Test Validity | 12 |
English (Second Language) | 11 |
Comparative Analysis | 10 |
More ▼ |
Source
Author
Abedi, Jamal | 2 |
Baghaei, Purya | 2 |
Ercikan, Kadriye | 2 |
Hambleton, Ronald K. | 2 |
Kao, Jenny C. | 2 |
Lee, Yoonsun | 2 |
Leon, Seth | 2 |
Oliveri, Maria Elena | 2 |
Taylor, Catherine S. | 2 |
Thurlow, Martha L. | 2 |
Zumbo, Bruno D. | 2 |
More ▼ |
Publication Type
Education Level
Elementary Education | 11 |
Elementary Secondary Education | 8 |
Grade 3 | 7 |
Higher Education | 7 |
Secondary Education | 7 |
Grade 4 | 6 |
Grade 7 | 6 |
Middle Schools | 6 |
Grade 8 | 5 |
High Schools | 5 |
Junior High Schools | 5 |
More ▼ |
Audience
Researchers | 3 |
Location
Canada | 3 |
Florida | 3 |
Iran | 3 |
Taiwan | 3 |
California | 2 |
Germany | 2 |
Hong Kong | 2 |
Kuwait | 2 |
Qatar | 2 |
Australia | 1 |
Botswana | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Paula Elosua – Language Assessment Quarterly, 2024
In sociolinguistic contexts where standardized languages coexist with regional dialects, the study of differential item functioning is a valuable tool for examining certain linguistic uses or varieties as threats to score validity. From an ecological perspective, this paper describes three stages in the study of differential item functioning…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Validity
Nikola Ebenbeck; Markus Gebhardt – Journal of Special Education Technology, 2024
Technologies that enable individualization for students have significant potential in special education. Computerized Adaptive Testing (CAT) refers to digital assessments that automatically adjust their difficulty level based on students' abilities, allowing for personalized, efficient, and accurate measurement. This article examines whether CAT…
Descriptors: Computer Assisted Testing, Students with Disabilities, Special Education, Grade 3
Allen, David; Nakamura, Keita – Language Testing, 2023
Although there is abundant evidence for the use of first-language (L1) knowledge by bilinguals when using a second language (L2), investigation into the impact of L1 knowledge in large-scale L2 language assessments and discussion of how such impact may be controlled has received little attention in the language assessment literature. This study…
Descriptors: Language Tests, Second Language Learning, Contrastive Linguistics, English (Second Language)
Moradi, Elahe; Ghabanchi, Zargham; Pishghadam, Reza – Language Testing in Asia, 2022
Given the significance of the test fairness, this study aimed to investigate a reading comprehension test for evidence of differential item functioning (DIF) based on English as a Foreign Language (EFL) learners' gender and their mode of learning (conventional vs. distance learning). To this end, 514 EFL learners were asked to take a 30-item…
Descriptors: Reading Comprehension, Test Bias, Test Items, Second Language Learning
Moghadam, M.; Nasirzadeh, F. – Language Testing in Asia, 2020
The present study tries to investigate the fairness of an English reading comprehension test employing Kunnan's (2004) test fairness framework (TFF) as the most comprehensive model available for test fairness. The participants of this study comprised 300 freshman students taking general English course chosen based on the availability sampling,…
Descriptors: Test Bias, Reading Tests, Reading Comprehension, Test Items
Lazarus, Sheryl S.; Johnstone, Christopher J.; Liu, Kristin K.; Thurlow, Martha L.; Hinkle, Andrew R.; Burden, Kathryn – National Center on Educational Outcomes, 2022
This "Guide" is an update to the State Guide to Universally Designed Assessments produced by the National Center on Educational Outcomes (NCEO) in 2006 (Johnstone et al.). It provides a brief overview of what a universally designed assessment is, followed by a set of steps for states to consider when designing and developing, or…
Descriptors: Alternative Assessment, Educational Assessment, Test Construction, Summative Evaluation
Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022
Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…
Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring
Wedman, Jonathan – Scandinavian Journal of Educational Research, 2018
Gender fairness in testing can be impeded by the presence of differential item functioning (DIF), which potentially causes test bias. In this study, the presence and causes of gender-related DIF were investigated with real data from 800 items answered by 250,000 test takers. DIF was examined using the Mantel-Haenszel and logistic regression…
Descriptors: Gender Differences, College Entrance Examinations, Test Items, Vocabulary
Sheybani, Elias; Zeraatpishe, Mitra – International Journal of Language Testing, 2018
Test method is deemed to affect test scores along with examinee ability (Bachman, 1996). In this research the role of method facet in reading comprehension tests is studied. Bachman divided method facet into five categories, one category is the nature of input and the nature of expected response. This study examined the role of method effect in…
Descriptors: Reading Comprehension, Reading Tests, Test Items, Test Format
Zumbo, Bruno D.; Liu, Yan; Wu, Amery D.; Shear, Benjamin R.; Olvera Astivia, Oscar L.; Ark, Tavinder K. – Language Assessment Quarterly, 2015
Methods for detecting differential item functioning (DIF) and item bias are typically used in the process of item analysis when developing new measures; adapting existing measures for different populations, languages, or cultures; or more generally validating test score inferences. In 2007 in "Language Assessment Quarterly," Zumbo…
Descriptors: Test Bias, Test Items, Holistic Approach, Models
Baghaei, Purya; Kubinger, Klaus D. – Practical Assessment, Research & Evaluation, 2015
The present paper gives a general introduction to the linear logistic test model (Fischer, 1973), an extension of the Rasch model with linear constraints on item parameters, along with eRm (an R package to estimate different types of Rasch models; Mair, Hatzinger, & Mair, 2014) functions to estimate the model and interpret its parameters. The…
Descriptors: Item Response Theory, Models, Test Validity, Hypothesis Testing
Baghaei, Purya; Ravand, Hamdollah – SAGE Open, 2019
In many reading comprehension tests, different test formats are employed. Two commonly used test formats to measure reading comprehension are sustained passages followed by some questions and cloze items. Individual differences in handling test format peculiarities could constitute a source of score variance. In this study, a bifactor Rasch model…
Descriptors: Cloze Procedure, Test Bias, Individual Differences, Difficulty Level
Li, Sylvia; Meyer, Patrick – NWEA, 2019
This simulation study examines the measurement precision, item exposure rates, and the depth of the MAP® Growth™ item pools under various grade-level restrictions. Unlike most summative assessments, MAP Growth allows examinees to see items from any grade level, regardless of the examinee's actual grade level. It does not limit the test to items…
Descriptors: Achievement Tests, Item Banks, Test Items, Instructional Program Divisions
Dahlke, Katie; Yang, Rui; Martínez, Carmen; Chavez, Suzette; Martin, Alejandra; Hawkinson, Laura; Shields, Joseph; Garland, Marshall; Carle, Jill – Regional Educational Laboratory Southwest, 2017
The New Mexico Public Education Department developed the Kindergarten Observation Tool (KOT) as a multidimensional observational measure of students' knowledge and skills at kindergarten entry. The primary purpose of the KOT is to inform instruction, so that kindergarten teachers can use the information about their students' knowledge and skills…
Descriptors: Test Validity, Observation, Measures (Individuals), Kindergarten
Farrington, Amber L.; Lonigan, Christopher J. – Journal of Learning Disabilities, 2015
Children's emergent literacy skills are highly predictive of later reading abilities. To determine which children have weaker emergent literacy skills and are in need of intervention, it is necessary to assess emergent literacy skills accurately and reliably. In this study, 1,351 children were administered the "Revised Get Ready to…
Descriptors: Emergent Literacy, Preschool Children, Reading Tests, Item Response Theory