Publication Date
| In 2026 | 0 |
| Since 2025 | 34 |
| Since 2022 (last 5 years) | 221 |
| Since 2017 (last 10 years) | 566 |
| Since 2007 (last 20 years) | 1373 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Ato Kwamina Arhin – Acta Educationis Generalis, 2024
Introduction: This article aimed at digging deep into distractors used for mathematics multiple-choice items. The quality of distractors may be more important than their number and the stem in a multiple-choice question. Little attention is given to this aspect of item writing especially, mathematics multiple-choice questions. This article…
Descriptors: Testing, Multiple Choice Tests, Test Items, Mathematics Tests
Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024
Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…
Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement
Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024
Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…
Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria
Vahid Aryadoust – Applied Linguistics, 2024
I analyzed a corpus of the international English language testing system (IELTS) comprising 256 listening sections (1996-2021). The primary objective of the study was to gain insights into the assumptions made by test designers regarding the real-life contexts that test-takers will encounter. Overall, 15 superordinate topic areas and 300 subtopics…
Descriptors: Dialects, Pronunciation, Commercialization, Second Language Learning
Fajer Shamsaldeen; Jue Wang; Soyeon Ahn – Language Testing in Asia, 2024
The use of college entrance exams for facilitating admission decisions become controversial, and the central argument is around the fairness of test scores. The Kuwait University English Aptitude Test (KUEAT) is a high-stakes test, but very few studies have examined the psychometric quality of the scores for this national-level assessment. This…
Descriptors: Test Bias, High Stakes Tests, College Entrance Examinations, Foreign Countries
Liu, Vivienne Yi-Yu; Lim, Sok Mui – Asia Pacific Journal of Education, 2022
Although the Brief Resilience Scale (BRS) has been extensively adapted worldwide, work on the generalizability of the original English BRS to Asian populations remains limited. This research evaluated the psychometric properties of the English BRS through two studies with Singaporean undergraduate freshmen (Study 1 n = 839; Study 2 n = 1,068)…
Descriptors: Foreign Countries, Psychometrics, College Freshmen, Resilience (Psychology)
Yaghi, Esra; Ryan, Jonathon – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2022
Applied linguists have increasingly focused on how the lives of English language teachers and learners are shaped by race and its intersections with other marginalized identities (e.g. Von Esch et al, 2020). Curiously overlooked, however, are the experiences of one particularly stigmatized group in English-majority countries: Muslim women veiled…
Descriptors: Muslims, Islam, Social Bias, Test Bias
Joo, Sean; Ali, Usama; Robin, Frederic; Shin, Hyo Jeong – Large-scale Assessments in Education, 2022
We investigated the potential impact of differential item functioning (DIF) on group-level mean and standard deviation estimates using empirical and simulated data in the context of large-scale assessment. For the empirical investigation, PISA 2018 cognitive domains (Reading, Mathematics, and Science) data were analyzed using Jackknife sampling to…
Descriptors: Test Items, Item Response Theory, Scores, Student Evaluation
Taroucha T. Williams – ProQuest LLC, 2023
A court decision in California, Larry P. v. Riles (1979) case, ruled in favor of African American students who were disproportionately and wrongly placed in special education (E.M.R. -- educable mentally retarded) classes. Standardized intelligence tests were biased, discriminatory and failed to identify the academic need to support African…
Descriptors: Court Litigation, Educational Legislation, African American Students, Disproportionate Representation
Daniel R. Isbell; Benjamin Kremmel; Jieun Kim – Language Assessment Quarterly, 2023
In the wake of the COVID-19 boom in remote administration of language tests, it appears likely that remote administration will be a permanent fixture in the language testing landscape. Accordingly, language test providers, stakeholders, and researchers must grapple with the implications of remote proctoring on valid, fair, and just uses of tests.…
Descriptors: Distance Education, Supervision, Language Tests, Culture Fair Tests
Sünbül, Seçil Ömür – International Journal of Progressive Education, 2019
In this study, it is aimed to investigate the effects of various factors on the performance of the methods used in the determination of differential item functioning (DIF) in the DINA model included in the Cognitive Diagnosis Models. The current study is limited with Logistic Regression and Wald test methods which were used to determine the…
Descriptors: Test Bias, Models, Correlation, Probability
Zhiqiang Yang; Chengyuan Yu – Asia Pacific Education Review, 2025
This study investigated the test fairness of the translation section of a large-scale English test in China by examining its Differential Test Functioning (DTF) and Differential Item Functioning (DIF) across gender and major. Regarding DTF, the entire translation section exhibits partial strong measurement invariance across female and male…
Descriptors: Multiple Choice Tests, Test Items, Scoring, Translation
Hepperlen, Renee A.; Rabaey, Paula; Hearst, Mary O. – Journal of Applied Research in Intellectual Disabilities, 2020
Background: Families of children with disabilities often face unique challenges. Developed in a U.S. context, the Beach Center Family Quality of Life measure assesses the effectiveness of supports and services that families receive. This study examines whether items from three sub-scales of the Beach Center instrument perform similarly for two…
Descriptors: Cross Cultural Studies, Test Validity, Family Life, Quality of Life
Altintas, Ozge; Wallin, Gabriel – International Journal of Assessment Tools in Education, 2021
Educational assessment tests are designed to measure the same psychological constructs over extended periods. This feature is important considering that test results are often used for admittance to university programs. To ensure fair assessments, especially for those whose results weigh heavily in selection decisions, it is necessary to collect…
Descriptors: College Admission, College Entrance Examinations, Test Bias, Equated Scores
Diaz, Emily; Brooks, Gordon; Johanson, George – International Journal of Assessment Tools in Education, 2021
This Monte Carlo study assessed Type I error in differential item functioning analyses using Lord's chi-square (LC), Likelihood Ratio Test (LRT), and Mantel-Haenszel (MH) procedure. Two research interests were investigated: item response theory (IRT) model specification in LC and the LRT and continuity correction in the MH procedure. This study…
Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Comparative Analysis

Peer reviewed
Direct link
