Publication Date
In 2025 | 16 |
Since 2024 | 97 |
Since 2021 (last 5 years) | 273 |
Since 2016 (last 10 years) | 617 |
Since 2006 (last 20 years) | 1413 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 110 |
Practitioners | 107 |
Teachers | 46 |
Administrators | 25 |
Policymakers | 24 |
Counselors | 12 |
Parents | 7 |
Students | 7 |
Support Staff | 4 |
Community | 2 |
Location
California | 60 |
Canada | 60 |
United States | 56 |
Turkey | 47 |
Australia | 43 |
Florida | 34 |
Germany | 26 |
Texas | 26 |
Netherlands | 25 |
China | 24 |
Iran | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Carmen Köhler; Lale Khorramdel; Artur Pokropek; Johannes Hartig – Journal of Educational Measurement, 2024
For assessment scales applied to different groups (e.g., students from different states; patients in different countries), multigroup differential item functioning (MG-DIF) needs to be evaluated in order to ensure that respondents with the same trait level but from different groups have equal response probabilities on a particular item. The…
Descriptors: Measures (Individuals), Test Bias, Models, Item Response Theory
Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024
This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…
Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction
Jose Fabian Elizondo-Gonzalez – Research in Pedagogy, 2024
Studies across countries part of the Organization for Economic Cooperation and Development (OECD) suggest that learners' sense of school belonging is often influenced by their place of birth. However, large-scale assessment studies rarely explore whether differences in belonging scores between non-immigrant and immigrant learners are due to test…
Descriptors: Immigrants, Sense of Belonging, Foreign Countries, Evaluation
Muslihin, Heri Yusuf; Suryana, Dodi; Ahman; Suherman, Uman; Dahlan, Tina Hayati – International Journal of Instruction, 2022
Self-determination can affect students to have a positive way of thinking and acting, also to make realistic choices so they can make a decision responsibly. This study aimed to develop a questionnaire to measure student self-determination and validate it. This study was conducted in 2019, involved 406 university students as participants…
Descriptors: Test Validity, Test Reliability, Item Response Theory, Questionnaires
Phillips, Gregory, II; Felt, Dylan; Perez-Bill, Esrea; Ruprecht, Megan M.; Glenn, Erik Elías; Lindeman, Peter; Miller, Robin Lin – American Journal of Evaluation, 2023
Lesbian, gay, bisexual, transgender, queer, intersex, Two-Spirit, and other sexual and gender minority (LGBTQ+) individuals encounter numerous obstacles to equity across health and healthcare, education, housing, employment, and other domains. Such barriers are even greater for LGBTQ+ individuals who are also Black, Indigenous, and People of Color…
Descriptors: Student Evaluation, LGBTQ People, Test Bias, Barriers
Maïano, Christophe; Morin, Alexandre J. S.; Gagnon, Cynthia; Olivier, Elizabeth; Tracey, Danielle; Craven, Rhonda G.; Bouchard, Stéphane – Journal of Autism and Developmental Disorders, 2023
The objective of the study was to validate adapted versions of the Glasgow Anxiety Scale for people with Intellectual Disabilities (GAS-ID) simultaneously developed in English and French. A sample of 361 youth with mild to moderate intellectual disability (ID) (M = 15.78 years) from Australia (English-speaking) and Canada (French-speaking)…
Descriptors: Intellectual Disability, Anxiety, French, English
Basman, Munevver – International Journal of Assessment Tools in Education, 2023
To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…
Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory
Fatma Betül Kurnaz; Hüseyin Yildiz – International Journal of Assessment Tools in Education, 2023
Investigating the existence of items with differential item functioning (DIF) may provide more accurate comparisons of group differences in studies that aim to compare scores obtained in a test by groups with different characteristics. In the present study, a scale measuring critical thinking motivation that was adapted to the Turkish culture was…
Descriptors: Test Bias, Foreign Countries, High School Graduates, College Graduates
Wu, Tong – ProQuest LLC, 2023
This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…
Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores
Procope, Kathryn – ProQuest LLC, 2023
The zero-sum game of education is supported by the implementation standardized testing in a manner that disenfranchises Black students. While assessments are accountability tools for ensuring rigorous teaching and for measuring student learning, the impact of these tests on students' identity needs to be examined. The demands of a knowledge-based…
Descriptors: Middle School Students, African American Students, Males, Standardized Tests
van Rensburg, Clarisse; Mostert, Karina – Journal of Student Affairs in Africa, 2023
Student well-being has gradually become a topic of interest in higher education, and the accurate, valid, and reliable measure of well-being constructs is crucial in the South African context. This study examined item bias and configural, metric and scalar invariance of the Satisfaction with Life Scale (SWLS) for South African first-year…
Descriptors: Life Satisfaction, Measures (Individuals), Foreign Countries, College Freshmen
Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021
Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…
Descriptors: Test Reliability, Scores, Pretests Posttests, Computation
Jerrim, John – Review of Education, 2021
PISA is an influential international study of the achievement of 15-year-olds. It has a high profile across the devolved nations of the UK, with the results having a substantial impact upon education policy. Yet many of the technical details underpinning PISA remain poorly understood--particularly amongst non-specialists--including important…
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
Chairil Anwar Korompot; Iskandarsyah Siregar; Nurislom Iskandarovich Khursanov; Diyorjon Abdullaev; Khaled M. Mohamed – International Journal of Language Testing, 2024
Construct-irrelevant variance is considered as a major threat to validity which indicates the existence of additional unrelated variables that distort the meaning of test scores and cause the test to be biased. Differential item functioning (DIF) analysis is an important technique in examining the validity and fairness of educational tests.…
Descriptors: Test Bias, Test Validity, Culture Fair Tests, Reading Comprehension
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias