Publication Date
| In 2026 | 0 |
| Since 2025 | 36 |
| Since 2022 (last 5 years) | 223 |
| Since 2017 (last 10 years) | 568 |
| Since 2007 (last 20 years) | 1375 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Ana Papiashvili – European Education, 2024
This article examines Georgia's integration into the Bologna Process, focusing on academic social responsibility (ASR). Utilizing qualitative methods, including document analysis and 30 in-depth interviews, the study assesses the evolution of higher education, alignment with Bologna standards, and external quality assurance. Findings reveal…
Descriptors: Foreign Countries, Higher Education, Educational Change, National Standards
Chandler Patton Miranda – International Journal of Qualitative Studies in Education (QSE), 2025
This critical ethnographic case study explores the impact of Performance-Based Assessment Tasks (PBATs) on high school dynamics and instructional practices, particularly in schools serving immigrant communities. PBATs, considered alternatives to standardized testing, have shown promise in enhancing student engagement, critical thinking, and…
Descriptors: High School Students, High School Teachers, Principals, Assistant Principals
Wind, Stefanie A. – Journal of Educational Measurement, 2019
Numerous researchers have proposed methods for evaluating the quality of rater-mediated assessments using nonparametric methods (e.g., kappa coefficients) and parametric methods (e.g., the many-facet Rasch model). Generally speaking, popular nonparametric methods for evaluating rating quality are not based on a particular measurement theory. On…
Descriptors: Nonparametric Statistics, Test Validity, Test Reliability, Item Response Theory
Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2019
M-fluctuation tests are a recently proposed method for detecting differential item functioning in Rasch models. This article discusses a generalization of this method to two additional item response theory models: the two-parametric logistic model and the three-parametric logistic model with a common guessing parameter. The Type I error rate and…
Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Maximum Likelihood Statistics
Luo, Yong; Liang, Xinya – Measurement: Interdisciplinary Research and Perspectives, 2019
Current methods that simultaneously model differential testlet functioning (DTLF) and differential item functioning (DIF) constrain the variances of latent ability and testlet effects to be equal between the focal and the reference groups. Such a constraint can be stringent and unrealistic with real data. In this study, we propose a multigroup…
Descriptors: Test Items, Item Response Theory, Test Bias, Models
Ames, Allison J. – Educational and Psychological Measurement, 2022
Individual response style behaviors, unrelated to the latent trait of interest, may influence responses to ordinal survey items. Response style can introduce bias in the total score with respect to the trait of interest, threatening valid interpretation of scores. Despite claims of response style stability across scales, there has been little…
Descriptors: Response Style (Tests), Individual Differences, Scores, Test Items
Ramadhani, Rahmi; Saragih, Sahat; Napitupulu, E. Elvis – Mathematics Teaching Research Journal, 2022
Statistical reasoning ability is one of the essential skills in developing competence, which is one of the Sustainable Development Goals (SDGs). This study aims to explore the statistical reasoning ability of junior high school students in descriptive statistics learning. The investigation directs students to determine their level of statistical…
Descriptors: Statistics, Thinking Skills, Statistics Education, Junior High School Students
von Zansen, Anna; Hilden, Raili; Laihanen, Emma – International Journal of Listening, 2022
In this study, we used the Rasch measurement to investigate the fairness of the listening section of a national computerized high-stakes English test for differential item functioning (DIF) across gender subgroups. The computerized test format inspired us to investigate whether the items measure listening comprehension differently for females and…
Descriptors: High Stakes Tests, Listening Comprehension Tests, Listening Comprehension, Gender Differences
Taylor, Catherine S. – Teachers College Press, 2022
This book addresses a problem that affects the work of all educators: how traditional methods of assessment undermine the capacity of schools to serve students with diverse cultural and social backgrounds and identities. Anchored in a commonsense notion of validity, this book explains how current K-12 assessment practices are grounded in the…
Descriptors: Student Diversity, Elementary Secondary Education, Cultural Differences, Student Evaluation
Alexandra Nicole Sparks – ProQuest LLC, 2022
The overidentification and misidentification of English Language Learners in special education has been a systemic issue since the 1960s. When students are mis-identified for special education, they are denied access to Free Appropriate Public Education and the Least Restrictive Environment. Although there has been countless research depicting…
Descriptors: English Language Learners, Special Education, Disability Identification, Equal Education
Gladstone, Jessica R.; Morell, Monica; Yang, Ji Seung; Ponnock, Annette; Turci Faust, Lara; Wigfield, Allan – Journal of Experimental Education, 2023
Researchers developing questionnaire measures of personality, motivation, and self-regulation constructs related to students' achievement and persistence in STEM or other fields rarely have examined whether the items on the measures used are functioning differently across groups, which is necessary for accurate group comparison. The present study…
Descriptors: Test Bias, STEM Education, Test Items, Student Characteristics
Watson, Sandy – Science Teacher, 2021
In response to increasingly diverse student groups in U.S. schools, educational researchers have developed curricula that respond to students' unique needs, cultures, and experiences. Curricula that embrace a pedagogy of empowerment are known as Culturally Responsive Curricula (CRC), and such curricula specific to science are referred to as…
Descriptors: Student Diversity, Culturally Relevant Education, Lesson Plans, Evaluation Methods
Chris Ryan Nesbitt – ProQuest LLC, 2021
This study explored the best testing practices for the NC Basic Law Enforcement Training (BLET) to determine preparedness for entry-level law enforcement officers. The researcher implemented a mixed-methods approach to answer five research questions based on collected historical data from Rowan-Cabarrus Community College, the North Carolina…
Descriptors: Community College Students, Law Enforcement, Police Education, High Stakes Tests
Sarallah Jafaripour; Omid Tabatabaei; Hadi Salehi; Hossein Vahid Dastjerdi – International Journal of Language Testing, 2024
The purpose of this study was to examine gender and discipline-based Differential Item Functioning (DIF) and Differential Distractor Functioning (DDF) on the Islamic Azad University English Proficiency Test (IAUEPT). The study evaluated DIF and DDF across genders and disciplines using the Rasch model. To conduct DIF and DDF analysis, the examinees…
Descriptors: Item Response Theory, Test Items, Language Tests, Language Proficiency
French, Brian F.; Vo, Thao T. – Journal of Psychoeducational Assessment, 2020
The Washington Assessment of Risk and Needs of Students (WARNS) is a brief self-report measure designed for schools, courts, and youth service providers to identify student behaviors and contexts related to school truancy. Empirical support for WARNS item invariance between ethnic groups is lacking. This study examined differential item…
Descriptors: Truancy, Student Behavior, Test Bias, Measures (Individuals)

Peer reviewed
Direct link
