Publication Date
In 2025 | 16 |
Since 2024 | 97 |
Since 2021 (last 5 years) | 273 |
Since 2016 (last 10 years) | 617 |
Since 2006 (last 20 years) | 1413 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 110 |
Practitioners | 107 |
Teachers | 46 |
Administrators | 25 |
Policymakers | 24 |
Counselors | 12 |
Parents | 7 |
Students | 7 |
Support Staff | 4 |
Community | 2 |
Location
California | 60 |
Canada | 60 |
United States | 56 |
Turkey | 47 |
Australia | 43 |
Florida | 34 |
Germany | 26 |
Texas | 26 |
Netherlands | 25 |
China | 24 |
Iran | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Meyer, J. Patrick; Dahlin, Michael – NWEA, 2022
The MAP® Growth™ theory of action describes key features of MAP Growth and its position in a comprehensive assessment system. The basic premise of the theory of action is that all students learn when MAP Growth is situated in a comprehensive assessment system and used for its intended purposes to yield information about student learning and enable…
Descriptors: Achievement Tests, Academic Achievement, Achievement Gains, Student Evaluation
Ronkin, Emily; Tully, Erin C.; Branum-Martin, Lee; Cohen, Lindsey L.; Hall, Christine; Dilly, Laura; Tone, Erin B. – Autism: The International Journal of Research and Practice, 2022
The Autism Diagnostic Observation Schedule, 2nd-edition (ADOS-2) Toddler Module is the current gold-standard measure of autism spectrum disorder (ASD), a neurodevelopmental condition more frequently diagnosed in toddler boys than girls. Some evidence suggests that behaviors assessed by the Toddler Module may capture an ASD phenotype that is more…
Descriptors: Diagnostic Tests, Autism Spectrum Disorders, Gender Differences, Interpersonal Communication
Nishizawa, Hitoshi – Language Testing, 2023
In this study, I investigate the construct validity and fairness pertaining to the use of a variety of Englishes in listening test input. I obtained data from a post-entry English language placement test administered at a public university in the United States. In addition to expectedly familiar American English, the test features Hawai'i,…
Descriptors: Construct Validity, Listening Comprehension Tests, Language Tests, English (Second Language)
Representation, Race and Empire: A Postcolonial Analysis of the New York Global History Regents Exam
Shreya Sunderram – Journal of Curriculum Studies, 2023
Postcolonial studies have long identified history curriculum as a site of empire building. High stakes exams like the Global History Regents Exam in New York (NYGHR) undoubtedly impact curriculum but have yet to be examined through a postcolonial lens. This study evaluates to what extent, if at all, the NYGHR perpetuates eurocentrism as defined by…
Descriptors: Postcolonialism, Decolonization, History Instruction, High Stakes Tests
Sam Bamkin – Ethnography and Education, 2024
The iterative process of ethnography not only constructs theory, but its methodology should embody theory. Developing a theoretical framework often demands adjustments in methodology, to leverage previous work and to avoid assumptions compounding through the magnification of blind spots. New theory in policy-engaged ethnography has emphasised the…
Descriptors: Foreign Countries, Teachers, Ethnography, Sampling
Karina Mostert; Clarisse van Rensburg; Reitumetse Machaba – Journal of Applied Research in Higher Education, 2024
Purpose: This study examined the psychometric properties of intention to drop out and study satisfaction measures for first-year South African students. The factorial validity, item bias, measurement invariance and reliability were tested. Design/methodology/approach: A cross-sectional design was used. For the study on intention to drop out, 1,820…
Descriptors: Intention, Potential Dropouts, Student Satisfaction, Test Items
Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024
Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…
Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment
Steven Lee; Matthew Schaelling – Society for Research on Educational Effectiveness, 2024
Background: Inequality along racial and economic dimensions is well-documented and widespread in educational contexts. Achievement gaps are observed among children as early as primary school and are especially notable in standardized testing (Fryer & Levitt, 2004; Fryer & Levitt, 2013; Bond & Lang 2013). In response, some observers and…
Descriptors: Elementary School Students, Middle School Students, Standardized Tests, Achievement Gap
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Grantee Submission, 2020
In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…
Descriptors: Virtual Classrooms, Item Response Theory, Test Bias, Test Items
An Intersectional Approach to Differential Item Functioning: Reflecting Configurations of Inequality
Russell, Michael; Kaplan, Larry – Practical Assessment, Research & Evaluation, 2021
Differential Item Functioning (DIF) is commonly employed to examine measurement bias of test scores. Current approaches to DIF compare item functioning separately for select demographic identities such as gender, racial stratification, and economic status. Examining potential item bias fails to recognize and capture the intersecting configurations…
Descriptors: Test Bias, Test Items, Demography, Identification
Schuster, Carolin; Narciss, Susanne; Bilz, Jessica – Social Psychology of Education: An International Journal, 2021
In three experiments (Ns = 327/137/210), we investigated whether test grades and elaborated feedback in a stereotypically male (Math) and a stereotypically female subject (German) are biased by the student's gender. For this purpose, pre-service teachers graded and provided written feedback on tests which were allegedly from boys or girls. In…
Descriptors: Tests, Test Bias, Gender Bias, Sex Stereotypes
Karp Gershon, Sa'ar; Ruipérez-Valiente, José A.; Alexandron, Giora – International Journal of Educational Technology in Higher Education, 2021
The emergence of Massive Open Online Courses (MOOCs) broadened the educational landscape by providing free access to quality learning materials for anyone with a device connected to the Internet. However, open access does not guarantee equals opportunities to learn, and research has repetitively reported that learners from affluent countries…
Descriptors: Online Courses, Access to Education, Developing Nations, Academic Achievement
Charles Andrews Dahl Jr. – ProQuest LLC, 2021
As the accuracy of evaluations is important to teachers, it is essential to examine their perceptions about accuracy (Cho & Schunn, 2018; Plunkett & Dyson, 2018). Understanding teachers' perceptions of accuracy are significant because an evaluation system's effectiveness depends on the teachers' belief that it is accurate (Lewis, 2018;…
Descriptors: Teacher Evaluation, Public School Teachers, Teacher Attitudes, Accuracy
Sünbül, Seçil Ömür – International Journal of Progressive Education, 2019
In this study, it is aimed to investigate the effects of various factors on the performance of the methods used in the determination of differential item functioning (DIF) in the DINA model included in the Cognitive Diagnosis Models. The current study is limited with Logistic Regression and Wald test methods which were used to determine the…
Descriptors: Test Bias, Models, Correlation, Probability