Publication Date
In 2025 | 319 |
Since 2024 | 1250 |
Since 2021 (last 5 years) | 5060 |
Since 2016 (last 10 years) | 13571 |
Since 2006 (last 20 years) | 29500 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Policymakers | 492 |
Practitioners | 488 |
Researchers | 348 |
Teachers | 332 |
Administrators | 187 |
Parents | 68 |
Community | 67 |
Students | 44 |
Counselors | 33 |
Media Staff | 7 |
Support Staff | 3 |
More ▼ |
Location
Turkey | 1153 |
Texas | 784 |
California | 733 |
Florida | 596 |
United States | 563 |
Canada | 510 |
Australia | 499 |
China | 475 |
North Carolina | 438 |
New York | 382 |
United Kingdom | 371 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 65 |
Meets WWC Standards with or without Reservations | 112 |
Does not meet standards | 116 |
Purwoko Haryadi Santoso; Bayu Setiaji; Wahyudi; Johan Syahbrudin; Syamsul Bahri; Fathurrahman; A. Suci Rizky Ananda; Yusuf Sodhiqin – Physical Review Physics Education Research, 2024
The Force Concept Inventory (FCI) is one of the research-based assessments established by the physics education research community to measure students' understanding of Newtonian mechanics. Former works have often recorded the notion of gendered mean FCI scores favoring male students notably in the North American (NA) based studies. Nevertheless,…
Descriptors: Gender Differences, Physics, Science Instruction, Science Tests
Aparajita Jaiswal; Muna Sapkota; Kris Acheson – International Journal of STEM Education, 2024
Background: Working and interacting with people from diverse backgrounds have become common in Engineering. Research has indicated that engineering graduates face challenges while working with a diverse workforce. Therefore, it is vital for higher education institutions to help engineering students develop intercultural competence skills by…
Descriptors: Program Effectiveness, Study Abroad, Cultural Awareness, Competence
Ting Sun; Stella Yun Kim – Educational and Psychological Measurement, 2024
Equating is a statistical procedure used to adjust for the difference in form difficulty such that scores on those forms can be used and interpreted comparably. In practice, however, equating methods are often implemented without considering the extent to which two forms differ in difficulty. The study aims to examine the effect of the magnitude…
Descriptors: Difficulty Level, Data Interpretation, Equated Scores, High School Students
Sadao Otsuka; Toshiya Murai – Reading and Writing: An Interdisciplinary Journal, 2024
There is widespread concern about declining literacy skills in recent young Japanese. The present study investigated how higher-level reading and writing proficiencies are underpinned by basic literacy skills in Japanese adolescents. From a large database of the most popular literacy exams in Japan, we retrospectively analyzed word- and text-level…
Descriptors: Foreign Countries, Literacy, Test Score Decline, Data
Juwel Ahmed Sarker; Josh McGee; Gema Zamarro; Andrew Camp – Society for Research on Educational Effectiveness, 2024
Background: Teacher quality matters for student achievement (Coleman, 1968; Rivkin et al., 2005; Rockoff, 2004; Aaronson et al., 2007) and later career success (Chetty et al., 2014). States use licensure exams as a quality screen believing that they are predictive of teaching effectiveness (Council et al., 2001). However, the evidence on the…
Descriptors: Teacher Certification, Licensing Examinations (Professions), Scores, Employment
Richard Churches; Kate Wastie; Max Jones; Nina Dhillon – Education Development Trust, 2024
This report provides advice to policymakers and school leaders on the use of assessment centres as part of a teacher selection approach. It discusses the relationship between assessment centre scores prior to joining teaching, and teacher effectiveness over a six-year period. It draws from various stages of a wider research project, the overall…
Descriptors: Beginning Teachers, Classroom Techniques, Prediction, Teacher Selection
Under the Weather? The Effects of Temperature on Student Test Performance. EdWorkingPaper No. 24-910
Deven Carlson; Adam Shepardson – Annenberg Institute for School Reform at Brown University, 2024
As students are exposed to extreme temperatures with ever-increasing frequency, it is important to understand how such exposure affects student learning. In this paper we draw upon detailed student achievement data, combined with high-resolution weather records, to paint a clear portrait of the effect of temperature on student learning across a…
Descriptors: Weather, Climate, Heat, Academic Achievement
Orhan, Ali – Journal of Psychoeducational Assessment, 2022
The aims of this reliability generalization study were to provide the overall alpha values of the California critical thinking disposition inventory (CCTDI) total score and subscales scores and investigate the characteristics of the studies that may be associated with the variability in the reliability values of the CCTDI total score and subscales…
Descriptors: Critical Thinking, Measures (Individuals), Test Reliability, Generalization
Chang, Kuo-Feng – ProQuest LLC, 2022
This dissertation was designed to foster a deeper understanding of population invariance in the context of composite-score equating and provide practitioners with guidelines for addressing score equity concerns at the composite score level. The purpose of this dissertation was threefold. The first was to compare different composite equating…
Descriptors: Test Items, Equated Scores, Methods, Design
Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022
In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…
Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods
Johnson, Matthew S.; Liu, Xiang; McCaffrey, Daniel F. – Journal of Educational Measurement, 2022
With the increasing use of automated scores in operational testing settings comes the need to understand the ways in which they can yield biased and unfair results. In this paper, we provide a brief survey of some of the ways in which the predictive methods used in automated scoring can lead to biased, and thus unfair automated scores. After…
Descriptors: Psychometrics, Measurement Techniques, Bias, Automation
Sadhwani, Anjali; Wheeler, Anne; Gwaltney, Angela; Peters, Sarika U.; Barbieri-Welge, Rene L.; Horowitz, Lucia T.; Noll, Lisa M.; Hundley, Rachel J.; Bird, Lynne M.; Tan, Wen-Hann – Journal of Autism and Developmental Disorders, 2023
We describe the development of 236 children with Angelman syndrome (AS) using the Bayley Scales of Infant and Toddler Development, Third Edition. Multilevel linear mixed modeling approaches were used to explore differences between molecular subtypes and over time. Individuals with AS continue to make slow gains in development through at least age…
Descriptors: Child Development, Developmental Disabilities, Psychomotor Skills, Infants
Moore, C. Missy; Mullen, Patrick R.; Hinchey, Kaitlin J.; Lambie, Glenn W. – Counselor Education and Supervision, 2023
Our study examines the differential item functioning of the Counselor Competencies Scale--Revised (CCS-R) scores due to respondents' gender, the type of evaluation, and a combination of these two variables using a large sample (N = 1614). Implications of the findings are offered to inform counselor educators and supervisors using the CCS-R and…
Descriptors: Item Analysis, Measures (Individuals), Counselors, Competence
Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023
In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…
Descriptors: Test Items, Scores, Goodness of Fit, Statistics
Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023
Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…
Descriptors: Test Interpretation, Scores, Test Use, Test Validity