Publication Date
In 2025 | 29 |
Since 2024 | 93 |
Since 2021 (last 5 years) | 448 |
Since 2016 (last 10 years) | 1317 |
Since 2006 (last 20 years) | 3619 |
Descriptor
Comparative Analysis | 4048 |
Scores | 4048 |
Elementary School Students | 1340 |
Gender Differences | 1246 |
Statistical Analysis | 1235 |
Public Schools | 1124 |
Racial Differences | 1105 |
Educational Assessment | 941 |
Foreign Countries | 898 |
Achievement Gap | 889 |
National Competency Tests | 878 |
More ▼ |
Source
Author
Sinharay, Sandip | 9 |
Bridgeman, Brent | 7 |
Petscher, Yaacov | 7 |
Attali, Yigal | 6 |
Jerrim, John | 6 |
Wolf, Patrick J. | 6 |
Cho, Sun-Joo | 5 |
Chudowsky, Naomi | 5 |
Chudowsky, Victor | 5 |
Kim, Sooyeon | 5 |
Klecker, Beverly M. | 5 |
More ▼ |
Publication Type
Education Level
Audience
Policymakers | 63 |
Practitioners | 60 |
Researchers | 18 |
Teachers | 14 |
Administrators | 7 |
Parents | 4 |
Community | 2 |
Counselors | 1 |
Students | 1 |
Location
Texas | 133 |
California | 119 |
Florida | 115 |
Turkey | 82 |
Georgia | 72 |
North Carolina | 71 |
Iran | 66 |
Illinois | 62 |
Massachusetts | 56 |
United States | 56 |
New York | 55 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 11 |
Meets WWC Standards with or without Reservations | 20 |
Does not meet standards | 25 |
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025
This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…
Descriptors: College Entrance Examinations, Testing, Change, Test Construction
R. Lanai Jennings; Megan Midkiff; Emily Nestor McCauley; Jeremy Lopuch; Sandra Stroebel; Rachel James; Mary Toler; Rebecca Wendell; Paula King; Mallory Frampton – Contemporary School Psychology, 2024
Reading comprehension is one of the most valuable academic skills taught in school. Selecting the appropriate assessment instrument to ensure early identification and intervention is important as there is an amalgam of cognitive abilities and academic skills involved in reading comprehension. The GORT-5 is the most recent edition of a test that…
Descriptors: Test Validity, Diagnostic Tests, Reading Comprehension, Early Intervention
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Jeff Allen; Ty Cruce – ACT Education Corp., 2025
This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…
Descriptors: College Entrance Examinations, Testing, Change, Scores
Harun Bayer; Fazilet Gül Ince Araci; Gülsah Gürkan – International Journal of Technology in Education and Science, 2024
The rapid advancement of artificial intelligence technologies, their pervasive use in every field, and the growing understanding of the benefits they bring have led actors in the education sector to pursue research in this field. In particular, the use of artificial intelligence tools has become more prevalent in the education sector due to the…
Descriptors: Artificial Intelligence, Computer Software, Computational Linguistics, Technology Uses in Education
Kimpo, Rhea R.; Puder, Barb – Anatomical Sciences Education, 2023
The traditional format for neuroanatomy lab practical exams involves stations with a time limit for each station and inability to revisit stations. Timed exams have been associated with anxiety, which can lead to poor performance. In alignment with the universal design for learning (UDL), "Timed Image Question" and "Untimed Image…
Descriptors: Anatomy, Neurosciences, Comparative Analysis, Laboratory Experiments
Jane Batamuliza; Gonzague Habinshuti; Jean Baptiste Nkurunziza – Journal of Technology and Science Education, 2024
This current study presents the effects of interactive computer simulations on students' performance and concept retention in the unit of chemical reactions. Purposive sampling was used to select four schools with a sample population of 320. The Achievement test on chemical reactions was developed, validated, and checked for reliability. The…
Descriptors: Chemistry, Science Instruction, Teaching Methods, Comparative Analysis
K. Supriya; Christofer Bang; Jessica Ebie; Christopher Pagliarulo; Derek Tucker; Kaela Villegas; Christian Wright; Sara Brownell – CBE - Life Sciences Education, 2024
Use of high-stakes exams in a course has been associated with gender, racial, and socioeconomic inequities. We investigated whether offering students the opportunity to retake an exam makes high-stakes exams more equitable. Following the control value theory of achievement emotions, we hypothesized that exam retakes would increase students'…
Descriptors: Test Anxiety, High Stakes Tests, Academic Achievement, Self Concept
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Leda Lampropoulou – Language Education & Assessment, 2023
Extensive oral tasks or monologues of different types (e.g., presentations, storytelling) are often used as second language acquisition tasks in the fields of language learning and language testing. Pre-task planning time is a common provision to test-takers who may use different strategies to prepare their response. High-stakes tests, such as the…
Descriptors: Language Tests, Speech Communication, Test Validity, Culture Fair Tests
Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023
Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…
Descriptors: Biology, Science Tests, High Stakes Tests, Scores
Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024
The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…
Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability
Tim Stoeckel; Liang Ye Tan; Hung Tan Ha; Nam Thi Phuong Ho; Tomoko Ishii; Young Ae Kim; Chunmei Huang; Stuart McLean – Vocabulary Learning and Instruction, 2024
Local item dependency (LID) occurs when test-takers' responses to one test item are affected by their responses to another. It can be problematic if it causes inflated reliability estimates or distorted person and item measures. The cued-recall reading comprehension test in Hu and Nation's (2000) well-known and influential coverage--comprehension…
Descriptors: Reading Comprehension, English (Second Language), Second Language Instruction, Second Language Learning