Publication Date
In 2025 | 340 |
Since 2024 | 1271 |
Since 2021 (last 5 years) | 5082 |
Since 2016 (last 10 years) | 13593 |
Since 2006 (last 20 years) | 29522 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Policymakers | 492 |
Practitioners | 488 |
Researchers | 348 |
Teachers | 332 |
Administrators | 187 |
Parents | 68 |
Community | 67 |
Students | 44 |
Counselors | 33 |
Media Staff | 7 |
Support Staff | 3 |
More ▼ |
Location
Turkey | 1153 |
Texas | 784 |
California | 734 |
Florida | 596 |
United States | 564 |
Canada | 510 |
Australia | 499 |
China | 475 |
North Carolina | 438 |
New York | 383 |
United Kingdom | 371 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 65 |
Meets WWC Standards with or without Reservations | 112 |
Does not meet standards | 116 |
Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023
The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…
Descriptors: Scoring, Tests, Evaluation Methods, Test Items
Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023
The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…
Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence
Mosquera, Jose Miguel Llanos; Suarez, Carlos Giovanny Hidalgo; Guerrero, Victor Andres Bucheli – Education and Information Technologies, 2023
This paper proposes to evaluate learning efficiency by implementing the flipped classroom and automatic source code evaluation based on the Kirkpatrick evaluation model in students of CS1 programming course. The experimentation was conducted with 82 students from two CS1 courses; an experimental group (EG = 56) and a control group (CG = 26). Each…
Descriptors: Flipped Classroom, Coding, Programming, Evaluation Methods
Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023
Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…
Descriptors: Scores, Test Items, Accuracy, Psychometrics
Tresansky, Lindsay M. – ProQuest LLC, 2023
The Annual Professional Performance Review (APPR) system in New York State (NYS) has been called into question by educators since its adoption nearly 10 years ago, yet it remains the mandated evaluation system in NYS schools today. Much of the concern has been over changes such as assigning teachers final evaluation scores, as well as for the…
Descriptors: Foreign Countries, Comparative Education, Teacher Evaluation, Alternative Assessment
Ebrahim Azimi; Jane Friesen; Simon Woodcock – Education Finance and Policy, 2023
We investigate the effects of private schools on reading and numeracy scores using rich population data. Conditional on lagged test scores and narrowly defined neighborhood indicators, Catholic and non-Christian faith private schools on average raise test scores by 0.18 standard deviation or more relative to the average public school, while…
Descriptors: Private Schools, Academic Achievement, Catholic Schools, Scores
Wendy Chan; Jimin Oh; Chen Li; Jiexuan Huang; Yeran Tong – Society for Research on Educational Effectiveness, 2023
Background: The generalizability of a study's results continues to be at the forefront of concerns in evaluation research in education (Tipton & Olsen, 2018). Over the past decade, statisticians have developed methods, mainly based on propensity scores, to improve generalizations in the absence of random sampling (Stuart et al., 2011; Tipton,…
Descriptors: Generalizability Theory, Probability, Scores, Sampling
Hess, Jessica – ProQuest LLC, 2023
This study was conducted to further research into the impact of student-group item parameter drift (SIPD) --referred to as subpopulation item parameter drift in previous research-- on ability estimates and proficiency classification accuracy when occurring in the discrimination parameter of a 2-PL item response theory (IRT) model. Using Monte…
Descriptors: Test Items, Groups, Ability, Item Response Theory
John N. Friedman; Bruce Sacerdote; Douglas O. Staiger; Michele Tine – National Bureau of Economic Research, 2025
We analyze admissions and transcript records for students at multiple Ivy-Plus colleges to study the relationship between standardized (SAT/ACT) test scores, high school GPA, and first-year college grades. Standardized test scores predict academic outcomes with a normalized slope four times greater than that from high school GPA, all conditional…
Descriptors: Standardized Tests, Scores, Grade Point Average, College Entrance Examinations
Eugene Zheng Yao; Alexandra List – Journal of Media Literacy Education, 2025
This study investigated students' critical reasoning about commercials, as an aspect of advertising literacy. Critical reasoning was examined under two different experimental conditions. That is, students were tasked with watching four different commercials with 1) brand information provided or not, and 2) asked to engage in critical reasoning or…
Descriptors: Information Literacy, Media Literacy, Advertising, Critical Literacy
Ana Cláudia Lopes; Marisa Lousada – International Journal of Language & Communication Disorders, 2025
Background: Breastfeeding is the optimal method of infant feeding, particularly during the first 6 months after birth, and ideally continuing until the child is at least 2 years old. Speech--language therapists (SLTs) can improve the quality of care in this area, especially in vulnerable populations. Aims: This pilot study aimed to assess the…
Descriptors: Foreign Countries, Speech Language Pathology, Speech Therapy, Infants
Fahruddin; Merci Robbi Kurniawanti; T. Heru Nurgiansah; Dhiniaty Gularso – Journal of Education and Learning (EduLearn), 2025
This study aims to find out: firstly, the qualifications for developing teaching materials to evaluate observation-based history learning and secondly the level of students' critical thinking skills. The results of this research contribute to improving students' critical thinking skills through the development of teaching materials. This research…
Descriptors: Critical Thinking, Scores, History Instruction, Thinking Skills
Yesim Yurdakul; Utku Beyazit; Aynur Bütün Ayhan – Early Childhood Education Journal, 2025
The present study aimed to examine the effect of a dialogic book reading program on preschool children's perspective taking skills. In line with this aim, a dialogic book reading program was designed, and its effects were tested in a quasi-experimental study involving both pre/post and follow-up tests. The study group consisted of 42 five-year old…
Descriptors: Dialogs (Language), Reading Programs, Preschool Education, Preschool Children
The Developmental Autism Early Screening (DAES): A Novel Test for Screening Autism Spectrum Disorder
Lara Cirnigliaro; Maria Stella Valle; Antonino Casabona; Martina Randazzo; Francesca La Bruna; Fabio Pettinato; Antonio Narzisi; Renata Rizzo; Rita Barone – Journal of Autism and Developmental Disorders, 2025
This study was undertaken to set a novel developmental screening test for autism spectrum disorder (ASD) using the Griffiths Scales of Child Development (Griffith III) (Green et al., 2016; Stroud et al., 2016), in order to intercept the early atypical developmental patterns indicating ASD risk in the first 3 years of age. An observational and…
Descriptors: Autism Spectrum Disorders, Test Construction, Screening Tests, Educational Diagnosis
Piyakon Suepbunma; Suthasinee Theerapan – International Journal of Education and Literacy Studies, 2025
This study aimed to evaluate the effectiveness of basic vocal training exercises for students at Yamaha Music School in Mahasarakham, Thailand, using a quantitative research approach. The research focused on collecting measurable numerical data and was divided into two main components: content validity assessment and effectiveness evaluation of…
Descriptors: Foreign Countries, Music Education, Training, Singing