Publication Date
| In 2026 | 6 |
| Since 2025 | 2195 |
| Since 2022 (last 5 years) | 12710 |
| Since 2017 (last 10 years) | 33835 |
| Since 2007 (last 20 years) | 68326 |
Descriptor
| Foreign Countries | 30532 |
| Test Validity | 21728 |
| Scores | 18248 |
| Academic Achievement | 16912 |
| Test Construction | 16738 |
| Test Reliability | 15015 |
| Achievement Tests | 14839 |
| Standardized Tests | 14712 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13038 |
| Language Tests | 12549 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3391 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2815 |
| Australia | 2426 |
| Canada | 2269 |
| California | 1853 |
| United States | 1725 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1121 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Nils Myszkowski; Martin Storme – Journal of Creative Behavior, 2025
In the PISA 2022 creative thinking test, students provide a response to a prompt, which is then coded by human raters as no credit, partial credit, or full credit. Like many large-scale educational testing frameworks, PISA uses the generalized partial credit model (GPCM) as a response model for these ordinal ratings. In this paper, we show that…
Descriptors: Creative Thinking, Creativity Tests, Scores, Prompting
Jonathan Seiden – Annenberg Institute for School Reform at Brown University, 2025
Direct assessments of early childhood development (ECD) are a cornerstone of research in developmental psychology and are increasingly used to evaluate programs and policies in lower- and middle-income countries. Despite strong psychometric properties, these assessments are too expensive and time consuming for use in large-scale monitoring or…
Descriptors: Young Children, Child Development, Performance Based Assessment, Developmental Psychology
Juan E. Jiménez; Cristina Rodríguez; Jennifer Balade – Reading and Writing: An Interdisciplinary Journal, 2025
The main aim of this study was to evaluate the Early Grade Writing Assessment for Kindergarten (EGWA-K), which is grounded in foundational literacy skills, for its validity and diagnostic accuracy in identifying children at risk of developing early learning disabilities in writing (LDW). To the best of our knowledge, no such tool exists for…
Descriptors: Kindergarten, Writing Evaluation, Writing Skills, Functional Literacy
Lars Andersson Hult; Anders Persson – Journal of Social Science Education, 2025
Purpose: This article's purpose is to examine the manifestations of the evolving modern society and what we now identify as civics or other contemporary social issues in the final examination questions from 1914 to 1937 at four teacher education institutions in Uppsala, Falun, Lund, and Landskrona. Design/methodology/approach: The method can be…
Descriptors: Civics, Tests, Preservice Teacher Education, Test Items
Yusuf Oc; Hela Hassen – Marketing Education Review, 2025
Driven by technological innovations, continuous digital expansion has transformed fundamentally the landscape of modern higher education, leading to discussions about evaluation techniques. The emergence of generative artificial intelligence raises questions about reliability and academic honesty regarding multiple-choice assessments in online…
Descriptors: Higher Education, Multiple Choice Tests, Computer Assisted Testing, Electronic Learning
Diego Cortes; Dirk Hastedt; Sabine Meinck – Large-scale Assessments in Education, 2025
This paper informs users of data collected in international large-scale assessments (ILSA), by presenting argumentsunderlining the importance of considering two design features employed in these studies. We examine a commonmisconception stating that the uncertainty arising from the assessment design is negligible compared with that arisingfrom the…
Descriptors: Sampling, Research Design, Educational Assessment, Statistical Inference
Shaila Sharmin; Minh-Hao D. Tran; Weihua Fan; Consuelo Arbona; Allison Master; Yali Zou – Journal of Psychoeducational Assessment, 2025
The study examined the psychometric structure of Basic Needs Satisfaction in General Scale (BNSG-S) with a sample of 495 college students. It compared the original 21-item version (Gagné, 2003) with the shortened 16-item version (Johnston & Finney, 2010), both with and without method effects. A series of confirmatory factor analyses (CFA)…
Descriptors: Need Gratification, Measures (Individuals), Psychometrics, College Faculty
Gerd Kortemeyer; Julian Nöhl – Physical Review Physics Education Research, 2025
This study explores the use of artificial intelligence in grading high-stakes physics exams, emphasizing the application of psychometric methods, particularly item response theory, to evaluate the reliability of AI-assisted grading. We examine how grading rubrics can be iteratively refined and how threshold parameters can determine when…
Descriptors: Physics, Science Tests, Grading, Artificial Intelligence
Leo Evans; Emily A. Lund; Krystal L. Werfel – Language, Speech, and Hearing Services in Schools, 2025
Purpose: Vocabulary skills in children are typically measured with norm-referenced assessments of receptive and expressive vocabulary. Language sample analysis is an alternative method of examining vocabulary actually produced in communicative events and may be better suited to exposing subtle vocabulary weaknesses. Here, we examine the…
Descriptors: Vocabulary Development, Children, Deafness, Hard of Hearing
Lauren Prather; Nancy Creaghead; Jennifer Vannest; Lisa Hunter; Amy Hobek; Tamika Odum; Mekibib Altaye; Juanita Lackey – Perspectives of the ASHA Special Interest Groups, 2025
Purpose: The lack of appropriate assessments affects populations presumed to be most at risk for speech and language concerns, one of them being children with a history of preterm birth. This study aims to examine whether cultural bias is present in two currently available language tests for Black children under 3 years of age: the Communication…
Descriptors: African American Children, Premature Infants, Evaluation Methods, Language Tests
Mao Li; Muhamad Gina Nugraha – International Electronic Journal of Mathematics Education, 2025
This study presents the validation of a Technological Pedagogical Content Knowledge (TPACK) scale tailored explicitly for primary mathematics teachers in China. Recognizing the gap in context-specific TPACK assessment tools, the research aims to provide an instrument that aligns with the unique educational landscape of China. Utilizing established…
Descriptors: Foreign Countries, Technological Literacy, Pedagogical Content Knowledge, Test Validity
Ryan J. McGill; Stefan C. Dombrowski; Gary L. Canivez – Psychology in the Schools, 2025
The present study examined the posited structure of the Wechsler Intelligence Scale for Children--Fifth Edition (WISC-V) ancillary index scores with normative sample participants aged 6-16 years (N = 2200) using a series of confirmatory factor analyzes (CFA) with maximum likelihood estimation. CFA results supported the retention of auditory…
Descriptors: Children, Intelligence Tests, Test Validity, Scores
Liangliang Xia; Kexin Shen; Herui Sun; Xin An; Yan Dong – Education and Information Technologies, 2025
While generative artificial intelligence (AI) empowers students in their learning, it may also have the potential to undermine their learning agency. Understanding student learning agency in the generative AI-supported contexts (SLA-GAI) has become critical for educators. However, student learning agency is a domain-specific construct, current…
Descriptors: Artificial Intelligence, Technology Uses in Education, Test Construction, Test Validity
Mei-Ju Chen; Chao-Yu Guo; Li-Ting Tseng; Ming-Yi Chiu; Hsuan-Jui Weng – Psychology in the Schools, 2025
An interdisciplinary curriculum provides integration of students' subjects and lives, guiding students to understand and engage with changes in the world through real situations and problems, with teachers playing a critical role. However, there is no assessment tool to explore teachers' interdisciplinary role perceptions (TIRP). In this study, an…
Descriptors: Teacher Role, Role Perception, Interdisciplinary Approach, Attitude Measures
Huiming Ding; Matt Homer – Advances in Health Sciences Education, 2025
Summative assessments are often underused for feedback, despite them being rich with data of students' applied knowledge and clinical and professional skills. To better inform teaching and student support, this study aims to gain insights from summative assessments through profiling students' performance patterns and identify those students…
Descriptors: Summative Evaluation, Profiles, Statistical Analysis, Outcomes of Education

Peer reviewed
Direct link
