Publication Date
| In 2026 | 6 |
| Since 2025 | 2195 |
| Since 2022 (last 5 years) | 12710 |
| Since 2017 (last 10 years) | 33835 |
| Since 2007 (last 20 years) | 68326 |
Descriptor
| Foreign Countries | 30532 |
| Test Validity | 21728 |
| Scores | 18248 |
| Academic Achievement | 16912 |
| Test Construction | 16738 |
| Test Reliability | 15015 |
| Achievement Tests | 14839 |
| Standardized Tests | 14712 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13038 |
| Language Tests | 12549 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3391 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2815 |
| Australia | 2426 |
| Canada | 2269 |
| California | 1853 |
| United States | 1725 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1121 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Alvie Faustino Diaz; Henrilyn Loñez – Journal of Science and Mathematics Education in Southeast Asia, 2023
Concurrent classrooms, also called hybrid classrooms, have become a well-liked teaching strategy. Research suggests that face-to-face and online set-ups are best integrated, combining their strengths to create a unique learning experience consistent with the context and intended educational purposes. However, more extensive research still needs to…
Descriptors: Test Construction, Test Validity, Teaching Methods, Educational Strategies
Christine M. White; Christopher Schatschneider – Grantee Submission, 2023
Universal screening to predict students' risk for reading problems is a foundational component of the Multi-Tiered Systems of Support framework and is required by law in many US states. School or district administrators are tasked with selecting screening assessments that are both technically adequate and feasible given the resources of their…
Descriptors: Screening Tests, Reading Tests, Reading Difficulties, Classification
Rossi, Olena; Brunfaut, Tineke – Language Assessment Quarterly, 2021
A long-standing debate in the testing of listening concerns the authenticity of the listening input. On the one hand, listening texts produced by item writers often lack spoken language characteristics. On the other hand, real-life recordings are often too context-specific to stand alone, or not suitable for item generation. In this study, we…
Descriptors: Listening Comprehension Tests, Test Items, Test Construction, Training
Haladyna, Thomas M.; Rodriguez, Michael C. – Educational Assessment, 2021
Full-information item analysis provides item developers and reviewers comprehensive empirical evidence of item quality, including option response frequency, point-biserial index (PBI) for distractors, mean-scores of respondents selecting each option, and option trace lines. The multi-serial index (MSI) is introduced as a more informative…
Descriptors: Test Items, Item Analysis, Reading Tests, Mathematics Tests
Kotecki, Jerome E.; Greene, Maurita A.; Khubchandani, Jagdish; Kandiah, Jayanthi – American Journal of Health Education, 2021
Background: Diet quality assessment in community health settings is critical to reduce the incidence and improve management of diet-related chronic disease. Unfortunately, understandable and actionable brief dietary screening tools that empower individuals are nearly absent. Purpose: The purpose of this article is to describe two rigorous…
Descriptors: Dietetics, Screening Tests, Counseling, Health Education
Ing, Marsha; Chinen, Starlie; Jackson, Kara; Smith, Thomas M. – Educational Measurement: Issues and Practice, 2021
Despite the ease of accessing a wide range of measures, little attention is given to validity arguments when considering whether to use the measure for a new purpose or in a different context. Making a validity argument has historically focused on the intended interpretation and use. There has been a press to consider both the intended and actual…
Descriptors: Instructional Improvement, Measures (Individuals), Test Validity, Test Interpretation
Sebastian Moncaleano – ProQuest LLC, 2021
The growth of computer-based testing over the last two decades has motivated the creation of innovative item formats. It is often argued that technology-enhanced items (TEIs) provide better measurement of test-takers' knowledge, skills, and abilities by increasing the authenticity of tasks presented to test-takers (Sireci & Zenisky, 2006).…
Descriptors: Computer Assisted Testing, Test Format, Test Items, Classification
Neha Biju; Nasser Said Gomaa Abdelrasheed; Khilola Bakiyeva; K. D. V. Prasad; Biruk Jember – Language Testing in Asia, 2024
In recent years, language practitioners have paid increasing attention to artificial intelligence (AI)'s role in language programs. This study investigated the impact of AI-assisted language assessment on L2 learners' foreign language anxiety (FLA), attitudes, motivation, and writing skills. The study adopted a sequential exploratory mixed-methods…
Descriptors: Artificial Intelligence, Computer Software, Computer Assisted Testing, Second Language Instruction
Ayako Aizawa – Vocabulary Learning and Instruction, 2024
The Vocabulary Size Test (VST) measures English learners' decontextualised receptive vocabulary knowledge of written English and has nine bilingual versions with multiple-choice options written in other languages. This study used the English-Japanese version of the VST to investigate the extent to which loanword items were answered correctly by…
Descriptors: Linguistic Borrowing, Second Language Learning, Native Language, English (Second Language)
Judit Kormos; Kathrin Eberharter; Elisa Guggenbichler; Simone Baumgartinger; Viktoria Ebner; Benjamin Kremmel – Language Assessment Quarterly, 2024
Authentic listening increasingly involves being able to pause or replay recordings as needed. In this study, we investigated differences in the reported use of listening strategies and listening anxiety between single-play and self-paced test administration. We also analyzed the interrelationships among first language (L1) literacy skills, second…
Descriptors: Native Language, Language Tests, Listening Comprehension Tests, Literacy
Crystal Spring; Andrea Ochoa – Journal of Psychoeducational Assessment, 2024
This study sought to develop an Academic School Climate Scale measuring students' perceptions of the learning environments at their schools. With a pilot sample of 1,265 students and validation sample of 14,773 students in Grades 4-12 in schools across the U.S., results of EFA and CFA supported a bifactor model with a general factor and three…
Descriptors: Educational Environment, Measures (Individuals), Student Attitudes, Grade 4
Nastasia Schreiner; Aleksandr Shneyderman – Office of Assessment, Research, and Data Analysis, Miami-Dade County Public Schools, 2024
To meet graduation requirements, public school students in Florida must participate in and pass any statewide, standardized assessments required for a standard diploma or earn identified concordant scores or comparative scores, as applicable, for the cohort year in which they entered in ninth grade (M-DCPS, 2024). One of the statewide assessments…
Descriptors: Scores, Graduation Requirements, Grade 10, Language Arts
Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2024
Analyzing heterogeneous treatment effects (HTE) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and pre-intervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…
Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics
Maeve Dwan-O'Reilly; Laura Walsh; Ailbhe Booth; Caroline Heary; Eilis Hennessy – School Mental Health, 2024
Secondary school staff are often tasked with delivering mental health content to students, yet there has been little research on staff confidence to do so. Given the responsibility placed on staff to support student mental health, reliable and valid measures are needed to facilitate assessment of teacher confidence in the classroom and evaluation…
Descriptors: Mental Health, Teacher Responsibility, Psychometrics, Test Reliability
Zoe L. Handley; Haiping Wang – Language Assessment Quarterly, 2024
This paper explores what the measures of utterance fluency typically employed in Automatic Speech Evaluation (ASE), i.e. automated speaking assessments, tell us about oral proficiency. 60 Chinese learners of English completed the second part of the speaking section of IELTS and six tasks designed to measure the linguistic knowledge and processing…
Descriptors: Foreign Countries, Speech Evaluation, Graduate Students, Articulation (Speech)

Peer reviewed
Direct link
