Publication Date
| In 2026 | 10 |
| Since 2025 | 2328 |
| Since 2022 (last 5 years) | 12843 |
| Since 2017 (last 10 years) | 33968 |
| Since 2007 (last 20 years) | 68459 |
Descriptor
| Foreign Countries | 30579 |
| Test Validity | 21757 |
| Scores | 18263 |
| Academic Achievement | 16934 |
| Test Construction | 16763 |
| Test Reliability | 15036 |
| Achievement Tests | 14864 |
| Standardized Tests | 14724 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13046 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2823 |
| Australia | 2430 |
| Canada | 2270 |
| California | 1854 |
| United States | 1727 |
| Texas | 1615 |
| China | 1579 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1203 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Burton, J. Dylan – Language Assessment Quarterly, 2023
The effects of question or task complexity on second language speaking have traditionally been investigated using complexity, accuracy, and fluency measures. Response processes in speaking tests, however, may manifest in other ways, such as through nonverbal behavior. Eye behavior, in the form of averted gaze or blinking frequency, has been found…
Descriptors: Oral Language, Speech Communication, Language Tests, Eye Movements
Jonathan Trace – Language Teaching Research Quarterly, 2023
The role of context in cloze tests has long been seen as both a benefit as well as a complication in their usefulness as a measure of second language comprehension (Brown, 2013). Passage cohesion, in particular, would seem to have a relevant and important effect on the degree to which cloze items function and the interpretability of performances…
Descriptors: Language Tests, Cloze Procedure, Connected Discourse, Test Items
Nebraska Department of Education, 2021
This technical report documents the processes and procedures implemented to support the Spring 2021 Nebraska Student-Centered Assessment System (NSCAS) Phase I Pilot in English Language Arts (ELA), Mathematics, and Science assessments by NWEA® under the supervision of the Nebraska Department of Education (NDE). The technical report shows how the…
Descriptors: Psychometrics, Standard Setting, English, Language Arts
Yung, Kevin Wai-Ho – RELC Journal: A Journal of Language Teaching and Research, 2023
Literature has long been used as a tool for language teaching and learning. In the New Academic Structure in Hong Kong, it has become an important element in the senior secondary English language curriculum to promote communicative language teaching (CLT) with a process-oriented approach. However, as in many other English as a second or foreign…
Descriptors: Singing, Music Education, Test Preparation, Language Tests
Rao, Chaitra; T. A., Sumathi; Midha, Rashi; Oberoi, Geet; Kar, Bhoomika; Khan, Masarrat; Vaidya, Kshipra; Midya, Vishal; Raman, Nitya; Gajre, Mona; Singh, Nandini Chatterjee – Annals of Dyslexia, 2021
A majority of Indian schoolchildren are biliterate in that they acquire literacy in at least two language systems, necessitating dyslexia assessment in both. The DALI-DAB assesses risk for dyslexia by evaluating reading ability and literacy-learning potential through a battery including literacy tests (letter and word reading, spelling, nonword…
Descriptors: Foreign Countries, Dyslexia, Reading Tests, Diagnostic Tests
Romig, John Elwood; Miller, Alexandra A.; Therrien, William J.; Lloyd, John W. – Exceptionality, 2021
Researchers studying curriculum-based measurement of written expression have used a variety of writing prompt types and durations when establishing criterion validity of these tools. The purpose of this study was to determine through meta-analytic procedures whether any prompt type or duration was superior to others in terms of criterion validity.…
Descriptors: Curriculum Based Assessment, Writing Evaluation, Prompting, Meta Analysis
Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021
Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…
Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas
Soohye Yeom; Lorena Llosa – Language Testing in Asia, 2024
With the increased popularity of English-medium instruction (EMI) in higher education, many East Asian universities are using international English proficiency tests to make admissions and placement decisions. Since these tests were not originally designed for the EMI contexts, validity evidence is needed to support the use of these tests in this…
Descriptors: Language Proficiency, Language Tests, Student Placement, Language of Instruction
Emily R. Forcht; Ethan R. Van Norman – Psychology in the Schools, 2024
The present study compared the diagnostic accuracy of a single computer adaptive test (CAT), Star Reading or Star Math, and a combination of the two in a gated screening framework to predict end-of-year proficiency in reading and math. Participants included 13,009 students in Grades 3-8 who had at least one fall screening score and end-of-year…
Descriptors: Computer Assisted Testing, Adaptive Testing, Diagnostic Tests, Screening Tests
Editorial Projects in Education, 2024
Effective assessments can illuminate strengths, pinpoint learning gaps, and better guide education. This Spotlight will help readers evaluate effective ways to offer students feedback; examine how some states are transitioning to through-year testing models; hear from educators regarding pressure for student success on standardized tests; gain…
Descriptors: Assessment Literacy, Evaluation, Educational Assessment, Feedback (Response)
Srikanth Allamsetty; M. V. S. S. Chandra; Neelima Madugula; Byamakesh Nayak – IEEE Transactions on Learning Technologies, 2024
The present study is related to the problem associated with student assessment with online examinations at higher educational institutes (HEIs). With the current COVID-19 outbreak, the majority of educational institutes are conducting online examinations to assess their students, where there would always be a chance that the students go for…
Descriptors: Computer Assisted Testing, Accountability, Higher Education, Comparative Analysis
Mark Wilson – Journal of Educational and Behavioral Statistics, 2024
This article introduces a new framework for articulating how educational assessments can be related to teacher uses in the classroom. It articulates three levels of assessment: macro (use of standardized tests), meso (externally developed items), and micro (on-the-fly in the classroom). The first level is the usual context for educational…
Descriptors: Educational Assessment, Measurement, Standardized Tests, Test Items
Elizabeth B. Vaughan; A. Montoya-Cowan; Jack Barbera – Chemistry Education Research and Practice, 2024
The Meaningful Learning in the Laboratory Instrument (MLLI) was designed to measure students' expectations before and after their laboratory courses and experiences. Although the MLLI has been used in various studies and laboratory environments to investigate students' cognitive and affective laboratory expectations, the authors of the instrument…
Descriptors: Test Validity, Test Reliability, Expectation, Measures (Individuals)
Huiying Cai; Xun Yan – Language Testing, 2024
Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…
Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation
Nina Charlotte Johanna Welsandt; Fabio Fortunati; Esther Winther; Hermann Josef Abs – Empirical Research in Vocational Education and Training, 2024
Background: Authentic situations are considered a source of learning due to their real world relevance. This can encourage learners to acquire new knowledge. Increasing digitisation and associated resources, such as professional development opportunities for teachers, technology tools, or digital equipment for schools enable the development and…
Descriptors: Test Construction, Test Validity, Evaluation, Educational Technology

Peer reviewed
Direct link
