Publication Date
In 2025 | 1 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 22 |
Since 2016 (last 10 years) | 52 |
Since 2006 (last 20 years) | 108 |
Descriptor
Source
Author
Linn, Robert L. | 10 |
Thompson, Bruce | 7 |
Wainer, Howard | 7 |
Herman, Joan L. | 6 |
Klein, Stephen P. | 6 |
Koretz, Daniel | 5 |
Neill, Monty | 5 |
Shepard, Lorrie A. | 5 |
Baker, Eva L. | 4 |
Easton, John Q. | 4 |
Glaser, Robert | 4 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 41 |
Administrators | 21 |
Teachers | 19 |
Policymakers | 17 |
Researchers | 9 |
Students | 7 |
Parents | 4 |
Community | 2 |
Location
Australia | 21 |
United States | 19 |
Canada | 16 |
United Kingdom | 12 |
California | 11 |
Netherlands | 11 |
United Kingdom (England) | 10 |
New York | 9 |
Israel | 6 |
Kentucky | 6 |
New Zealand | 6 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Lederman, Josh – Applied Measurement in Education, 2023
Given its centrality to assessment, until the concept of validity includes concern for racial justice, such matters will be seen as residing outside the "real" work of validation, rendering them powerless to count against the apparent scientific merit of the test. As the definition of validity has evolved, however, it holds great…
Descriptors: Educational Assessment, Validity, Social Justice, Race
Lauwaert, Pieter – Studies in Applied Linguistics & TESOL, 2023
The way in which validity has been conceptualized has changed throughout the years. The focus in validation studies shifted from evaluating distinct components of validity to developing a comprehensive argument for the use and interpretations of test scores. The argument-based approach to validity incorporates the distinct types of the…
Descriptors: Language Tests, Test Validity, Test Use, Construct Validity
Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024
A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…
Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History
Tahir Taga – International Journal of Education and Literacy Studies, 2023
In the 21st century, international interaction in social, economic, cultural, and educational fields has increased. Consequently, international standards have become essential in national education policies, reforms, and practices. As an international assessment, PISA has started to function as a prominent tool in this regard. However, the impact…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Barnes, Amy C. – New Directions for Student Leadership, 2021
This article explores the ethical use of assessments in leadership training, education, and development. From the importance of having well-trained facilitators to the consideration of power and social identity in the interpretation of individual results, this article advocates for approaching the use of leadership assessments and inventories with…
Descriptors: Leadership, Measures (Individuals), Ethics, Test Use
Schmitt, Norbert; Nation, Paul; Kremmel, Benjamin – Language Teaching, 2020
Recently, a large number of vocabulary tests have been made available to language teachers, testers, and researchers. Unfortunately, most of them have been launched with inadequate validation evidence. The field of language testing has become increasingly more rigorous in the area of test validation, but developers of vocabulary tests have…
Descriptors: Test Construction, Test Validity, Language Tests, Test Use
Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021
Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…
Descriptors: Rating Scales, Test Construction, Language Tests, Test Use
Jieun Kim; Daniel Richard Isbell – Language Assessment Quarterly, 2024
The ACTFL Assessment of Performance Toward Proficiency in Languages (AAPPL, https://www.actfl.n.d.org/assessments/k-12-assessments/aappl) assesses proficiency in 11 languages for students in grades 3 to 12 and is often used to award the Seal of Biliteracy. While arguments for the valid interpretation and uses of the AAPPL have previously been…
Descriptors: Language Tests, Second Language Learning, Second Language Instruction, Language Proficiency
Daniel R. Isbell; Benjamin Kremmel; Jieun Kim – Language Assessment Quarterly, 2023
In the wake of the COVID-19 boom in remote administration of language tests, it appears likely that remote administration will be a permanent fixture in the language testing landscape. Accordingly, language test providers, stakeholders, and researchers must grapple with the implications of remote proctoring on valid, fair, and just uses of tests.…
Descriptors: Distance Education, Supervision, Language Tests, Culture Fair Tests
Salmani Nodoushan, Mohammad Ali – Online Submission, 2021
This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…
Descriptors: Construct Validity, Test Theory, Test Use, Affordances
Anne H. Davidson – National Assessment Governing Board, 2025
The purpose of this National Assessment of Educational Progress (NAEP) Achievement Levels Validity Argument Report is to synthesize evidence currently available to address the validity of the interpretations and uses of the NAEP Achievement Levels. Validity is the extent to which theory and evidence supports or refutes proposed and enacted test…
Descriptors: National Competency Tests, Academic Achievement, Test Validity, College Entrance Examinations
Rod Ellis; Carsten Roever; Natsuko Shintani; Yan Zhu – Multilingual Matters, 2024
Taking a psycholinguistic perspective, this book investigates how second language (L2) learners' pragmatic abilities in English can be measured. It complements and extends earlier work on the testing of implicit and explicit grammar. The authors present a set of tests they developed using both well-established methods of measuring pragmatic…
Descriptors: Language Proficiency, English (Second Language), Second Language Learning, Second Language Instruction
Assessing the Speaking Proficiency of L2 Chinese Learners: Review of the Hanyu Shuiping Kouyu Kaoshi
Li, Albert W. – Language Testing, 2023
The Hanyu Shuiping Kaoshi (HSK) is a multi-level, multi-purpose Chinese proficiency test developed by the Center for Language Education and Cooperation (previously the Office of Chinese Language Council International and, henceforth, referred to by its colloquial name "Hanban"). It assesses reading, writing, and listening skills of…
Descriptors: Language Tests, Language Proficiency, Chinese, Second Language Learning
Furuta, Jared – Sociology of Education, 2021
National high-stakes exams are a fundamental structural feature of education systems around the world. Despite their importance in shaping educational stratification, little is known about the social processes that influence how and why national high-stakes exams are used at early ages on a global basis. I argue that global trends in the use of…
Descriptors: Educational Trends, High Stakes Tests, Foreign Countries, Comparative Education
Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021
This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…
Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making