Publication Date
In 2025 | 2 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 16 |
Descriptor
Construct Validity | 31 |
Testing Problems | 31 |
Test Validity | 14 |
Language Tests | 10 |
Test Construction | 10 |
Foreign Countries | 7 |
Measurement Techniques | 7 |
Second Language Learning | 7 |
Educational Assessment | 6 |
Evaluation Methods | 6 |
Psychometrics | 6 |
More ▼ |
Source
Author
Afflerbach, Peter P. | 1 |
Alonzo, Alicia C. | 1 |
Aryadoust, Vahid | 1 |
Bao, Lei | 1 |
Bernal, Ernesto M. | 1 |
Brown, Nathaniel J. | 1 |
Croninger, Robert G. | 1 |
Endler, Norman S. | 1 |
Facione, Peter A. | 1 |
Finn, Bridgid | 1 |
Fraboni, Maryann | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 4 |
Elementary Education | 3 |
Higher Education | 3 |
Postsecondary Education | 2 |
Grade 5 | 1 |
Grade 6 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Leader Behavior Description… | 1 |
Pearson Test of English… | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
Texas Assessment of Academic… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Yi Zou; Ying Zheng; Jingwen Wang – International Journal of Language Testing, 2025
The Pearson Test of English Academic (PTE-A), a widely used high-stakes language proficiency test for university admissions and migration purposes, underwent a notable change from a three-hour to a two-hour version in November 2021. The implementation of the new version has prompted inquiries into the washback effects on various stakeholders.…
Descriptors: Testing Problems, Test Preparation, High Stakes Tests, English (Second Language)
Aryadoust, Vahid – Language Testing, 2023
Construct validity and building validity arguments are some of the main challenges facing the language assessment community. The notion of construct validity and validity arguments arose from research in psychological assessment and developed into the gold standard of validation/validity research in language assessment. At a theoretical level,…
Descriptors: Testing Problems, Test Validity, Second Language Learning, Construct Validity
Bao, Lei; Xiao, Yang; Koenig, Kathleen; Han, Jing – Physical Review Physics Education Research, 2018
In science, technology, engineering, and mathematics education there has been increased emphasis on teaching goals that include not only the learning of content knowledge but also the development of scientific reasoning skills. The Lawson classroom test of scientific reasoning (LCTSR) is a popular assessment instrument for scientific reasoning.…
Descriptors: Science Tests, Science Process Skills, Logical Thinking, Test Validity
Salmani Nodoushan, Mohammad Ali – Online Submission, 2021
This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…
Descriptors: Construct Validity, Test Theory, Test Use, Affordances
Peiyu Wang; Liying Cheng – Critical Inquiry in Language Studies, 2025
This study employed a multi-methods design to investigate the impact of preparation on Chinese test-takers' perceptions of the integrated TOEFL iBT speaking and writing design. Combining results from over 1700 surveys and 10 interviews, it was found that these Chinese test-takers, who are the most vulnerable group in the multimillion testing…
Descriptors: Foreign Countries, Second Language Learning, English (Second Language), Language Tests
Khamis, Abdelmoneim Hassan Adam – English Language Teaching, 2019
This case study of Al-Aqsa School attempts to show that Cambridge Young Learners English Tests (YLE) may hurt learners' motivation and teaching process. The paper aims to highlight the adoption of Cambridge YLE proficiency tests and explore learners and teachers' perception of those tests. Also, providing alternative yardstick measures learners'…
Descriptors: Case Studies, Language Tests, Language Proficiency, Second Language Learning
Giraldo, Frank – HOW, 2019
The purpose of this article of reflection is to raise awareness of how poor design of language assessments may have detrimental effects, if crucial qualities and technicalities of test design are not met. The article first discusses these central qualities for useful language assessments. Then, guidelines for creating listening assessments, as an…
Descriptors: Test Construction, Consciousness Raising, Language Tests, Second Language Learning
Kirkpatrick, Robert; Hlaing, Hmone Lian – Language Testing in Asia, 2013
This study examines the English section of the university entrance examination in Myanmar in terms of validity, reliability, practicality, and washback. The study highlights the significance of the matriculation examination, evaluates individual test items, and presents the opinions of teachers and students about the test. The results reveal that…
Descriptors: Foreign Countries, College Entrance Examinations, Test Reliability, Test Items
Yastibas, Ahmet Erdost; Takkaç, Mehmet – Journal of Language and Linguistic Studies, 2018
Language assessment literacy has become a critical competence for a language teacher to have. Accordingly, there are many studies in the literature which have researched different aspects of language assessment literacy (i.e., language assessment training, professional development and language assessment literacy levels of language teachers).…
Descriptors: Language Tests, Language Teachers, College Faculty, English (Second Language)
Finn, Bridgid – ETS Research Report Series, 2015
There is a growing concern that when scores from low-stakes assessments are reported without considering student motivation as a construct of interest, biased conclusions about how much students know will result. Low motivation is a problem particularly relevant to low-stakes testing scenarios, which may be low stakes for the test taker but have…
Descriptors: Research Reports, Student Motivation, Self Disclosure (Individuals), Construct Validity
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Brown, Nathaniel J.; Afflerbach, Peter P.; Croninger, Robert G. – Educational Psychology Review, 2014
National policy and standards documents, including the National Assessment of Educational Progress frameworks, the "Common Core State Standards" and the "Next Generation Science Standards," assert the need to assess critical-analytic thinking (CAT) across subject areas. However, assessment of CAT poses several challenges for…
Descriptors: Critical Thinking, Thinking Skills, National Standards, National Competency Tests
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Alonzo, Alicia C. – Measurement: Interdisciplinary Research and Perspectives, 2007
Schilling et al. (this issue) have done a commendable job in illustrating a comprehensive process of validating assessments of teacher knowledge (and, more broadly, other types of tests as well). On one hand, the concrete illustration of a process that often remains murky and incomplete is profoundly heartening, as it provides a rigorous model for…
Descriptors: Mathematics Education, Teacher Characteristics, Mathematics Instruction, Knowledge Base for Teaching
Scholz, George E. – 1993
A discussion of language testing in the context of a program in English for Special Purposes (ESP) focuses on the lack of "fit" between the two areas and makes some recommendations for improvement. It begins with overviews of recent trends in testing and recent issues in ESP. Overlap is seen in two areas: construct and content validity. It is…
Descriptors: Construct Validity, Content Validity, Curriculum Design, English for Special Purposes