Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 23 |
Since 2006 (last 20 years) | 36 |
Descriptor
Test Construction | 239 |
Test Use | 239 |
Test Validity | 239 |
Test Reliability | 118 |
Higher Education | 49 |
Evaluation Methods | 40 |
Foreign Countries | 39 |
Test Items | 38 |
Elementary Secondary Education | 35 |
Language Tests | 34 |
Psychometrics | 34 |
More ▼ |
Source
Author
Straus, Murray A. | 4 |
Baker, Eva L. | 3 |
Fraser, Barry J. | 3 |
Clark, John L. D. | 2 |
Hambleton, Ronald K. | 2 |
Hamby, Sherry L. | 2 |
Johnson, Bil | 2 |
Kobak, Kenneth A. | 2 |
Linn, Robert L. | 2 |
Mehrens, William A. | 2 |
Messick, Samuel | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 11 |
Postsecondary Education | 10 |
Elementary Education | 5 |
Elementary Secondary Education | 4 |
Secondary Education | 4 |
Early Childhood Education | 2 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Grade 6 | 2 |
Grade 7 | 2 |
More ▼ |
Audience
Practitioners | 20 |
Teachers | 11 |
Students | 6 |
Administrators | 4 |
Researchers | 4 |
Policymakers | 2 |
Community | 1 |
Parents | 1 |
Location
Australia | 9 |
Japan | 3 |
New York | 3 |
Tennessee | 3 |
United Kingdom (England) | 3 |
Canada | 2 |
Colorado | 2 |
Georgia | 2 |
Israel | 2 |
New Jersey | 2 |
Sweden | 2 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 2 |
Education Consolidation… | 1 |
Every Student Succeeds Act… | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Schmitt, Norbert; Nation, Paul; Kremmel, Benjamin – Language Teaching, 2020
Recently, a large number of vocabulary tests have been made available to language teachers, testers, and researchers. Unfortunately, most of them have been launched with inadequate validation evidence. The field of language testing has become increasingly more rigorous in the area of test validation, but developers of vocabulary tests have…
Descriptors: Test Construction, Test Validity, Language Tests, Test Use
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Im, Gwan-Hyeok; Shin, Dongil; Park, Soohyeon – Current Issues in Language Planning, 2022
This study suggests a conceptual framework for policy-driven test development and validation, using the Test of Proficiency in Korean (TOPIK) as an example context. By linking the literature on policy analysis and argument structure in the validation of testing, the strong relationships between policy and testing are illustrated. This rationalizes…
Descriptors: Language Proficiency, Language Tests, Korean, Test Construction
Gopal Prasad Pandey – Journal of Practical Studies in Education, 2024
This paper explores the role of language testing in English education, focusing on its theoretical foundations, methodologies and practical applications. It analyzes how language tests fulfill various purposes, such as placement, progress monitoring, achievement evaluation and diagnostic feedback, underlining the importance of a critical…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Second Language Instruction
Papageorgiou, Spiros; Davis, Larry; Norris, John M.; Garcia Gomez, Pablo; Manna, Venessa F.; Monfils, Lora – Educational Testing Service, 2021
The "TOEFL® Essentials"™ test is a new English language proficiency test in the "TOEFL"® family of assessments. It measures foundational language skills and communication abilities in academic and general (daily life) contexts. The test covers the four language skills of reading, listening, writing, and speaking and is intended…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Language Proficiency
Salmani Nodoushan, Mohammad Ali – Online Submission, 2020
Language testing has witnessed three major trends in the 1990s: theoretical, methodological, and analytical. Theoretically, emphasis has been placed on the further understanding of the construct of language proficiency. Methodologically, there has been an outburst of interest in language performance testing and the promotion of the professional…
Descriptors: Test Construction, Test Use, Psychometrics, Item Response Theory
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction
Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016
Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…
Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction
Haider, Muhammad Qadeer – ProQuest LLC, 2019
Inquiry-oriented teaching is a specific form of active learning gaining popularity in teaching communities. The goal of inquiry-oriented classes is to help students in gaining a conceptual understanding of the material. My research focus is to gauge students' performance and conceptual understanding in inquiry-oriented linear algebra classes. This…
Descriptors: Mathematics Tests, Test Construction, Test Validity, Test Reliability
College Board, 2023
Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…
Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices
Flett, Gordon L.; Nepon, Taryn; Hewitt, Paul L.; Zaki-Azat, Justeena; Rose, Alison L.; Swiderski, Kristina – Journal of Psychoeducational Assessment, 2020
In the current article, we describe the development and validation of the Mistake Rumination Scale as a supplement to existing trait and cognitive measures of perfectionism. The Mistake Rumination Scale is a seven-item inventory that taps the tendency to ruminate about a past personal mistake. Psychometric analyses confirmed that the Mistake…
Descriptors: Personality Traits, Cognitive Processes, Test Construction, Cognitive Tests
Lehane, Paula; Scully, Darina; O'Leary, Michael – Irish Educational Studies, 2022
In line with the widespread proliferation of digital technology in everyday life, many countries are now beginning to use computer-based exams (CBEs) in their post-primary education systems. To ensure that these CBEs are delivered in a manner that preserves their fairness, validity, utility and credibility, several factors pertaining to their…
Descriptors: Computer Assisted Testing, Secondary School Students, Culture Fair Tests, Test Validity
Torrance, Harry – British Journal of Educational Studies, 2018
There are sound educational and examining reasons for the use of coursework assessment and practical assessment of student work by teachers in schools for purposes of reporting examination grades. Coursework and practical work test a range of different curriculum goals to final papers and increase the validity and reliability of the result.…
Descriptors: Foreign Countries, National Curriculum, Achievement Tests, Accountability
Goldstein, Harvey – Assessment in Education: Principles, Policy & Practice, 2015
The term "validity" is one of the most important and one of the most debated concepts in educational measurement. In this paper, I argue that various different approaches can all be viewed from an associational perspective. I also argue that our understanding will be enhanced by adopting some basic ideas of scientific reasoning to the…
Descriptors: Educational Assessment, Test Validity, Scientific Principles, Test Construction