Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 19 |
Since 2006 (last 20 years) | 32 |
Descriptor
Test Reliability | 313 |
Test Use | 313 |
Test Validity | 295 |
Test Construction | 121 |
Elementary Secondary Education | 57 |
Psychometrics | 49 |
Higher Education | 48 |
Evaluation Methods | 45 |
Foreign Countries | 42 |
Student Evaluation | 38 |
Scoring | 37 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 9 |
Postsecondary Education | 9 |
Elementary Education | 7 |
Early Childhood Education | 5 |
Secondary Education | 5 |
Elementary Secondary Education | 4 |
Grade 3 | 4 |
Grade 4 | 4 |
Grade 5 | 4 |
Grade 6 | 4 |
Grade 7 | 4 |
More ▼ |
Audience
Practitioners | 33 |
Teachers | 11 |
Administrators | 7 |
Researchers | 7 |
Students | 6 |
Parents | 5 |
Community | 2 |
Policymakers | 2 |
Counselors | 1 |
Support Staff | 1 |
Location
Australia | 10 |
Canada | 5 |
New York | 5 |
Georgia | 2 |
Hong Kong | 2 |
Israel | 2 |
Massachusetts | 2 |
Michigan | 2 |
New Jersey | 2 |
United Kingdom | 2 |
United Kingdom (Great Britain) | 2 |
More ▼ |
Laws, Policies, & Programs
Education Consolidation… | 2 |
Elementary and Secondary… | 1 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Articulating and Evaluating Validity Arguments for the "TOEIC"® Tests. Research Report. ETS RR-17-51
Schmidgall, Jonathan E. – ETS Research Report Series, 2017
This report provides a brief overview of how the "TOEIC"® program has adopted an argument-based approach to validity in order to support the use of the TOEIC tests. This approach emphasizes the need to explicitly state claims about the measurement quality and intended use of a test and to support those claims with evidence. This report…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Test Use
Kotowicz, Justyna; Woll, Bencie; Herman, Rosalind – Language Testing, 2021
The evaluation of sign language proficiency needs to be based on measures with well-established psychometric proprieties. To date, no valid and reliable test is available to assess Polish Sign Language ("Polski Jezyk Migowy," PJM) skills in deaf children. Hence, our aim with this study was to adapt the British Sign Language Receptive…
Descriptors: Language Tests, Receptive Language, Sign Language, Language Proficiency
Rehfeld, David M.; Padgett, R. Noah – Journal of Psychoeducational Assessment, 2019
This article presents a review of the Comprehensive Assessment of Spoken Language--Second Edition (CASL-2), in which reliability, utility, and validity are analyzed and discussed. Some limited recommendations for practice are made based on a review of the information provided by the publisher for clinicians.
Descriptors: Oral Language, Language Tests, Receptive Language, Expressive Language
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction
Al-Owidha, Amjed A. – Language Testing in Asia, 2018
Background: This study investigated the psychometric properties of the recently developed Qiyas for L1 Arabic language test using a Rasch measurement framework. Methods: Responses from 271 examinees were analyzed in this study. The test is hypothesized to involve one dominant factor that assesses four skills: reading comprehension, rhetorical…
Descriptors: Semitic Languages, Language Tests, Psychometrics, Reading Comprehension
Haider, Muhammad Qadeer – ProQuest LLC, 2019
Inquiry-oriented teaching is a specific form of active learning gaining popularity in teaching communities. The goal of inquiry-oriented classes is to help students in gaining a conceptual understanding of the material. My research focus is to gauge students' performance and conceptual understanding in inquiry-oriented linear algebra classes. This…
Descriptors: Mathematics Tests, Test Construction, Test Validity, Test Reliability
Flett, Gordon L.; Nepon, Taryn; Hewitt, Paul L.; Zaki-Azat, Justeena; Rose, Alison L.; Swiderski, Kristina – Journal of Psychoeducational Assessment, 2020
In the current article, we describe the development and validation of the Mistake Rumination Scale as a supplement to existing trait and cognitive measures of perfectionism. The Mistake Rumination Scale is a seven-item inventory that taps the tendency to ruminate about a past personal mistake. Psychometric analyses confirmed that the Mistake…
Descriptors: Personality Traits, Cognitive Processes, Test Construction, Cognitive Tests
Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2013
To efficiently assess multiple psychological constructs and to minimize the burden on respondents, psychologists increasingly use shortened versions of existing tests. However, compared to the longer test, a shorter test version may have a substantial impact on the reliability and the validity of the test scores in psychological research and…
Descriptors: Test Length, Psychological Testing, Test Use, Test Validity
Sabol, F. Robert – National Art Education Association, 2018
This White Paper provides a selection of some general principles of assessment or overarching ideas that may guide educators in selecting, developing, and implementing assessments of students' learning at all instructional levels or educational settings in which they are used. These principles represent a framework for understanding the nature of…
Descriptors: Visual Arts, Art Education, Educational Principles, Student Evaluation
Allen, Jeff M.; Mattern, Krista – ACT, Inc., 2019
States and districts have expressed interest in administering the ACT® to 10th-grade students. Given that the ACT was designed to be administered in the spring of 11th grade or fall of 12th grade, the appropriateness of this use should be evaluated. As such, the focus of this paper is to summarize empirical evidence evaluating the use of the ACT…
Descriptors: Test Validity, College Entrance Examinations, High School Students, Grade 10
McClellan, Catherine; Snyder, Rebecca; Woods-Murphy, Maryann; Basset, Katherine – National Network of State Teachers of the Year, 2018
Great teachers recognize great assessments. As policy and education leaders work to make sure state tests are measuring the problem-solving, writing, and critical-thinking skills students need for success, they should convene and rely on teachers to review test quality and help answer the question: Do the questions on our state test reflect…
Descriptors: Student Evaluation, Educational Quality, Standardized Tests, Test Items
Hassan, Nurul Huda; Shih, Chih-Min – Language Assessment Quarterly, 2013
This article describes and reviews the Singapore-Cambridge General Certificate of Education Advanced Level General Paper (GP) examination. As a written test that is administered to preuniversity students, the GP examination is internationally recognised and accepted by universities and employers as proof of English competence. In this article, the…
Descriptors: Foreign Countries, College Entrance Examinations, English (Second Language), Writing Tests
Rahn, Rhonda N.; Pruitt, Buster; Goodson, Patricia – Journal of American College Health, 2016
Objective: To analyze the literature in which researchers have utilized the National College Health Assessment (NCHA) I or the NCHA II. Participants and Methods: The authors selected peer-reviewed articles published between 2004 and July 2013 utilizing a single search term: National College Health Assessment. Articles were assessed for instrument…
Descriptors: Literature Reviews, College Students, Health, National Surveys