Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 27 |
| Since 2017 (last 10 years) | 56 |
| Since 2007 (last 20 years) | 106 |
Descriptor
| Test Interpretation | 963 |
| Test Validity | 963 |
| Test Reliability | 375 |
| Test Construction | 252 |
| Testing | 169 |
| Scores | 158 |
| Testing Problems | 157 |
| Standardized Tests | 152 |
| Elementary Secondary Education | 148 |
| Achievement Tests | 124 |
| Test Results | 119 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 22 |
| Postsecondary Education | 20 |
| Elementary Secondary Education | 18 |
| Elementary Education | 8 |
| Secondary Education | 8 |
| High Schools | 3 |
| Middle Schools | 3 |
| Grade 12 | 2 |
| Grade 4 | 2 |
| Grade 5 | 2 |
| Grade 8 | 2 |
| More ▼ | |
Audience
| Practitioners | 60 |
| Researchers | 35 |
| Teachers | 21 |
| Administrators | 8 |
| Students | 7 |
| Counselors | 4 |
| Parents | 3 |
| Policymakers | 3 |
| Community | 1 |
Location
| Australia | 10 |
| Canada | 8 |
| California | 7 |
| United Kingdom | 7 |
| United States | 6 |
| New York | 5 |
| Germany | 4 |
| Israel | 4 |
| United Kingdom (England) | 4 |
| China | 3 |
| Illinois | 3 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jacqueline Raymond; David Wei Dai; Sue McAllister – Advances in Health Sciences Education, 2025
There is increasing interest in health professions education (HPE) in applying argument-based validity approaches, such as Kane's, to assessment design. The critical first step in employing Kane's approach is to specify the interpretation-use argument (IUA). However, in the HPE literature, this step is often poorly articulated. This article…
Descriptors: Allied Health Occupations Education, Test Interpretation, Test Construction, Inferences
Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023
Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…
Descriptors: Test Interpretation, Scores, Test Use, Test Validity
Yuriko K. Sosa Paredes; Björn Andersson – Educational Assessment, Evaluation and Accountability, 2025
In international large-scale assessments, student performance comparisons across educational systems are frequently done to assess the state and development in different domains. These results often have a large impact on educational policy and on the perceptions of an educational system's performance. Early assessments, such as the First and…
Descriptors: Test Interpretation, International Assessment, Science Tests, Scores
Philipp Sterner; Kim De Roover; David Goretzko – Structural Equation Modeling: A Multidisciplinary Journal, 2025
When comparing relations and means of latent variables, it is important to establish measurement invariance (MI). Most methods to assess MI are based on confirmatory factor analysis (CFA). Recently, new methods have been developed based on exploratory factor analysis (EFA); most notably, as extensions of multi-group EFA, researchers introduced…
Descriptors: Error of Measurement, Measurement Techniques, Factor Analysis, Structural Equation Models
Cara Cahalan Laitusis; Meagan Karvonen – Educational Measurement: Issues and Practice, 2025
The 2014 "Standards for Educational and Psychological Testing" describe universal design as an approach that offers promise for improving the fairness of educational assessments. As the field reconsiders questions of fairness in assessments, we propose a new framework that addresses the entire assessment lifecycle: universal design of…
Descriptors: Educational Assessment, Access to Education, Systems Approach, Psychological Needs
Sunghee Choi – ProQuest LLC, 2022
Traditionally, most autism assessment instruments are based on medical models and designed to identify social communication deficits and behavioral abnormality in an individual. However, as more autistic narratives reveal the insider views of autists, some scholars and autistic activists support the neurodiversity model and assert the acceptance…
Descriptors: Test Validity, Autism Spectrum Disorders, Disability Identification, Adults
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Kuhn, Melissa Gayle – ProQuest LLC, 2022
Validity in psychometrics refers to the degree to which evidence and theory supports the interpretations drawn from a test, and Messick's Contemporary Validity Theory (1994) includes several facets with well-established evidence collection methods. However, there is a lack of consensus on appropriate methods of evaluating the facet of…
Descriptors: Test Validity, Psychometrics, Test Interpretation, Scores
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025
This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction
Chao Han; Binghan Zheng; Mingqing Xie; Shirong Chen – Interpreter and Translator Trainer, 2024
Human raters' assessment of interpreting is a complex process. Previous researchers have mainly relied on verbal reports to examine this process. To advance our understanding, we conducted an empirical study, collecting raters' eye-movement and retrospection data in a computerised interpreting assessment in which three groups of raters (n = 35)…
Descriptors: Foreign Countries, College Students, College Graduates, Interrater Reliability
Gorney, Kylie – ProQuest LLC, 2023
Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…
Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias
Ing, Marsha; Chinen, Starlie; Jackson, Kara; Smith, Thomas M. – Educational Measurement: Issues and Practice, 2021
Despite the ease of accessing a wide range of measures, little attention is given to validity arguments when considering whether to use the measure for a new purpose or in a different context. Making a validity argument has historically focused on the intended interpretation and use. There has been a press to consider both the intended and actual…
Descriptors: Instructional Improvement, Measures (Individuals), Test Validity, Test Interpretation
Marta Godoy-Giménez; Ángel García-Pérez; Fernando Cañadas; Angeles F. Estévez; Pablo Sayans-Jiménez – Autism: The International Journal of Research and Practice, 2024
The broad autism phenotype is the phenotypic expression of the primary characteristics of autism. However, currently available tests do not agree with the two-domain operationalization of broad autism phenotype or autism, and their internal structure has shown instability across applications. This study presents the Broad Autism…
Descriptors: Autism Spectrum Disorders, Genetics, Diagnostic Tests, Foreign Countries
Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025
Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…
Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Peer reviewed
Direct link
