Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 95 |
Descriptor
Construct Validity | 165 |
Test Validity | 165 |
Test Reliability | 63 |
Factor Analysis | 39 |
Psychometrics | 38 |
Test Construction | 37 |
Measures (Individuals) | 32 |
Predictive Validity | 24 |
Scores | 24 |
Factor Structure | 22 |
Foreign Countries | 21 |
More ▼ |
Source
Author
Eaves, Ronald C. | 3 |
Merrell, Kenneth W. | 3 |
Tindal, Gerald | 3 |
Williams, Thomas O., Jr. | 3 |
Alonzo, Julie | 2 |
Anderson, Daniel | 2 |
Brown, Ted | 2 |
Lai, Cheng-Fei | 2 |
Lowe, Patricia A. | 2 |
Maes, Bea | 2 |
Nese, Joseph F. T. | 2 |
More ▼ |
Publication Type
Reports - Evaluative | 165 |
Journal Articles | 140 |
Speeches/Meeting Papers | 10 |
Information Analyses | 8 |
Tests/Questionnaires | 4 |
Numerical/Quantitative Data | 3 |
Reports - Research | 3 |
Opinion Papers | 2 |
Dissertations/Theses -… | 1 |
Education Level
Audience
Researchers | 7 |
Practitioners | 1 |
Teachers | 1 |
Location
Netherlands | 4 |
Indiana | 3 |
South Africa | 3 |
China | 2 |
Germany | 2 |
Australia | 1 |
Brazil | 1 |
Burundi | 1 |
Canada | 1 |
Finland | 1 |
Hawaii | 1 |
More ▼ |
Laws, Policies, & Programs
Improving Americas Schools… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lauwaert, Pieter – Studies in Applied Linguistics & TESOL, 2023
The way in which validity has been conceptualized has changed throughout the years. The focus in validation studies shifted from evaluating distinct components of validity to developing a comprehensive argument for the use and interpretations of test scores. The argument-based approach to validity incorporates the distinct types of the…
Descriptors: Language Tests, Test Validity, Test Use, Construct Validity
Aryadoust, Vahid – Language Testing, 2023
Construct validity and building validity arguments are some of the main challenges facing the language assessment community. The notion of construct validity and validity arguments arose from research in psychological assessment and developed into the gold standard of validation/validity research in language assessment. At a theoretical level,…
Descriptors: Testing Problems, Test Validity, Second Language Learning, Construct Validity
Crisp, Victoria – London Review of Education, 2017
This article discusses how comparability relates to current mainstream conceptions of validity, in the context of educational assessment. Relevant literature was used to consider the relationship between these concepts. The article concludes that, depending on the exact claims being made about the appropriate interpretations and uses of the…
Descriptors: Educational Assessment, Test Validity, Comparative Analysis, Scores
Christensen, Rhonda; Knezek, Gerald – Journal of Technology Education, 2022
This article describes the development and validation of an Innovation Attitude Survey (IAS) composed of 16 Likert-type items selected to measure middle school students' attitudes toward innovation and leadership in the advancement of new ideas. The goal of developing the IAS was to identify desirable dispositions that may be related to future…
Descriptors: Attitude Measures, Likert Scales, Test Construction, Test Validity
Weideman, Albert – Language Assessment Quarterly, 2022
This paper will deal, firstly, with the South African context, that cries out for attention to responsible language assessment. The renewed interest in language testing in South Africa is well illustrated in assessments of language ability for educational purposes generally, and more specifically in the assessment of academic literacy. Secondly,…
Descriptors: Foreign Countries, Language Tests, Testing, Academic Language
Toker, Deniz – TESL-EJ, 2019
The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Brown, Ted; Peres, Lisa – Journal of Occupational Therapy, Schools & Early Intervention, 2018
The "Motor-Free Visual Perception Test-fourth edition" (MVPT-4) is a revised version of the "Motor-Free Visual Perception Test-third edition." The MVPT-4 is used to assess the visual-perceptual ability of individuals aged 4.0 through 80+ years via a series of visual-perceptual tasks that do not require a motor response. Test…
Descriptors: Visual Perception, Vision Tests, Test Validity, Culture Fair Tests
Markus, Keith A. – Measurement: Interdisciplinary Research and Perspectives, 2014
Keith Marcus congratulates Almond et al. on an interesting article bringing together two topics that are important to the field of testing. He states that some aspects of the exposition came across as not yet fully developed, as if the manuscript had been hurried to press. In this commentary, he attempts to expand aspects of the article, which he…
Descriptors: Test Validity, Theory Practice Relationship, Observation, Educational Assessment
Foghahaee, Zahra – Language Teaching Research Quarterly, 2019
Reverse engineering (RE) can play an important role in the re-designing tests in L2 English. It can also enrich the aim of teaching the same as raising children through academic achievement. In addition, it can play a key role in helping students understand how much their test is valid by using Standard reverse engineering (SRE). This paper is a…
Descriptors: Language Tests, Second Language Learning, Second Language Instruction, English (Second Language)
Climie, Emma A.; Cadogan, Sarah; Goukon, Rina – Journal of Psychoeducational Assessment, 2014
The "Comprehensive Executive Function Inventory" (CEFI; Naglieri & Goldstein, 2013), published by Multi-Health Systems Inc. (MHS), is a new executive function (EF) rating scale for children and youth ages 5 to 18 years. The CEFI strives to accurately assess EF abilities based on self, parent, and teacher reports, and provides…
Descriptors: Executive Function, Cognitive Tests, Testing, Scoring
Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis
Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016
Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…
Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size
McCrimmon, Adam; Rostad, Kristin – Journal of Psychoeducational Assessment, 2014
This article reviews the "Autism Diagnostic Observation Schedule, Second Edition" (ADOS-2; Lord, Luyster, Gotham, & Guthrie, 2012; Lord, Rutter et al., 2012), a newly updated, semistructured, standardized measure of communication, social interaction, play/imagination, and restricted and/or repetitive behaviors published by Western…
Descriptors: Diagnostic Tests, Autism, Pervasive Developmental Disorders, Testing
Yanosky, Daniel J.; Schwanenflugel, Paula J.; Kamphaus, Randy W. – Journal of Psychoeducational Assessment, 2013
A 25 item short form of the Behavioral Assessment System for Children (BASC) Teacher Rating Scale--Preschool (TRS-P) was developed by the BASC authors to serve as an emotional/behavioral indicator for an academic intervention study targeting preschool-aged students. The BASC screener is thought to fulfill a need for an abbreviated behavior rating…
Descriptors: Behavior Rating Scales, Psychometrics, Preschool Teachers, Preschool Children
Brown, Anna; Maydeu-Olivares, Alberto – Psychological Methods, 2013
In multidimensional forced-choice (MFC) questionnaires, items measuring different attributes are presented in blocks, and participants have to rank order the items within each block (fully or partially). Such comparative formats can reduce the impact of numerous response biases often affecting single-stimulus items (aka rating or Likert scales).…
Descriptors: Test Validity, Item Response Theory, Scoring, Questionnaires
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling