Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 20 |
Descriptor
Construct Validity | 42 |
Test Interpretation | 42 |
Scores | 18 |
Test Validity | 13 |
Performance Based Assessment | 11 |
Test Construction | 11 |
Educational Assessment | 9 |
Foreign Countries | 8 |
Language Tests | 8 |
Test Items | 8 |
Test Use | 8 |
More ▼ |
Source
Author
Messick, Samuel | 6 |
Canivez, Gary L. | 2 |
Austin, James T. | 1 |
Bailey, Jennifer | 1 |
Banaji, Mahzarin R. | 1 |
Banta, Trudy W. | 1 |
Ben Seipel | 1 |
Beran, Tanya | 1 |
Brown, James Dean | 1 |
Carolin Hahnel | 1 |
Carter Grissom, Elizabeth | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 5 |
Higher Education | 5 |
Postsecondary Education | 4 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Preschool Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 4 |
Practitioners | 2 |
Counselors | 1 |
Teachers | 1 |
Location
United Kingdom | 4 |
Taiwan | 2 |
Australia | 1 |
Connecticut | 1 |
Hawaii | 1 |
Illinois | 1 |
Iran (Tehran) | 1 |
Kentucky | 1 |
United States | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
Test of English as a Foreign… | 2 |
ACTFL Oral Proficiency… | 1 |
Kaufman Assessment Battery… | 1 |
Minnesota Multiphasic… | 1 |
SAT (College Admission Test) | 1 |
Test of English for… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Ching-Ni Hsieh – ETS Research Report Series, 2024
The TOEFL Junior® tests are designed to evaluate young language students' English reading, listening, speaking, and writing skills in an English-medium secondary instructional context. This paper articulates a validity argument constructed to support the use and interpretation of the TOEFL Junior test scores for the purpose of placement, progress…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024
The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…
Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation
Canivez, Gary L.; Watkins, Marley W.; McGill, Ryan J. – British Journal of Educational Psychology, 2019
Background: There is inadequate information regarding the factor structure of the Wechsler Intelligence Scale for Children -- Fifth UK Edition (WISC-V[superscript UK]; Wechsler, 2016a, Wechsler Intelligence Scale for Children-Fifth UK Edition, Harcourt Assessment, London, UK) to guide interpretation. Aims and methods: The WISC-V[superscript UK]…
Descriptors: Children, Intelligence Tests, Construct Validity, Factor Analysis
Schmidgall, Jonathan; Cid, Jaime; Carter Grissom, Elizabeth; Li, Lucy – ETS Research Report Series, 2021
The redesigned "TOEIC Bridge"® tests were designed to evaluate test takers' English listening, reading, speaking, and writing skills in the context of everyday adult life. In this paper, we summarize the initial validity argument that supports the use of test scores for the purpose of selection, placement, and evaluation of a test…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Language Proficiency
Ben Seipel; Sarah E. Carlson; Virginia Clinton-Lisell; Mark L. Davison; Patrick C. Kennedy – Grantee Submission, 2022
Originally designed for students in Grades 3 through 5, MOCCA (formerly the Multiple-choice Online Causal Comprehension Assessment), identifies students who struggle with comprehension, and helps uncover why they struggle. There are many reasons why students might not comprehend what they read. They may struggle with decoding, or reading words…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Diagnostic Tests, Reading Tests
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Nolan, Meaghan M.; Beran, Tanya; Hecker, Kent G. – Statistics Education Research Journal, 2012
Students with positive attitudes toward statistics are likely to show strong academic performance in statistics courses. Multiple surveys measuring students' attitudes toward statistics exist; however, a comparison of the validity and reliability of interpretations based on their scores is needed. A systematic review of relevant electronic…
Descriptors: Student Attitudes, Statistics, Attitude Measures, Student Surveys
Rix, Samantha – Journal on English Language Teaching, 2012
This paper examines the utilization of construct validity in formative assessment for classroom-based purposes. Construct validity pertains to the notion that interpretations are made by educators who analyze test scores during formative assessment. The purpose of this paper is to note the challenges that educators face when interpreting these…
Descriptors: Construct Validity, Formative Evaluation, Scores, Tests
Sims, James M.; Kunnan, Antony John – Language Testing in Asia, 2016
Background: This study investigated the factor structure and factorial invariance of an English Placement Exam (EPE) from 1998 to 2011 to provide evidence for both the appropriateness of the test scores interpretations and for a validity argument. Methods: Test performance data collected from 38,632 freshmen non-English majors from a university in…
Descriptors: Language Tests, Student Placement, Second Language Learning, English (Second Language)
Canivez, Gary L.; Konold, Timothy R.; Collins, Jason M.; Wilson, Greg – School Psychology Quarterly, 2009
The Wechsler Abbreviated Scale of Intelligence (WASI; Psychological Corporation, 1999) and the Wide Range Intelligence Test (WRIT; Glutting, Adams, & Sheslow, 2000) are two well-normed brief measures of general intelligence with subtests purportedly assessing verbal-crystallized abilities and nonverbal-fluid-visual abilities. With a sample of…
Descriptors: Construct Validity, Test Validity, Factor Structure, Intelligence Tests
Murley, Lisa D.; Stobaugh, Rebecca; Jukes, Pamela; Tassell, Janet – Educational Renaissance, 2014
The purpose of this article is to provide an overview of the process used to examine the inter-rater reliability of the Teacher Work Sample (TWS) Scoring Rubric involved with the senior culminating experience for teacher candidates used at a large comprehensive university. The study compared holistic and analytic scores reported by Student Teacher…
Descriptors: Teacher Education, Interrater Reliability, Scoring Rubrics, Preservice Teachers
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Bailey, Jennifer; Little, Chelsea; Rigney, Rex; Thaler, Anna; Weiderman, Ken; Yorkovich, Ben – Online Submission, 2010
This handbook is designed as a quick reference for first-year teachers who find themselves in an assessment driven environment with little experience to help make sense of the language, underlying philosophy, or organizational structure of the assessment system. The handbook begins with advice on developing and evaluating effective learning…
Descriptors: Student Evaluation, Portfolio Assessment, Elementary Secondary Education, Performance Based Assessment