Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 20 |
Descriptor
Source
Author
Canivez, Gary L. | 2 |
Bailey, Jennifer | 1 |
Ben Seipel | 1 |
Beran, Tanya | 1 |
Brown, James Dean | 1 |
Carolin Hahnel | 1 |
Carter Grissom, Elizabeth | 1 |
Chen, Yi-Hsin | 1 |
Ching-Ni Hsieh | 1 |
Cid, Jaime | 1 |
Coe, Robert | 1 |
More ▼ |
Publication Type
Journal Articles | 18 |
Reports - Research | 8 |
Reports - Evaluative | 7 |
Reports - Descriptive | 5 |
Guides - Non-Classroom | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Higher Education | 5 |
Elementary Secondary Education | 4 |
Postsecondary Education | 4 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Preschool Education | 1 |
Secondary Education | 1 |
Audience
Teachers | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
ACTFL Oral Proficiency… | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Ching-Ni Hsieh – ETS Research Report Series, 2024
The TOEFL Junior® tests are designed to evaluate young language students' English reading, listening, speaking, and writing skills in an English-medium secondary instructional context. This paper articulates a validity argument constructed to support the use and interpretation of the TOEFL Junior test scores for the purpose of placement, progress…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024
The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…
Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Canivez, Gary L.; Watkins, Marley W.; McGill, Ryan J. – British Journal of Educational Psychology, 2019
Background: There is inadequate information regarding the factor structure of the Wechsler Intelligence Scale for Children -- Fifth UK Edition (WISC-V[superscript UK]; Wechsler, 2016a, Wechsler Intelligence Scale for Children-Fifth UK Edition, Harcourt Assessment, London, UK) to guide interpretation. Aims and methods: The WISC-V[superscript UK]…
Descriptors: Children, Intelligence Tests, Construct Validity, Factor Analysis
Schmidgall, Jonathan; Cid, Jaime; Carter Grissom, Elizabeth; Li, Lucy – ETS Research Report Series, 2021
The redesigned "TOEIC Bridge"® tests were designed to evaluate test takers' English listening, reading, speaking, and writing skills in the context of everyday adult life. In this paper, we summarize the initial validity argument that supports the use of test scores for the purpose of selection, placement, and evaluation of a test…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Language Proficiency
Ben Seipel; Sarah E. Carlson; Virginia Clinton-Lisell; Mark L. Davison; Patrick C. Kennedy – Grantee Submission, 2022
Originally designed for students in Grades 3 through 5, MOCCA (formerly the Multiple-choice Online Causal Comprehension Assessment), identifies students who struggle with comprehension, and helps uncover why they struggle. There are many reasons why students might not comprehend what they read. They may struggle with decoding, or reading words…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Diagnostic Tests, Reading Tests
Sims, James M.; Kunnan, Antony John – Language Testing in Asia, 2016
Background: This study investigated the factor structure and factorial invariance of an English Placement Exam (EPE) from 1998 to 2011 to provide evidence for both the appropriateness of the test scores interpretations and for a validity argument. Methods: Test performance data collected from 38,632 freshmen non-English majors from a university in…
Descriptors: Language Tests, Student Placement, Second Language Learning, English (Second Language)
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Rix, Samantha – Journal on English Language Teaching, 2012
This paper examines the utilization of construct validity in formative assessment for classroom-based purposes. Construct validity pertains to the notion that interpretations are made by educators who analyze test scores during formative assessment. The purpose of this paper is to note the challenges that educators face when interpreting these…
Descriptors: Construct Validity, Formative Evaluation, Scores, Tests
Nolan, Meaghan M.; Beran, Tanya; Hecker, Kent G. – Statistics Education Research Journal, 2012
Students with positive attitudes toward statistics are likely to show strong academic performance in statistics courses. Multiple surveys measuring students' attitudes toward statistics exist; however, a comparison of the validity and reliability of interpretations based on their scores is needed. A systematic review of relevant electronic…
Descriptors: Student Attitudes, Statistics, Attitude Measures, Student Surveys
Murley, Lisa D.; Stobaugh, Rebecca; Jukes, Pamela; Tassell, Janet – Educational Renaissance, 2014
The purpose of this article is to provide an overview of the process used to examine the inter-rater reliability of the Teacher Work Sample (TWS) Scoring Rubric involved with the senior culminating experience for teacher candidates used at a large comprehensive university. The study compared holistic and analytic scores reported by Student Teacher…
Descriptors: Teacher Education, Interrater Reliability, Scoring Rubrics, Preservice Teachers
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Coe, Robert – Research Papers in Education, 2010
Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…
Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards
Canivez, Gary L.; Konold, Timothy R.; Collins, Jason M.; Wilson, Greg – School Psychology Quarterly, 2009
The Wechsler Abbreviated Scale of Intelligence (WASI; Psychological Corporation, 1999) and the Wide Range Intelligence Test (WRIT; Glutting, Adams, & Sheslow, 2000) are two well-normed brief measures of general intelligence with subtests purportedly assessing verbal-crystallized abilities and nonverbal-fluid-visual abilities. With a sample of…
Descriptors: Construct Validity, Test Validity, Factor Structure, Intelligence Tests
Newton, Paul E. – Research Papers in Education, 2010
Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…
Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests
Previous Page | Next Page »
Pages: 1 | 2