Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 12 |
Descriptor
Construct Validity | 18 |
Test Content | 18 |
Test Construction | 11 |
Test Items | 8 |
Test Validity | 7 |
Content Validity | 5 |
Scores | 5 |
Test Reliability | 5 |
Testing | 5 |
Foreign Countries | 4 |
Scoring | 4 |
More ▼ |
Source
Author
Sireci, Stephen G. | 2 |
Bell, Karen N. | 1 |
Chang, Chung-Yen | 1 |
Cho, Daeyeon | 1 |
Choi, Bo Young | 1 |
Deng, Hui | 1 |
Freedle, Roy | 1 |
Fu, Hsieh-Hai | 1 |
Geisinger, Kurt F. | 1 |
Gorin, Joanna S. | 1 |
Gregoire, Jacques | 1 |
More ▼ |
Publication Type
Journal Articles | 13 |
Reports - Research | 12 |
Speeches/Meeting Papers | 5 |
Reports - Evaluative | 3 |
Tests/Questionnaires | 3 |
Dissertations/Theses -… | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 5 |
Postsecondary Education | 4 |
Elementary Secondary Education | 2 |
High Schools | 2 |
Secondary Education | 2 |
Audience
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
SAT (College Admission Test) | 2 |
General Social Survey | 1 |
Raven Progressive Matrices | 1 |
Test of English as a Foreign… | 1 |
Wechsler Adult Intelligence… | 1 |
Wechsler Intelligence Scale… | 1 |
Work Keys (ACT) | 1 |
What Works Clearinghouse Rating
Sarah K. Cowan; Michael Hout; Stuart Perrett – Sociological Methods & Research, 2024
Long-running surveys need a systematic way to reflect social change and to keep items relevant to respondents, especially when they ask about controversial subjects, or they threaten the items' validity. We propose a protocol for updating measures that preserves content and construct validity. First, substantive experts articulate the current and…
Descriptors: Surveys, Public Opinion, Social Attitudes, Pregnancy
Weiss, Lawrence G.; Gregoire, Jacques; Zhu, Jianjun – Journal of Psychoeducational Assessment, 2016
Many Flynn effect (FE) studies compare scores across different editions of Wechsler's IQ tests. When construct changes are introduced by the test developers in the new edition, however, the presumed generational effects are difficult to untangle from changes due to test content. To remove this confound, we use the same edition of Wechsler…
Descriptors: Generational Differences, Intelligence Tests, Comparative Analysis, Scores
Slater, Stephanie J. – Journal of Astronomy & Earth Sciences Education, 2014
The Test Of Astronomy STandards (TOAST) is a comprehensive assessment instrument designed to measure students' general astronomy content knowledge. Built upon the research embedded within a generation of astronomy assessments designed to measure single concepts, the TOAST is appropriate to measure across an entire astronomy course. The TOAST's…
Descriptors: Astronomy, Academic Standards, Science Tests, Test Content
Continual Improvement of a Student Evaluation of Teaching over Seven Semesters at a State University
Rates, Christopher; Liu, Xiufeng; Vanzile-Tamzen, Carol; Morreale, Cathleen – AERA Online Paper Repository, 2017
In the fall of 2014, the University at Buffalo created a new universal Student Evaluation of Teaching (SET). The purpose of the present study was to establish the construct validity of SET items. Rasch analyses of data from 7 semesters (N=203,194 students) revealed problems with item fit indices and threshold distances. Changes to items and…
Descriptors: Student Evaluation of Teacher Performance, State Universities, College Students, Teacher Effectiveness
Hsiao, Chien-Hua; Wu, Ying-Tien; Lin, Chung-Yen; Wong, Terrence William; Fu, Hsieh-Hai; Yeh, Ting-Kuang; Chang, Chung-Yen – Learning Environments Research, 2014
This study aimed to develop an instrument, named the inquiry-based laboratory classroom environment instrument (ILEI), for assessing senior high-school science students' preferred and perceived laboratory environment. A total of 262 second-year students, from a senior-high school in Taiwan, were recruited for this study. Four stages were included…
Descriptors: Test Construction, Science Laboratories, Inquiry, Science Instruction
Lee, Kwangmin; Ye, Yafei – Language Education & Assessment, 2021
The aim of this mixed methods study is to identify the underlying structure of the construct of Foreign Language Anxiety in integrated listening-to-speak tasks. First, the analysis of the qualitative interviews with six postsecondary ESL learners reveals that anxiety for integrated speaking stems from four different factors: "listening,"…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Factor Analysis
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Kumazawa, Takaaki – ProQuest LLC, 2011
Although classroom assessment is one of the most frequent practices carried out by teachers in all educational programs, limited research has been conducted to investigate the dependability and validity of criterion-referenced tests (CRTs). The main purpose of this study is to develop a criterion-referenced test for first-year Japanese university…
Descriptors: Criterion Referenced Tests, Test Construction, Test Validity, English (Second Language)
Choi, Bo Young; Park, Heerak; Nam, Suk Kyung; Lee, Jayoung; Cho, Daeyeon; Lee, Sang Min – Career Development Quarterly, 2011
The purpose of this study was to develop a Korean College Stress Inventory (KCSI), which is designed to measure Korean college students' experiences and symptoms of career stress. Even though there have been numerous scales related to career issues, few scales measure the career stress construct and its dimensions. Factor structure, internal…
Descriptors: College Students, Factor Structure, Psychometrics, Stress Variables
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Gorin, Joanna S. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for validity theory and terminology, emphasizing a shift in theory and practice toward issues of test content rather than constructs. The author of this article argues that several of Lissitz and Samuelsen's critiques of validity theory focus on previously considered, but subsequently discarded,…
Descriptors: Test Content, Test Validity, Construct Validity, Test Construction
Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007
This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…
Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills
Sireci, Stephen G. – 1995
The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…
Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995
An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)
Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity

Ludlow, Larry H.; Bell, Karen N. – Educational and Psychological Measurement, 1996
Fifty education majors in two sections responded to an Attitudes toward Mathematics and Its Teaching (ATMAT) scale. Results with two psychometric models, classical true-score theory and the one-parameter Rasch model, supported the ATMAT's reliability, content and construct validity, and invariance over three time points. (SLD)
Descriptors: College Students, Construct Validity, Education Majors, Elementary Education
Previous Page | Next Page ยป
Pages: 1 | 2