ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	12

Descriptor

Construct Validity	18
Test Content	18
Test Construction	11
Test Items	8
Test Validity	7
Content Validity	5
Scores	5
Test Reliability	5
Testing	5
Foreign Countries	4
Scoring	4
Student Attitudes	4
Test Theory	4
College Students	3
Correlation	3
Definitions	3
English (Second Language)	3
Item Analysis	3
Psychometrics	3
Statistical Analysis	3
Student Evaluation	3
Achievement Tests	2
College Entrance Examinations	2
Educational Testing	2
Evaluation Research	2
More ▼

Source

Review of Research in…	2
AERA Online Paper Repository	1
Applied Psychological…	1
Career Development Quarterly	1
Educational Researcher	1
Educational and Psychological…	1
Journal of Applied Testing…	1
Journal of Astronomy & Earth…	1
Journal of Psychoeducational…	1
Language Education &…	1
Language Testing	1
Learning Environments Research	1
ProQuest LLC	1
Sociological Methods &…	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	12
Speeches/Meeting Papers	5
Reports - Evaluative	3
Tests/Questionnaires	3
Dissertations/Theses -…	1
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Higher Education	5
Postsecondary Education	4
Elementary Secondary Education	2
High Schools	2
Secondary Education	2

Audience

Location

Japan	1
New York (Buffalo)	1
South Korea	1
Taiwan	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	2
General Social Survey	1
Raven Progressive Matrices	1
Test of English as a Foreign…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Updating a Time-Series of Survey Questions: The Case of Abortion Attitudes in the General Social Survey

Peer reviewed

Direct link

Sarah K. Cowan; Michael Hout; Stuart Perrett – Sociological Methods & Research, 2024

Long-running surveys need a systematic way to reflect social change and to keep items relevant to respondents, especially when they ask about controversial subjects, or they threaten the items' validity. We propose a protocol for updating measures that preserves content and construct validity. First, substantive experts articulate the current and…

Descriptors: Surveys, Public Opinion, Social Attitudes, Pregnancy

Flaws in Flynn Effect Research with the Wechsler Scales

Peer reviewed

Direct link

Weiss, Lawrence G.; Gregoire, Jacques; Zhu, Jianjun – Journal of Psychoeducational Assessment, 2016

Many Flynn effect (FE) studies compare scores across different editions of Wechsler's IQ tests. When construct changes are introduced by the test developers in the new edition, however, the presumed generational effects are difficult to untangle from changes due to test content. To remove this confound, we use the same edition of Wechsler…

Descriptors: Generational Differences, Intelligence Tests, Comparative Analysis, Scores

The Development and Validation of the Test Of Astronomy STandards (TOAST)

Peer reviewed
PDF on ERIC

Download full text

Slater, Stephanie J. – Journal of Astronomy & Earth Sciences Education, 2014

The Test Of Astronomy STandards (TOAST) is a comprehensive assessment instrument designed to measure students' general astronomy content knowledge. Built upon the research embedded within a generation of astronomy assessments designed to measure single concepts, the TOAST is appropriate to measure across an entire astronomy course. The TOAST's…

Descriptors: Astronomy, Academic Standards, Science Tests, Test Content

Continual Improvement of a Student Evaluation of Teaching over Seven Semesters at a State University

Peer reviewed

Direct link

Rates, Christopher; Liu, Xiufeng; Vanzile-Tamzen, Carol; Morreale, Cathleen – AERA Online Paper Repository, 2017

In the fall of 2014, the University at Buffalo created a new universal Student Evaluation of Teaching (SET). The purpose of the present study was to establish the construct validity of SET items. Rasch analyses of data from 7 semesters (N=203,194 students) revealed problems with item fit indices and threshold distances. Changes to items and…

Descriptors: Student Evaluation of Teacher Performance, State Universities, College Students, Teacher Effectiveness

Development of an Instrument for Assessing Senior High School Students' Preferred and Perceived Laboratory Classroom Environment

Peer reviewed

Direct link

Hsiao, Chien-Hua; Wu, Ying-Tien; Lin, Chung-Yen; Wong, Terrence William; Fu, Hsieh-Hai; Yeh, Ting-Kuang; Chang, Chung-Yen – Learning Environments Research, 2014

This study aimed to develop an instrument, named the inquiry-based laboratory classroom environment instrument (ILEI), for assessing senior high-school science students' preferred and perceived laboratory environment. A total of 262 second-year students, from a senior-high school in Taiwan, were recruited for this study. Four stages were included…

Descriptors: Test Construction, Science Laboratories, Inquiry, Science Instruction

The Underlying Structure of Foreign Language Anxiety in Integrated Speaking Assessment: A Mixed Methods Study

Peer reviewed
PDF on ERIC

Download full text

Lee, Kwangmin; Ye, Yafei – Language Education & Assessment, 2021

The aim of this mixed methods study is to identify the underlying structure of the construct of Foreign Language Anxiety in integrated listening-to-speak tasks. First, the analysis of the qualitative interviews with six postsecondary ESL learners reveals that anxiety for integrated speaking stems from four different factors: "listening,"…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Factor Analysis

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

Systematic Criterion-Referenced Test Development in an English-Language Program

Direct link

Kumazawa, Takaaki – ProQuest LLC, 2011

Although classroom assessment is one of the most frequent practices carried out by teachers in all educational programs, limited research has been conducted to investigate the dependability and validity of criterion-referenced tests (CRTs). The main purpose of this study is to develop a criterion-referenced test for first-year Japanese university…

Descriptors: Criterion Referenced Tests, Test Construction, Test Validity, English (Second Language)

The Development and Initial Psychometric Evaluation of the Korean Career Stress Inventory for College Students

Peer reviewed

Direct link

Choi, Bo Young; Park, Heerak; Nam, Suk Kyung; Lee, Jayoung; Cho, Daeyeon; Lee, Sang Min – Career Development Quarterly, 2011

The purpose of this study was to develop a Korean College Stress Inventory (KCSI), which is designed to measure Korean college students' experiences and symptoms of career stress. Even though there have been numerous scales related to career issues, few scales measure the career stress construct and its dimensions. Factor structure, internal…

Descriptors: College Students, Factor Structure, Psychometrics, Stress Variables

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

Reconsidering Issues in Validity Theory

Peer reviewed

Direct link

Gorin, Joanna S. – Educational Researcher, 2007

Lissitz and Samuelsen (2007) propose a new framework for validity theory and terminology, emphasizing a shift in theory and practice toward issues of test content rather than constructs. The author of this article argues that several of Lissitz and Samuelsen's critiques of validity theory focus on previously considered, but subsequently discarded,…

Descriptors: Test Content, Test Validity, Construct Validity, Test Construction

Does Quantity Equal Quality?: The Relationship between Length of Response and Scores on the SAT Essay

Peer reviewed

Direct link

Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007

This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…

Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills

The Central Role of Content Representation in Test Validity.

Download full text

Sireci, Stephen G. – 1995

The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…

Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis

Using Subject-Matter Experts to Assess Content Representation: An MDS Analysis.

Peer reviewed

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995

An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)

Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity

Psychometric Characteristics of the Attitudes toward Mathematics and Its Teaching (ATMAT) Scale.

Peer reviewed

Ludlow, Larry H.; Bell, Karen N. – Educational and Psychological Measurement, 1996

Fifty education majors in two sections responded to an Attitudes toward Mathematics and Its Teaching (ATMAT) scale. Results with two psychometric models, classical true-score theory and the one-parameter Rasch model, supported the ATMAT's reliability, content and construct validity, and invariance over three time points. (SLD)

Descriptors: College Students, Construct Validity, Education Majors, Elementary Education

Previous Page | Next Page »

Pages: 1 | 2

Sireci, Stephen G.	2
Bell, Karen N.	1
Chang, Chung-Yen	1
Cho, Daeyeon	1
Choi, Bo Young	1
Deng, Hui	1
Freedle, Roy	1
Fu, Hsieh-Hai	1
Geisinger, Kurt F.	1
Gorin, Joanna S.	1
Gregoire, Jacques	1
Hater, John J.	1
Hsiao, Chien-Hua	1
Kettler, Ryan J.	1
Kobrin, Jennifer L.	1
Kostin, Irene	1
Kumazawa, Takaaki	1
Lee, Jayoung	1
Lee, Kwangmin	1
Lee, Sang Min	1
Lin, Chung-Yen	1
Liu, Xiufeng	1
Ludlow, Larry H.	1
Michael Hout	1
Morreale, Cathleen	1
More ▼