Descriptor
Content Validity | 5 |
Multidimensional Scaling | 5 |
Construct Validity | 4 |
Cluster Analysis | 3 |
Evaluation Methods | 3 |
Licensing Examinations… | 3 |
Test Construction | 3 |
Test Content | 3 |
Test Format | 3 |
Test Items | 3 |
Achievement Tests | 2 |
More ▼ |
Author
Sireci, Stephen G. | 9 |
Bastari, B. | 1 |
Bhola, Dennison | 1 |
Foster, David F. | 1 |
Geisinger, Kurt | 1 |
Geisinger, Kurt F. | 1 |
Harter, James | 1 |
Huff, Kristen L. | 1 |
Olsen, James | 1 |
Robin, Frederic | 1 |
Yang, Yongwei | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 9 |
Reports - Evaluative | 4 |
Reports - Research | 4 |
Journal Articles | 2 |
Reports - Descriptive | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sireci, Stephen G. – 1995
The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…
Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995
An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)
Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity
Sireci, Stephen G. – 1998
Multidimensional scaling (MDS) is a versatile technique for understanding the structure of multivariate data. Recent studies have applied MDS to the problem of evaluating content validity. This paper describes the importance of evaluating test content and the logic of using MDS to analyze data gathered from subject matter experts employed in…
Descriptors: Content Validity, Evaluation Methods, Multidimensional Scaling, Research Methodology

Huff, Kristen L.; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2001
Discusses the potential positive and negative effects computer-based testing could have on validity, reviews the literature on validation perspectives in computer-based testing, and suggests ways to evaluate the contributions of computer-based testing to more valid measurement practices. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Validity
Sireci, Stephen G.; Bastari, B. – 1998
In many cross-cultural research studies, assessment instruments are translated or adapted for use in multiple languages. However, it cannot be assumed that different language versions of an assessment are equivalent across languages. A fundamental issue to be addressed is the comparability or equivalence of the construct measured by each language…
Descriptors: Construct Validity, Cross Cultural Studies, Evaluation Methods, Multidimensional Scaling
Sireci, Stephen G.; Harter, James; Yang, Yongwei; Bhola, Dennison – 2000
Assessing people who operate in different languages necessitates the use of multiple language versions of an assessment. However, different language versions of an assessment are not necessarily equivalent. In this paper, the psychometric properties of different language versions on an international employee attitude survey are evaluated. This…
Descriptors: Analysis of Covariance, Attitude Measures, Attitudes, Construct Validity
Sireci, Stephen G.; And Others – 1990
Although some researchers have argued against use of the term "content validity," the ability of a test item to adequately represent the domain of knowledge tested continues to be an issue of paramount importance in test construction. The present paper reviews previous analyses of test content and proposes a new empirical method for…
Descriptors: Cluster Analysis, Content Analysis, Content Validity, Evaluators
Sireci, Stephen G.; Foster, David F.; Robin, Frederic; Olsen, James – 1997
Evaluating the comparability of a test administered in different languages is a difficult, if not impossible, task. Comparisons are problematic because observed differences in test performance between groups who take different language versions of a test could be due to a difference in difficulty between the tests, to cultural differences in test…
Descriptors: Adaptive Testing, Adults, Certification, Comparative Analysis
Sireci, Stephen G.; Geisinger, Kurt – 1993
Various methods used to assess the content of a test are reviewed, and a new procedure designed to improve on these methods is presented. The two tests considered are a professional licensure examination, the auditing section of the Uniform Certified Public Accountant Examination, and an educational achievement test, a nationally standardized…
Descriptors: Achievement Tests, Certified Public Accountants, Cluster Analysis, Content Analysis