Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Multidimensional Scaling | 13 |
Test Items | 7 |
Content Validity | 6 |
Test Construction | 5 |
Test Format | 5 |
Cluster Analysis | 4 |
Evaluation Methods | 4 |
Licensing Examinations… | 4 |
Test Content | 4 |
Content Analysis | 3 |
Item Bias | 3 |
More ▼ |
Source
Applied Psychological… | 2 |
Educational Assessment | 1 |
Educational and Psychological… | 1 |
International Journal of… | 1 |
Multivariate Behavioral… | 1 |
Author
Sireci, Stephen G. | 13 |
Geisinger, Kurt F. | 2 |
Robin, Frederic | 2 |
Bastari, B. | 1 |
Fitzgerald, Cyndy | 1 |
Geisinger, Kurt | 1 |
Gonzalez, Eugenio J. | 1 |
Hambleton, Ronald K. | 1 |
Khaliq, Shameem Nyla | 1 |
Li, Xueming | 1 |
Meara, Kevin | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 9 |
Journal Articles | 6 |
Reports - Evaluative | 6 |
Reports - Research | 5 |
Reports - Descriptive | 2 |
Numerical/Quantitative Data | 1 |
Education Level
Audience
Location
California | 1 |
Delaware | 1 |
Florida | 1 |
Kentucky | 1 |
Maryland | 1 |
Ohio | 1 |
South Carolina | 1 |
Texas | 1 |
Virginia | 1 |
Washington | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013
Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…
Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing

Meara, Kevin; Robin, Frederic; Sireci, Stephen G. – Multivariate Behavioral Research, 2000
Investigated the usefulness of multidimensional scaling (MDS) for assessing the dimensionality of dichotomous test data. Focused on two MDS proximity measures, one based on the PC statistic (T. Chen and M. Davidson, 1996) and other, on interitem Euclidean distances. Simulation results show that both MDS procedures correctly identify…
Descriptors: Correlation, Multidimensional Scaling, Simulation, Test Items
Sireci, Stephen G. – 1998
Multidimensional scaling (MDS) is a versatile technique for understanding the structure of multivariate data. Recent studies have applied MDS to the problem of evaluating content validity. This paper describes the importance of evaluating test content and the logic of using MDS to analyze data gathered from subject matter experts employed in…
Descriptors: Content Validity, Evaluation Methods, Multidimensional Scaling, Research Methodology

Robin, Frederic; Sireci, Stephen G.; Hambleton, Ronald K. – International Journal of Testing, 2003
Illustrates how multidimensional scaling (MDS) and differential item functioning (DIF) procedures can be used to evaluate the equivalence of different language versions of an examination. Presents examples of structural differences and DIF across languages. (SLD)
Descriptors: Item Bias, Licensing Examinations (Professions), Multidimensional Scaling, Multilingual Materials

Sireci, Stephen G. – Educational Assessment, 1998
Describes content-validity theory and illustrates new and traditional approaches for conducting content-validity studies. Newer approaches are based on multidimensional scaling analysis of item-similarity ratings, while traditional approaches are based on ratings of item-objective congruence and relevance. (Author/SLD)
Descriptors: Content Validity, Data Analysis, Evaluation Methods, Multidimensional Scaling
Sireci, Stephen G.; Bastari, B. – 1998
In many cross-cultural research studies, assessment instruments are translated or adapted for use in multiple languages. However, it cannot be assumed that different language versions of an assessment are equivalent across languages. A fundamental issue to be addressed is the comparability or equivalence of the construct measured by each language…
Descriptors: Construct Validity, Cross Cultural Studies, Evaluation Methods, Multidimensional Scaling

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1992
A new method for evaluating the content representation of a test is illustrated. Item similarity ratings were obtained from three content domain experts to assess whether ratings corresponded to item groupings specified in the test blueprint. Multidimensional scaling and cluster analysis provided substantial information about the test's content…
Descriptors: Cluster Analysis, Content Analysis, Multidimensional Scaling, Multiple Choice Tests
Sireci, Stephen G.; Gonzalez, Eugenio J. – 2003
International comparative educational studies make use of test instruments originally developed in English by international panels of experts, but that are ultimately administered in the language of instruction of the students. The comparability of the different language versions of these assessments is a critical issue in validating the…
Descriptors: Academic Achievement, Comparative Analysis, Difficulty Level, International Education
Sireci, Stephen G.; Khaliq, Shameem Nyla – 2002
Many students in the United States who are required to take educational tests are not fully proficient in English. To address this problem, a state-mandated testing program created dual language English-Spanish versions of some of their tests. In this study, the psychometric properties of the English and dual language versions of a fourth-grade…
Descriptors: Item Bias, Language Proficiency, Limited English Speaking, Multidimensional Scaling
Sireci, Stephen G.; And Others – 1990
Although some researchers have argued against use of the term "content validity," the ability of a test item to adequately represent the domain of knowledge tested continues to be an issue of paramount importance in test construction. The present paper reviews previous analyses of test content and proposes a new empirical method for…
Descriptors: Cluster Analysis, Content Analysis, Content Validity, Evaluators

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995
An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)
Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity
Sireci, Stephen G.; Fitzgerald, Cyndy; Xing, Dehui – 1998
Adapting credentialing examinations for international uses involves translating tests for use in multiple languages. This paper explores methods for evaluating construct equivalence and item equivalence across different language versions of a test. These methods were applied to four different language versions (English, French, German, and…
Descriptors: Credentials, Engineers, Factor Analysis, Foreign Countries
Sireci, Stephen G.; Geisinger, Kurt – 1993
Various methods used to assess the content of a test are reviewed, and a new procedure designed to improve on these methods is presented. The two tests considered are a professional licensure examination, the auditing section of the Uniform Certified Public Accountant Examination, and an educational achievement test, a nationally standardized…
Descriptors: Achievement Tests, Certified Public Accountants, Cluster Analysis, Content Analysis