Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 6 |
Descriptor
Evaluation Methods | 5 |
Content Validity | 4 |
Multidimensional Scaling | 4 |
Test Items | 4 |
Test Validity | 4 |
Validity | 4 |
Achievement Tests | 3 |
Educational Testing | 3 |
Psychological Testing | 3 |
Psychometrics | 3 |
Scores | 3 |
More ▼ |
Source
Author
Sireci, Stephen G. | 13 |
Bastari, B. | 1 |
Faulkner-Bond, Molly | 1 |
Geisinger, Kurt | 1 |
Geisinger, Kurt F. | 1 |
Hambleton, Ronald K. | 1 |
Han, Kyung T. | 1 |
Hauger, Jeffrey B. | 1 |
Huff, Kristen L. | 1 |
Martone, Andrea | 1 |
O'Neil, Timothy | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 13 |
Journal Articles | 8 |
Speeches/Meeting Papers | 4 |
Information Analyses | 2 |
Education Level
Elementary Secondary Education | 2 |
Adult Education | 1 |
Grade 10 | 1 |
Grade 8 | 1 |
Audience
Location
Iran | 1 |
Singapore | 1 |
United States | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Medical College Admission Test | 1 |
What Works Clearinghouse Rating
Sireci, Stephen G. – Assessment in Education: Principles, Policy & Practice, 2016
A misconception exists that validity may refer only to the "interpretation" of test scores and not to the "uses" of those scores. The development and evolution of validity theory illustrate test score interpretation was a primary focus in the earliest days of modern testing, and that validating interpretations derived from test…
Descriptors: Test Validity, Misconceptions, Evaluation Utilization, Data Interpretation
Sireci, Stephen G.; Faulkner-Bond, Molly – Review of Research in Education, 2015
Across the globe, educational tests are being used at a rapidly increasing rate. More recently, educational tests are being used to inform educational policy and for holding educators accountable for student learning. One reason educational assessments are used for these important purposes is that they are considered to provide reliable and…
Descriptors: English Language Learners, Accountability, Educational Testing, Student Evaluation
Hauger, Jeffrey B.; Sireci, Stephen G. – International Journal of Testing, 2008
In this study, we examined the presence of differential item functioning (DIF) among groups of students who were tested in their native language or in a different language when participating in the 1999 Trends in International Mathematics and Science Study. Data from 18,837 examinees from three countries (Singapore, United States, and Iran) were…
Descriptors: Test Bias, Language Dominance, Second Languages, Language Proficiency
Martone, Andrea; Sireci, Stephen G. – Review of Educational Research, 2009
The authors (a) discuss the importance of alignment for facilitating proper assessment and instruction, (b) describe the three most common methods for evaluating the alignment between state content standards and assessments, (c) discuss the relative strengths and limitations of these methods, and (d) discuss examples of applications of each…
Descriptors: Teaching Methods, Alignment (Education), Student Evaluation, Curriculum Development
Sireci, Stephen G.; Han, Kyung T.; Wells, Craig S. – Educational Assessment, 2008
In the United States, when English language learners (ELLs) are tested, they are usually tested in English and their limited English proficiency is a potential cause of construct-irrelevant variance. When such irrelevancies affect test scores, inaccurate interpretations of ELLs' knowledge, skills, and abilities may occur. In this article, we…
Descriptors: Test Use, Educational Assessment, Psychological Testing, Validity

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995
An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)
Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity
Sireci, Stephen G. – 1998
Multidimensional scaling (MDS) is a versatile technique for understanding the structure of multivariate data. Recent studies have applied MDS to the problem of evaluating content validity. This paper describes the importance of evaluating test content and the logic of using MDS to analyze data gathered from subject matter experts employed in…
Descriptors: Content Validity, Evaluation Methods, Multidimensional Scaling, Research Methodology
Sireci, Stephen G.; Parker, Polly – Educational Measurement: Issues and Practice, 2006
The psychometric literature is replete with comprehensive discussions of test validity, test validation, and the characteristics of quality assessment programs. The most authoritative source for guidance regarding sound test development and evaluation practices is the Standards for Educational and Psychological Testing. However, the Standards are…
Descriptors: Psychometrics, Test Validity, Educational Testing, Psychological Testing
Sireci, Stephen G.; Bastari, B. – 1998
In many cross-cultural research studies, assessment instruments are translated or adapted for use in multiple languages. However, it cannot be assumed that different language versions of an assessment are equivalent across languages. A fundamental issue to be addressed is the comparability or equivalence of the construct measured by each language…
Descriptors: Construct Validity, Cross Cultural Studies, Evaluation Methods, Multidimensional Scaling
Zenisky, April L.; Hambleton, Ronald K.; Sireci, Stephen G. – 2001
Measurement specialists routinely assume examinee responses to test items are independent of one another. However, previous research has shown that many contemporary tests contain item dependencies and not accounting for these dependencies leads to misleading estimates of item, test, and ability parameters. In this study, methods for detecting…
Descriptors: Ability, College Applicants, College Entrance Examinations, Higher Education
Sireci, Stephen G. – National Assessment Governing Board, 2004
The National Assessment of Educational Progress (NAEP) seeks to include all students in the United States in the sampling frame from which students are selected to participate in the assessment. However, some students with disabilities (SWD) are either unable to take NAEP tests under standard testing conditions or are unable to perform at their…
Descriptors: Testing Accommodations, Validity, National Competency Tests, Reading Tests
O'Neil, Timothy; Sireci, Stephen G.; Huff, Kristen L. – Educational Assessment, 2004
Educational tests used for accountability purposes must represent the content domains they purport to measure. When such tests are used to monitor progress over time, the consistency of the test content across years is important for ensuring that observed changes in test scores are due to student achievement rather than to changes in what the test…
Descriptors: Test Items, Cognitive Ability, Test Content, Science Teachers
Sireci, Stephen G.; Geisinger, Kurt – 1993
Various methods used to assess the content of a test are reviewed, and a new procedure designed to improve on these methods is presented. The two tests considered are a professional licensure examination, the auditing section of the Uniform Certified Public Accountant Examination, and an educational achievement test, a nationally standardized…
Descriptors: Achievement Tests, Certified Public Accountants, Cluster Analysis, Content Analysis