Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 6 |
Descriptor
Item Analysis | 8 |
Test Items | 6 |
Scores | 4 |
Achievement Tests | 3 |
Test Construction | 3 |
Content Validity | 2 |
Culture Fair Tests | 2 |
Elementary Secondary Education | 2 |
English Language Learners | 2 |
Guidelines | 2 |
Science Tests | 2 |
More ▼ |
Source
American Educational Research… | 1 |
Educational Measurement:… | 1 |
Educational Research and… | 1 |
Educational Researcher | 1 |
Journal of Psychoeducational… | 1 |
Review of Research in… | 1 |
Author
Sireci, Stephen G. | 8 |
Wells, Craig S. | 2 |
Berberoglu, Giray | 1 |
Faulkner-Bond, Molly | 1 |
Geisinger, Kurt | 1 |
Gökçe, Semirhan | 1 |
Kachchaf, Rachel R. | 1 |
Lewis, Jennifer | 1 |
Lim, Hwanggyu | 1 |
Noble, Tracy | 1 |
Padellaro, Frank | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 4 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 2 |
Opinion Papers | 1 |
Education Level
Elementary Secondary Education | 2 |
Elementary Education | 1 |
Grade 5 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Audience
Location
Massachusetts | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Trends in International… | 1 |
What Works Clearinghouse Rating
Does not meet standards | 1 |
Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022
Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…
Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis
Noble, Tracy; Sireci, Stephen G.; Wells, Craig S.; Kachchaf, Rachel R.; Rosebery, Ann S.; Wang, Yang Caroline – American Educational Research Journal, 2020
In this experimental study, 20 multiple-choice test items from the Massachusetts Grade 5 science test were linguistically simplified, and original and simplified test items were administered to 310 English learners (ELs) and 1,580 non-ELs in four Massachusetts school districts. This study tested the hypothesis that specific linguistic features of…
Descriptors: Science Tests, Language Usage, English Language Learners, School Districts
Gökçe, Semirhan; Berberoglu, Giray; Wells, Craig S.; Sireci, Stephen G. – Journal of Psychoeducational Assessment, 2021
The 2015 Trends in International Mathematics and Science Study (TIMSS) involved 57 countries and 43 different languages to assess students' achievement in mathematics and science. The purpose of this study is to evaluate whether items and test scores are affected as the differences between language families and cultures increase. Using…
Descriptors: Language Classification, Elementary Secondary Education, Mathematics Achievement, Mathematics Tests
Sireci, Stephen G.; Rios, Joseph A. – Educational Research and Evaluation, 2013
There are numerous statistical procedures for detecting items that function differently across subgroups of examinees that take a test or survey. However, in endeavouring to detect items that may function differentially, selection of the statistical method is only one of many important decisions. In this article, we discuss the important decisions…
Descriptors: Effect Size, Test Bias, Item Analysis, Statistical Analysis
Sireci, Stephen G.; Faulkner-Bond, Molly – Review of Research in Education, 2015
Across the globe, educational tests are being used at a rapidly increasing rate. More recently, educational tests are being used to inform educational policy and for holding educators accountable for student learning. One reason educational assessments are used for these important purposes is that they are considered to provide reliable and…
Descriptors: English Language Learners, Accountability, Educational Testing, Student Evaluation
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items
Sireci, Stephen G. – 1995
The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…
Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis
Sireci, Stephen G.; Geisinger, Kurt – 1993
Various methods used to assess the content of a test are reviewed, and a new procedure designed to improve on these methods is presented. The two tests considered are a professional licensure examination, the auditing section of the Uniform Certified Public Accountant Examination, and an educational achievement test, a nationally standardized…
Descriptors: Achievement Tests, Certified Public Accountants, Cluster Analysis, Content Analysis