Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Item Response Theory | 4 |
Test Format | 4 |
Test Items | 3 |
Achievement Tests | 2 |
Educational Assessment | 2 |
Measurement | 2 |
Scores | 2 |
Student Evaluation | 2 |
Test Construction | 2 |
Ability | 1 |
Algebra | 1 |
More ▼ |
Source
Educational Measurement:… | 4 |
Author
Binici, Salih | 1 |
Cuhadar, Ismail | 1 |
Frey, Andreas | 1 |
Hartig, Johannes | 1 |
Rupp, Andre A. | 1 |
Sireci, Stephen G. | 1 |
Zwick, Rebecca | 1 |
Publication Type
Journal Articles | 4 |
Reports - Evaluative | 2 |
Reports - Descriptive | 1 |
Reports - Research | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Cuhadar, Ismail; Binici, Salih – Educational Measurement: Issues and Practice, 2022
This study employs the 4-parameter logistic item response theory model to account for the unexpected incorrect responses or slipping effects observed in a large-scale Algebra 1 End-of-Course assessment, including several innovative item formats. It investigates whether modeling the misfit at the upper asymptote has any practical impact on the…
Descriptors: Item Response Theory, Measurement, Student Evaluation, Algebra
Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009
In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…
Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design

Sireci, Stephen G. – Educational Measurement: Issues and Practice, 1997
Different methodologies for linking tests across languages are reviewed and evaluated, focusing on monolingual item response theory, bilingual group designs, and matched monolingual group designs. These methods, although not without weaknesses, are superior for promoting score comparability than methods that rely on translation or expert judgment…
Descriptors: Bilingualism, Comparative Analysis, Cross Cultural Studies, Educational Assessment

Zwick, Rebecca – Educational Measurement: Issues and Practice, 1991
Item parameter estimates derived through item response theory methods have been considered relatively robust to changes in item position and context, but the anomaly in reading scores from the 1986 National Assessment of Educational Progress (NAEP) illustrates problems with common population equating procedures when there are test form changes.…
Descriptors: Achievement Tests, Context Effect, Equated Scores, Estimation (Mathematics)