ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Item Response Theory	4
Test Format	4
Test Items	3
Achievement Tests	2
Educational Assessment	2
Measurement	2
Scores	2
Student Evaluation	2
Test Construction	2
Ability	1
Algebra	1
Bias	1
Bilingualism	1
Comparative Analysis	1
Context Effect	1
Cross Cultural Studies	1
Design	1
Design Requirements	1
Educational Research	1
Educational Testing	1
Elementary Secondary Education	1
Equated Scores	1
Estimation (Mathematics)	1
Evaluation Criteria	1
Evaluation Research	1
More ▼

Source

Educational Measurement:…

Author

Binici, Salih	1
Cuhadar, Ismail	1
Frey, Andreas	1
Hartig, Johannes	1
Rupp, Andre A.	1
Sireci, Stephen G.	1
Zwick, Rebecca	1

Publication Type

Journal Articles	4
Reports - Evaluative	2
Reports - Descriptive	1
Reports - Research	1

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Modeling Slipping Effects in a Large-Scale Assessment with Innovative Item Formats

Peer reviewed

Direct link

Cuhadar, Ismail; Binici, Salih – Educational Measurement: Issues and Practice, 2022

This study employs the 4-parameter logistic item response theory model to account for the unexpected incorrect responses or slipping effects observed in a large-scale Algebra 1 End-of-Course assessment, including several innovative item formats. It investigates whether modeling the misfit at the upper asymptote has any practical impact on the…

Descriptors: Item Response Theory, Measurement, Student Evaluation, Algebra

An NCME Instructional Module on Booklet Designs in Large-Scale Assessments of Student Achievement: Theory and Practice

Peer reviewed

Direct link

Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009

In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…

Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design

Problems and Issues in Linking Assessments across Languages.

Peer reviewed

Sireci, Stephen G. – Educational Measurement: Issues and Practice, 1997

Different methodologies for linking tests across languages are reviewed and evaluated, focusing on monolingual item response theory, bilingual group designs, and matched monolingual group designs. These methods, although not without weaknesses, are superior for promoting score comparability than methods that rely on translation or expert judgment…

Descriptors: Bilingualism, Comparative Analysis, Cross Cultural Studies, Educational Assessment

Effects of Item Order and Context on Estimation of NAEP Reading Proficiency.

Peer reviewed

Zwick, Rebecca – Educational Measurement: Issues and Practice, 1991

Item parameter estimates derived through item response theory methods have been considered relatively robust to changes in item position and context, but the anomaly in reading scores from the 1986 National Assessment of Educational Progress (NAEP) illustrates problems with common population equating procedures when there are test form changes.…

Descriptors: Achievement Tests, Context Effect, Equated Scores, Estimation (Mathematics)