ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Test Items	5
Testing Programs	5
Item Response Theory	4
Evaluation Methods	2
Item Analysis	2
Mathematics Tests	2
Psychometrics	2
Reading Tests	2
Testing	2
Adaptive Testing	1
Alternative Assessment	1
Computer Assisted Testing	1
Computer Simulation	1
Constructed Response	1
Cutting Scores	1
Data Analysis	1
Data Use	1
Difficulty Level	1
Educational Change	1
Educational Practices	1
Error Patterns	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
More ▼

Source

Applied Measurement in…

Author

Albano, Anthony D.	1
Brian F. French	1
Meyers, Jason L.	1
Miller, G. Edward	1
Pomplun, Mark	1
Puhan, Gautam	1
Sundbye, Nita	1
Thao Thu Vo	1
Tony Albano	1
Way, Walter D.	1
Wyse, Adam E.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	3
Reports - Evaluative	2

Education Level

Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Gender Differences in Constructed Response Reading Items.

Peer reviewed

Pomplun, Mark; Sundbye, Nita – Applied Measurement in Education, 1999

Gender differences in answers to constructed-response reading items from a state assessment program were studied with four raters rating approximately 500 papers at two grade levels. Results indicate that number of words written and number of unrelated responses show significant gender differences and are related to holistic scores. (SLD)

Descriptors: Constructed Response, Holistic Evaluation, Reading Tests, Secondary Education