ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Comparative Testing	4
Mathematics Tests	4
Test Reliability	4
Item Response Theory	2
Reading Tests	2
Robustness (Statistics)	2
Basic Skills	1
Black Students	1
Classification	1
Compensatory Education	1
Computer Assisted Testing	1
Context Effect	1
Early Childhood Education	1
Educational Diagnosis	1
Elementary Education	1
Elementary School Students	1
Equated Scores	1
Estimation (Mathematics)	1
Evaluation Criteria	1
Grade 3	1
Grade 6	1
Grade 8	1
Item Analysis	1
Item Bias	1
Junior High School Students	1
More ▼

Source

Applied Measurement in…	2
Journal of Educational…	1

Author

Lane, Suzanne	1
Lee, Yoonsun	1
Ryan, Katherine E.	1
Stone, Clement A.	1
Taylor, Catherine S.	1
Terrasi, Salvatore	1

Publication Type

Journal Articles	3
Reports - Research	3
Reports - Evaluative	1

Education Level

Grade 10	1
Grade 4	1
Grade 7	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Stability of Rasch Scales over Time

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…

Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

Use of Restricted Item Response Theory Models for Examining the Stability of Item Parameter Estimates over Time.

Peer reviewed

Stone, Clement A.; Lane, Suzanne – Applied Measurement in Education, 1991

A model-testing approach for evaluating the stability of item response theory item parameter estimates (IPEs) in a pretest-posttest design is illustrated. Nineteen items from the Head Start Measures Battery were used. A moderately high degree of stability in the IPEs for 5,510 children assessed on 2 occasions was found. (TJH)

Descriptors: Comparative Testing, Compensatory Education, Computer Assisted Testing, Early Childhood Education

Examining the Reliability of a State-Mandated Basic Skills Test for a Sample of Special Needs Students.

Terrasi, Salvatore – 1989

This study examined the consistency of classification for a sample of special needs students on the state-mandated Massachusetts Basic Skills Inventory (BSI). The study sample consisted of 172 special education students (114 males and 58 females) from 15 elementary schools in a large urban school district in Massachusetts, who took the…

Descriptors: Basic Skills, Classification, Comparative Testing, Educational Diagnosis

The Performance of the Mantel-Haenszel Procedure across Samples and Matching Criteria.

Peer reviewed

Ryan, Katherine E. – Journal of Educational Measurement, 1991

The reliability of Mantel-Haenszel (MH) indexes across samples of examinees and sample sizes and their robustness to item context effects were investigated with data for 670 African-American and 5,015 white students from the Second International Mathematics Study. MH procedures can be used to detect differential item functioning. (SLD)

Descriptors: Black Students, Comparative Testing, Context Effect, Evaluation Criteria