ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Comparative Testing	8
Reading Tests	8
Test Items	8
Mathematics Tests	4
Item Analysis	3
Reading Achievement	3
Difficulty Level	2
Foreign Countries	2
Grade 3	2
Grade 4	2
Grade 5	2
High Schools	2
Low Achievement	2
Mathematical Models	2
Reading Comprehension	2
State Programs	2
Test Bias	2
Test Construction	2
Test Results	2
Test Validity	2
Academic Achievement	1
Academic Standards	1
Accountability	1
Achievement Gains	1
Achievement Tests	1
More ▼

Source

Applied Measurement in…	1
Curriculum Journal	1
Educational Assessment	1
Educational Measurement:…	1
Educational and Psychological…	1

Author

Davey, Beth	1
Ferdous, Abdullah A.	1
Hoadley, Ursula	1
Hu, P. Gillian	1
Kato, Kentaro	1
Kimmel, Rumena	1
Kulick, Edward	1
Lowenkamp, Lena	1
Macready, George B.	1
Moen, Ross E.	1
Muller, Johan	1
Plake, Barbara S.	1
Rost, Detlef H.	1
Silva, Sharron J.	1
Sparfeldt, Jorn R.	1
Steele, D. Joyce	1
Steingraber, Antje	1
Thurlow, Martha L.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	5
Reports - Evaluative	3
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Education	4
Elementary Secondary Education	2
Grade 3	2
Grade 4	2
Early Childhood Education	1
Grade 5	1
Grade 8	1
Intermediate Grades	1
Primary Education	1

Audience

Researchers

Location

Alabama	1
Germany	1
South Africa	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Alabama High School…	1
Progress in International…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Visibility and Differentiation: Systemic Testing in a Developing Country Context

Peer reviewed

Direct link

Hoadley, Ursula; Muller, Johan – Curriculum Journal, 2016

Why has large-scale standardised testing attracted such a bad press? Why has pedagogic benefit to be derived from test results been downplayed? The paper investigates this question by first surveying the pros and cons of testing in the literature, and goes on to examine educators' responses to standardised, large-scale tests in a sample of low…

Descriptors: Foreign Countries, Standardized Tests, Developing Nations, Visual Discrimination

Not Read, but Nevertheless Solved? Three Experiments on PIRLS Multiple Choice Reading Comprehension Test Items

Peer reviewed

Direct link

Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012

Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…

Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

Item Selection Strategy for Reducing the Number of Items Rated in an Angoff Standard Setting Study

Peer reviewed

Direct link

Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2007

In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…

Descriptors: Test Items, Probability, Item Analysis, Standard Setting (Scoring)

Applications of Latent Class Modeling to Investigate the Structure Underlying Reading Comprehension Items.

Peer reviewed

Davey, Beth; Macready, George B. – Applied Measurement in Education, 1990

The usefulness of latent class modeling in addressing several measurement issues is demonstrated via a study of 74 good and 74 poor readers in grades 5 and 6. Procedures were particularly useful for assessing the hierarchical relation among skills and for exploring issues related to item domains. (SLD)

Descriptors: Comparative Testing, Elementary School Students, Grade 5, Grade 6

A Comparison of Traditional Approaches and Item Response Approaches to the Problem of Item Selection for Criterion-Referenced Measurement.

Download full text

Silva, Sharron J. – 1985

Test item selection techniques based on traditional item analysis methods were compared to techniques based on item response theory. The consistency of mastery classifications in criterion referenced reading tests was examined. Pretest and posttest data were available for 945 first and second grade students and for 1796 fourth to sixth grade…

Descriptors: Analysis of Variance, Comparative Testing, Criterion Referenced Tests, Elementary Education

A Descriptive Comparison of Test Item Statistics from Items Utilized in an Item Pilot, a Form Pilot, and Live Administrations of the Alabama High School Graduation Examination: The 1991 Update.

Download full text

Steele, D. Joyce – 1991

This paper compares descriptive information based on analyses of the pilot and live administrations of the Alabama High School Graduation Examination (AHSGE). The AHSGE, a product of decisions made in 1977 and 1984 by the Alabama State Board of Education, is composed of subject tests in reading, mathematics, and language. The pass score for each…

Descriptors: Comparative Testing, Difficulty Level, Grade 11, Graduation Requirements

Examining the Relationship between Differential Item Functioning and Item Difficulty. College Board Report No. 89-5.

Kulick, Edward; Hu, P. Gillian – 1989

The relationship of differential item functioning (DIF) to item difficulty on the Scholastic Aptitude Test (SAT) was examined, based on data from nine recent administrations of the test from June 1986 through December 1987. This pool of information includes item statistics on 765 verbal and 540 mathematical items computed for subgroups of White,…

Descriptors: Asian Americans, Black Students, College Bound Students, College Entrance Examinations