Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Comparative Testing | 8 |
Reading Tests | 8 |
Test Items | 8 |
Mathematics Tests | 4 |
Item Analysis | 3 |
Reading Achievement | 3 |
Difficulty Level | 2 |
Foreign Countries | 2 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
More ▼ |
Source
Applied Measurement in… | 1 |
Curriculum Journal | 1 |
Educational Assessment | 1 |
Educational Measurement:… | 1 |
Educational and Psychological… | 1 |
Author
Davey, Beth | 1 |
Ferdous, Abdullah A. | 1 |
Hoadley, Ursula | 1 |
Hu, P. Gillian | 1 |
Kato, Kentaro | 1 |
Kimmel, Rumena | 1 |
Kulick, Edward | 1 |
Lowenkamp, Lena | 1 |
Macready, George B. | 1 |
Moen, Ross E. | 1 |
Muller, Johan | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 5 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 2 |
Numerical/Quantitative Data | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 4 |
Elementary Secondary Education | 2 |
Grade 3 | 2 |
Grade 4 | 2 |
Early Childhood Education | 1 |
Grade 5 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Primary Education | 1 |
Audience
Researchers | 1 |
Location
Alabama | 1 |
Germany | 1 |
South Africa | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Alabama High School… | 1 |
Progress in International… | 1 |
SAT (College Admission Test) | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Hoadley, Ursula; Muller, Johan – Curriculum Journal, 2016
Why has large-scale standardised testing attracted such a bad press? Why has pedagogic benefit to be derived from test results been downplayed? The paper investigates this question by first surveying the pros and cons of testing in the literature, and goes on to examine educators' responses to standardised, large-scale tests in a sample of low…
Descriptors: Foreign Countries, Standardized Tests, Developing Nations, Visual Discrimination
Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012
Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…
Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4
Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009
Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…
Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior
Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2007
In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…
Descriptors: Test Items, Probability, Item Analysis, Standard Setting (Scoring)

Davey, Beth; Macready, George B. – Applied Measurement in Education, 1990
The usefulness of latent class modeling in addressing several measurement issues is demonstrated via a study of 74 good and 74 poor readers in grades 5 and 6. Procedures were particularly useful for assessing the hierarchical relation among skills and for exploring issues related to item domains. (SLD)
Descriptors: Comparative Testing, Elementary School Students, Grade 5, Grade 6
Silva, Sharron J. – 1985
Test item selection techniques based on traditional item analysis methods were compared to techniques based on item response theory. The consistency of mastery classifications in criterion referenced reading tests was examined. Pretest and posttest data were available for 945 first and second grade students and for 1796 fourth to sixth grade…
Descriptors: Analysis of Variance, Comparative Testing, Criterion Referenced Tests, Elementary Education
Steele, D. Joyce – 1991
This paper compares descriptive information based on analyses of the pilot and live administrations of the Alabama High School Graduation Examination (AHSGE). The AHSGE, a product of decisions made in 1977 and 1984 by the Alabama State Board of Education, is composed of subject tests in reading, mathematics, and language. The pass score for each…
Descriptors: Comparative Testing, Difficulty Level, Grade 11, Graduation Requirements
Kulick, Edward; Hu, P. Gillian – 1989
The relationship of differential item functioning (DIF) to item difficulty on the Scholastic Aptitude Test (SAT) was examined, based on data from nine recent administrations of the test from June 1986 through December 1987. This pool of information includes item statistics on 765 verbal and 540 mathematical items computed for subgroups of White,…
Descriptors: Asian Americans, Black Students, College Bound Students, College Entrance Examinations