NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 7 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Yiling Cheng; I-Chien Chen; Barbara Schneider; Mark Reckase; Joseph Krajcik – Applied Measurement in Education, 2024
The current study expands on previous research on gender differences and similarities in science test scores. Using three different approaches -- differential item functioning, differential distractor functioning, and decision tree analysis -- we examine a high school science assessment administered to 3,849 10th-12th graders, of whom 2,021 are…
Descriptors: Gender Differences, Science Achievement, Responses, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Abbakumov, Dmitry; Desmet, Piet; Van den Noortgate, Wim – Applied Measurement in Education, 2020
Formative assessments are an important component of massive open online courses (MOOCs), online courses with open access and unlimited student participation. Accurate conclusions on students' proficiency via formative, however, face several challenges: (a) students are typically allowed to make several attempts; and (b) student performance might…
Descriptors: Item Response Theory, Formative Evaluation, Online Courses, Response Style (Tests)
Peer reviewed Peer reviewed
Direct linkDirect link
Miller, G. Edward; Yoes, Michael E.; Twing, Jon S. – Applied Measurement in Education, 2004
Two models are presented in this article for estimating the proportion of students who would pass all of three or more content area tests given that none have actually been tested in more than two of the content areas. The first model allows one to estimate the proportion of students who would pass all of three or more content area tests from the…
Descriptors: Scores, Standardized Tests, Student Evaluation, Testing Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004
A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…
Descriptors: Test Results, Student Evaluation, Scores, Testing Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Ferrara, Steve; Johnson, Eugene; Chen, Wen-Hung – Applied Measurement in Education, 2005
Psychometricians continue to develop and evaluate methods for linking test scores, both horizontally and vertically. This article describes a social moderation process for articulating (i.e., linking) performance standards across grade levels for an operational state assessment program. The researchers used generated data to evaluate the likely…
Descriptors: Grade 2, Grade 3, Scores, Error of Measurement
Peer reviewed Peer reviewed
Gabrielson, Stephen; And Others – Applied Measurement in Education, 1995
The effects of presenting a choice of writing tasks on the quality of essays produced by eleventh graders were studied with 34,200 students in Georgia. The choice condition had no substantive effect on the quality of essays, but race, gender, and the writing task variable did. (SLD)
Descriptors: Essay Tests, Grade 11, High School Students, High Schools
Peer reviewed Peer reviewed
Aschbacher, Pamela R. – Applied Measurement in Education, 1991
The University of California's (Los Angeles) Center for Research on Evaluation, Standards, and Student Testing survey of state assessment directors reveals that about 25 states currently study or develop performance assessments. Obstacles to statewide use of performance assessments were expressed. The new Student Assessment Exchange should…
Descriptors: Accountability, Cost Effectiveness, Educational Assessment, Educational Improvement