NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Myers, Aaron J.; Ames, Allison J.; Leventhal, Brian C.; Holzman, Madison A. – Applied Measurement in Education, 2020
When rating performance assessments, raters may ascribe different scores for the same performance when rubric application does not align with the intended application of the scoring criteria. Given performance assessment score interpretation assumes raters apply rubrics as rubric developers intended, misalignment between raters' scoring processes…
Descriptors: Scoring Rubrics, Validity, Item Response Theory, Interrater Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Gotwals, Amelia Wenk – Applied Measurement in Education, 2018
In this commentary, I consider the three empirical studies in this special issue based on two main aspects: (a) the nature of the learning progressions and (b) what formative assessment practice(s) were investigated. Specifically, I describe differences among the learning progressions in terms of scope and grain size. I also identify three…
Descriptors: Skill Development, Behavioral Objectives, Formative Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Alonzo, Alicia C. – Applied Measurement in Education, 2018
Learning progressions--particularly as defined and operationalized in science education--have significant potential to inform teachers' formative assessment practices. In this overview article, I lay out an argument for this potential, starting from definitions for "formative assessment practices" and "learning progressions"…
Descriptors: Skill Development, Behavioral Objectives, Science Education, Formative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Bo; Ohland, Matthew W. – Applied Measurement in Education, 2009
One major challenge in using group projects to assess student learning is accounting for the differences of contribution among group members so that the mark assigned to each individual actually reflects their performance. This research addresses the validity of grading group projects by evaluating different methods that derive individualized…
Descriptors: Monte Carlo Methods, Validity, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Su, Ya-Hui – Applied Measurement in Education, 2004
In this study we investigated the effects of the average signed area (ASA) between the item characteristic curves of the reference and focal groups and three test purification procedures on the uniform differential item functioning (DIF) detection via the Mantel-Haenszel (M-H) method through Monte Carlo simulations. The results showed that ASA,…
Descriptors: Test Bias, Student Evaluation, Evaluation Methods, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Ayala, Carlos C.; Shavelson, Richard J.; Ruiz-Primo, Maria Araceli; Brandon, Paul R.; Yin, Yue; Furtak, Erin Marie; Young, Donald B.; Tomita, Miki K. – Applied Measurement in Education, 2008
The idea that formative assessments embedded in a curriculum could help guide teachers toward better instructional practices that lead to greater student learning has taken center stage in science assessment research. In order to embed formative assessments in a curriculum, curriculum developers and assessment specialists should collaborate to…
Descriptors: Student Evaluation, Formative Evaluation, Teaching Methods, Alignment (Education)
Peer reviewed Peer reviewed
Direct linkDirect link
D'Agostino, Jerome V.; Welsh, Megan E.; Cimetta, Adriana D.; Falco, Lia D.; Smith, Shannon; VanWinkle, Waverely Hester; Powers, Sonya J. – Applied Measurement in Education, 2008
Central to the standards-based assessment validation process is an examination of the alignment between state standards and test items. Several alignment analysis systems have emerged recently, but most rely on either traditional rating or matching techniques. Little, if any, analyses have been reported on the degree of consistency between the two…
Descriptors: Test Items, Student Evaluation, State Standards, Evaluation Methods
Peer reviewed Peer reviewed
Wilson, Mark; Sloane, Kathryn – Applied Measurement in Education, 2000
Describes the principles that guided the creation and implementation of a system of embedded assessments, the Berkeley Evaluation and Assessment Research System (BEAR). The assessment system builds on methodological advances in alternative assessment. Discusses how the application of the principles generates the component parts of the system. (SLD)
Descriptors: Educational Practices, Evaluation Methods, Research, Student Evaluation
Peer reviewed Peer reviewed
Trevisan, Michael S. – Applied Measurement in Education, 1999
State-level administrator certification requirements were studied with respect to student-assessment expectations, using responses of state certification offices. Only 18 states require some form of student-assessment knowledge and skills, and only Washington state requires the assessment competencies promulgated by the National Policy Board for…
Descriptors: Administrators, Certification, Competence, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Penfield, Randall D.; Miller, Jeffrey M. – Applied Measurement in Education, 2004
As automated scoring of complex constructed-response examinations reaches operational status, the process of evaluating the quality of resultant scores, particularly in contrast to scores of expert human graders, becomes as complex as the data itself. Using a vignette from the Architectural Registration Examination (ARE), this article explores the…
Descriptors: Student Evaluation, Evaluation Methods, Content Validity, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004
Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…
Descriptors: True Scores, Simulation, Test Bias, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Rodriguez, Michael C. – Applied Measurement in Education, 2004
This project evaluated the relationship between assessment practices and achievement and the mediating roles of student self-efficacy and effort. In part, this was based on a framework proposed by Brookhart (1997). The United States portion of the Third International Math and Science Study was used to estimate these relationships. Several student…
Descriptors: Student Characteristics, Self Efficacy, Mathematics Achievement, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004
A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…
Descriptors: Test Results, Student Evaluation, Scores, Testing Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Johnson, Robert L.; Penny, Jim; Fisher, Steve; Kuhs, Therese – Applied Measurement in Education, 2003
When raters assign different scores to a performance task, a method for resolving rating differences is required to report a single score to the examinee. Recent studies indicate that decisions about examinees, such as pass/fail decisions, differ across resolution methods. Previous studies also investigated the interrater reliability of…
Descriptors: Test Reliability, Test Validity, Scores, Interrater Reliability
Peer reviewed Peer reviewed
Brookhart, Susan M. – Applied Measurement in Education, 1994
This article is organized into two parts: (1) a review of literature on teachers' grading practices, and (2) a discussion of the findings about teachers' grading practices in light of evaluation and motivation theory. The discussion considers both research implications and practical recommendations. (Author)
Descriptors: Educational Assessment, Educational Practices, Elementary Secondary Education, Evaluation Methods
Previous Page | Next Page ยป
Pages: 1  |  2