ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Student Evaluation	20
Evaluation Methods	19
Educational Assessment	5
Scores	5
Test Construction	5
Elementary Secondary Education	4
Performance Based Assessment	4
Formative Evaluation	3
Measurement Techniques	3
Scoring	3
Simulation	3
Test Items	3
Test Use	3
Validity	3
Accountability	2
Behavioral Objectives	2
Educational Improvement	2
Educational Practices	2
Educational Research	2
Grades (Scholastic)	2
Grading	2
Interrater Reliability	2
Monte Carlo Methods	2
Outcomes of Education	2
Program Development	2
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	20
Reports - Research	8
Reports - Evaluative	6
Reports - Descriptive	4
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Reports - General	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	1
High Schools	1

Audience

Location

Arizona

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Validating Rubric Scoring Processes: An Application of an Item Response Tree Model

Peer reviewed

Direct link

Myers, Aaron J.; Ames, Allison J.; Leventhal, Brian C.; Holzman, Madison A. – Applied Measurement in Education, 2020

When rating performance assessments, raters may ascribe different scores for the same performance when rubric application does not align with the intended application of the scoring criteria. Given performance assessment score interpretation assumes raters apply rubrics as rubric developers intended, misalignment between raters' scoring processes…

Descriptors: Scoring Rubrics, Validity, Item Response Theory, Interrater Reliability

Where Are We Now? Learning Progressions and Formative Assessment

Peer reviewed

Direct link

Gotwals, Amelia Wenk – Applied Measurement in Education, 2018

In this commentary, I consider the three empirical studies in this special issue based on two main aspects: (a) the nature of the learning progressions and (b) what formative assessment practice(s) were investigated. Specifically, I describe differences among the learning progressions in terms of scope and grain size. I also identify three…

Descriptors: Skill Development, Behavioral Objectives, Formative Evaluation, Evaluation Methods

An Argument for Formative Assessment with Science Learning Progressions

Peer reviewed

Direct link

Alonzo, Alicia C. – Applied Measurement in Education, 2018

Learning progressions--particularly as defined and operationalized in science education--have significant potential to inform teachers' formative assessment practices. In this overview article, I lay out an argument for this potential, starting from definitions for "formative assessment practices" and "learning progressions"…

Descriptors: Skill Development, Behavioral Objectives, Science Education, Formative Evaluation

How to Assign Individualized Scores on a Group Project: An Empirical Evaluation

Peer reviewed

Direct link

Zhang, Bo; Ohland, Matthew W. – Applied Measurement in Education, 2009

One major challenge in using group projects to assess student learning is accounting for the differences of contribution among group members so that the mark assigned to each individual actually reflects their performance. This research addresses the validity of grading group projects by evaluating different methods that derive individualized…

Descriptors: Monte Carlo Methods, Validity, Student Evaluation, Evaluation Methods

Effects of Average Signed Area Between Two Item Characteristic Curves and Test Purification Procedures on the DIF Detection via the Mantel-Haenszel Method

Peer reviewed

Direct link

Wang, Wen-Chung; Su, Ya-Hui – Applied Measurement in Education, 2004

In this study we investigated the effects of the average signed area (ASA) between the item characteristic curves of the reference and focal groups and three test purification procedures on the uniform differential item functioning (DIF) detection via the Mantel-Haenszel (M-H) method through Monte Carlo simulations. The results showed that ASA,…

Descriptors: Test Bias, Student Evaluation, Evaluation Methods, Test Items

From Formal Embedded Assessments to Reflective Lessons: The Development of Formative Assessment Studies

Peer reviewed

Direct link

Ayala, Carlos C.; Shavelson, Richard J.; Ruiz-Primo, Maria Araceli; Brandon, Paul R.; Yin, Yue; Furtak, Erin Marie; Young, Donald B.; Tomita, Miki K. – Applied Measurement in Education, 2008

The idea that formative assessments embedded in a curriculum could help guide teachers toward better instructional practices that lead to greater student learning has taken center stage in science assessment research. In order to embed formative assessments in a curriculum, curriculum developers and assessment specialists should collaborate to…

Descriptors: Student Evaluation, Formative Evaluation, Teaching Methods, Alignment (Education)

The Rating and Matching Item-Objective Alignment Methods

Peer reviewed

Direct link

D'Agostino, Jerome V.; Welsh, Megan E.; Cimetta, Adriana D.; Falco, Lia D.; Smith, Shannon; VanWinkle, Waverely Hester; Powers, Sonya J. – Applied Measurement in Education, 2008

Central to the standards-based assessment validation process is an examination of the alignment between state standards and test items. Several alignment analysis systems have emerged recently, but most rely on either traditional rating or matching techniques. Little, if any, analyses have been reported on the degree of consistency between the two…

Descriptors: Test Items, Student Evaluation, State Standards, Evaluation Methods

From Principles to Practice: An Embedded Assessment System.

Peer reviewed

Wilson, Mark; Sloane, Kathryn – Applied Measurement in Education, 2000

Describes the principles that guided the creation and implementation of a system of embedded assessments, the Berkeley Evaluation and Assessment Research System (BEAR). The assessment system builds on methodological advances in alternative assessment. Discusses how the application of the principles generates the component parts of the system. (SLD)

Descriptors: Educational Practices, Evaluation Methods, Research, Student Evaluation

Administrator Certification Requirements for Student Assessment Competence.

Peer reviewed

Trevisan, Michael S. – Applied Measurement in Education, 1999

State-level administrator certification requirements were studied with respect to student-assessment expectations, using responses of state certification offices. Only 18 states require some form of student-assessment knowledge and skills, and only Washington state requires the assessment competencies promulgated by the National Policy Board for…

Descriptors: Administrators, Certification, Competence, Evaluation Methods

Improving Content Validation Studies Using an Asymmetric Confidence Interval for the Mean of Expert Ratings

Peer reviewed

Direct link

Penfield, Randall D.; Miller, Jeffrey M. – Applied Measurement in Education, 2004

As automated scoring of complex constructed-response examinations reaches operational status, the process of evaluating the quality of resultant scores, particularly in contrast to scores of expert human graders, becomes as complex as the data itself. Using a vignette from the Architectural Registration Examination (ARE), this article explores the…

Descriptors: Student Evaluation, Evaluation Methods, Content Validity, Scoring

Performance of SIBTEST When the Percentage of DIF Items Is Large

Peer reviewed

Direct link

Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004

Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…

Descriptors: True Scores, Simulation, Test Bias, Student Evaluation

The Role of Classroom Assessment in Student Performance on TIMSS

Peer reviewed

Direct link

Rodriguez, Michael C. – Applied Measurement in Education, 2004

This project evaluated the relationship between assessment practices and achievement and the mediating roles of student self-efficacy and effort. In part, this was based on a framework proposed by Brookhart (1997). The United States portion of the Third International Math and Science Study was used to estimate these relationships. Several student…

Descriptors: Student Characteristics, Self Efficacy, Mathematics Achievement, Student Evaluation

Student Test Score Reports and Interpretive Guides: Review of Current Practices and Suggestions for Future Research

Peer reviewed

Direct link

Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004

A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…

Descriptors: Test Results, Student Evaluation, Scores, Testing Programs

Score Resolution: An Investigation of the Reliability and Validity of Resolved Scores

Peer reviewed

Direct link

Johnson, Robert L.; Penny, Jim; Fisher, Steve; Kuhs, Therese – Applied Measurement in Education, 2003

When raters assign different scores to a performance task, a method for resolving rating differences is required to report a single score to the examinee. Recent studies indicate that decisions about examinees, such as pass/fail decisions, differ across resolution methods. Previous studies also investigated the interrater reliability of…

Descriptors: Test Reliability, Test Validity, Scores, Interrater Reliability

Teachers' Grading: Practice and Theory.

Peer reviewed

Brookhart, Susan M. – Applied Measurement in Education, 1994

This article is organized into two parts: (1) a review of literature on teachers' grading practices, and (2) a discussion of the findings about teachers' grading practices in light of evaluation and motivation theory. The discussion considers both research implications and practical recommendations. (Author)

Descriptors: Educational Assessment, Educational Practices, Elementary Secondary Education, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

Alonzo, Alicia C.	1
Ames, Allison J.	1
Aschbacher, Pamela R.	1
Ayala, Carlos C.	1
Baron, Joan Boykoff	1
Boughton, Keith A.	1
Brandon, Paul R.	1
Brookhart, Susan M.	1
Burry-Stock, Judith A.	1
Calfee, Robert	1
Cimetta, Adriana D.	1
D'Agostino, Jerome V.	1
Falco, Lia D.	1
Fisher, Steve	1
Furtak, Erin Marie	1
Gierl, Mark J.	1
Goodman, Dean P.	1
Gotwals, Amelia Wenk	1
Gotzmann, Andrea	1
Hambleton, Ronald K.	1
Holzman, Madison A.	1
Johnson, Robert L.	1
Kuhs, Therese	1
Leventhal, Brian C.	1
Miller, Jeffrey M.	1
More ▼