NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
Individuals with Disabilities…1
What Works Clearinghouse Rating
Showing 1 to 15 of 29 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025
In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…
Descriptors: Automation, Grading, Computer Assisted Testing, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025
Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…
Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Aray, Henry; Pedauga, Luis – Educational Measurement: Issues and Practice, 2019
This article presents a novel experimental methodology in which groups of students were offered the option to choose between two equivalent scoring rules to assess a multiple-choice test. The effect of choosing the scoring rule on marks is tested. Two major contributions arise from this research. First, it contributes to the literature on the…
Descriptors: Multiple Choice Tests, Scoring, Student Attitudes, Decision Making
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022
In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…
Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Han, Chao; Xiao, Xiaoyan – Language Testing, 2022
The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…
Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Çekiç, Ahmet; Bakla, Arif – International Online Journal of Education and Teaching, 2021
The Internet and the software stores for mobile devices come with a huge number of digital tools for any task, and those intended for digital formative assessment (DFA) have burgeoned exponentially in the last decade. These tools vary in terms of their functionality, pedagogical quality, cost, operating systems and so forth. Teachers and learners…
Descriptors: Formative Evaluation, Futures (of Society), Computer Assisted Testing, Guidance
Ward, C.; Metz, A.; Louison, L.; Loper, A.; Cusumano, D. – National Implementation Research Network, 2019
The purpose of the "Drivers Best Practices Assessment" ("DBPA") is to assist organizations in assessing their current supports and resources for quality use of selected programs or practices. Specifically, organizations can use it to: (1) Identify strengths and opportunities for improvement in their current supports and…
Descriptors: Best Practices, Organizational Effectiveness, Program Effectiveness, Program Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017
Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…
Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education
Rhode Island Department of Education, 2019
Rhode Island is committed to ensuring that all educators receive fair, accurate, and meaningful educator evaluations that provide information that can help all teachers improve and refine their practice. Currently, districts in Rhode Island may submit a district-designed model for approval that complies with the Educator Evaluation System…
Descriptors: Guides, Evaluation Methods, Public Schools, Educational Objectives
Peer reviewed Peer reviewed
Direct linkDirect link
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Rhode Island Department of Education, 2015
Rhode Island educators believe that implementing a fair, accurate, and meaningful educator evaluation and support system will help improve teaching and learning. The primary purpose of the Rhode Island Model Teacher Evaluation and Support System (Rhode Island Model) is to help all teachers improve. Through the Model, the goal is to help create a…
Descriptors: Guides, Student Evaluation, Evaluation Methods, Public Schools
Peer reviewed Peer reviewed
Direct linkDirect link
Oakes, Wendy Peia; Lane, Kathleen Lynne; Cox, Meredith Lucille; Messenger, Mallory – Preventing School Failure, 2014
In this article, the authors provide an overview of behavior screening tools available, including free and commercially available options. Next, the authors offer step-by-step procedures for (a) selecting, (b) scheduling, (c) preparing, (d) administering, and (e) scoring and interpreting behaviors screening tools. The authors conclude with…
Descriptors: Screening Tests, Behavior Problems, Decision Making, Models
Rhode Island Department of Education, 2014
The purpose of this Guidebook is to describe the process and basic requirements for the student learning measures that are used as part of the building administrator evaluation and support process. For aspects of the process that have room for flexibility and school/district-level discretion, the different options have been clearly separated and…
Descriptors: Guides, Student Evaluation, Evaluation Methods, Public Schools
Rhode Island Department of Education, 2014
The purpose of this Guidebook is to describe the process and basic requirements for the student learning measures that are used as part of the support professional evaluation and support process. For aspects of the process that have room for flexibility and school/district-level discretion, the different options have been clearly separated and…
Descriptors: Guides, Student Evaluation, Evaluation Methods, Public Schools
Rhode Island Department of Education, 2015
Rhode Island is committed to ensuring that all educators receive fair, accurate, and meaningful educator evaluations that provide information that can help all teachers improve and refine their practice. This commitment is an outgrowth of the state's recognition of the influence teachers have on student growth and achievement. Currently, districts…
Descriptors: Guides, Evaluation Methods, Public Schools, Educational Objectives
Previous Page | Next Page »
Pages: 1  |  2