NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 34 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Guy B. deBrun – Journal of Outdoor Recreation, Education, and Leadership, 2025
Discussions of what it means to be an effective outdoor leader are common in outdoor education literature (Martin et al., 2025; Smith, 2021). Research has identified core competencies (Martin et al., 2025), conceptual frameworks (Pomfret et al., 2023), and course curricula/qualifications for effective leadership (Baker & O'Brien, 2019; Seaman…
Descriptors: Outdoor Leadership, Leadership Effectiveness, Evaluation Methods, Scoring Rubrics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Saito, Daisuke; Yajima, Risei; Washizaki, Hironori; Fukazawa, Yoshiaki – Education Sciences, 2021
In evaluating the learning achievement of programming-thinking skills, the method of using a rubric that describes evaluation items and evaluation stages is widely employed. However, few studies have evaluated the reliability, validity, and consistency of the rubrics themselves. In this study, we introduced a statistical method for evaluating the…
Descriptors: Scoring Rubrics, Computer Science Education, Programming, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Morgan, Grant B.; Moore, Courtney A.; Floyd, Harlee S. – Journal of Psychoeducational Assessment, 2018
Although content validity--how well each item of an instrument represents the construct being measured--is foundational in the development of an instrument, statistical validity is also important to the decisions that are made based on the instrument. The primary purpose of this study is to demonstrate how simulation studies can be used to assist…
Descriptors: Simulation, Decision Making, Test Construction, Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Reilly, Erin Dawna; Stafford, Rose Eleanore; Williams, Kyle Marie; Corliss, Stephanie Brooks – International Review of Research in Open and Distance Learning, 2014
The use of massive open online courses (MOOCs) to expand students' access to higher education has raised questions regarding the extent to which this course model can provide and assess authentic, higher level student learning. In response to this need, MOOC platforms have begun utilizing automated essay scoring (AES) systems that allow…
Descriptors: Online Courses, Essays, Scoring, Automation
Peer reviewed Peer reviewed
Direct linkDirect link
Britton, Emily; Simper, Natalie; Leger, Andrew; Stephenson, Jenn – Assessment & Evaluation in Higher Education, 2017
Effective teamwork skills are essential for success in an increasingly team-based workplace. However, research suggests that there is often confusion concerning how teamwork is measured and assessed, making it difficult to develop these skills in undergraduate curricula. The goal of the present study was to develop a sustainable tool for assessing…
Descriptors: Teamwork, Undergraduate Students, Skills, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Menéndez-Varela, José-Luis; Gregori-Giralt, Eva – Assessment & Evaluation in Higher Education, 2016
Rubrics have attained considerable importance in the authentic and sustainable assessment paradigm; nevertheless, few studies have examined their contribution to validity, especially outside the domain of educational studies. This empirical study used a quantitative approach to analyse the validity of a rubrics-based performance assessment. Raters…
Descriptors: Scoring Rubrics, Validity, Performance Based Assessment, College Freshmen
Peer reviewed Peer reviewed
Direct linkDirect link
Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016
As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…
Descriptors: Essays, Scoring, Comparative Analysis, Evaluators
Zeng, Songtian – ProQuest LLC, 2017
Over 30 states have adopted the Early Childhood Environmental Rating Scale-Revised (ECERS-R) as a component of their program quality assessment systems, but the use of ECERS-R on such a large scale has raised important questions about implementation. One of the most pressing question centers upon decisions users must make between two scoring…
Descriptors: Rating Scales, Scoring, Validity, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Conoyer, Sarah J.; Lembke, Erica S.; Hosp, John L.; Espin, Christine A.; Hosp, Michelle K.; Poch, Apryl L. – Reading & Writing Quarterly, 2017
The present study examined the technical adequacy of maze-selection tasks constructed in 2 different ways: typical versus novel. We selected distractors for each measure systematically based on rules related to the content of the passage and the part of speech of the correct choice. Participants included 262 middle school students who were…
Descriptors: Cloze Procedure, Multiple Choice Tests, Reading Tests, Reading Comprehension
Peer reviewed Peer reviewed
Direct linkDirect link
Shin, Sun-Young; Lidster, Ryan – Language Testing, 2017
In language programs, it is crucial to place incoming students into appropriate levels to ensure that course curriculum and materials are well targeted to their learning needs. Deciding how and where to set cutscores on placement tests is thus of central importance to programs, but previous studies in educational measurement disagree as to which…
Descriptors: Language Tests, English (Second Language), Standard Setting (Scoring), Student Placement
Peer reviewed Peer reviewed
Direct linkDirect link
Peterman, Karen; Withy, Kelley; Boulay, Rachel – CBE - Life Sciences Education, 2018
A common challenge in the evaluation of K-12 science education is identifying valid scales that are an appropriate fit for both a student's age and the educational outcomes of interest. Though many new scales have been validated in recent years, there is much to learn about the appropriate educational contexts and audiences for these measures.…
Descriptors: Self Efficacy, Career Choice, Vocational Interests, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mokhtari, Kouider; Dimitrov, Dimiter M.; Reichard, Carla A. – Studies in Second Language Learning and Teaching, 2018
In this study, we revised the "Metacognitive Awareness of Reading Strategies Inventory" (MARSI), a self-report instrument designed to assess students' awareness of reading strategies when reading school-related materials. We collected evidence of structural, generalizability, and external aspects of validity for the revised inventory…
Descriptors: Metacognition, Reading Strategies, Measures (Individuals), Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Brancaccio-Taras, Loretta; Pape-Lindstrom, Pamela; Peteroy-Kelly, Marcy; Aguirre, Karen; Awong-Taylor, Judy; Balser, Teri; Cahill, Michael J.; Frey, Regina F.; Jack, Thomas; Kelrick, Michael; Marley, Kate; Miller, Kathryn G.; Osgood, Marcy; Romano, Sandra; Uzman, J. Akif; Zhao, Jiuqing – CBE - Life Sciences Education, 2016
The PULSE Vision & Change Rubrics, version 1.0, assess life sciences departments' progress toward implementation of the principles of the "Vision and Change report." This paper reports on the development of the rubrics, their validation, and their reliability in measuring departmental change aligned with the "Vision and…
Descriptors: Scoring Rubrics, Biological Sciences, Validity, Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Chen, Jin; Lin, Jianghao; Li, Xinguang – English Language Teaching, 2015
This article aims to find out the validity of rhythm measurements to capture the rhythmic features of Chinese English. Besides, the reliability of the valid rhythm measurements applied in automatically scoring the English rhythm proficiency of Chinese EFL learners is also explored. Thus, two experiments were carried out. First, thirty students of…
Descriptors: Foreign Countries, Language Rhythm, Measurement, Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Breyer, F. Jay; Attali, Yigal; Williamson, David M.; Ridolfi-McCulla, Laura; Ramineni, Chaitanya; Duchnowski, Matthew; Harris, April – ETS Research Report Series, 2014
In this research, we investigated the feasibility of implementing the "e-rater"® scoring engine as a check score in place of all-human scoring for the "Graduate Record Examinations"® ("GRE"®) revised General Test (rGRE) Analytical Writing measure. This report provides the scientific basis for the use of e-rater as a…
Descriptors: Computer Software, Computer Assisted Testing, Scoring, College Entrance Examinations
Previous Page | Next Page »
Pages: 1  |  2  |  3