NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 181 to 195 of 3,162 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023
Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…
Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Wong, Wai Yee Amy; Thistlethwaite, Jill; Moni, Karen; Roberts, Chris – Advances in Health Sciences Education, 2023
Examiners' judgements play a critical role in competency-based assessments such as objective structured clinical examinations (OSCEs). The standardised nature of OSCEs and their alignment with regulatory accountability assure their wide use as high-stakes assessment in medical education. Research into examiner behaviours has predominantly explored…
Descriptors: Sociocultural Patterns, Evaluators, Performance Based Assessment, Accountability
Peer reviewed Peer reviewed
Direct linkDirect link
Dunaway, Krystall; Gardner, Kristine; Grieve, Karly – American Journal of Evaluation, 2023
As part of its "Guiding Principles for Evaluators," the American Evaluation Association (AEA) requires that evaluators develop cultural competencies. Using a successive-independent-samples design, the researchers sought to compare perceptions of cultural competence across a duration of 10 years. Qualitative data were collected via online…
Descriptors: Cultural Awareness, Program Evaluation, Evaluators, Preferences
Peer reviewed Peer reviewed
Direct linkDirect link
Jia, Wenfeng; Zhang, Peixin – Language Testing in Asia, 2023
It is widely believed that raters' cognition is an important aspect of writing assessment, as it has both logical and temporal priority over scores. Based on a critical review of previous research in this area, it is found that raters' cognition can be boiled to two fundamental issues: building text images and strategies for articulating scores.…
Descriptors: Problem Solving, Cognitive Processes, Writing Evaluation, Evaluators
Peer reviewed Peer reviewed
Direct linkDirect link
Anderson, Emily E.; Hurley, Elisa A.; Serpico, Kimberley; Johnson, Ann; Rowe, Jessica; Singleton, Megan; Bierer, Barbara E.; Cholka, Brooke; Chaudhari, Swapnali; Fernandez Lynch, Holly – Research Ethics, 2023
The primary purpose of Institutional Review Boards (IRBs) is to protect the rights and welfare of human research participants. Evaluation and measurement of how IRBs satisfy this purpose and other important goals are open questions that demand empirical research. Research on IRBs, and the Human Research Protection Programs (HRPPs) of which they…
Descriptors: Research, Ethics, Stakeholders, Barriers
Peer reviewed Peer reviewed
Direct linkDirect link
McDonough, Kim; Lindberg, Rachael; Trofimovich, Pavel; Tekin, Oguzhan – Language Teaching, 2023
This replication study seeks to extend the generalizability of an exploratory study (McDonough et al., 2019) that identified holds (i.e., temporary cessation of dynamic movement by the listener) as a reliable visual cue of non-understanding. Conversations between second language (L2) English speakers in the Corpus of English as a Lingua Franca…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computational Linguistics
James DiFranco – ProQuest LLC, 2023
The recent Next Generation Science Standards recommend students have increased opportunities engaging in modeling work scientists do. Science classroom tasks include simple inquiry compared to more complex authentic inquiry that closely resembles authentic science. Science fair participation presents students with the opportunity to engage in…
Descriptors: Science Instruction, Teaching Methods, Authentic Learning, Inquiry
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2020
Rater fit analyses provide insight into the degree to which rater judgments correspond to expected properties, as defined within a measurement framework. Parametric models such as the Rasch model provide a useful framework for evaluating rating quality; however, these models are not appropriate for all assessment contexts. The purpose of this…
Descriptors: Evaluators, Goodness of Fit, Simulation, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Lubienski, Sarah Theule – Educational Researcher, 2020
This essay provides advice for effectively reviewing conference proposals, including how to write comments that are helpful to proposal authors, how to use the "Comments to Program Chair" box, and issues to consider when assigning proposal ratings and recommending acceptance or rejection. Several benefits of reviewing proposals are…
Descriptors: Conferences (Gatherings), Conference Papers, Evaluation Methods, Evaluation Criteria
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Miriti, Justine Majau; Kirima, Lucy K.; Nzivo, Mirriam M.; Thuranira, Simon; Budambula, Nancy L. M. – International Journal of Educational Administration and Policy Studies, 2021
Human Resource (HR) practices like performance appraisal (PA) training are meant to ensure that employees are equipped with the knowledge and skills needed for the attainment of organisational goals. However, gaps still exist on the relationship between PA employees' training and employees' performance. This study aimed to establish the…
Descriptors: Foreign Countries, Schools of Education, Personnel Evaluation, Evaluators
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sanosi, Abdulaziz; Abdalla, Mohamed – Australian Journal of Applied Linguistics, 2021
This study aimed to examine the potentials of the NLP approach in detecting discourse markers (DMs), namely okay, in transcribed spoken data. One hundred thirty-eight concordance lines were presented to human referees to judge the functions of okay in them as a DM or Non-DM. After that, the researchers used a Python script written according to the…
Descriptors: Natural Language Processing, Computational Linguistics, Programming Languages, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Nagle, Charles L.; Rehman, Ivana – Studies in Second Language Acquisition, 2021
Listener-based ratings have become a prominent means of defining second language (L2) users' global speaking ability. In most cases, local listeners are recruited to evaluate speech samples in person. However, in many teaching and research contexts, recruiting local listeners may not be possible or advisable. The goal of this study was to hone a…
Descriptors: Second Language Learning, Intercultural Communication, Speech, Language Research
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ben Smith; Stephen Morris; Harry Armitage – Education Endowment Foundation, 2021
This guidance is intended for the planning stage of trials -- in particular, trials for which the primary outcome measure being considered is attainment at GCSE (and in particular, GCSE English Language and Mathematics). However, it is generalisable to any other graded qualification or assessment used as an outcome measure, i.e. A-levels. It is…
Descriptors: Foreign Countries, National Competency Tests, Sample Size, Pilot Projects
Peer reviewed Peer reviewed
Direct linkDirect link
Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024
In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…
Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Whalen, Kate; Paez, Antonio – Journal of Geography, 2022
Experiential education partnered with guided reflection is thought to support students with higher-order thinking skills. In this study, 44 reflections from two university-level sustainability courses were compared. In both courses students were asked to write a reflection, but only one course used the Reflective Learning Framework (RLF). Tests of…
Descriptors: Geography Instruction, Thinking Skills, Experiential Learning, Sustainability
Pages: 1  |  ...  |  9  |  10  |  11  |  12  |  13  |  14  |  15  |  16  |  17  |  ...  |  211