NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 46 to 60 of 10,088 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025
Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…
Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition
Michael O. Martin, Editor; Julian Fraillon, Editor; Heiko Sibberns, Editor; Betina Borisova, Contributor; Ekaterina Buzkich, Contributor; David Ebbs, Contributor; Eugenio Gonzalez, Contributor; Seamus Hegarty, Contributor; Sabine Meinck, Contributor; Sebastian Meyer, Contributor; Irini Moustaki, Contributor; Lauren Musu, Contributor; Keith Rust, Contributor; Ulrich Sievers, Contributor; Matthias von Davier, Contributor; Kentaro Yamamoto, Contributor – International Association for the Evaluation of Educational Achievement, 2025
This publication presents "IEA's Technical Standards for International Large-Scale Assessment." The initial standards, published in 1999, aimed to consolidate the best practices and methodological rigor in IEA's approach to educational assessment, addressing the unique needs of international studies. The standards presented in this…
Descriptors: International Assessment, Standards, Test Construction, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Rosaline Tandiono; Amelia Limijaya – Asia-Pacific Education Researcher, 2025
Self and peer assessment in group work offers numerous benefits but is also susceptible to bias. Yet, research examining bias in self and peer assessments often overlooks cultural perspectives and predominantly favors Western contexts. This study aims to address this gap by examining how culture influences rater bias in self and peer assessments…
Descriptors: Evaluators, Bias, Self Evaluation (Individuals), Peer Evaluation
Frank Morley; Emma Walland – Research Matters, 2025
The recent development of Large Language Models (LLMs) such as Claude, Gemini, and GPT has led to widespread attention on potential applications of these models. Marking exams is a domain which requires the ability to interpret and evaluate student responses (often consisting of written text), and the potential for artificial intelligence (AI)…
Descriptors: Ethics, Artificial Intelligence, Automation, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Guy B. deBrun – Journal of Outdoor Recreation, Education, and Leadership, 2025
Discussions of what it means to be an effective outdoor leader are common in outdoor education literature (Martin et al., 2025; Smith, 2021). Research has identified core competencies (Martin et al., 2025), conceptual frameworks (Pomfret et al., 2023), and course curricula/qualifications for effective leadership (Baker & O'Brien, 2019; Seaman…
Descriptors: Outdoor Leadership, Leadership Effectiveness, Evaluation Methods, Scoring Rubrics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024
At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…
Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations
Peer reviewed Peer reviewed
Direct linkDirect link
Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024
Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…
Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses
Peer reviewed Peer reviewed
Direct linkDirect link
Benjamin Goecke; Paul V. DiStefano; Wolfgang Aschauer; Kurt Haim; Roger Beaty; Boris Forthmann – Journal of Creative Behavior, 2024
Automated scoring is a current hot topic in creativity research. However, most research has focused on the English language and popular verbal creative thinking tasks, such as the alternate uses task. Therefore, in this study, we present a large language model approach for automated scoring of a scientific creative thinking task that assesses…
Descriptors: Creativity, Creative Thinking, Scoring, Automation
Peer reviewed Peer reviewed
Direct linkDirect link
Daniel Lewis; Melanie Graw; Michael Baker – Journal of Applied Testing Technology, 2024
Embedded Standard Setting (ESS; Lewis & Cook, 2020) transforms standard setting from a standalone workshop to an active part of the assessment development lifecycle. ESS purports to lower costs by eliminating the standard-setting workshop and enhance the validity argument by maintaining a consistent focus on the evidentiary relationship…
Descriptors: Standard Setting (Scoring), Test Items, Test Construction, Food Service
Peer reviewed Peer reviewed
Direct linkDirect link
Culpepper, Dawn; White-Lewis, Damani; O'Meara, KerryAnn; Templeton, Lindsey; Anderson, Julia – Journal of Higher Education, 2023
Many colleges and universities now require faculty search committees to use rubrics when evaluating faculty job candidates, as proponents believe these "decision-support tools" can reduce the impact of bias in candidate evaluation. That is, rubrics are intended to ensure that candidates are evaluated more fairly, which is then thought to…
Descriptors: Scoring Rubrics, Bias, Personnel Selection, College Faculty
Peer reviewed Peer reviewed
Direct linkDirect link
A. R. Georgeson – Structural Equation Modeling: A Multidisciplinary Journal, 2025
There is increasing interest in using factor scores in structural equation models and there have been numerous methodological papers on the topic. Nevertheless, sum scores, which are computed from adding up item responses, continue to be ubiquitous in practice. It is therefore important to compare simulation results involving factor scores to…
Descriptors: Structural Equation Models, Scores, Factor Analysis, Statistical Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Stephen M. Kosslyn; Elizabeth P. Callaghan; David P. Green – Learning: Research and Practice, 2025
This article addresses the transformative potential of generative Artificial Intelligence (AI) to optimize human potential by making education more efficient and effective. We describe a new teaching method called "Dynamic Personalized Learning." In this method, AI dynamically provides feedback and adjusts the level and pace of…
Descriptors: Artificial Intelligence, Feedback (Response), Individualized Instruction, Learning Objectives
Peer reviewed Peer reviewed
Direct linkDirect link
Scott A. Crossley; Minkyung Kim; Quian Wan; Laura K. Allen; Rurik Tywoniw; Danielle S. McNamara – Grantee Submission, 2025
This study examines the potential to use non-expert, crowd-sourced raters to score essays by comparing expert raters' and crowd-sourced raters' assessments of writing quality. Expert raters and crowd-sourced raters scored 400 essays using a standardised holistic rubric and comparative judgement (pairwise ratings) scoring techniques, respectively.…
Descriptors: Writing Evaluation, Essays, Novices, Knowledge Level
Peer reviewed Peer reviewed
Direct linkDirect link
Scott A. Crossley; Minkyung Kim; Qian Wan; Laura K. Allen; Rurik Tywoniw; Danielle McNamara – Assessment in Education: Principles, Policy & Practice, 2025
This study examines the potential to use non-expert, crowd-sourced raters to score essays by comparing expert raters' and crowd-sourced raters' assessments of writing quality. Expert raters and crowd-sourced raters scored 400 essays using a standardised holistic rubric and comparative judgement (pairwise ratings) scoring techniques, respectively.…
Descriptors: Writing Evaluation, Essays, Novices, Knowledge Level
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Abdul-Waris Mustapha; Mohammed Gunu Ibrahim – International Journal of Contemporary Educational Research, 2025
This study examined the adherence of Senior High School teachers to the principles of test construction, administration, and scoring. Achievement tests play a critical role in assessing student learning and guiding instructional decisions, yet challenges in their effective implementation persist. Using a descriptive research design, data were…
Descriptors: Foreign Countries, High School Teachers, Achievement Tests, Testing
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  673