Publication Date
| In 2026 | 2 |
| Since 2025 | 188 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2889 |
| Since 2007 (last 20 years) | 6174 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025
Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…
Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading
Wyse, Adam E. – Practical Assessment, Research & Evaluation, 2018
One common modification to the Angoff standard-setting method is to have panelists round their ratings to the nearest 0.05 or 0.10 instead of 0.01. Several reasons have been offered as to why it may make sense to have panelists round their ratings to the nearest 0.05 or 0.10. In this article, we examine one reason that has been suggested, which is…
Descriptors: Interrater Reliability, Evaluation Criteria, Scoring Formulas, Achievement Rating
Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020
Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…
Descriptors: Test Content, Test Items, Discussion, Test Validity
Harrison, Frank – Journal of Competency-Based Education, 2020
The use of the 1-4 grading scale is gaining popularity as schools transition to competency-based education. However, independent interpretations and percentage conversions have led to general confusion and sometimes outright rejection of competency-based assessment. The first step to establish clear communication is to anchor the existing…
Descriptors: Grading, Competency Based Education, Grade Point Average, Scoring Rubrics
Thompson, Jennifer L. W.; Richmond, Aaron S.; Barboza, Barika; Bradley, Jennifer; White, J. Noland; Landrum, R. Eric – Teaching of Psychology, 2020
Although many psychology departments and instructors are aware of the "American Psychological Association Guidelines for the Undergraduate Psychology Major Version 2.0," they are often less aware of the means by which to assess student mastery of the recommended goals. Our purpose is to discuss general principles for assessment, offer a…
Descriptors: Student Evaluation, Psychology, Undergraduate Students, Taxonomy
Forthmann, Boris; Paek, Sue Hyeon; Dumas, Denis; Barbot, Baptiste; Holling, Heinz – British Journal of Educational Psychology, 2020
Background: The originality of divergent thinking (DT) production is one of the most critical indicators of creative potential. It is commonly scored using the statistical infrequency of responses relative to all responses provided in a given sample. Aims: Response frequency estimates vary in terms of measurement precision. This issue has been…
Descriptors: Creative Thinking, Creativity Tests, Item Response Theory, Scores
Lubienski, Sarah Theule – Educational Researcher, 2020
This essay provides advice for effectively reviewing conference proposals, including how to write comments that are helpful to proposal authors, how to use the "Comments to Program Chair" box, and issues to consider when assigning proposal ratings and recommending acceptance or rejection. Several benefits of reviewing proposals are…
Descriptors: Conferences (Gatherings), Conference Papers, Evaluation Methods, Evaluation Criteria
Abdelkareem Ali Abdelnaeim Mehany; Asmaa Ghanem Gheith – Online Submission, 2024
The present study attempted to examine the effect of using the connectivist approach on developing secondary-stage students' cross-cultural awareness and translation performance. The study comprised thirty-two first-year secondary stage students enrolled in El-Jalawea Institute, Sohag Governorate. The study adopted the quasi-experimental design.…
Descriptors: Cultural Awareness, Translation, Second Language Learning, Second Language Instruction
Kimberly Vo; Mahbub Sarkar; Paul J. White; Elizabeth Yuriev – Chemistry Education Research and Practice, 2024
Despite problem solving being a core skill in chemistry, students often struggle to solve chemistry problems. This difficulty may arise from students trying to solve problems through memorising algorithms. Goldilocks Help serves as a problem-solving scaffold that supports students through structured problem solving and its elements, such as…
Descriptors: Metacognition, Scaffolding (Teaching Technique), Chemistry, Science Instruction
Jordan King; Katja Brundiers; Daniel Fischer – Assessment & Evaluation in Higher Education, 2024
Evolving conceptions of the purposes of higher education suggest the need for assessment practices that contribute to preparing students to navigate complex social-ecological challenges. Though shifts in assessment discourse have begun to respond to this need, further examination of the role of students in assessment processes is required. One…
Descriptors: Personal Autonomy, Sustainability, Student Evaluation, Learning Theories
Mehdi Mehranirad; Nahid Basafa; Reza Zabihi – Early Child Development and Care, 2024
The present study aimed to examine the effect of activity engagement, age, language proficiency, and time elapse on children's response accuracy to adult's questions. A total of 70, 3- to 6-year-old children participated in the study, engaging in a story-telling activity, a proficiency test, and two interviews. Additionally, 57 of these children…
Descriptors: Accuracy, Language Proficiency, Age Differences, Reaction Time
Mashael Salem Alsalem – Cogent Education, 2024
This study investigated English as a Foreign Language (EFL) teachers' beliefs concerning the use of an AI grading tool (CoGrader) for essay scoring and feedback. The study also explored the factors which contributed to those beliefs. EFL teachers (n = 10) from public universities (n = 3) in Saudi Arabia participated in this study. The study…
Descriptors: Foreign Countries, Language Teachers, Teacher Attitudes, English (Second Language)
Christopher Harris; Joe Krajcik; James Pellegrino – NSTA Press, 2024
Imagine handing out assessments that spark enthusiasm rather than dread. In six easy-to-follow steps, this book empowers science teachers to create tasks that guide students to use their knowledge, not just memorize facts. The NGSA design process transforms assessments into valuable classroom tools that teachers can use to chart how students'…
Descriptors: Standards, Science Education, Teaching Methods, Teaching Guides
Alicia A. Stoltenberg – ProQuest LLC, 2024
Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…
Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests
Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024
In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…
Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis

Peer reviewed
Direct link
