ERIC - Search Results

Publication Date

In 2025	9
Since 2024	36
Since 2021 (last 5 years)	100
Since 2016 (last 10 years)	218
Since 2006 (last 20 years)	404

Descriptor

Interrater Reliability	408
Foreign Countries	165
College Students	98
Undergraduate Students	95
Scoring Rubrics	80
Student Evaluation	74
Correlation	66
Comparative Analysis	64
Scores	61
Statistical Analysis	61
English (Second Language)	56
Teaching Methods	56
Second Language Learning	55
Test Reliability	55
Evaluation Methods	54
College Faculty	51
Evaluators	51
Preservice Teachers	49
Student Attitudes	48
Scoring	47
Graduate Students	45
Validity	41
Reliability	39
Second Language Instruction	39
Test Validity	37
More ▼

Publication Type

Journal Articles	368
Reports - Research	338
Tests/Questionnaires	53
Reports - Evaluative	27
Dissertations/Theses -…	22
Information Analyses	15
Reports - Descriptive	15
Speeches/Meeting Papers	7
Numerical/Quantitative Data	2
Collected Works - General	1
Collected Works - Proceedings	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Postsecondary Education	408
Higher Education	401
Secondary Education	31
Elementary Education	20
Elementary Secondary Education	12
High Schools	11
Early Childhood Education	8
Two Year Colleges	7
Middle Schools	6
Junior High Schools	5
Adult Education	4
Grade 10	3
Primary Education	3
Grade 2	2
Kindergarten	2
Preschool Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
More ▼

Audience

Practitioners	1
Teachers	1

Location

Turkey	26
China	19
Australia	13
Japan	11
United Kingdom	11
Canada	10
Germany	8
Taiwan	8
California	7
Netherlands	7
United States	7
New Zealand	6
Florida	5
Iran	5
Pennsylvania	5
Saudi Arabia	5
South Korea	5
Texas	5
Belgium	4
Malaysia	4
Singapore	4
Washington	4
Indiana	3
Maryland	3
Philippines	3
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
Americans with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Temporary Assistance for…	1

Assessments and Surveys

Graduate Record Examinations	6
Test of English as a Foreign…	5
SAT (College Admission Test)	3
ACT Assessment	2
Draw a Person Test	2
edTPA (Teacher Performance…	2
COMPASS (Computer Assisted…	1
Classroom Assessment Scoring…	1
Flesch Kincaid Grade Level…	1
Graduate Management Admission…	1
International English…	1
Modern Language Aptitude Test	1
Program for International…	1
Study Process Questionnaire	1
Trends in International…	1
United States Medical…	1
Wechsler Individual…	1
Woodcock Reading Mastery Test	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 408 results Save | Export

Inconsistencies in Rater-Based Assessments Mainly Affect Borderline Candidates: But Using Simple Heuristics Might Improve Pass-Fail Decisions

Peer reviewed

Direct link

Stefan K. Schauber; Anne O. Olsen; Erik L. Werner; Morten Magelssen – Advances in Health Sciences Education, 2024

Introduction: Research in various areas indicates that expert judgment can be highly inconsistent. However, expert judgment is indispensable in many contexts. In medical education, experts often function as examiners in rater-based assessments. Here, disagreement between examiners can have far-reaching consequences. The literature suggests that…

Descriptors: Medical Students, Performance Based Assessment, Expertise, Interrater Reliability

Agree to Disagree: Multiple Methods to Assess Rater Agreement during Student Teaching

Peer reviewed

Direct link

Elayne P. Colón; Lori M. Dassa; Thomas M. Dana; Nathan P. Hanson – Action in Teacher Education, 2024

To meet accreditation expectations, teacher preparation programs must demonstrate their candidates are evaluated using summative assessment tools that yield sound, reliable, and valid data. These tools are primarily used by the clinical experience team -- university supervisors and mentor teachers. Institutional beliefs regarding best practices…

Descriptors: Student Teachers, Teacher Interns, Evaluation Methods, Interrater Reliability

Citation Metrics and Boyer's Model of Scholarship: How Do Bibliometrics and Altmetrics Respond to Research Impact?

Peer reviewed

Direct link

Gilstrap, Donald L.; Whitver, Sara Maurice; Scalfani, Vincent F.; Bray, Nathaniel J. – Innovative Higher Education, 2023

This article explores how well bibliometrics and altmetrics reflect research impact in relation to Boyer's Model of the Scholarship. Indices used for both types of metrics are explored and discussed while including an analysis on primary methodological works performed on each in the literature to date. As confirmatory in nature, we chose as our…

Descriptors: Bibliometrics, Models, Scholarship, Research

Do Mathematicians and Undergraduates Agree about Explanation Quality?

Peer reviewed

Direct link

Evans, Tanya; Mejía-Ramos, Juan Pablo; Inglis, Matthew – Educational Studies in Mathematics, 2022

Offering explanations is a central part of teaching mathematics, and understanding those explanations is a vital activity for learners. Given this, it is natural to ask what makes a good mathematical explanation. This question has received surprisingly little attention in the mathematics education literature, perhaps because the field has no…

Descriptors: Mathematics, Professional Personnel, Undergraduate Students, Mathematics Activities

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Validity and Reliability of Cognitive Constructivism-Oriented Teaching Conception Questionnaire

Peer reviewed

Direct link

Duong Thi Ngoc Ngan; Maria Hercz – Asia-Pacific Education Researcher, 2024

As there is a paucity of instrument investigating a hybrid teaching conception, the current study is seen as part of attempt to fill this gap. The subjects in the study were 310 University participants--instructors in Socialist Republic of Viet Nam (Vietnam). The survey was implemented with the use of Cognitive Constructivism-oriented Teaching…

Descriptors: Blended Learning, Faculty, Teaching Methods, Foreign Countries

Inter-Rater Reliability in Comprehensive Examination Scoring: The Case for Consistent and Collaborative Rater Training and Calibration

Download full text

Saenz, David Arron – Online Submission, 2023

There is a vast body of literature documenting the positive impacts that rater training and calibration sessions have on inter-rater reliability as research indicates several factors including frequency and timing play crucial roles towards ensuring inter-rater reliability. Additionally, increasing amounts research indicate possible links in…

Descriptors: Interrater Reliability, Scoring, Training, Scoring Rubrics

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Engaging Classroom Observation: A Brief Measure of Active Learning in the College Classroom

Peer reviewed

Direct link

Chase Young; Benjamin Mitchell-Yellin; George Kevin Randall – Active Learning in Higher Education, 2025

The purpose of this study was to develop a valid, reliable, and brief measure of active learning in college classrooms that is cheap and easy to complete and yields results that faculty can easily use to inform their development as instructors. Initial construct and face validity was achieved by modifying existing instruments and creating a draft…

Descriptors: College Faculty, College Students, Active Learning, Classroom Observation Techniques

Same Grade for Different Reasons, Different Grades for the Same Reason?

Peer reviewed

Direct link

Ilona Rinne – Assessment & Evaluation in Higher Education, 2024

It is widely acknowledged in research that common criteria and aligned standards do not result in consistent assessment of such a complex performance as the final undergraduate thesis. Assessment is determined by examiners' understanding of rubrics and their views on thesis quality. There is still a gap in the research literature about how…

Descriptors: Foreign Countries, Undergraduate Students, Teacher Education Programs, Evaluation Criteria

The Reliability of Using ChatGPT in Rating EFL Writings

Peer reviewed
PDF on ERIC

Download full text

Yang Yang – Shanlax International Journal of Education, 2024

This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…

Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods

All Types of Experience Are Equal, but Some Are More Equal: The Effect of Different Types of Experience on Rater Severity and Rater Consistency

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to different types of rater experience over a long period of time. The article is based on longitudinal data collected from 2009 to 2019 from the second language Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. The study investigated…

Descriptors: Foreign Countries, Interrater Reliability, Error of Measurement, Experience

Agency, Advocacy, Positionality: Bringing an Equity Mindset to Higher Education Assessment

Peer reviewed

Direct link

Beth K. Janetski; Mary K. Thompson – Assessment Update, 2025

The Grand Challenges Project supports global collaborations that inform equitable practices for assessment practitioners in higher education while identifying evidence-informed solutions. One goal named in the project is to leverage assessment findings to increase equity in higher education. This article focuses on the results from the Equity…

Descriptors: Higher Education, Educational Assessment, Global Approach, International Cooperation

The Bank Robbery: A Behavioral Observation Exercise for Enhancing Understanding of Reliability

Peer reviewed

Direct link

Strelan, Peter – Teaching of Psychology, 2022

Background: The concept of reliability is central to conducting--and understanding--research in Psychology. Students' understanding of concepts are strengthened when they learn by applying concepts. Objective: This article describes initial evidence of an activity for teaching reliability. Method: Students watched a short video of a staged bank…

Descriptors: Learning Activities, Psychology, Recall (Psychology), Crime

Development of Assessment Instrument Business Proposal for Students' Systems Thinking Skills on Business Model Canvas in Bioentrepreneurship Course

Peer reviewed
PDF on ERIC

Download full text

Lulu Desia Mutiani Rahmayuni; Siti Sriyatib; Diah Kusumawaty – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2024

Business Model Canvas (BMC) is a business model that must be mastered by students in the Bioentrepreneurship course as an initial provision for entering the entrepreneurial world, while in compiling Business Model Canvas (BMC) systematic thinking skills are needed. This study aims to provide an assessment instrument to measure students' system…

Descriptors: Systems Approach, Thinking Skills, Models, Business Administration

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

ProQuest LLC	21
Assessment & Evaluation in…	15
Advances in Health Sciences…	11
ETS Research Report Series	11
Online Submission	8
Language Assessment Quarterly	7
Creativity Research Journal	6
English Language Teaching	6
Language Testing	5
Research & Practice in…	5
Assessment in Education:…	4
CBE - Life Sciences Education	4
Eurasian Journal of…	4
Grantee Submission	4
International Education…	4
Journal of Baltic Science…	4
Journal of Education for…	4
Journal of Educational…	4
Physical Review Physics…	4
Studies in Higher Education	4
Teaching in Higher Education	4
Advances in Physiology…	3
American Journal of Distance…	3
Assessment Update	3
International Journal of…	3
More ▼

Attali, Yigal	3
Sata, Mehmet	3
Unal, Zafer	3
Ahmadi, Alireza	2
Bodur, Yasar	2
Clariana, Roy B.	2
Crossley, Scott A.	2
Elder, Catherine	2
Elliot, Norbert	2
Güler, Nese	2
Helms, Marilyn M.	2
Incikabi, Lutfi	2
Karakaya, Ismail	2
McNamara, Danielle S.	2
Oriogun, Peter K.	2
Park, Yoon Soo	2
Prevost, Luanna B.	2
Scharf, Davida	2
Steier, Michael	2
Tsai, Chin-Chung	2
Unal, Aslihan	2
Wendler, Cathy	2
Yudkowsky, Rachel	2
Zayac, Ryan M.	2
A. C., John	1
More ▼