ERIC - Search Results

Publication Date

In 2026	0
Since 2025	63
Since 2022 (last 5 years)	329
Since 2017 (last 10 years)	827
Since 2007 (last 20 years)	1777

Descriptor

Evaluators	3206
Evaluation Methods	1148
Program Evaluation	795
Foreign Countries	677
Second Language Learning	357
Elementary Secondary Education	323
English (Second Language)	323
Higher Education	306
Evaluation Criteria	303
Interrater Reliability	295
Comparative Analysis	294
Scores	243
Decision Making	240
Models	238
Scoring	236
Student Evaluation	231
Language Tests	220
Teacher Evaluation	217
Evaluation Utilization	203
Correlation	199
Second Language Instruction	197
Evaluation	193
Teaching Methods	187
Research Methodology	178
Educational Assessment	176
More ▼

Education Level

Higher Education	589
Postsecondary Education	434
Elementary Secondary Education	145
Secondary Education	118
Elementary Education	95
High Schools	59
Adult Education	48
Early Childhood Education	43
Middle Schools	43
Junior High Schools	25
Preschool Education	19
Two Year Colleges	18
Kindergarten	17
Primary Education	17
Grade 8	13
Grade 4	11
Grade 5	11
Intermediate Grades	11
Grade 7	10
Grade 2	9
Grade 6	9
Grade 1	8
Grade 3	6
Grade 10	2
Grade 12	2
More ▼

Audience

Researchers	86
Practitioners	63
Administrators	34
Teachers	25
Policymakers	23
Community	5
Media Staff	5
Support Staff	5
Counselors	2
Parents	2
Students	2
More ▼

Location

Australia	64
United Kingdom	59
Canada	54
China	40
United States	39
California	37
United Kingdom (England)	36
Texas	32
Turkey	28
Japan	26
Israel	23
Florida	22
Netherlands	22
Sweden	22
Louisiana	21
Iran	20
Michigan	20
Tennessee	19
Europe	17
Germany	17
Hong Kong	17
New Zealand	14
Illinois	13
Ohio	13
Finland	12
More ▼

What Works Clearinghouse Rating

Showing 256 to 270 of 3,206 results Save | Export

"How Do Raters Learn to Rate?" Many-Facet Rasch Modeling of Rater Performance over the Course of a Rater Certification Program

Peer reviewed

Direct link

Yan, Xun; Chuang, Ping-Lin – Language Testing, 2023

This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters (n = 30) across four rounds in the certification program.…

Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Certification

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

How Many Raters Can Be Enough: G Theory Applied to Assessment and Measurement of L2 Speech Perception

Peer reviewed
PDF on ERIC

Download full text

Kevin Hirschi; Okim Kang – Language Teaching Research Quarterly, 2023

This paper extends the use of Generalizability Theory to the measurement of extemporaneous L2 speech through the lens of speech perception. Using six datasets of previous studies, it reports on "G studies"--a method of breaking down measurement variance--and "D studies"--a predictive study of the impact on reliability when…

Descriptors: Evaluators, Generalization, Evaluation Methods, Speech Communication

When Accent Does Not Match Expectations: A Dynamic Perspective of L2 Speaker Evaluations in a French Interview Context

Peer reviewed
PDF on ERIC

Download full text

Rachael Lindberg; Pavel Trofimovich – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2023

According to expectation violation theory, job applicants can be upgraded or downgraded during an interview when their accent does not match employers' speech expectations. Focusing on the employment of second language French job candidates in Québec, this study explored this issue dynamically in terms of how expectations may impact the trajectory…

Descriptors: French, Pronunciation, Second Language Learning, Service Occupations

Speaking Race or Racialized Speaking: Evidence from Perceptions of Lateral Variants by Puerto Rican Listeners

Peer reviewed

Direct link

Ramos, Jorge E.; Shea, Christine – Hispania, 2023

In this study we show that the perception of lateral variants by Puerto Rican listeners changes according to who the listener believes is speaking. Puerto Rican listeners heard sentences with target words featuring either rhotic [voiced alveolar tap or flap] or lateral [l] (amo[voiced alveolar tap or flap] -- amo[l]) codas, a sociophonetic…

Descriptors: Race, Racism, Puerto Ricans, Language Variation

Exploring Potential Biases in GPT-4o's Ratings of English Language Learners' Essays

Peer reviewed

Direct link

Taichi Yamashita – Language Testing, 2025

With the rapid development of generative artificial intelligence (AI) frameworks (e.g., the generative pre-trained transformer [GPT]), a growing number of researchers have started to explore its potential as an automated essay scoring (AES) system. While previous studies have investigated the alignment between human ratings and GPT ratings, few…

Descriptors: Artificial Intelligence, English (Second Language), Second Language Learning, Second Language Instruction

Liars Are Perceived as More Credible than Truth-Tellers Who Recall a Repeated Event

Peer reviewed

Direct link

Deck, Sarah L.; Paterson, Helen M. – Applied Cognitive Psychology, 2020

Recurring forms of abuse like domestic violence are unfortunately common. When an individual makes an allegation about their experience, however, there is rarely additional evidence to corroborate their claim. The veracity of the allegation is thus likely to be a central concern in subsequent proceedings. This experiment explored evaluator's…

Descriptors: Recall (Psychology), Ethics, Family Violence, Disclosure

Predictive Modeling of Rater Behavior: Implications for Quality Assurance in Essay Scoring

Peer reviewed

Direct link

Bejar, Isaac I.; Li, Chen; McCaffrey, Daniel – Applied Measurement in Education, 2020

We evaluate the feasibility of developing predictive models of rater behavior, that is, "rater-specific" models for predicting the scores produced by a rater under operational conditions. In the present study, the dependent variable is the score assigned to essays by a rater, and the predictors are linguistic attributes of the essays…

Descriptors: Scoring, Essays, Behavior, Predictive Measurement

Collective Intelligence in Fingerprint Analysis

Peer reviewed

Direct link

Tangen, Jason M.; Kent, Kirsty M.; Searston, Rachel A. – Cognitive Research: Principles and Implications, 2020

When a fingerprint is located at a crime scene, a human examiner is counted upon to manually compare this print to those stored in a database. Several experiments have now shown that these professional analysts are highly accurate, but not infallible, much like other fields that involve high-stakes decision-making. One method to offset mistakes in…

Descriptors: Crime, Identification, Human Body, Evaluators

How Do Judges in Comparative Judgement Exercises Make Their Judgements?

Download full text

Leech, Tony; Chambers, Lucy – Research Matters, 2022

Two of the central issues in comparative judgement (CJ), which are perhaps underexplored compared to questions of the method's reliability and technical quality, are "what processes do judges use to make their decisions" and "what features do they focus on when making their decisions?" This article discusses both, in the…

Descriptors: Comparative Analysis, Decision Making, Evaluators, Reliability

Critical Evaluation Capital (CEC): A New Tool for Applying Critical Race Theory to the Evaluand

Peer reviewed

Direct link

Ginsberg, Alice E. – American Journal of Evaluation, 2022

This article presents a new tool called Critical Evaluation Capital (CEC) designed to address issues of equity and social justice in program evaluation. CEC is grounded in the tenants of critical race theory and inspired by Yosso's work on community cultural wealth which raises critical issues of positionality and access. CEC is a system for…

Descriptors: Critical Race Theory, Social Justice, Program Evaluation, Evaluation Methods

Using Rasch Analysis to Examine Raters' Expertise Turkish Teacher Candidates' Competency Levels in Writing Different Types of Test Items

Peer reviewed
PDF on ERIC

Download full text

Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022

The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…

Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise

Enhancing the Generalizability of Impact Studies in Education. Toolkit. NCEE 2022-003

Peer reviewed
PDF on ERIC

Download full text

Tipton, Elizabeth; Olsen, Robert B. – National Center for Education Evaluation and Regional Assistance, 2022

This guide will help researchers design and implement impact studies in education so that the findings are more generalizable to the study's target population. Guidance is provided on key steps that researchers can take, including defining the target population, selecting a sample of schools--and replacement schools, when needed--managing school…

Descriptors: Outcome Measures, Evaluators, Educational Researchers, Educational Research

The Intersection of AI and Language Assessment: A Study on the Reliability of ChatGPT in Grading IELTS Writing Task 2

Peer reviewed
PDF on ERIC

Download full text

Osama Koraishi – Language Teaching Research Quarterly, 2024

This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence

The Need for Analysts in Social Impact Measurement: How Evaluators Can Help

Peer reviewed

Direct link

Ruff, Kate; Olsen, Sara – American Journal of Evaluation, 2018

The authors of this article suggest three features of a common approach to impact measurement: harness operational data, use constructs with bounded flexibility, and develop a cadre of analysts who are skilled at interpreting reports. The analysts are the most crucial of these. Evaluators are well suited to step into these roles, but it will…

Descriptors: Measurement, Evaluators, Investment, Financial Services

« Previous Page | Next Page »

Pages: 1 | ... | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | ... | 214

American Journal of Evaluation	252
New Directions for Evaluation	99
ProQuest LLC	99
Language Testing	77
Evaluation and Program…	61
Evaluation Practice	53
New Directions for Program…	50
Evaluation Review	39
Educational Evaluation and…	36
Studies in Educational…	36
Language Assessment Quarterly	29
Advances in Health Sciences…	28
Applied Measurement in…	27
Educational and Psychological…	25
Evaluation and Program…	25
Journal of Educational…	21
Language Testing in Asia	21
Online Submission	21
Assessment & Evaluation in…	19
Educational Measurement:…	18
ETS Research Report Series	16
Journal of MultiDisciplinary…	15
Canadian Journal of Program…	13
Grantee Submission	13
Research Evaluation	13
More ▼

King, Jean A.	22
Morris, Michael	16
Wind, Stefanie A.	16
Smith, Nick L.	15
Alkin, Marvin C.	14
Cousins, J. Bradley	14
Trofimovich, Pavel	13
Patton, Michael Quinn	12
Christie, Christina A.	11
Plake, Barbara S.	11
Saito, Kazuya	10
Thompson, Bruce	10
Wolfe, Edward W.	10
Brown, Robert D.	9
Jaeger, Richard M.	9
Newman, Dianna L.	9
Scriven, Michael	9
Azzam, Tarek	8
Lawrenz, Frances	8
Myford, Carol M.	8
Engelhard, George, Jr.	7
Mark, Melvin M.	7
Coniam, David	6
Coryn, Chris L. S.	6
More ▼

Journal Articles	2264
Reports - Research	1533
Reports - Evaluative	576
Reports - Descriptive	515
Speeches/Meeting Papers	374
Opinion Papers	275
Tests/Questionnaires	162
Information Analyses	119
Guides - Non-Classroom	114
Dissertations/Theses -…	99
Books	33
Reports - General	24
Collected Works - General	23
Guides - General	23
Collected Works - Proceedings	18
Numerical/Quantitative Data	14
Collected Works - Serials	13
ERIC Publications	13
Book/Product Reviews	12
ERIC Digests in Full Text	11
Reference Materials -…	8
Historical Materials	6
Collected Works - Serial	4
Non-Print Media	4
Guides - Classroom - Learner	3
More ▼

No Child Left Behind Act 2001	19
Race to the Top	8
Elementary and Secondary…	6
Elementary and Secondary…	4
Education for All Handicapped…	3
Individuals with Disabilities…	3
Americans with Disabilities…	2
Education Consolidation…	2
Elementary and Secondary…	2
Elementary and Secondary…	2
Government Performance and…	2
Higher Education Act 1965	2
Developmental Disabilities…	1
Education Amendments 1978	1
Education Consolidation and…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
First Amendment	1
Higher Education Act Title IX	1
Larry P v Riles	1
Rehabilitation Act 1973	1
Stewart B McKinney Homeless…	1
Womens Educational Equity Act	1
More ▼

Test of English as a Foreign…	39
International English…	21
National Assessment of…	10
Test of English for…	7
Flanders System of…	4
Graduate Record Examinations	4
National Teacher Examinations	4
ACTFL Oral Proficiency…	3
Flesch Kincaid Grade Level…	3
Praxis Series	3
Alabama High School…	2
Center for Epidemiologic…	2
Clinical Evaluation of…	2
Program for International…	2
SAT (College Admission Test)	2
Student Teacher Relationship…	2
Teacher Performance…	2
Torrance Tests of Creative…	2
United States Medical…	2
edTPA (Teacher Performance…	2
Adaptive Behavior Scale	1
Autism Diagnostic Observation…	1
Beck Depression Inventory	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
More ▼