Publication Date
| In 2026 | 0 |
| Since 2025 | 63 |
| Since 2022 (last 5 years) | 329 |
| Since 2017 (last 10 years) | 827 |
| Since 2007 (last 20 years) | 1777 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 86 |
| Practitioners | 63 |
| Administrators | 34 |
| Teachers | 25 |
| Policymakers | 23 |
| Community | 5 |
| Media Staff | 5 |
| Support Staff | 5 |
| Counselors | 2 |
| Parents | 2 |
| Students | 2 |
| More ▼ | |
Location
| Australia | 64 |
| United Kingdom | 59 |
| Canada | 54 |
| China | 40 |
| United States | 39 |
| California | 37 |
| United Kingdom (England) | 36 |
| Texas | 32 |
| Turkey | 28 |
| Japan | 26 |
| Israel | 23 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yan, Xun; Chuang, Ping-Lin – Language Testing, 2023
This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters (n = 30) across four rounds in the certification program.…
Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Certification
Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023
We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…
Descriptors: Computer Assisted Testing, Essays, Scoring, Scores
Kevin Hirschi; Okim Kang – Language Teaching Research Quarterly, 2023
This paper extends the use of Generalizability Theory to the measurement of extemporaneous L2 speech through the lens of speech perception. Using six datasets of previous studies, it reports on "G studies"--a method of breaking down measurement variance--and "D studies"--a predictive study of the impact on reliability when…
Descriptors: Evaluators, Generalization, Evaluation Methods, Speech Communication
Rachael Lindberg; Pavel Trofimovich – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2023
According to expectation violation theory, job applicants can be upgraded or downgraded during an interview when their accent does not match employers' speech expectations. Focusing on the employment of second language French job candidates in Québec, this study explored this issue dynamically in terms of how expectations may impact the trajectory…
Descriptors: French, Pronunciation, Second Language Learning, Service Occupations
Ramos, Jorge E.; Shea, Christine – Hispania, 2023
In this study we show that the perception of lateral variants by Puerto Rican listeners changes according to who the listener believes is speaking. Puerto Rican listeners heard sentences with target words featuring either rhotic [voiced alveolar tap or flap] or lateral [l] (amo[voiced alveolar tap or flap] -- amo[l]) codas, a sociophonetic…
Descriptors: Race, Racism, Puerto Ricans, Language Variation
Taichi Yamashita – Language Testing, 2025
With the rapid development of generative artificial intelligence (AI) frameworks (e.g., the generative pre-trained transformer [GPT]), a growing number of researchers have started to explore its potential as an automated essay scoring (AES) system. While previous studies have investigated the alignment between human ratings and GPT ratings, few…
Descriptors: Artificial Intelligence, English (Second Language), Second Language Learning, Second Language Instruction
Deck, Sarah L.; Paterson, Helen M. – Applied Cognitive Psychology, 2020
Recurring forms of abuse like domestic violence are unfortunately common. When an individual makes an allegation about their experience, however, there is rarely additional evidence to corroborate their claim. The veracity of the allegation is thus likely to be a central concern in subsequent proceedings. This experiment explored evaluator's…
Descriptors: Recall (Psychology), Ethics, Family Violence, Disclosure
Bejar, Isaac I.; Li, Chen; McCaffrey, Daniel – Applied Measurement in Education, 2020
We evaluate the feasibility of developing predictive models of rater behavior, that is, "rater-specific" models for predicting the scores produced by a rater under operational conditions. In the present study, the dependent variable is the score assigned to essays by a rater, and the predictors are linguistic attributes of the essays…
Descriptors: Scoring, Essays, Behavior, Predictive Measurement
Tangen, Jason M.; Kent, Kirsty M.; Searston, Rachel A. – Cognitive Research: Principles and Implications, 2020
When a fingerprint is located at a crime scene, a human examiner is counted upon to manually compare this print to those stored in a database. Several experiments have now shown that these professional analysts are highly accurate, but not infallible, much like other fields that involve high-stakes decision-making. One method to offset mistakes in…
Descriptors: Crime, Identification, Human Body, Evaluators
Leech, Tony; Chambers, Lucy – Research Matters, 2022
Two of the central issues in comparative judgement (CJ), which are perhaps underexplored compared to questions of the method's reliability and technical quality, are "what processes do judges use to make their decisions" and "what features do they focus on when making their decisions?" This article discusses both, in the…
Descriptors: Comparative Analysis, Decision Making, Evaluators, Reliability
Ginsberg, Alice E. – American Journal of Evaluation, 2022
This article presents a new tool called Critical Evaluation Capital (CEC) designed to address issues of equity and social justice in program evaluation. CEC is grounded in the tenants of critical race theory and inspired by Yosso's work on community cultural wealth which raises critical issues of positionality and access. CEC is a system for…
Descriptors: Critical Race Theory, Social Justice, Program Evaluation, Evaluation Methods
Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022
The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…
Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise
Tipton, Elizabeth; Olsen, Robert B. – National Center for Education Evaluation and Regional Assistance, 2022
This guide will help researchers design and implement impact studies in education so that the findings are more generalizable to the study's target population. Guidance is provided on key steps that researchers can take, including defining the target population, selecting a sample of schools--and replacement schools, when needed--managing school…
Descriptors: Outcome Measures, Evaluators, Educational Researchers, Educational Research
Osama Koraishi – Language Teaching Research Quarterly, 2024
This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence
Ruff, Kate; Olsen, Sara – American Journal of Evaluation, 2018
The authors of this article suggest three features of a common approach to impact measurement: harness operational data, use constructs with bounded flexibility, and develop a cadre of analysts who are skilled at interpreting reports. The analysts are the most crucial of these. Evaluators are well suited to step into these roles, but it will…
Descriptors: Measurement, Evaluators, Investment, Financial Services

Peer reviewed
Direct link
