Publication Date
In 2025 | 24 |
Since 2024 | 96 |
Since 2021 (last 5 years) | 377 |
Since 2016 (last 10 years) | 878 |
Since 2006 (last 20 years) | 1799 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 86 |
Practitioners | 63 |
Administrators | 34 |
Teachers | 24 |
Policymakers | 23 |
Community | 5 |
Media Staff | 5 |
Support Staff | 5 |
Counselors | 2 |
Parents | 2 |
Students | 2 |
More ▼ |
Location
Australia | 64 |
United Kingdom | 57 |
Canada | 53 |
China | 40 |
United States | 39 |
California | 37 |
United Kingdom (England) | 34 |
Texas | 32 |
Turkey | 27 |
Japan | 26 |
Florida | 22 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
McGhee, Emily; Masterson, Jackie – Support for Learning, 2022
The purpose of this study was to investigate Access Arrangements (AA) which allow students with specific learning difficulties (SpLDs), disabilities or medical needs to access assessments by making "reasonable" adjustments, as established by the Equality Act 2010. A review of the literature revealed some problematic issues relating to…
Descriptors: Access to Education, Students with Disabilities, Learning Disabilities, Health Needs
Wind, Stefanie A.; Guo, Wenjing – Educational and Psychological Measurement, 2019
Rater effects, or raters' tendencies to assign ratings to performances that are different from the ratings that the performances warranted, are well documented in rater-mediated assessments across a variety of disciplines. In many real-data studies of rater effects, researchers have reported that raters exhibit more than one effect, such as a…
Descriptors: Evaluators, Bias, Scoring, Data Collection
Yuichiro Yokouchi – Language Testing in Asia, 2025
The performance decision tree (PDT; Fulcher et al., 2011) is a rubric style that is applicable to performance assessment, with origins in Upshur and Turner's (1995) empirically derived binary-choice, boundary-definition (EBB) scale. It is easier for raters to assess performance by evaluating multiple binary-choice descriptors. Additionally,…
Descriptors: Scoring Rubrics, Second Language Learning, Second Language Instruction, Language Teachers
Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024
Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…
Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques
Kettil Nordesjö – American Journal of Evaluation, 2024
The relationship between "stability" and "change" is a central paradox of administration that pervades all forms of organizing. Evaluation is not unfamiliar with paradoxical objectives and roles, which can result in tensions for evaluators and stakeholders. In this article, paradoxes between stability and change in the…
Descriptors: Logical Thinking, Philosophy, Evaluation, Social Capital
Jordon J. Beasley – Professional School Counseling, 2024
Given the ever-evolving role of school counselors and increasing demands placed on them in today's sociopolitical climate, proficiency in program evaluation is more important than ever as school counselors advocate for their role and for comprehensive school counseling program delivery in schools. This article presents a reexamination of the…
Descriptors: Professional Identity, Counselor Training, Decision Making, Self Efficacy
Deck, Sarah L.; Paterson, Helen M. – Applied Cognitive Psychology, 2020
Recurring forms of abuse like domestic violence are unfortunately common. When an individual makes an allegation about their experience, however, there is rarely additional evidence to corroborate their claim. The veracity of the allegation is thus likely to be a central concern in subsequent proceedings. This experiment explored evaluator's…
Descriptors: Recall (Psychology), Ethics, Family Violence, Disclosure
Bejar, Isaac I.; Li, Chen; McCaffrey, Daniel – Applied Measurement in Education, 2020
We evaluate the feasibility of developing predictive models of rater behavior, that is, "rater-specific" models for predicting the scores produced by a rater under operational conditions. In the present study, the dependent variable is the score assigned to essays by a rater, and the predictors are linguistic attributes of the essays…
Descriptors: Scoring, Essays, Behavior, Predictive Measurement
Tangen, Jason M.; Kent, Kirsty M.; Searston, Rachel A. – Cognitive Research: Principles and Implications, 2020
When a fingerprint is located at a crime scene, a human examiner is counted upon to manually compare this print to those stored in a database. Several experiments have now shown that these professional analysts are highly accurate, but not infallible, much like other fields that involve high-stakes decision-making. One method to offset mistakes in…
Descriptors: Crime, Identification, Human Body, Evaluators
Crossley, Scott; Wan, Qian; Allen, Laura; McNamara, Danielle – Reading and Writing: An Interdisciplinary Journal, 2023
Synthesis writing is widely taught across domains and serves as an important means of assessing writing ability, text comprehension, and content learning. Synthesis writing differs from other types of writing in terms of both cognitive and task demands because it requires writers to integrate information across source materials. However, little is…
Descriptors: Writing Skills, Cognitive Processes, Essays, Cues
Yan, Xun; Chuang, Ping-Lin – Language Testing, 2023
This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters (n = 30) across four rounds in the certification program.…
Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Certification
Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023
We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…
Descriptors: Computer Assisted Testing, Essays, Scoring, Scores
Kevin Hirschi; Okim Kang – Language Teaching Research Quarterly, 2023
This paper extends the use of Generalizability Theory to the measurement of extemporaneous L2 speech through the lens of speech perception. Using six datasets of previous studies, it reports on "G studies"--a method of breaking down measurement variance--and "D studies"--a predictive study of the impact on reliability when…
Descriptors: Evaluators, Generalization, Evaluation Methods, Speech Communication
Rachael Lindberg; Pavel Trofimovich – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2023
According to expectation violation theory, job applicants can be upgraded or downgraded during an interview when their accent does not match employers' speech expectations. Focusing on the employment of second language French job candidates in Québec, this study explored this issue dynamically in terms of how expectations may impact the trajectory…
Descriptors: French, Pronunciation, Second Language Learning, Service Occupations
Ramos, Jorge E.; Shea, Christine – Hispania, 2023
In this study we show that the perception of lateral variants by Puerto Rican listeners changes according to who the listener believes is speaking. Puerto Rican listeners heard sentences with target words featuring either rhotic [voiced alveolar tap or flap] or lateral [l] (amo[voiced alveolar tap or flap] -- amo[l]) codas, a sociophonetic…
Descriptors: Race, Racism, Puerto Ricans, Language Variation