Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 22 |
Descriptor
Evaluators | 34 |
Interrater Reliability | 34 |
Student Evaluation | 34 |
Foreign Countries | 14 |
Scoring | 11 |
Comparative Analysis | 10 |
Evaluation Methods | 8 |
Second Language Learning | 7 |
Scores | 6 |
English (Second Language) | 5 |
Evaluation Criteria | 5 |
More ▼ |
Source
Author
Johnson, Martin | 2 |
Myford, Carol M. | 2 |
Wind, Stefanie A. | 2 |
Ahmadi, Alireza | 1 |
Aida Carballo-Fazanes | 1 |
Beattie, Darrin | 1 |
Bell, John F. | 1 |
Blok, H. | 1 |
Bourke, Sid | 1 |
Coombe, Kennece | 1 |
Cousins, J. Bradley | 1 |
More ▼ |
Publication Type
Journal Articles | 25 |
Reports - Research | 22 |
Reports - Evaluative | 7 |
Speeches/Meeting Papers | 4 |
Reports - Descriptive | 3 |
Dissertations/Theses -… | 2 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Researchers | 1 |
Location
Japan | 2 |
Australia | 1 |
California | 1 |
Cuba | 1 |
Europe | 1 |
Hong Kong | 1 |
India | 1 |
Iran | 1 |
Israel | 1 |
Japan (Tokyo) | 1 |
Michigan | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
Flesch Kincaid Grade Level… | 1 |
International English… | 1 |
Test of English for… | 1 |
Torrance Tests of Creative… | 1 |
What Works Clearinghouse Rating
Cristina Menescardi; Aida Carballo-Fazanes; Núria Ortega-Benavent; Isaac Estevan – Journal of Motor Learning and Development, 2024
The Canadian Agility and Movement Skill Assessment (CAMSA) is a valid and reliable circuit-based test of motor competence which can be used to assess children's skills in a live or recorded performance and then coded. We aimed to analyze the intrarater reliability of the CAMSA scores (total, time, and skill score) and time measured, by comparing…
Descriptors: Interrater Reliability, Evaluators, Scoring, Psychomotor Skills
Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022
In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…
Descriptors: Evaluators, Bias, Identification, Performance Based Assessment
Whalen, Kate; Paez, Antonio – Journal of Geography, 2022
Experiential education partnered with guided reflection is thought to support students with higher-order thinking skills. In this study, 44 reflections from two university-level sustainability courses were compared. In both courses students were asked to write a reflection, but only one course used the Reflective Learning Framework (RLF). Tests of…
Descriptors: Geography Instruction, Thinking Skills, Experiential Learning, Sustainability
Ahmadi, Alireza – Taiwan Journal of TESOL, 2020
Rater subjectivity has long been an intriguing topic. The use of discussion as a resolution method is a practical way to reduce this subjectivity. However, the efficacy of discussion depends on whether different raters get equally engaged in it or one rater tends to dominate others. This study investigated whether and how rater dominance occurs in…
Descriptors: Evaluators, Interrater Reliability, Discussion, Discourse Analysis
Kovalkov, Anastasia; Paassen, Benjamin; Segal, Avi; Gal, Kobi; Pinkwart, Niels – International Educational Data Mining Society, 2021
Promoting creativity is considered an important goal of education, but creativity is notoriously hard to define and measure. In this paper, we make the journey from defining a formal creativity and applying the measure in a practical domain. The measure relies on core theoretical concepts in creativity theory, namely fluency, flexibility, and…
Descriptors: Creativity, Theory Practice Relationship, Evaluators, Specialists
Han, Qie – Working Papers in TESOL & Applied Linguistics, 2016
This literature review attempts to survey representative studies within the context of L2 speaking assessment that have contributed to the conceptualization of rater cognition. Two types of studies are looked at: 1) studies that examine "how" raters differ (and sometimes agree) in their cognitive processes and rating behaviors, in terms…
Descriptors: Second Language Learning, Student Evaluation, Evaluators, Speech Tests
Tam, Cheung On – International Journal of Art & Design Education, 2018
This article reports on the development and validation of a rubric for assessing students' written responses to artworks. Since the implementation of the Hong Kong New Senior Secondary Curriculum in 2009, art educators have seen responding to artworks as increasingly important. In this context, the Art Criticism Assessment Rubric (ACAR) was…
Descriptors: Foreign Countries, Art Education, Art Appreciation, Student Evaluation
Robins, Claire – International Journal of Art & Design Education, 2016
This article draws on recent research from the Pre-Degree Summative Assessment in Art Design and Media Study, conducted at UCL Institute of Education, which found that pre-degree art and design qualifications at levels 3 and 4 vary greatly in their appropriateness as a preparation for degree level study in art subjects. Central to the article are…
Descriptors: Sustainability, Art Education, Design, Student Evaluation
Huang, Lan-fen; Kubelec, Simon; Keng, Nicole; Hsu, Lung-hsun – Language Testing in Asia, 2018
Background: Although teachers of English are required to assess students' speaking proficiency in the Common European Framework of Reference for Languages (CEFR), their ability to rate is seldom evaluated. The application of descriptors in the assessment of English speaking on CEFR in the context of English as a foreign language has not often been…
Descriptors: Evaluators, Second Language Learning, Second Language Instruction, English (Second Language)
Negishi, Junko – Journal of Pan-Pacific Association of Applied Linguistics, 2015
The study considers the assessment of L2 English learners by trained raters in paired and group oral assessments in comparison to an individual, monologue assessment, to determine 1) the degree to which raters assign pairs/groups shared (the same) scores and the degree to which raters give individual members of pairs/groups higher or lower as…
Descriptors: Evaluators, English (Second Language), Second Language Learning, Scores
Wind, Stefanie A.; Engelhard, George, Jr.; Wesolowski, Brian – Educational Assessment, 2016
When good model-data fit is observed, the Many-Facet Rasch (MFR) model acts as a linking and equating model that can be used to estimate student achievement, item difficulties, and rater severity on the same linear continuum. Given sufficient connectivity among the facets, the MFR model provides estimates of student achievement that are equated to…
Descriptors: Evaluators, Interrater Reliability, Academic Achievement, Music Education
Lehan, Tara; Hussey, Heather; Mika, Eva – Journal of University Teaching and Learning Practice, 2016
Throughout the dissertation process, the chair and committee members provide feedback regarding quality to help the doctoral candidate to produce the highest-quality document and become an independent scholar. Nevertheless, results of previous research suggest that overall dissertation quality generally is poor. Because much of the feedback about…
Descriptors: Graduate Students, Doctoral Dissertations, Student Evaluation, Feedback (Response)
Gustafsson, Jan-Eric; Erickson, Gudrun – Educational Assessment, Evaluation and Accountability, 2013
In the Swedish educational system, teachers have the dual responsibility of assigning final grades and marking their own students' national tests. The Government has mandated the Swedish Schools Inspectorate to remark samples of the national tests to see if teacher marking can be trusted. Reports from this project have concluded that intermarker…
Descriptors: Logical Thinking, Student Evaluation, Inferences, Trust (Psychology)
Hijikata-Someya, Yuko; Ono, Masumi; Yamanishi, Hiroyuki – English Language Teaching, 2015
Although the importance of summary writing is well documented in prior studies, few have investigated the evaluation of written summaries. Due to the complex nature of L2 summary writing, which requires one to read the original material and summarize its content in the L2, raters often emphasize different features when judging the quality of L2…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Second Language Learning
Matsugu, Sawako – ProQuest LLC, 2013
Understanding the sources of variance in speaking assessment is important in Japan where society's high demand for English speaking skills is growing. Three challenges threaten fair assessment of speaking. First, in Japanese university speaking courses, teachers are typically the only raters, but teachers' knowledge of their students may unfairly…
Descriptors: Foreign Countries, Oral Language, English (Second Language), Second Language Learning