NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Showing all 11 results Save | Export
Jiyeo Yun – English Teaching, 2023
Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…
Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Ramlall, Suvira; Singaram, V. S.; Sommerville, T. E. – Perspectives in Education, 2019
National and institutional policies to escalate the production of doctorates have raised concerns about the quality of PhDs in South Africa. This study evaluates examiner reports of doctorates by thesis and publication in clinical medicine to ascertain the criteria that examiners used to define a successful doctoral thesis. A qualitative…
Descriptors: Doctoral Dissertations, Educational Policy, Medical Research, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Banerjee, Rashida; Movahedazarhouligh, Sara; Millen, Kaitlyn; Luckner, John L. – Topics in Early Childhood Special Education, 2018
Valid and evidence-informed practices are critical to help young children with disabilities and their families with highly effective interventions and instruction to reach their potentials. Replication research is critical for appraising research and identifying evidence-based practices. The purpose of this study was to replicate the methods used…
Descriptors: Evidence, Early Childhood Education, Special Education, Replication (Evaluation)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Han, Qie – Working Papers in TESOL & Applied Linguistics, 2016
This literature review attempts to survey representative studies within the context of L2 speaking assessment that have contributed to the conceptualization of rater cognition. Two types of studies are looked at: 1) studies that examine "how" raters differ (and sometimes agree) in their cognitive processes and rating behaviors, in terms…
Descriptors: Second Language Learning, Student Evaluation, Evaluators, Speech Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Robertson, Clare; Ramsay, Craig; Gurung, Tara; Mowatt, Graham; Pickard, Robert; Sharma, Pawana – Research Synthesis Methods, 2014
We describe our experience of using a modified version of the Cochrane risk of bias (RoB) tool for randomised and non-randomised comparative studies. Objectives: (1) To assess time to complete RoB assessment; (2) To assess inter-rater agreement; and (3) To explore the association between RoB and treatment effect size. Methods: Cochrane risk of…
Descriptors: Risk, Randomized Controlled Trials, Research Design, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Jang, Hyewon – Journal of Science Education and Technology, 2016
Gaps between science, technology, engineering, and mathematics (STEM) education and required workplace skills have been identified in industry, academia, and government. Educators acknowledge the need to reform STEM education to better prepare students for their future careers. We pursue this growing interest in the skills needed for STEM…
Descriptors: STEM Education, Work Environment, Interrater Reliability, Engineering Education
Peer reviewed Peer reviewed
Direct linkDirect link
Van de Grift, Wim – Educational Research, 2007
Background: From 2002 onwards, initiatives and first steps for the project International Comparative Analysis of Learning and Teaching (ICALT) have been taken by the inspectorates of education in England, Flanders (Belgium), Lower Saxony (Germany) and The Netherlands. The inspectorates of education in these European countries reviewed the results…
Descriptors: Foreign Countries, Comparative Analysis, Observation, Teacher Effectiveness
Peer reviewed Peer reviewed
Janes, Joseph W.; McKinney, Renee – Library Quarterly, 1992
This study examined judgments of document relevance made by library science graduate students who were not the originators of the queries for which the documents were retrieved. Although the secondary judgments compared well with those of the original users, it was found that secondary judges used document record fields differently and had a…
Descriptors: Comparative Analysis, Higher Education, Interrater Reliability, Online Searching
Dudczak, Craig; Day, Donald – 1991
Philosophy statements have been used in the National Debate Tournament (NDT) since the mid-1970s and the Cross Examination Debate Association (CEDA) National Tournament since its 1986 inception. The statements should help debaters adapt to critics' expressed preferences. Moreover, philosophy statements can guide the study of argumentation theory…
Descriptors: Comparative Analysis, Content Analysis, Debate, Higher Education
Peer reviewed Peer reviewed
Harrison, Patti L. – Journal of Special Education, 1987
Part of a special issue on adaptive behavior, the article reviews adaptive behavior research in areas which include the relationship between adaptive behavior and intelligence and school achievement, relationship between different measures of adaptive behavior, predictive aspects, declassification, group differences in adaptive behavior,…
Descriptors: Academic Achievement, Adaptive Behavior (of Disabled), Behavior Rating Scales, Comparative Analysis
Takala, Sauli – 1998
This paper discusses recent developments in language testing. It begins with a review of the traditional criteria that are applied to all measurement and outlines recent emphases that derive from the expanding range of stakeholders. Drawing on Alderson's seminal work, criteria are presented for evaluating communicative language tests. Developments…
Descriptors: Alternative Assessment, Communicative Competence (Languages), Comparative Analysis, Evaluation Criteria