Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 20 |
Since 2006 (last 20 years) | 49 |
Descriptor
Comparative Analysis | 53 |
Interrater Reliability | 53 |
Statistical Analysis | 53 |
Foreign Countries | 21 |
Correlation | 19 |
Second Language Learning | 11 |
Pretests Posttests | 10 |
Measures (Individuals) | 9 |
Second Language Instruction | 9 |
English (Second Language) | 8 |
Teaching Methods | 8 |
More ▼ |
Source
Author
Beach, Kristen D. | 2 |
Bocian, Kathleen M. | 2 |
Coniam, David | 2 |
O'Connor, Rollanda E. | 2 |
Abbott, Robert | 1 |
Adamson, Katie Anne | 1 |
Ahmadi, Alireza | 1 |
Ahour, Touran | 1 |
Alhaisoni, Eid | 1 |
Alkahtani, Saif F. | 1 |
Alsma, Jelmer | 1 |
More ▼ |
Publication Type
Journal Articles | 47 |
Reports - Research | 44 |
Tests/Questionnaires | 6 |
Reports - Evaluative | 5 |
Dissertations/Theses -… | 4 |
Information Analyses | 1 |
Education Level
Higher Education | 22 |
Postsecondary Education | 20 |
Secondary Education | 8 |
Elementary Education | 3 |
Elementary Secondary Education | 3 |
High Schools | 3 |
Middle Schools | 3 |
Grade 8 | 2 |
Junior High Schools | 2 |
Grade 1 | 1 |
Grade 10 | 1 |
More ▼ |
Audience
Location
Iran | 4 |
Netherlands | 3 |
Germany | 2 |
Saudi Arabia | 2 |
Asia | 1 |
Belgium | 1 |
Finland | 1 |
Greece | 1 |
Hong Kong | 1 |
Japan | 1 |
Jordan | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Woodcock Johnson Tests of… | 2 |
Autism Diagnostic Observation… | 1 |
Dynamic Indicators of Basic… | 1 |
Multifactor Leadership… | 1 |
Obsessive Compulsive Scale | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Does not meet standards | 1 |
De Raadt, Alexandra; Warrens, Matthijs J.; Bosker, Roel J.; Kiers, Henk A. L. – Educational and Psychological Measurement, 2019
Cohen's kappa coefficient is commonly used for assessing agreement between classifications of two raters on a nominal scale. Three variants of Cohen's kappa that can handle missing data are presented. Data are considered missing if one or both ratings of a unit are missing. We study how well the variants estimate the kappa value for complete data…
Descriptors: Interrater Reliability, Data, Statistical Analysis, Statistical Bias
Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020
In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…
Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level
Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019
Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…
Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials
Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
Descriptors: Automation, Scoring, Comparative Analysis, Test Items
Yun, Jiyeo – ProQuest LLC, 2017
Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…
Descriptors: Interrater Reliability, Essays, Scoring, Evaluators
Morris, Darrell; Pennell, Ashley M.; Perney, Jan; Trathen, Woodrow – Reading Psychology, 2018
This study compared reading rate to reading fluency (as measured by a rating scale). After listening to first graders read short passages, we assigned an overall fluency rating (low, average, or high) to each reading. We then used predictive discriminant analyses to determine which of five measures--accuracy, rate (objective); accuracy, phrasing,…
Descriptors: Reading Fluency, Prediction, Grade 1, Elementary School Students
Thawabieh, Ahmad M. – Journal of Curriculum and Teaching, 2017
This study aimed to compare between the students' self-assessment and teachers' assessment. The study sample consisted of 71 students at Tafila Technical University studying Introduction to Psychology course. The researcher used 2 students' self-assessment tools and 2 tests. The results indicated that students can assess themselves accurately if…
Descriptors: Comparative Analysis, Self Evaluation (Individuals), Student Evaluation, Psychology
Cook, Bryan G.; Buysse, Virginia; Klingner, Janette; Landrum, Timothy J.; McWilliam, R. A.; Tankersley, Melody; Test, David W. – Remedial and Special Education, 2015
As an initial step toward improving the outcomes of learners with disabilities, special educators have formulated guidelines for identifying evidence-based practices. We describe the Council of Exceptional Children's new set of standards for identifying evidence-based practices in special education and how they (a) were systematically vetted by…
Descriptors: Classification, Special Education, Educational Practices, Educational Researchers
Wang, Ning; Wilhite, Stephen; Martino, Daniel – Educational Management Administration & Leadership, 2016
This study examined the possible relationship between emotional competence and transformational leadership in K-12 school leaders as a function of self-other agreement. The study found that, for those school leaders whose self-assessment of their leadership agreed with that of their subordinates, the self-ratings of emotional competence were…
Descriptors: Interpersonal Competence, Emotional Intelligence, Transformational Leadership, Elementary Secondary Education
Lehan, Tara; Hussey, Heather; Mika, Eva – Journal of University Teaching and Learning Practice, 2016
Throughout the dissertation process, the chair and committee members provide feedback regarding quality to help the doctoral candidate to produce the highest-quality document and become an independent scholar. Nevertheless, results of previous research suggest that overall dissertation quality generally is poor. Because much of the feedback about…
Descriptors: Graduate Students, Doctoral Dissertations, Student Evaluation, Feedback (Response)
Miller, M. Elizabeth; Kwon, Sockju – Journal of Child Nutrition & Management, 2015
Purpose/Objectives: The purpose of this study was to explore milk and yogurt selection among students participating in a School Breakfast Program. Methods: Researchers observed breakfast selection of milk, juice and yogurt in six elementary and four secondary schools. Data were analyzed using descriptive statistics and logistic regression to…
Descriptors: Breakfast Programs, Food, Decision Making, Secondary Schools
Kokkinaki, Theano; Pratikaki, Anastasia – Early Child Development and Care, 2014
Primary objective: Research has provided evidence of the intersubjective function of imitation in grandparent-infant interaction based on the basic aspects of imitation. This lacks the systematic investigation of behaviour dynamics framing spontaneous imitation. The aim of this study was to compare the dyadic expressive behaviours (vocal, kinetic…
Descriptors: Grandparents, Video Technology, Infants, Imitation
Sadaf, Ayesha; Olesova, Larisa – American Journal of Distance Education, 2017
The researchers in this study examined the influence of questions designed with the Practical Inquiry Model (PIM), compared with the regular (playground) questions, on students' levels of cognitive presence in online discussions. Students' discussion postings were collected and categorized according to the four levels of cognitive presence:…
Descriptors: Graduate Students, Masters Programs, Cognitive Processes, Web Based Instruction
Ahmadi, Alireza; Sadeghi, Elham – Language Assessment Quarterly, 2016
In the present study we investigated the effect of test format on oral performance in terms of test scores and discourse features (accuracy, fluency, and complexity). Moreover, we explored how the scores obtained on different test formats relate to such features. To this end, 23 Iranian EFL learners participated in three test formats of monologue,…
Descriptors: Oral Language, Comparative Analysis, Language Fluency, Accuracy
Robertson, Clare; Ramsay, Craig; Gurung, Tara; Mowatt, Graham; Pickard, Robert; Sharma, Pawana – Research Synthesis Methods, 2014
We describe our experience of using a modified version of the Cochrane risk of bias (RoB) tool for randomised and non-randomised comparative studies. Objectives: (1) To assess time to complete RoB assessment; (2) To assess inter-rater agreement; and (3) To explore the association between RoB and treatment effect size. Methods: Cochrane risk of…
Descriptors: Risk, Randomized Controlled Trials, Research Design, Comparative Analysis