Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 23 |
Descriptor
Comparative Analysis | 27 |
Interrater Reliability | 27 |
Reliability | 27 |
Validity | 9 |
Foreign Countries | 8 |
Statistical Analysis | 6 |
Teaching Methods | 6 |
Correlation | 5 |
Scores | 5 |
Classroom Environment | 4 |
Evaluators | 4 |
More ▼ |
Source
Author
Adamson, Katie Anne | 1 |
Akalin, Selma | 1 |
Amanda Huee-Ping Wong | 1 |
Balan, Andreia | 1 |
Behuniak, Peter, Jr. | 1 |
Beilinson, Jill S. | 1 |
Bell, John F. | 1 |
Bennett, Max | 1 |
Bijlsma, Hannah J. E. | 1 |
Bilginer, Hayriye | 1 |
Binnie, Rachel | 1 |
More ▼ |
Publication Type
Journal Articles | 24 |
Reports - Research | 23 |
Reports - Evaluative | 3 |
Dissertations/Theses -… | 1 |
Information Analyses | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 6 |
Postsecondary Education | 6 |
Elementary Education | 4 |
Elementary Secondary Education | 2 |
Grade 4 | 2 |
Preschool Education | 2 |
Early Childhood Education | 1 |
Grade 1 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Intermediate Grades | 1 |
More ▼ |
Audience
Location
Netherlands | 2 |
Australia | 1 |
Belgium | 1 |
Florida | 1 |
Iceland | 1 |
Philippines | 1 |
Singapore | 1 |
Turkey | 1 |
Turkey (Ankara) | 1 |
United Kingdom (England) | 1 |
United States | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Early Childhood Environment… | 1 |
Neale Analysis of Reading… | 1 |
What Works Clearinghouse Rating
Kinnear, George; Bennett, Max; Binnie, Rachel; Bolt, Róisín; Zheng, Yinglan – Teaching Mathematics and Its Applications, 2020
The MATH taxonomy classifies questions according to the mathematical skills required to answer them. It was created to aid the development of more balanced assessments in undergraduate mathematics and has since been used to compare different assessment regimes across school and university. To date, there has been no systematic investigation of the…
Descriptors: Taxonomy, Mathematics Instruction, Teaching Methods, Reliability
Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024
The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…
Descriptors: Accuracy, Reliability, Computational Linguistics, Standards
Jönsson, Anders; Balan, Andreia – Practical Assessment, Research & Evaluation, 2018
Research on teachers' grading has shown that there is great variability among teachers regarding both the process and product of grading, resulting in low comparability and issues of inequality when using grades for selection purposes. Despite this situation, not much is known about the merits or disadvantages of different models for grading. In…
Descriptors: Grading, Models, Reliability, Validity
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Lanah Stafford; Erin Cousins; Linda Bol; Megan Mize – Research & Practice in Assessment, 2023
Integrative learning is an important outcome for graduates of higher education. Therefore, it should be well-defined and assessed reliably. The American Association of Colleges & Universities has developed a rubric to define and assess integrative learning, but it has low reliability. This pilot study examines whether this rubric's reliability…
Descriptors: Scoring Rubrics, Reliability, Evaluation Methods, Faculty Development
Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019
Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…
Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials
van der Scheer, Emmelien A.; Bijlsma, Hannah J. E.; Glas, Cees A. W. – School Effectiveness and School Improvement, 2019
A Bayesian IRT-model approach was used to investigate the validity and reliability of student perceptions of teaching quality. Furthermore, the student perceptions were compared with ratings of teaching quality by external observers. Grade 4 students (n = 675) filled out a questionnaire that was used to measure their opinions about the lessons of…
Descriptors: Student Attitudes, Validity, Interrater Reliability, Correlation
Hestenes, Linda L.; Rucker, Lia; Wang, Yudan Chen; Mims, Sharon U.; Hestenes, Stephen E.; Cassidy, Deborah J. – Early Education and Development, 2019
Research Findings: The present study provides an initial descriptive comparison of the Early Childhood Environment Rating Scale-Revised (ECERS-R) and the Early Childhood Environment Rating Scale-Third Edition (ECERS-3) in a relatively large sample in 1 state that uses the Environment Rating Scales within its Quality Rating and Improvement System…
Descriptors: Comparative Analysis, Educational Quality, Rating Scales, Early Childhood Education
Lehan, Tara; Hussey, Heather; Mika, Eva – Journal of University Teaching and Learning Practice, 2016
Throughout the dissertation process, the chair and committee members provide feedback regarding quality to help the doctoral candidate to produce the highest-quality document and become an independent scholar. Nevertheless, results of previous research suggest that overall dissertation quality generally is poor. Because much of the feedback about…
Descriptors: Graduate Students, Doctoral Dissertations, Student Evaluation, Feedback (Response)
Ho, Andrew D.; Kane, Thomas J. – Bill & Melinda Gates Foundation, 2013
For many teachers, the classroom observation has been the only opportunity to receive direct feedback from another school professional. As such, it is an indispensable part of every teacher evaluation system. Yet it also requires a major time commitment from teachers, principals, and peer observers. To justify the investment of time and resources,…
Descriptors: Observation, Teacher Evaluation, Accuracy, Reliability
Shubert, Christopher W.; Meredith, Dawn C. – Physical Review Special Topics - Physics Education Research, 2015
Students' epistemologies affect how and what they learn: do they believe physics is a list of equations, or a coherent and sensible description of the physical world? In order to study these epistemologies as part of curricular assessment, we adopt the resources framework, which posits that students have many productive epistemological resources…
Descriptors: Epistemology, Recall (Psychology), Physics, Educational Environment
Rapp, John T.; Carroll, Regina A.; Stangeland, Lindsay; Swanson, Greg; Higgins, William J. – Behavior Modification, 2011
The authors evaluated the extent to which interobserver agreement (IOA) scores, using the block-by-block method for events scored with continuous duration recording (CDR), were higher when the data from the same sessions were converted to discontinuous methods. Sessions with IOA scores of 89% or less with CDR were rescored using 10-s partial…
Descriptors: Intervals, Sampling, Comparative Analysis, Measures (Individuals)
Kurt, Mehmet; Bilginer, Hayriye – Online Submission, 2016
In the globalizing world economy, for the realization of international trade is increasing the need for foreign language learning. TR63 (Kahramanmaras, Osmaniye and Hatay) region is increasing its export every day. Besides these advances, interest is awakening in foreign language education among the region. In preparing syllabus for this kind of…
Descriptors: Cognitive Style, Second Language Learning, Foreign Countries, Geographic Regions
Gustilo, Leah E. – Online Submission, 2016
The present study aimed at characterizing what skilled or more proficient ESL college writing is in the Philippine setting through a contrastive analysis of three groups of variables identified from previous studies: resources, processes, and performance of ESL writers. Based on Chenoweth and Hayes' (2001; 2003) framework, the resource level…
Descriptors: Language Proficiency, English (Second Language), Second Language Learning, Foreign Countries
Marinus, Eva; Kohnen, Saskia; McArthur, Genevieve – Australian Journal of Learning Difficulties, 2013
This paper reports provisional Australian comparison data and scoring instructions for the "Test of Word Reading Efficiency" (TOWRE). The TOWRE is a popular reading fluency test used in reading research, classroom assessment and clinical practice. Approximate "norms" were obtained from children attending four primary schools in…
Descriptors: Foreign Countries, Reading Fluency, Comparative Analysis, Data
Previous Page | Next Page »
Pages: 1 | 2