Publication Date
In 2025 | 5 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 40 |
Since 2016 (last 10 years) | 85 |
Since 2006 (last 20 years) | 196 |
Descriptor
Interrater Reliability | 303 |
Student Evaluation | 303 |
Evaluation Methods | 90 |
Foreign Countries | 77 |
Higher Education | 60 |
Scoring | 52 |
Test Reliability | 52 |
Scoring Rubrics | 48 |
Comparative Analysis | 41 |
Grading | 38 |
Test Validity | 38 |
More ▼ |
Source
Author
Johnson, Martin | 3 |
Littlefield, John H. | 3 |
Avery, Marybell | 2 |
Canivez, Gary L. | 2 |
Cason, Gerald J. | 2 |
Dyson, Ben | 2 |
Fisette, Jennifer L. | 2 |
Fox, Connie | 2 |
Franck, Marian | 2 |
Gilbert, Sharon L. | 2 |
Graber, Kim C. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 14 |
Practitioners | 7 |
Teachers | 7 |
Administrators | 4 |
Location
Australia | 12 |
United Kingdom | 11 |
Turkey | 7 |
Canada | 5 |
China | 4 |
Japan | 4 |
Netherlands | 4 |
Spain | 4 |
California | 3 |
Israel | 3 |
New York | 3 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Susan K. Johnsen – Gifted Child Today, 2025
The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…
Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement
Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management
Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023
Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…
Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education
Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024
Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…
Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork
Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025
Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…
Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7
Emma Healy – ProQuest LLC, 2024
The shortage of autism specialists and lack of culturally sensitive autism assessment tools are helping to perpetuate racial and ethnic disparities in autism identification and treatment. Using DisCrit as a framework, this quantitative study examined the utility of one autism assessment tool, the Social Responsiveness Scale, second edition (SRS-2)…
Descriptors: Autism Spectrum Disorders, Student Evaluation, Diagnostic Tests, Disability Identification
Alexandra Jackson; Cheryl Bodnar; Elise Barrella; Juan Cruz; Krista Kecskemety – Journal of STEM Education: Innovations and Research, 2025
Recent curricular interventions in engineering education have focused on encouraging students to develop an entrepreneurial mindset (EM) to equip them with the skills needed to generate innovative ideas and address complex global problems upon entering the workforce. Methods to evaluate these interventions have been inconsistent due to the lack of…
Descriptors: Engineering Education, Entrepreneurship, Concept Mapping, Student Evaluation
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Gitomer, Drew H.; Martínez, José Felipe; Battey, Dan; Hyland, Nora E. – American Educational Research Journal, 2021
The Educative Teacher Performance Assessment (edTPA) is a system of standardized portfolio assessments of teaching performance mandated for use by educator preparation programs in 18 states, and approved in 21 others, as part of initial certification for preservice teachers. Because of the high stakes involved for examinees, it is critical that…
Descriptors: Evaluation, Performance Based Assessment, Test Reliability, Test Validity
Saenz, David Arron – Online Submission, 2023
There is a vast body of literature documenting the positive impacts that rater training and calibration sessions have on inter-rater reliability as research indicates several factors including frequency and timing play crucial roles towards ensuring inter-rater reliability. Additionally, increasing amounts research indicate possible links in…
Descriptors: Interrater Reliability, Scoring, Training, Scoring Rubrics
Jaburek, Michal; Tápal, Adam; Portešová, Šárka; Pfeiffer, Steven I. – Journal of Psychoeducational Assessment, 2021
The factor structure, the concurrent validity, and test-retest reliability of the Czech translation of the Gifted Rating Scales-School Form [GRS-S; Pfeiffer, S. I., & Jarosewich, T. (2003). "GRS (gifted rating scales) - manual." Pearson] were evaluated. Ten alternative models were tested. Four models were found to exhibit acceptable…
Descriptors: Test Validity, Test Reliability, Gifted, Foreign Countries
Wenjing Guo – ProQuest LLC, 2021
Constructed response (CR) items are widely used in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district and state-level assessments in the United States. One unique feature of CR items is that they depend on human raters to assess the quality of examinees' work. The judgment of human…
Descriptors: National Competency Tests, Responses, Interrater Reliability, Error of Measurement
Kübra Karakaya Özyer – Journal of Educators Online, 2025
This meta-analytic study investigates the impact of online peer assessment on academic achievement in higher education. By synthesizing 20 effect sizes, we provide a comprehensive understanding of how online peer assessment influences student learning outcomes. The findings reveal a statistically significant positive effect (Hedges's g = 0.672),…
Descriptors: Electronic Learning, Peer Evaluation, Higher Education, Meta Analysis
Michelle Herridge – ProQuest LLC, 2021
Evaluation of student written work during summative assessments is an important and critical task for instructors at all educational levels. Nevertheless, few research studies exist that provide insights into how different instructors approach this task. Chemistry faculty (FIs) and graduate student instructors (GSIs) regularly engage in the…
Descriptors: Science Instruction, Chemistry, College Faculty, Teaching Assistants
Atilgan, Hakan – Eurasian Journal of Educational Research, 2019
Purpose: This study intended to examine the generalizability and reliability of essay ratings within the scope of the generalizability (G) theory. Specifically, the effect of raters on the generalizability and reliability of students' essay ratings was examined. Furthermore, variations of the generalizability and reliability coefficients with…
Descriptors: Foreign Countries, Essay Tests, Test Reliability, Interrater Reliability
Anna Kay Steadman – ProQuest LLC, 2023
The Performance Assessment and Evaluation System (PAES) is used by all major universities in the state of Utah to measure the effective teaching skills of preservice candidates as they progress through their teaching preparation program. The resulting ratings are used to make high-stakes decisions relating to course completion as well as…
Descriptors: Preservice Teachers, Student Evaluation, Teaching Skills, Elementary School Teachers