Publication Date
| In 2026 | 0 |
| Since 2025 | 60 |
| Since 2022 (last 5 years) | 286 |
| Since 2017 (last 10 years) | 782 |
| Since 2007 (last 20 years) | 2044 |
Descriptor
| Interrater Reliability | 3126 |
| Foreign Countries | 655 |
| Test Reliability | 504 |
| Evaluation Methods | 503 |
| Test Validity | 411 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Wagner, Kyle; Smith, Alex; Allen, Abigail; McMaster, Kristen; Poch, Apryl; Lembke, Erica – Assessment for Effective Intervention, 2019
Researchers and practitioners have questioned whether scoring procedures used with curriculum-based measures of writing (CBM-W) capture growth in complexity of writing. We analyzed data from six independent samples to examine two potential scoring metrics for picture word CBM-W (PW), a sentence-level CBM task. Correct word sequences per response…
Descriptors: Curriculum Based Assessment, Writing Evaluation, Comparative Analysis, Scoring
Jayashankar, Shailaja; Sridaran, R. – Education and Information Technologies, 2017
Teachers are thrown open to abundance of free text answers which are very daunting to read and evaluate. Automatic assessments of open ended answers have been attempted in the past but none guarantees 100% accuracy. In order to deal with the overload involved in this manual evaluation, a new tool becomes necessary. The unique superlative model…
Descriptors: Word Frequency, Models, Electronic Learning, Student Evaluation
Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017
Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…
Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives
McIntosh, Kent; Massar, Michelle M.; Algozzine, Robert F.; George, Heather Peshak; Horner, Robert H.; Lewis, Timothy J.; Swain-Bradway, Jessica – Journal of Positive Behavior Interventions, 2017
Full and durable implementation of school-based interventions is supported by regular evaluation of fidelity of implementation. Multiple assessments have been developed to evaluate the extent to which schools are applying the core features of school-wide positive behavioral interventions and supports (SWPBIS). The "SWPBIS Tiered Fidelity…
Descriptors: Positive Behavior Supports, Fidelity, Program Implementation, Program Evaluation
Kladouchou, Vasiliki; Papathanasiou, Ilias; Efstratiadou, Eva A.; Christaki, Vasiliki; Hilari, Katerina – International Journal of Language & Communication Disorders, 2017
Background & Aims: This study ran within the framework of the Thales Aphasia Project that investigated the efficacy of elaborated semantic feature analysis (ESFA). We evaluated the treatment integrity (TI) of ESFA, i.e., the degree to which therapists implemented treatment as intended by the treatment protocol, in two different formats:…
Descriptors: Aphasia, Semantics, Speech Therapy, Group Therapy
Reinisch, Bianca; Krell, Moritz; Hergert, Susann; Gogolin, Sarah; Krüger, Dirk – International Journal of Science Education, 2017
Students' and pre-service teachers' conceptions of scientists have been assessed in a variety of studies. One of the most commonly used instruments is the Draw-A-Scientist Test (DAST) which offers the advantage that no verbal skills are needed by the participants. In some studies, methodical challenges related to the DAST have been discussed; for…
Descriptors: Foreign Countries, Cognitive Tests, Freehand Drawing, Personality Measures
Bartelink, Cora; de Kwaadsteniet, Leontien; ten Berge, Ingrid J.; Witteman, Cilia L. M. – Child & Youth Care Forum, 2017
Background: The LIRIK, an instrument for the assessment of child safety and risk, is designed to improve assessments by guiding professionals through a structured evaluation of relevant signs, risk factors, and protective factors. Objective: We aimed to assess the interrater agreement and the predictive validity of professionals' judgments made…
Descriptors: Child Safety, Test Validity, Test Reliability, Risk
Britton, Emily; Simper, Natalie; Leger, Andrew; Stephenson, Jenn – Assessment & Evaluation in Higher Education, 2017
Effective teamwork skills are essential for success in an increasingly team-based workplace. However, research suggests that there is often confusion concerning how teamwork is measured and assessed, making it difficult to develop these skills in undergraduate curricula. The goal of the present study was to develop a sustainable tool for assessing…
Descriptors: Teamwork, Undergraduate Students, Skills, Student Evaluation
Lakes, Kimberley D.; Guo, Yuqing; Taylor Lucas, Candice; Cooper, Dan – Infants and Young Children, 2017
One of the most important considerations in designing clinical infant research studies is the selection of reliable and valid measurement procedures. Few measures of caregiver-child interactions have been studied with newborns, particularly premature infants. The main objective of this study was to examine psychometric properties of the National…
Descriptors: Mothers, Neonates, Parent Child Relationship, Hospitalized Children
Lambie, Glenn W.; Mullen, Patrick R.; Swank, Jacqueline M.; Blount, Ashley – Measurement and Evaluation in Counseling and Development, 2018
Supervisors evaluated counselors-in-training at multiple points during their practicum experience using the Counseling Competencies Scale (CCS; N = 1,070). The CCS evaluations were randomly split to conduct exploratory factor analysis and confirmatory factor analysis, resulting in a 2-factor model (61.5% of the variance explained).
Descriptors: Counselor Training, Counseling, Measures (Individuals), Competence
Amin, Sarah A.; Panzarella, Carolyn; Lehnerd, Megan; Cash, Sean B.; Economos, Christina D.; Sacheck, Jennifer M. – Health Education & Behavior, 2018
Background: Recent efforts supporting children's dietary behaviors suggest the importance of food literacy (FL), which is a multidimensional concept that encompasses food-related knowledge, skills, and behaviors. To date, FL has been largely informed by adult and adolescent research. Aims: To assess the FL experiences, perceived skills, and…
Descriptors: Educational Opportunities, Food, Safety, Foods Instruction
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Benton, Stephen L.; Li, Dan – IDEA Center, Inc., 2018
This technical report describes the results of analyses performed on data collected from 2013 to 2017, using the IDEA Feedback System for Administrators (FSA). The FSA is used to gather impressions from core constituents about an administrator's performance of relevant administrative roles, as well as her/his leadership style, interpersonal…
Descriptors: Feedback (Response), Administrators, Administrator Attitudes, Administrator Role
Hidalgo, María Ángeles; Lázaro-Ibarrola, Amparo – Studies in Second Language Learning and Teaching, 2020
Research into the potential of collaborative writing is relatively new. Similarly, task repetition (TR), which has been claimed to be a valuable tool for language learning, has been rarely explored in the context of writing. Therefore, little is known about the potential of combining TR and collaborative writing, and even less if we focus on young…
Descriptors: Task Analysis, Second Language Learning, Second Language Instruction, Accuracy
Linlin, Cao – English Language Teaching, 2020
Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…
Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

Peer reviewed
Direct link
