ERIC - Search Results

Publication Date

In 2026	0
Since 2025	60
Since 2022 (last 5 years)	286
Since 2017 (last 10 years)	782
Since 2007 (last 20 years)	2044

Descriptor

Interrater Reliability	3126
Foreign Countries	655
Test Reliability	504
Evaluation Methods	503
Test Validity	411
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	242
Reliability	231
Observation	229
Scoring Rubrics	217
Test Construction	213
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 226 to 240 of 3,126 results Save | Export

The Underlying Cognitive Processes of Thin Slices Judgments on Teaching Quality

Peer reviewed
PDF on ERIC

Download full text

Konstantin Vinokic; Lukas Begrich; Mareike Kunter; Susanne Kuger – Frontline Learning Research, 2024

Thin slices ratings (i.e., ratings based on first impressions) have yielded intriguingly accurate results in various domains. Among other, researcher have applied the thin slices technique to assess instructional quality, showing that teacher-student interactions can be reliably inferred by just very short snippets of classroom instruction. The…

Descriptors: Teacher Effectiveness, Teacher Student Relationship, Foreign Countries, Classroom Observation Techniques

Primary School Students' Ratings of Teaching -- Do They Differentiate between Subjects and Teachers?

Peer reviewed

Direct link

Svenja Rieser; Alexander Naumann – School Effectiveness and School Improvement, 2024

Our study aims to provide empirical evidence for and against the valid use of primary school students' ratings of three generic dimensions of teaching quality (classroom management, supportive climate, cognitive activation). We examine whether students discriminate between corresponding dimensions in different subjects, taking into account whether…

Descriptors: Foreign Countries, Elementary School Students, Elementary School Teachers, Student Evaluation of Teacher Performance

Beyond Percent Correct: Measuring Change in Individual Picture Naming Ability

Peer reviewed

Direct link

Walker, Grant M.; Basilakos, Alexandra; Fridriksson, Julius; Hickok, Gregory – Journal of Speech, Language, and Hearing Research, 2022

Purpose: Meaningful changes in picture naming responses may be obscured when measuring accuracy instead of quality. A statistic that incorporates information about the severity and nature of impairments may be more sensitive to the effects of treatment. Method: We analyzed data from repeated administrations of a naming test to 72 participants with…

Descriptors: Naming, Change, Aphasia, Severity (of Disability)

Real-World Executive Functioning for Autistic Children in School and Home Settings

Peer reviewed

Direct link

Tschida, Jessica E.; Yerys, Benjamin E. – Autism: The International Journal of Research and Practice, 2022

Executive function challenges are commonly reported in the home setting for children with an autism spectrum disorder diagnosis (hereafter, autism), but little is known about these challenges in the school setting. A total of 337 youth (autism, N = 241 and typically developing, N = 96) were assessed using Behavior Rating Inventory of Executive…

Descriptors: Executive Function, Students with Disabilities, Age Differences, Behavior Problems

Comparing Evidence on the Effectiveness of Reading Resources from Expert Ratings, Practitioner Judgements, and Research Repositories

Peer reviewed

Direct link

Hollands, Fiona M.; Pan, Yilin; Kieffer, Michael J.; Holmes, Venita R.; Wang, Yixin; Escueta, Maya; Head, Laura; Muroga, Atsuko – Evidence & Policy: A Journal of Research, Debate and Practice, 2022

Background: Education decision makers are increasingly expected to use evidence to inform their actions. However, the majority of educational interventions have not yet been studied and it is challenging to produce high quality research evidence quickly enough to influence policy questions. Aims and objectives: We set out to gather evidence on the…

Descriptors: Elementary Schools, Urban Schools, Reading Instruction, Instructional Effectiveness

Assessing Measurement Equivalence of PSC-17 across Teacher and Parent Respondents

Peer reviewed
PDF on ERIC

Download full text

Direct link

Gao, Ruiqin; Raygoza, Alyssa; Distefano, Christine; Greer, Fred; Dowdy, Erin – School Psychology International, 2022

The Pediatric Symptom Checklist-17 (PSC-17) is a popular screening instrument used by parents and clinicians to assess children's behavioral functioning. However, more schools are examining the potential of the PSC-17 as part of a Multi-Tier System of Support framework. To investigate the potential of the PSC-17 in the schools, a sample of 1,779…

Descriptors: Check Lists, Measures (Individuals), Screening Tests, Child Behavior

Using Inter-Rater Discourse to Trace the Origins of Disagreement: Towards Collective Reflective Practice in L2 Assessment

Peer reviewed

Direct link

Matthews, Joshua – RELC Journal: A Journal of Language Teaching and Research, 2023

This article explores how the analysis of inter-rater discourse can be used to support collective reflective practice in second language (L2) assessment. To demonstrate, a focused case of the discourse between two experienced language teachers as they negotiate assessment decisions on L2 written texts is presented. Of particular interest was the…

Descriptors: Interrater Reliability, Discourse Analysis, Student Evaluation, Second Language Learning

Continuous Improvement of Inter-Rater Reliability in Transition Compliance at a State Agency

Direct link

Heather Raithel – ProQuest LLC, 2023

A mixed methods action research study was designed to answer three research questions based on inter-rater reliability (IRR) in compliance calls for transition at a state education agency, perceived confidence levels in making and discussing compliance calls, and perceived confidence in sharing transition resources. An innovation based on…

Descriptors: Public Agencies, Interrater Reliability, Compliance (Legal), Comparative Analysis

Pedagogical Considerations for Examining Rater Variability in Rater-Mediated Assessments: A Three-Model Framework

Peer reviewed

Direct link

Wesolowski, Brian C.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Rater-mediated assessments are a common methodology for measuring persons, investigating rater behavior, and/or defining latent constructs. The purpose of this article is to provide a pedagogical framework for examining rater variability in the context of rater-mediated assessments using three distinct models. The first model is the observation…

Descriptors: Interrater Reliability, Models, Observation, Measurement

Modeling Rater Response Processes in Evaluating Score Meaning

Peer reviewed

Direct link

Lane, Suzanne – Journal of Educational Measurement, 2019

Rater-mediated assessments require the evaluation of the accuracy and consistency of the inferences made by the raters to ensure the validity of score interpretations and uses. Modeling rater response processes allows for a better understanding of how raters map their representations of the examinee performance to their representation of the…

Descriptors: Responses, Accuracy, Validity, Interrater Reliability

Examining Rater Reliability When Using an Analytical Rubric for Oral Presentation Assessments

Peer reviewed
PDF on ERIC

Download full text

Sasithorn Limgomolvilas; Patsawut Sukserm – LEARN Journal: Language Education and Acquisition Research Network, 2025

The assessment of English speaking in EFL environments can be inherently subjective and influenced by various factors beyond linguistic ability, including choice of assessment criteria, and even the rubric type. In classroom assessment, the type of rubric recommended for English speaking tasks is the analytical rubric. Driven by three aims, this…

Descriptors: Oral Language, Speech Communication, English (Second Language), Second Language Learning

Can AI Grade Like a Human? Validity, Reliability, and Fairness in University Coursework Assessment

Peer reviewed
PDF on ERIC

Download full text

Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025

Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…

Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading

On the Superior Statistical Properties of Frequency Scales in Job Analyses

Peer reviewed

Direct link

Babcock, Ben; Risk, Nicole M.; Wyse, Adam E. – Educational Measurement: Issues and Practice, 2020

This study compared the statistical properties of four job analysis task survey response scale types: criticality, difficulty in learning, importance, and frequency. We used nine job analysis studies spanning two fields, medical imaging and allied health professionals, to compare the job analysis scales in terms of variability and interrater…

Descriptors: Job Analysis, Radiology, Allied Health Personnel, Surveys

Inter-Rater Reliability of Washington State's Kindergarten Entry Assessment

Peer reviewed

Direct link

Joseph, Gail; Soderberg, Janet S.; Stull, Sara; Cummings, Kevin; McCutchen, Deborah; Han, Rachel J. – Early Education and Development, 2020

Research Findings: This study explores the inter-rater reliability of WaKIDS, Washington State's kindergarten entry assessment (KEA). Specifically, we analyze (1) the extent to which teachers' assessments are in agreement with a master code, (2) how often inaccurate assessment decisions lead to misidentification of school readiness, and (3)…

Descriptors: Interrater Reliability, School Readiness, Kindergarten, Evaluation Problems

Evidence on the Dimensionality and Reliability of Professional References' Ratings of Teacher Applicants. Working Paper No. 237-0620

Download full text

Goldhaber, Dan; Grout, Cyrus; Wolf, Malcom; Martinkova, Patricia – National Center for Analysis of Longitudinal Data in Education Research (CALDER), 2020

There is growing interest in using measures of teacher applicant quality to improve hiring decisions, but the statistical properties of such measures are poorly understood. We present evidence on structured ratings solicited from teacher applicants' references. We find that the reference ratings capture only one underlying dimension of applicant…

Descriptors: Job Applicants, Teacher Selection, Interrater Reliability, Decision Making

« Previous Page | Next Page »

Pages: 1 | ... | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	57
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2557
Reports - Research	2245
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	163
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼