ERIC - Search Results

Publication Date

In 2025	27
Since 2024	95
Since 2021 (last 5 years)	356
Since 2016 (last 10 years)	878
Since 2006 (last 20 years)	2091

Descriptor

Interrater Reliability	3093
Foreign Countries	642
Evaluation Methods	501
Test Reliability	498
Test Validity	406
Correlation	401
Scoring	336
Comparative Analysis	327
Scores	321
Validity	309
Student Evaluation	301
Measures (Individuals)	298
Evaluators	291
Rating Scales	282
Statistical Analysis	268
Higher Education	263
Psychometrics	238
Observation	228
Reliability	228
Scoring Rubrics	214
Test Construction	212
Teaching Methods	208
English (Second Language)	203
Writing Evaluation	202
Intervention	200
More ▼

Education Level

Higher Education	562
Postsecondary Education	408
Elementary Education	280
Secondary Education	177
Early Childhood Education	142
Elementary Secondary Education	119
Middle Schools	108
High Schools	84
Preschool Education	72
Junior High Schools	64
Adult Education	58
Primary Education	55
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	36
Grade 6	35
Grade 8	32
Grade 3	30
Grade 7	27
Grade 2	25
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	52
United Kingdom	46
Canada	45
Netherlands	40
California	37
China	37
United States	30
United Kingdom (England)	24
Taiwan	23
Japan	22
Pennsylvania	22
Florida	21
Germany	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
Texas	17
Georgia	16
South Korea	16
Israel	15
New Zealand	14
Washington	14
South Africa	13
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Interrater Reliability X

Showing 46 to 60 of 3,093 results Save | Export

Informant Discrepancies in Universal Screening as a Function of Student and Teacher Characteristics

Peer reviewed

Direct link

Brittany N. Zakszeski; Heather E. Ormiston; Malena A. Nygaard; Kane Carlock – School Psychology Review, 2025

Despite the widespread use of school-based universal screening systems for social, emotional, and behavioral risk, limited research has examined discrepancies in ratings provided by teachers and their secondary students. Using the Social, Academic, and Emotional Behavior Risk Screener (SAEBRS; teacher report) and mySAEBRS (student report) scores…

Descriptors: Middle School Students, Middle School Teachers, Screening Tests, Affective Behavior

Assessing Social Communication and Measuring Changes in Chinese Autistic Preschoolers: A Preliminary Study Using the Social Communication Scale

Peer reviewed

Direct link

Li Wang; Xin Qi; Ziyan Meng; Meiyu Xiang; Zhuoqing Li; Sitong Zhang; Longyun Hu; Hoyee W. Hirai; Carol K. S. To; Patrick C. M. Wong – Journal of Speech, Language, and Hearing Research, 2025

Purpose: Assessing social communication and measuring its changes among young autistic children presents significant challenges, particularly when tracking intervention effects within short timeframes. Existing measures, mostly validated in Western contexts, may not be suitable for culturally diverse populations. Addressing this gap, the Social…

Descriptors: Autism Spectrum Disorders, Preschool Children, Interpersonal Communication, Communication Skills

Inter-Evaluator Reliability of Sagittal and Rotational Spinal Measurements from 3D Ultrasound Imaging of Healthy Females in Standing with Varying Arm Positions

Peer reviewed

Direct link

Aislinn Ganci; Miran Qazizada; Brianna Fehr; Ana Vucenovic; Edmond Lou; Eric Parent – Measurement in Physical Education and Exercise Science, 2024

Spinal alignment can be assessed without radiation using three-dimensional ultrasound imaging (3DUS). Reliable measurements could inform the ideal arm position for scoliosis radiographs. This study determined the inter-evaluator reliability of axial vertebral rotation (AVR) measurements and sagittal curve angles in healthy females from 3DUS spinal…

Descriptors: Foreign Countries, Young Adults, Adults, Adolescents

The Development of a Novel, Standardized, Norm-Referenced Arabic Discourse Assessment Tool (ADAT), Including an Examination of Psychometric Properties of Discourse Measures in Aphasia

Peer reviewed

Direct link

Reem S. W. Alyahya – International Journal of Language & Communication Disorders, 2024

Background: People with aphasia (PWA) typically exhibit deficits in spoken discourse. Discourse analysis is the gold standard approach to assess language deficits beyond sentence level. However, the available discourse assessment tools are biased towards English and European languages and Western culture. Additionally, there is a lack of consensus…

Descriptors: Arabic, Aphasia, Psychometrics, Test Construction

Interrater Reliability of the Test of Gross Motor Development--Third Edition following Raters' Agreement on Measurement Criteria

Peer reviewed

Direct link

Carballo-Fazanes, Aida; Rey, Ezequiel; Valentini, Nadia C.; Varela-Casal, Cristina; Abelairas-Gómez, Cristian – Journal of Motor Learning and Development, 2023

We aimed to calculate interrater reliability of the Test of Gross Motor Development--Third Edition (TGMD-3) after raters reached a consensus regarding measurement criteria. Three raters measured the fundamental movement skills of 25 children on the TGMD-3 at two different times: (a) once when simply following the measurement criteria in the TGMD-3…

Descriptors: Motor Development, Children, Norm Referenced Tests, Interrater Reliability

Examining Inter-Rater Reliability of Evaluators Judging Teacher Performance: Proposing an Alternative to Cohen's Kappa. CEME Technical Report. CEMETR-2022-02

Download full text

Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle – Center for Educational Measurement and Evaluation, 2022

The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…

Descriptors: Interrater Reliability, Evaluators, Rating Scales, Teacher Evaluation

Using Machine Learning to Score Multi-Dimensional Assessments of Chemistry and Physics

Peer reviewed

Direct link

Maestrales, Sarah; Zhai, Xiaoming; Touitou, Israel; Baker, Quinton; Schneider, Barbara; Krajcik, Joseph – Journal of Science Education and Technology, 2021

In response to the call for promoting three-dimensional science learning (NRC, 2012), researchers argue for developing assessment items that go beyond rote memorization tasks to ones that require deeper understanding and the use of reasoning that can improve science literacy. Such assessment items are usually performance-based constructed…

Descriptors: Artificial Intelligence, Scoring, Evaluation Methods, Chemistry

Large-Sample Variance of Fleiss Generalized Kappa

Peer reviewed

Direct link

Gwet, Kilem L. – Educational and Psychological Measurement, 2021

Cohen's kappa coefficient was originally proposed for two raters only, and it later extended to an arbitrarily large number of raters to become what is known as Fleiss' generalized kappa. Fleiss' generalized kappa and its large-sample variance are still widely used by researchers and were implemented in several software packages, including, among…

Descriptors: Sample Size, Statistical Analysis, Interrater Reliability, Computation

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

The Politics of Reading Textbooks: Intergenerational and International Reflections on China

Peer reviewed

Direct link

Liz Jackson; Michael W. Apple; Fei Yan; Jason Cong Lin; Chenxi Jiang; Tongzhou Li; Edward Vickers – Educational Philosophy and Theory, 2024

In this collective essay the authors consider the nature and consequences of reading and researching across difference in an international and intergenerational team, whose core members are focused on understanding how curriculum operates and the nature of textbook representation of diversity in Mainland China, Hong Kong, Taiwan, and Macau.…

Descriptors: Foreign Countries, Textbooks, Reading Research, Educational Research

Naive Listener Ratings of Speech Intelligibility over the Course of Motor-Based Intervention in Children with Childhood Apraxia of Speech

Peer reviewed

Direct link

Emily W. Wang; Maria I. Grigos – Journal of Speech, Language, and Hearing Research, 2024

Purpose: The aim of this study was to describe changes in speech intelligibility and interrater and intrarater reliability of naive listeners' ratings of words produced by young children diagnosed with childhood apraxia of speech (CAS) over a period of motor-based intervention (dynamic temporal and tactile cueing [DTTC]). Method: A total of 120…

Descriptors: Speech Communication, Intelligibility, Speech Impairments, Perceptual Motor Learning

Development of a Scoring Key to Evaluate the Creative Story Writing Levels of Secondary School Seventh Grade Students

Peer reviewed
PDF on ERIC

Download full text

Ebru Öztürk; Erol Duran – Educational Policy Analysis and Strategic Research, 2024

In this study, it was aimed to develop a rubric to evaluate the creative story writing skill levels of seventh grade secondary school students. The research was designed in quantitative research method and survey model. In the research, convenience sampling technique was used and 270 students studying at the seventh grade level of secondary school…

Descriptors: Scoring Rubrics, Writing Evaluation, Creative Writing, Middle School Students

Statistically Guided Grading Judgements: Contextualisation or Contamination?

Peer reviewed

Direct link

Louise Badham – Oxford Review of Education, 2025

Different sources of assessment evidence are reviewed during International Baccalaureate (IB) grade awarding to convert marks into grades and ensure fair results for students. Qualitative and quantitative evidence are analysed to determine grade boundaries, with statistical evidence weighed against examiner judgement and teachers' feedback on…

Descriptors: Advanced Placement Programs, Grading, Interrater Reliability, Evaluative Thinking

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Engaging Classroom Observation: A Brief Measure of Active Learning in the College Classroom

Peer reviewed

Direct link

Chase Young; Benjamin Mitchell-Yellin; George Kevin Randall – Active Learning in Higher Education, 2025

The purpose of this study was to develop a valid, reliable, and brief measure of active learning in college classrooms that is cheap and easy to complete and yields results that faculty can easily use to inform their development as instructors. Initial construct and face validity was achieved by modifying existing instruments and creating a draft…

Descriptors: College Faculty, College Students, Active Learning, Classroom Observation Techniques

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 207

ProQuest LLC	86
Educational and Psychological…	61
Journal of Speech, Language,…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	37
Online Submission	35
Assessment & Evaluation in…	33
International Journal of…	33
Research in Developmental…	31
Applied Measurement in…	28
Assessment for Effective…	26
Advances in Health Sciences…	25
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	22
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2526
Reports - Research	2212
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	129
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	29
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	10
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
SAT (College Admission Test)	8
International English…	6
Teacher Performance…	6
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACT Assessment	4
ACTFL Oral Proficiency…	4
Battelle Developmental…	4
More ▼