ERIC - Search Results

Publication Date

In 2025	3
Since 2024	5
Since 2021 (last 5 years)	36
Since 2016 (last 10 years)	121
Since 2006 (last 20 years)	343

Descriptor

Correlation	401
Interrater Reliability	401
Foreign Countries	110
Scores	87
Measures (Individuals)	76
Statistical Analysis	69
Validity	66
Comparative Analysis	65
Test Reliability	62
Test Validity	60
Evaluation Methods	59
Rating Scales	59
Psychometrics	48
Reliability	47
Scoring	45
Questionnaires	44
Evaluators	43
Measurement Techniques	39
Second Language Learning	38
Observation	37
Children	34
Factor Analysis	33
Student Evaluation	32
English (Second Language)	30
Autism	29
More ▼

Publication Type

Journal Articles	349
Reports - Research	312
Reports - Evaluative	61
Tests/Questionnaires	32
Speeches/Meeting Papers	16
Dissertations/Theses -…	12
Information Analyses	12
Reports - Descriptive	11
Numerical/Quantitative Data	5
Collected Works - Proceedings	1
Collected Works - Serials	1
Opinion Papers	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	87
Postsecondary Education	66
Elementary Education	34
Early Childhood Education	31
Secondary Education	28
Elementary Secondary Education	19
Preschool Education	17
High Schools	16
Middle Schools	15
Grade 3	9
Adult Education	7
Grade 5	7
Grade 6	7
Primary Education	7
Grade 1	6
Grade 4	6
Grade 7	6
Kindergarten	6
Junior High Schools	5
Grade 2	4
Grade 8	4
Intermediate Grades	4
Grade 11	3
Grade 10	2
Grade 9	1
More ▼

Audience

Researchers	10
Administrators	1
Practitioners	1
Teachers	1

Location

Netherlands	14
China	11
California	9
Canada	9
Turkey	8
United Kingdom	8
Japan	7
United States	7
Florida	6
Germany	6
Hong Kong	5
Sweden	5
Taiwan	5
Texas	5
Australia	4
Italy	4
Ohio	4
Pennsylvania	4
South Korea	4
Washington	4
Estonia	3
India	3
Ireland	3
Massachusetts	3
Philippines	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 401 results Save | Export

Profiling Communication Ability in Dementia: Validation of a New Cognitive-Communication Assessment Tool

Peer reviewed

Direct link

Suzanna Dooley; Tammy Hopper; Rachael Doyle; Orla Gilheaney; Margaret Walshe – International Journal of Language & Communication Disorders, 2025

Background: Individuals with dementia have communication limitations resulting from cognitive impairments that define the syndrome. Whereas there are numerous cognitive assessments for individuals with dementia, there are far fewer communication assessments. The Profiling Communication Ability in Dementia (P-CAD) was developed to address this gap.…

Descriptors: Communication Skills, Communication Problems, Dementia, Intellectual Disability

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

To What Extent Are Item Discrimination Values Realistic? A New Index for Two-Dimensional Structures

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Uysal, Ibrahim – International Journal of Assessment Tools in Education, 2022

Most researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the…

Descriptors: Item Analysis, Correlation, Item Response Theory, Test Items

Graders of the Future: Comparing the Consistency and Accuracy of GPT4 and Pre-Service Teachers in Physics Essay Question Assessments

Peer reviewed
PDF on ERIC

Download full text

Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025

As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…

Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy

The Whole Is More than the Sum of Its Parts -- Assessing Writing Using the Consensual Assessment Technique

Peer reviewed

Direct link

Zahn, Daniela; Canton, Ursula; Boyd, Victoria; Hamilton, Laura; Mamo, Josianne; McKay, Jane; Proudfoot, Linda; Telfer, Dickson; Williams, Kim; Wilson, Colin – Studies in Higher Education, 2021

Evaluating the impact of Academic Literacies teaching (Lea and Street [1998. "Student Writing in Higher Education: An Academic Literacies Approach." "Studies in Higher Education" 23 (2): 157-72. doi:10.1080/03075079812331380364]) is difficult, as it involves gauging whether writers: (1) gain better understanding of what…

Descriptors: Writing Evaluation, Evaluation Methods, Undergraduate Students, Foreign Countries

A Unified Approach to Estimating the Intraclass Correlation Coefficient and Its Bias: An Exploratory Study

Direct link

Kelvin Terrell Pompey – ProQuest LLC, 2021

Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…

Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation

Rater Connections and the Detection of Bias in Performance Assessment

Peer reviewed

Direct link

Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022

In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…

Descriptors: Evaluators, Bias, Identification, Performance Based Assessment

Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items

Peer reviewed

Direct link

Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…

Descriptors: Test Bias, Interrater Reliability, Responses, Correlation

Development of the Social Motor Function Classification System for Children with Autism Spectrum Disorders: A Psychometric Study

Peer reviewed

Direct link

Pin, Tamis W.; So, Vincent K. K.; Siu, Cynthia S. H.; Yip, Sheila S. N.; Cheung, Stella See-wing; Kan, Jenny Yim-mui – Journal of Autism and Developmental Disorders, 2021

To examine reliability and validity of the new Social Motor Function Classification System for Children with Autism Spectrum Disorders (SMFCS-ASD). The SMFCS-ASD reliability was examined on 25 children (62.4 months SD 7.8) with ASD among six physical therapists. The validity study involved 1001 children (57.0 months, SD 9.9) with ASD using the…

Descriptors: Autism, Pervasive Developmental Disorders, Children, Classification

Examining Rater Reliability When Using an Analytical Rubric for Oral Presentation Assessments

Peer reviewed
PDF on ERIC

Download full text

Sasithorn Limgomolvilas; Patsawut Sukserm – LEARN Journal: Language Education and Acquisition Research Network, 2025

The assessment of English speaking in EFL environments can be inherently subjective and influenced by various factors beyond linguistic ability, including choice of assessment criteria, and even the rubric type. In classroom assessment, the type of rubric recommended for English speaking tasks is the analytical rubric. Driven by three aims, this…

Descriptors: Oral Language, Speech Communication, English (Second Language), Second Language Learning

Development and Validation of Sentences in Tamil for Psychoacoustic Evaluation of Voice Using the Consensus Auditory-Perceptual Evaluation of Voice

Peer reviewed

Direct link

Venkatraman, Yamini; Mahalingam, Shenbagavalli; Boominathan, Prakash – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) is a standardized instrument used in voice assessment to assess voice quality. It has been translated and culturally adapted in several languages. This study aimed at developing and validating a Tamil version of CAPE-V through auditory perceptual evaluation of remotely…

Descriptors: Sentences, Dravidian Languages, Acoustics, Auditory Perception

Correlating What We Know: A Mixed Methods Study of Reflection and Writing in First-Year Writing Assessment

Peer reviewed

Direct link

Pruchnic, Jeff; Barton, Ellen; Primeau, Sarah; Trimble, Thomas; Varty, Nicole; Foster, Tanina – Composition Forum, 2021

Over the past two decades, reflective writing has occupied an increasingly prominent position in composition theory, pedagogy, and assessment as researchers have described the value of reflection and reflective writing in college students' development of higher-order writing skills, such as genre conventions (Yancey, "Reflection";…

Descriptors: Reflection, Correlation, Essays, Freshman Composition

Exploring the Reliability and Its Influencing Factors of Peer Assessment in Massive Open Online Courses

Peer reviewed

Direct link

Li, Hongxia; Zhao, ChengLing; Long, Taotao; Huang, Yan; Shu, Fengfang – British Journal of Educational Technology, 2021

As an innovative evaluation tool, peer assessment is essential in Massive Open Online Courses (MOOCs). In both formative and summative peer assessments in MOOCs, providing reliable feedback is crucial in enhancing learning outcomes. Peer assessment has been highlighted as a reliable tool in both traditional classrooms and small-scale online…

Descriptors: Peer Evaluation, Online Courses, Open Education, Feedback (Response)

Intra- and Inter-Rater Reliability of the Behaviour Mapping Schedule: A Direct Observational Tool for Classifying Children's Play Behaviour

Peer reviewed

Direct link

Dankiw, Kylie A.; Baldock, Katherine L.; Kumar, Saravana; Tsiros, Margarita D. – Australasian Journal of Early Childhood, 2021

Identifying and describing children's play behaviours is an important component of evaluating child development. The Behaviour Mapping Schedule is a direct observational tool which aims to describe and quantify children's play behaviours but is yet to undergo reliability testing. This study aimed to determine the intra- and inter-rater reliability…

Descriptors: Interrater Reliability, Classification, Child Behavior, Play

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 27

ProQuest LLC	12
Educational and Psychological…	10
Language Testing	10
ETS Research Report Series	9
Journal of Speech, Language,…	8
Research in Developmental…	8
International Journal of…	7
Journal of Autism and…	7
Online Submission	7
Advances in Health Sciences…	5
Applied Measurement in…	5
Autism: The International…	5
Physical & Occupational…	5
Creativity Research Journal	4
Developmental Medicine &…	4
Educational Assessment	4
Grantee Submission	4
International Journal of…	4
Journal of Educational…	4
Journal of Emotional and…	4
Journal of Psychoeducational…	4
Measurement in Physical…	4
Psychological Assessment	4
Assessment	3
Educational Research and…	3
More ▼

Coniam, David	4
Attali, Yigal	3
Scahill, Lawrence	3
Zhang, Mo	3
Abrams, Lisa M.	2
Aman, Michael G.	2
Anna-Maria Fall	2
Benton, Stephen L.	2
Beula M. Magimairaj	2
Bolton, Patrick	2
Botting, Nicola	2
Buitelaar, Jan K.	2
Conroy, Maureen A.	2
Davis, Larry	2
Epstein, Michael H.	2
Goldhaber, Dan	2
Greg Roberts	2
Hagiwara, Taku	2
Ichikawa, Hironobu	2
Inoue, Masahiko	2
Johnston, Charlotte	2
Kamio, Yoko	2
Kaufman, James C.	2
Konold, Timothy R.	2
More ▼

Child Behavior Checklist	7
Strengths and Difficulties…	6
Test of English as a Foreign…	6
Autism Diagnostic Observation…	5
Graduate Record Examinations	4
Peabody Developmental Motor…	3
SAT (College Admission Test)	3
Vineland Adaptive Behavior…	3
Battelle Developmental…	2
Behavior Assessment System…	2
Behavioral and Emotional…	2
Dynamic Indicators of Basic…	2
Early Childhood Environment…	2
MacArthur Communicative…	2
Mullen Scales of Early…	2
National Assessment of…	2
Obsessive Compulsive Scale	2
Peabody Picture Vocabulary…	2
Praxis Series	2
Preschool Language Scale	2
Program for International…	2
Raven Progressive Matrices	2
Student Teacher Relationship…	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
More ▼