ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	46

Descriptor

Correlation	59
Evaluation Methods	59
Interrater Reliability	59
Foreign Countries	12
Test Reliability	12
Rating Scales	11
Statistical Analysis	11
Measures (Individuals)	10
Student Evaluation	10
Comparative Analysis	9
Evaluators	9
Measurement Techniques	9
Scores	9
Scoring	8
Test Validity	8
Psychometrics	7
Data Analysis	6
Evaluation Research	6
Validity	6
Computer Software	5
Educational Technology	5
Evaluation Criteria	5
Intervention	5
Observation	5
Second Language Learning	5
More ▼

Publication Type

Journal Articles	48
Reports - Research	38
Reports - Evaluative	18
Tests/Questionnaires	4
Dissertations/Theses -…	2
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1
More ▼

Education Level

Higher Education	12
Postsecondary Education	9
Elementary Secondary Education	8
Secondary Education	4
Adult Education	3
Elementary Education	2
High Schools	2
Middle Schools	2
Grade 1	1
Grade 6	1

Audience

Researchers	3
Practitioners	1
Teachers	1

Location

Florida	3
United Kingdom	3
Asia	2
China	2
Italy	2
Netherlands	2
Ohio	2
Pennsylvania	2
Portugal	2
South Korea	2
Turkey	2
United States	2
Australia	1
Brazil	1
California	1
Canada	1
Canada (Ottawa)	1
Colombia	1
Colombia (Bogota)	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Finland (Helsinki)	1
Germany	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Child Behavior Checklist	2
Graduate Record Examinations	2
Autism Diagnostic Observation…	1
Behavior Assessment System…	1
Developmental Behavior…	1
MacArthur Communicative…	1
Mullen Scales of Early…	1
Praxis Series	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 59 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

The Whole Is More than the Sum of Its Parts -- Assessing Writing Using the Consensual Assessment Technique

Peer reviewed

Direct link

Zahn, Daniela; Canton, Ursula; Boyd, Victoria; Hamilton, Laura; Mamo, Josianne; McKay, Jane; Proudfoot, Linda; Telfer, Dickson; Williams, Kim; Wilson, Colin – Studies in Higher Education, 2021

Evaluating the impact of Academic Literacies teaching (Lea and Street [1998. "Student Writing in Higher Education: An Academic Literacies Approach." "Studies in Higher Education" 23 (2): 157-72. doi:10.1080/03075079812331380364]) is difficult, as it involves gauging whether writers: (1) gain better understanding of what…

Descriptors: Writing Evaluation, Evaluation Methods, Undergraduate Students, Foreign Countries

A Unified Approach to Estimating the Intraclass Correlation Coefficient and Its Bias: An Exploratory Study

Direct link

Kelvin Terrell Pompey – ProQuest LLC, 2021

Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…

Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

Assessment of Interrater and Intermethod Agreement in the Kinesiology Literature

Peer reviewed

Direct link

Looney, Marilyn A. – Measurement in Physical Education and Exercise Science, 2018

The purpose of this article was two-fold (1) provide an overview of the commonly reported and under-reported absolute agreement indices in the kinesiology literature for continuous data; and (2) present examples of these indices for hypothetical data along with recommendations for future use. It is recommended that three types of information be…

Descriptors: Interrater Reliability, Evaluation Methods, Kinetics, Indexes

Assessing Language in Unstructured Conversation in People with Aphasia: Methods, Psychometric Integrity, Normative Data, and Comparison to a Structured Narrative Task

Peer reviewed

Direct link

Leaman, Marion C.; Edmonds, Lisa A. – Journal of Speech, Language, and Hearing Research, 2021

Purpose: This study evaluated interrater reliability (IRR) and test-retest stability (TRTS) of seven linguistic measures (percent correct information units, relevance, subject-verb-[object], complete utterance, grammaticality, referential cohesion, global coherence), and communicative success in unstructured conversation and in a story narrative…

Descriptors: Aphasia, Psychometrics, Correlation, Speech Language Pathology

Development of a Novel Tool for Assessing Coverage of Implementation Factors in Health Promotion Program Resources

Peer reviewed
PDF on ERIC

Download full text

Direct link

Bejarano, Carolina M.; Snow, Kelli; Lane, Hannah; Calvert, Hannah; Hoppe, Kate; Alfonsin, Nicole; Turner, Lindsey; Carlson, Jordan A. – Grantee Submission, 2019

Purpose: This study presents a novel methodology/process for assessing inclusion of theoretically-based implementation factors within available adoption-ready health promotion programs. Methods: Classroom-based physical activity (CBPA) programs were used as an example to describe the process. Our team selected an implementation science framework…

Descriptors: Evaluation Methods, Program Evaluation, Health Promotion, Physical Activity Level

Applying Kane's Validity Framework to a Simulation Based Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018

Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…

Descriptors: Competence, Simulation, Allied Health Personnel, Certification

Effect of Quality Characteristics of Peer Raters on Rating Errors in Peer Assessment

Peer reviewed

Direct link

Guo, Xiuyan; Lei, Pui-Wa – International Journal of Testing, 2020

Little research has been done on the effects of peer raters' quality characteristics on peer rating qualities. This study aims to address this gap and investigate the effects of key variables related to peer raters' qualities, including content knowledge, previous rating experience, training on rating tasks, and rating motivation. In an experiment…

Descriptors: Peer Evaluation, Error Patterns, Correlation, Knowledge Level

The Counseling Competencies Scale: Validation and Refinement

Peer reviewed

Direct link

Lambie, Glenn W.; Mullen, Patrick R.; Swank, Jacqueline M.; Blount, Ashley – Measurement and Evaluation in Counseling and Development, 2018

Supervisors evaluated counselors-in-training at multiple points during their practicum experience using the Counseling Competencies Scale (CCS; N = 1,070). The CCS evaluations were randomly split to conduct exploratory factor analysis and confirmatory factor analysis, resulting in a 2-factor model (61.5% of the variance explained).

Descriptors: Counselor Training, Counseling, Measures (Individuals), Competence

Assessing Students' Social and Emotional Skills through Triangulation of Assessment Methods. OECD Education Working Papers, No. 208

Direct link

Kankaraš, Miloš; Feron, Eva; Renbarger, Rachel – OECD Publishing, 2019

Triangulation -- a combined use of different assessment methods or sources to evaluate psychological constructs -- is still a rarely used assessment approach in spite of its potential in overcoming inherent constraints of individual assessment methods. This paper uses field test data from a new OECD Study on Social and Emotional Skills to examine…

Descriptors: Interpersonal Competence, Emotional Intelligence, Evaluation Methods, Student Evaluation

Functional Adequacy in L2 Writing: Towards a New Rating Scale

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017

The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…

Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse

Discrepancies between Students' and Teachers' Ratings of Instructional Practice: A Way to Measure Classroom Intuneness and Evaluate Teaching Quality

Direct link

Dockterman, Daniel Milo – ProQuest LLC, 2017

Student surveys have gained prominence in recent years as a way to give students a voice in their learning process, and teacher self-reports have always been an effective instrument for revealing the planning, intentions, and expectations behind a given lesson. Though student and teacher surveys are widely used, extant research in education has…

Descriptors: Outcome Measures, Teacher Evaluation, Student Evaluation of Teacher Performance, Evaluation Methods

Examining the Reliability of Scores from the Consensual Assessment Technique in the Measurement of Individual and Small Group Creativity

Peer reviewed

Direct link

Stefanic, Nicholas; Randles, Clint – Music Education Research, 2015

The purpose of this study was to explore the reliability of measures of both individual and group creative work using the consensual assessment technique (CAT). CAT was used to measure individual and group creativity among a population of pre-service music teachers enrolled in a secondary general music class (n = 23) and was evaluated from…

Descriptors: Music Education, Creativity, Preservice Teachers, Music Teachers

Does the Brief Observation of Social Communication Change Help Moving Forward in Measuring Change in Early Autism Intervention Studies?

Peer reviewed

Direct link

Pijl, Mirjam K. J.; Rommelse, Nanda N. J.; Hendriks, Monica; De Korte, Manon W. P.; Buitelaar, Jan K.; Oosterling, Iris J. – Autism: The International Journal of Research and Practice, 2018

The field of early autism research is in dire need of outcome measures that adequately reflect subtle changes in core autistic behaviors. This article compares the ability of a newly developed measure, the Brief Observation of Social Communication Change (BOSCC), and the Autism Diagnostic Observation Schedule (ADOS) to detect changes in core…

Descriptors: Intervention, Autism, Interpersonal Communication, Interrater Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Advances in Health Sciences…	2
Educational and Psychological…	2
ProQuest LLC	2
Action in Teacher Education	1
Applied Measurement in…	1
Assessing Writing	1
Assessment for Effective…	1
Autism: The International…	1
Behavioral Disorders	1
Canadian Modern Language…	1
Computers & Education	1
Counselor Education and…	1
Developmental Medicine &…	1
ETS Research Report Series	1
Education Canada	1
Education and Information…	1
Educational Assessment	1
Educational Research and…	1
Gerontologist	1
Grantee Submission	1
International Association for…	1
International Journal of…	1
Journal of Applied Research…	1
Journal of Asynchronous…	1
Journal of Autism and…	1
More ▼

Lambie, Glenn W.	2
Swank, Jacqueline M.	2
A. C., John	1
Alfonsin, Nicole	1
Bavier, Richard	1
Beech, Anthony	1
Bejarano, Carolina M.	1
Bergstrom, H.	1
Bertelli, Marco	1
Bianco, Annamaria	1
Bielefeldt, Talbot	1
Blount, Ashley	1
Boyd, Victoria	1
Bradshaw, Helen	1
Bridgeman, Brent	1
Brunosson, A.	1
Brydges, Ryan	1
Buitelaar, Jan K.	1
Cahill, Louise M.	1
Calvert, Hannah	1
Canton, Ursula	1
Carlson, Jordan A.	1
Carlson, Sybil B.	1
Carran, Deborah	1
More ▼