ERIC - Search Results

Publication Date

In 2026	0
Since 2025	60
Since 2022 (last 5 years)	286
Since 2017 (last 10 years)	782
Since 2007 (last 20 years)	2044

Descriptor

Interrater Reliability	3126
Foreign Countries	655
Test Reliability	504
Evaluation Methods	503
Test Validity	411
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	242
Reliability	231
Observation	229
Scoring Rubrics	217
Test Construction	213
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 706 to 720 of 3,126 results Save | Export

Autism at a Glance: A Pilot Study Optimizing Thin-Slice Observations

Peer reviewed

Direct link

Hampton, Lauren H.; Curtis, Philip R.; Roberts, Megan Y. – Autism: The International Journal of Research and Practice, 2019

Borrowing from a clinical psychology observational methodology, thin-slice observations were used to assess autism characteristics in toddlers. Thin-slices are short observations taken from a longer behavior stream which are assigned ratings by multiple raters using a 5-point scale. The raters' observations are averaged together to assign a…

Descriptors: Autism, Pervasive Developmental Disorders, Observation, Toddlers

A Comparison of Rubrics and Graded Category Rating Scales with Various Methods Regarding Raters' Reliability

Peer reviewed
PDF on ERIC

Download full text

Dogan, C. Deha; Uluman, Müge – Educational Sciences: Theory and Practice, 2017

The aim of this study was to determine the extent at which graded-category rating scales and rubrics contribute to inter-rater reliability. The research was designed as a correlational study. Study group consisted of 82 students attending sixth grade and three writing course teachers in a private elementary school. A performance task was…

Descriptors: Comparative Analysis, Scoring Rubrics, Rating Scales, Interrater Reliability

Developing a Machine-Supported Coding System for Constructed-Response Items in PISA. Research Report. ETS RR-17-47

Peer reviewed
PDF on ERIC

Download full text

Yamamoto, Kentaro; He, Qiwei; Shin, Hyo Jeong; von Davier, Mattias – ETS Research Report Series, 2017

Approximately a third of the Programme for International Student Assessment (PISA) items in the core domains (math, reading, and science) are constructed-response items and require human coding (scoring). This process is time-consuming, expensive, and prone to error as often (a) humans code inconsistently, and (b) coding reliability in…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Reaching a Conclusion--Procedures and Processes of Judgement Formation in School Inspection Teams

Peer reviewed

Direct link

Dedering, Kathrin; Sowada, Moritz G. – Educational Assessment, Evaluation and Accountability, 2017

School inspections have become an important instrument of quality assurance and quality development in many European countries. So far, the focus of empirical research on school inspections has been on the acceptance of the procedure among the school-internal actors, its influence for internal quality development and its effects on student…

Descriptors: Inspection, Administrative Policy, Administrative Principles, Teamwork

An Analytic Creativity Assessment Scale for Digital Game Story Design: Construct Validity, Internal Consistency and Interrater Reliability

Peer reviewed

Direct link

Chuang, Tsung-Yen; Huang, Yun-Hsuan – Creativity Research Journal, 2015

Mobile technology has rapidly made digital games a popular entertainment to this digital generation, and thus digital game design received considerable attention in both the game industry and design education. Digital game design involves diverse dimensions in which digital game story design (DGSD) particularly attracts our interest, as the…

Descriptors: Creativity, Interrater Reliability, Construct Validity, Creativity Tests

Comparing the Effectiveness of Self-Paced and Collaborative Frame-of-Reference Training on Rater Accuracy in a Large-Scale Writing Assessment

Peer reviewed

Direct link

Raczynski, Kevin R.; Cohen, Allan S.; Engelhard, George, Jr.; Lu, Zhenqiu – Journal of Educational Measurement, 2015

There is a large body of research on the effectiveness of rater training methods in the industrial and organizational psychology literature. Less has been reported in the measurement literature on large-scale writing assessments. This study compared the effectiveness of two widely used rater training methods--self-paced and collaborative…

Descriptors: Interrater Reliability, Writing Evaluation, Training Methods, Pacing

Gauging Item Alignment through Online Systems While Controlling for Rater Effects

Peer reviewed

Direct link

Anderson, Daniel; Irvin, Shawn; Alonzo, Julie; Tindal, Gerald A. – Educational Measurement: Issues and Practice, 2015

The alignment of test items to content standards is critical to the validity of decisions made from standards-based tests. Generally, alignment is determined based on judgments made by a panel of content experts with either ratings averaged or via a consensus reached through discussion. When the pool of items to be reviewed is large, or the…

Descriptors: Test Items, Alignment (Education), Standards, Online Systems

Comparison of Intelligibility Measures for Adults with Parkinson's Disease, Adults with Multiple Sclerosis, and Healthy Controls

Peer reviewed

Direct link

Stipancic, Kaila L.; Tjaden, Kris; Wilding, Gregory – Journal of Speech, Language, and Hearing Research, 2016

Purpose: This study obtained judgments of sentence intelligibility using orthographic transcription for comparison with previously reported intelligibility judgments obtained using a visual analog scale (VAS) for individuals with Parkinson's disease and multiple sclerosis and healthy controls (K. Tjaden, J. E. Sussman, & G. E. Wilding, 2014).…

Descriptors: Diseases, Neurological Impairments, Sentences, Measures (Individuals)

Instrument Reporting Practices in Second Language Research

Peer reviewed

Direct link

Derrick, Deirdre J. – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2016

Second language (L2) researchers often have to develop or change the instruments they use to measure numerous constructs (Norris & Ortega, 2012). Given the prevalence of researcher-developed and -adapted data collection instruments, and given the profound effect instrumentation can have on results, thorough reporting of instrumentation is…

Descriptors: Second Language Learning, Language Research, Research Methodology, Interrater Reliability

Applying a Thurstonian, Two-Stage Method in the Standardized Assessment of Writing

Peer reviewed

Direct link

McGrane, Joshua Aaron; Humphry, Stephen Mark; Heldsinger, Sandra – Applied Measurement in Education, 2018

National standardized assessment programs have increasingly included extended written performances, amplifying the need for reliable, valid, and efficient methods of assessment. This article examines a two-stage method using comparative judgments and calibrated exemplars as a complement and alternative to existing methods of assessing writing.…

Descriptors: Standardized Tests, Foreign Countries, Writing Tests, Writing Evaluation

Generalizability Theory Research on Developing a Scoring Rubric to Assess Primary School Students' Problem Posing Skills

Peer reviewed

Direct link

Cankoy, Osman; Özder, Hasan – EURASIA Journal of Mathematics, Science & Technology Education, 2017

The aim of this study is to develop a scoring rubric to assess primary school students' problem posing skills. The rubric including five dimensions namely solvability, reasonability, mathematical structure, context and language was used. The raters scored the students' problem posing skills both with and without the scoring rubric to test the…

Descriptors: Generalizability Theory, Elementary School Students, Foreign Countries, Problem Solving

American Progressive Education and the Schooling of Poor Children: A Brief History of a Philosophy in Practice

Peer reviewed
PDF on ERIC

Download full text

Garte, Rebecca – International Journal of Progressive Education, 2017

This paper provides a historical analysis of the past century of progressive education, within the general socio-political context of schooling within the US. The purpose of this review is to create a social, historical and philosophical context for understanding the current narrative of progressive education that exists in educational policy…

Descriptors: Progressive Education, Educational History, Educational Practices, Philosophy

Psychometrics and Validation of a Brief Rating Measure of Parent-Infant Interaction: Manchester Assessment of Caregiver-Infant Interaction

Peer reviewed

Direct link

Wan, Ming Wai; Brooks, Ami; Green, Jonathan; Abel, Kathryn; Elmadih, Alya – International Journal of Behavioral Development, 2017

This study investigated the psychometrics of a recently developed global rating measure of videotaped parent-infant interaction, the "Manchester Assessment of Caregiver-Infant Interaction" (MACI), in a normative sample. Inter-rater reliability, stability over time, and convergent and discriminant validity were tested. Six-minute play…

Descriptors: Rating Scales, Parent Child Relationship, Infants, Interaction

The Complexity of Teacher Questions in Chemistry Classrooms: An Empirical Analysis on the Basis of Two Competence Models

Peer reviewed

Direct link

Nehring, Andreas; Päßler, Andreas; Tiemann, Rüdiger – International Journal of Science and Mathematics Education, 2017

With regard to the moderate performance of German students in international large-scale assessments, one branch of German science education research is concerned with the construction and evaluation of competence models. Based on the theory-driven definition of competence levels, these models imply a correlation between the complexity of a…

Descriptors: Foreign Countries, Science Education, Chemistry, Science Teachers

Comparison Study of Judged Clinical Skills Competence from Standard Setting Ratings Generated under Different Administration Conditions

Peer reviewed

Direct link

Roberts, William L.; Boulet, John; Sandella, Jeanne – Advances in Health Sciences Education, 2017

When the safety of the public is at stake, it is particularly relevant for licensing and credentialing exam agencies to use defensible standard setting methods to categorize candidates into competence categories (e.g., pass/fail). The aim of this study was to gather evidence to support change to the Comprehensive Osteopathic Medical Licensing-USA…

Descriptors: Standard Setting, Comparative Analysis, Clinical Experience, Skill Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	57
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2557
Reports - Research	2245
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	163
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼