ERIC - Search Results

Publication Date

In 2026	0
Since 2025	60
Since 2022 (last 5 years)	286
Since 2017 (last 10 years)	782
Since 2007 (last 20 years)	2044

Descriptor

Interrater Reliability	3126
Foreign Countries	655
Test Reliability	504
Evaluation Methods	503
Test Validity	411
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	242
Reliability	231
Observation	229
Scoring Rubrics	217
Test Construction	213
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 511 to 525 of 3,126 results Save | Export

Inter-Rater Agreement in Assigning Cognitive Demand to Life Sciences Examination Questions

Peer reviewed

Direct link

Dempster, Edith R.; Kirby, Nicola F. – Perspectives in Education, 2018

Taxonomies of cognitive demand are frequently used to ensure that assessment tasks include questions ranging from low to high cognitive demand. This paper investigates inter-rater agreement among four evaluators on the cognitive demand of the South African National Senior Certificate Life Sciences examinations after training, practice and…

Descriptors: Interrater Reliability, Biological Sciences, Cognitive Processes, Test Items

Investigating the Validity of Oral Assessment Rater Training Program: A Mixed-Methods Study of Raters' Perceptions and Attitudes before and after Training

Peer reviewed

Direct link

Bijani, Houman – Cogent Education, 2018

Rater variability has always been identified as an important source of measurement error in performance assessment, especially for oral proficiency tests. Rater training is commonly used as a means for compensating various sources of rater variability and adjusting their assessment quality. However, there is little research regarding the nature of…

Descriptors: Evaluators, Training, Verbal Tests, Interrater Reliability

Rounding in Angoff Ratings

Peer reviewed
PDF on ERIC

Download full text

Wyse, Adam E. – Practical Assessment, Research & Evaluation, 2018

One common modification to the Angoff standard-setting method is to have panelists round their ratings to the nearest 0.05 or 0.10 instead of 0.01. Several reasons have been offered as to why it may make sense to have panelists round their ratings to the nearest 0.05 or 0.10. In this article, we examine one reason that has been suggested, which is…

Descriptors: Interrater Reliability, Evaluation Criteria, Scoring Formulas, Achievement Rating

Evaluating Special Educators' Classroom Performance: Does Rater "Type" Matter?

Peer reviewed

Direct link

Lawson, Janelle E.; Cruz, Rebecca A. – Assessment for Effective Intervention, 2018

Classroom observations remain the predominant data source used in teacher evaluations, but little is known about how rater characteristics may affect teachers' scores. For special educators, whose instructional practice requires specialized knowledge and skills, school administrators (i.e., the raters) without experience in special education…

Descriptors: Special Education Teachers, Teacher Evaluation, Interrater Reliability, Administrators

Development and Initial Evaluation of the Measure of Active Supervision and Interaction

Peer reviewed

Direct link

Collier-Meek, Melissa A.; Johnson, Austin H.; Farrell, Anne F. – Assessment for Effective Intervention, 2018

Implementation of research-based, Tier 1 behavior management strategies can be monitored to provide data-driven feedback and in support of integrity. The "Measure of Active Supervision and Interaction" (MASI) was developed to measure four behavior management practices (i.e., Praise, Correction, References to Behavior Expectations, Active…

Descriptors: Behavior Modification, Test Reliability, Test Validity, Interrater Reliability

A Generalizability Theory Study to Examine Sources of Score Variance in Third-Party Evaluations Used in Decision-Making for Graduate School Admissions. ETS GRE® Board Research Report. ETS GRE®-18-03. ETS RR-18-37

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Oliveri, Maria Elena; Holtzman, Steven – ETS Research Report Series, 2018

Scores from noncognitive measures are increasingly valued for their utility in helping to inform postsecondary admissions decisions. However, their use has presented challenges because of faking, response biases, or subjectivity, which standardized third-party evaluations (TPEs) can help minimize. Analysts and researchers using TPEs, however, need…

Descriptors: Generalizability Theory, Scores, College Admission, Admission Criteria

Applying the Classroom Assessment Scoring System in Classrooms Serving Students with Emotional and Behavioural Disorders

Peer reviewed

Direct link

Cipriano, Christina; Barnes, Tia N.; Bertoli, Michelle C.; Rivers, Susan E. – Emotional & Behavioural Difficulties, 2018

Students with Emotional and Behavioural Disorders (EBD) have the poorest academic and social outcomes across the general and special education student populations, and are among the most likely to receive instruction in self-contained special education classrooms hallmarked by small teacher-student ratios, frequent transitions, extreme student…

Descriptors: Emotional Disturbances, Behavior Disorders, Self Contained Classrooms, Classroom Environment

Sources of Variance in the Accuracy of Interviewer Observations

Peer reviewed

Direct link

West, Brady T.; Li, Dan – Sociological Methods & Research, 2019

In face-to-face surveys, interviewer observations are a cost-effective source of paradata for nonresponse adjustment of survey estimates and responsive survey designs. Unfortunately, recent studies have suggested that the accuracy of these observations can vary substantially among interviewers, even after controlling for household-, area-, and…

Descriptors: Observation, Interviews, Error of Measurement, Accuracy

Selecting High Quality Dual Language Texts for Young Children in Multicultural Contexts: A UAE Case

Peer reviewed

Direct link

Hojeij, Zeina; Dillon, Anne Marie; Perkins, Alecia; Grey, Ian – Issues in Educational Research, 2019

Bilingual literature for children is valuable in encouraging literacy in second language learners. Stories can enhance vocabulary and language abilities, learning encounters, subject content, social aptitude, and other skills in the early reader through text as well as illustrations. This paper explores issues in selecting quality dual language…

Descriptors: Foreign Countries, Multilingual Materials, Childrens Literature, Picture Books

The Validity and Reliability of the "MyJump2" Application to Assess Vertical Jumps in Trained Junior Athletes

Peer reviewed

Direct link

Rogers, Simon A.; Hassmén, Peter; Hunter, Adam; Alcock, Alison; Crewe, Stewart T.; Strauts, Janina A.; Gilleard, Wendy L.; Weissensteiner, Juanita R. – Measurement in Physical Education and Exercise Science, 2019

This study aimed to assess the validity and reliability of jump assessments using the "MyJump2" application. Eleven junior athletes (15 ± 1.4 years) performed five countermovement (CMJ) and drop jumps (DJ) measured simultaneously by a force platform and "MyJump2." Additionally, intra- and inter-day reliability was assessed over…

Descriptors: Adolescents, Athletes, Measurement Equipment, Handheld Devices

Development of a Novel Tool for Assessing Coverage of Implementation Factors in Health Promotion Program Resources

Peer reviewed
PDF on ERIC

Download full text

Direct link

Bejarano, Carolina M.; Snow, Kelli; Lane, Hannah; Calvert, Hannah; Hoppe, Kate; Alfonsin, Nicole; Turner, Lindsey; Carlson, Jordan A. – Grantee Submission, 2019

Purpose: This study presents a novel methodology/process for assessing inclusion of theoretically-based implementation factors within available adoption-ready health promotion programs. Methods: Classroom-based physical activity (CBPA) programs were used as an example to describe the process. Our team selected an implementation science framework…

Descriptors: Evaluation Methods, Program Evaluation, Health Promotion, Physical Activity Level

A Multilevel Factor Analytic Investigation of the Learning-to-Learn Scales: A More Child-Centered Look at Dimensionality

Direct link

Brumley, Benjamin Pratt – ProQuest LLC, 2019

Children from low-income households are at risk for entering school behind their more economically advantaged peers across major domains of school readiness. The Head Start program represents the federal government's response to these achievement gaps by mandating the use of scientifically based assessments and curricula to provide children with…

Descriptors: School Readiness, Learning Processes, Preschool Children, Measures (Individuals)

Development and Initial Psychometrics of a Generic Treatment Integrity Measure Designed to Assess Practice Elements Targeting Social, Emotional, and Behavioral Outcomes in Early Childhood Settings

Peer reviewed
PDF on ERIC

Download full text

Direct link

McLeod, Bryce D.; Sutherland, Kevin S.; Broda, Michael; Granger, Kristen L.; Martinez, Ruben G.; Conroy, Maureen A.; Snyder, Patricia A.; Southam-Gerow, Michael A. – Grantee Submission, 2021

Though treatment integrity measurement is important for research intended to promote social and behavioral outcomes of children at risk for emotional and behavioral disorders (EBDs) in early childhood settings, measurement gaps exist in the field. This paper reports on the development and preliminary psychometric assessment of the treatment…

Descriptors: Psychometrics, Measures (Individuals), Fidelity, Integrity

Kappa and Rater Accuracy: Paradigms and Parameters

Peer reviewed

Direct link

Conger, Anthony J. – Educational and Psychological Measurement, 2017

Drawing parallels to classical test theory, this article clarifies the difference between rater accuracy and reliability and demonstrates how category marginal frequencies affect rater agreement and Cohen's kappa. Category assignment paradigms are developed: comparing raters to a standard (index) versus comparing two raters to one another…

Descriptors: Interrater Reliability, Evaluators, Accuracy, Statistical Analysis

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

« Previous Page | Next Page »

Pages: 1 | ... | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | ... | 209

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2557
Reports - Research	2245
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	163
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	57
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼