ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,206 to 2,220 of 3,124 results Save | Export

On the Reliability of Meta-Analytic Reviews: The Role of Intercoder Agreement.

Peer reviewed

Yeaton, William H.; Wortman, Paul M. – Evaluation Review, 1993

Current practices of reporting a single mean intercoder agreement in meta-analysis leads to systematic bias and overestimates reliability. An alternative is recommended in which average intercoder agreement statistics are calculated within clusters of coded variables. Two studies of intercoder agreement illustrate the model. (SLD)

Descriptors: Coding, Decision Making, Estimation (Mathematics), Interrater Reliability

Teachers' Evaluation of Correlational Reasoning Skills.

Peer reviewed

Cousins, J. Bradley; And Others – Alberta Journal of Educational Research, 1993

Two experiments studied teachers' proficiency in assessing students' higher order thinking skills. After training alone or after training plus implementation of an instructional unit on correlational thinking, teacher ratings of student samples did not correspond highly with an expert's assessment although they showed sensitivity to student age…

Descriptors: Elementary Secondary Education, Evaluation Problems, Evaluators, Interrater Reliability

Behavior States and a Half-Full Glass: A Response to Mudford, Hogg, and Roberts.

Arthur, Michael – American Journal on Mental Retardation, 2000

In this response to critiques (Mudford, Hogg and Roberts 1997, 1999) of the use of behavior states in research involving individuals with mental retardation, it is argued that the work on behavioral state analysis by Robert D. Guess has contributed to the field at the practical, empirical, and theoretical levels. (Contains references.) (CR)

Descriptors: Adults, Behavior Patterns, Children, Evaluation Methods

Investigating Rater/Prompt Interactions in Writing Assessment: Quantitative and Qualitative Approaches.

Peer reviewed

Weigle, Sara Cushing – Assessing Writing, 1999

Investigates how experienced and inexperienced raters score essays written by English as a Second Language (ESL) students on two different prompts. Shows that the inexperienced raters were more severe than the experienced raters on one prompt but not on the other prompt, and that differences between the two groups of raters were eliminated…

Descriptors: Elementary Secondary Education, English (Second Language), Evaluation Research, Evaluators

Reliability of Essay Rating and Score Adjustment.

Peer reviewed

Longford, N. T. – Journal of Educational and Behavioral Statistics, 1994

Presents a model-based approach to rater reliability for essays read by multiple raters. The approach is motivated by generalizability theory, and variation of rater severity and rater inconsistency is considered in the presence of between-examinee variations. Illustrates methods with data from standardized educational tests. (Author/SLD)

Descriptors: Educational Testing, Essay Tests, Generalizability Theory, Interrater Reliability

Are Caregivers Reports of Motivation Valid? Reliability and Validity of the Reiss Profile MRDD

Peer reviewed

Direct link

Lecavalier, L.; Havercamp, S. M. – Journal of Intellectual Disability Research, 2004

Sensitivity theory proposes that there are wide individual differences in what motivates people with intellectual disability. The Reiss Profile MRDD is a rating scale that measures 15 fundamental motives. This study examined the internal consistency and interrater reliability of the 15 subscales as well as the validity of motivational profiles.…

Descriptors: Profiles, Caregivers, Validity, Rating Scales

Postscript: Differences between the Causal Powers Theory and the Power PC Theory

Peer reviewed

Direct link

White, Peter A. – Psychological Review, 2005

Comments on the response offered by Cheng and Novick to White's initial comments on Cheng's and Cheng and Novick's previous articles. White asks if regularity information necessary for causal learning. He and Cheng and Novick agree that the causal relation is understood as a generative relation, but disagree on how this understanding comes about.…

Descriptors: Differences, Review (Reexamination), Interrater Reliability, Error Correction

Making Change Visible: The Possibilities in Assessing Mental Health Counseling Outcomes

Peer reviewed

Direct link

Leibert, Todd W. – Journal of Counseling & Development, 2006

The product of mental health counseling, unlike that of most professions, remains invisible to most people, leaving counselors vulnerable in a competitive market. The author argues that clinicians should recognize the value of, understand, and begin using outcome measures in their work. Research focusing on critical problems in psychotherapy…

Descriptors: Mental Health, Outcomes of Treatment, Counseling, Measures (Individuals)

In the Eye of the Beholder: Reply to Wilson and Shadish (2006) and Radin, Nelson, Dobyns, and Houtkooper (2006)

Peer reviewed

Direct link

Bosch, Holger; Steinkamp, Fiona; Boller, Emil – Psychological Bulletin, 2006

H. Bosch, F. Steinkamp, and E. Boller's (see record 2006-08436-001) meta-analysis, which demonstrated (a) a small but highly significant overall effect, (b) a small-study effect, and (c) extreme heterogeneity, has provoked widely differing responses. After considering D. B. Wilson and W. R. Shadish's (see record 2006-08436-002) and D. Radin, R.…

Descriptors: Meta Analysis, Publications, Bias, Models

A Reliability Study of BDAE-3 Discourse Coding

Peer reviewed

Direct link

Powell, Thomas W. – Clinical Linguistics & Phonetics, 2006

The third edition of the "Boston Diagnostic Aphasia Examination" (Goodglass, Kaplan, and Barresi) introduced standardized procedures for coding discourse samples elicited using the well known Cookie Theft illustration. To evaluate the reliability of this discourse coding procedure, a transcribed sample was coded by 14 novice examiners…

Descriptors: Examiners, Interrater Reliability, Test Reliability, Aphasia

The Assessment of Information Literacy: A Case Study. Research Report. ETS RR-08-33

Peer reviewed
PDF on ERIC

Download full text

Katz, Irvin R.; Elliot, Norbert; Attali, Yigal; Scharf, Davida; Powers, Donald; Huey, Heather; Joshi, Kamal; Briller, Vladimir – ETS Research Report Series, 2008

This study presents an investigation of information literacy as defined by the ETS iSkills™ assessment and by the New Jersey Institute of Technology (NJIT) Information Literacy Scale (ILS). As two related but distinct measures, both iSkills and the ILS were used with undergraduate students at NJIT during the spring 2006 semester. Undergraduate…

Descriptors: Information Literacy, Information Skills, Skill Analysis, Case Studies

The Impact of Two Professional Development Interventions on Early Reading Instruction and Achievement. NCEE 2008-4030

Peer reviewed
PDF on ERIC

Download full text

Garet, Michael S.; Cronen, Stephanie; Eaton, Marian; Kurki, Anja; Ludwig, Meredith; Jones, Wehmah; Uekawa, Kazuaki; Falk, Audrey; Bloom, Howard S.; Doolittle, Fred; Zhu, Pei; Sztejnberg, Laura – National Center for Education Evaluation and Regional Assistance, 2008

To help states and districts make informed decisions about the professional development (PD) they implement to improve reading instruction, the U.S. Department of Education commissioned the Early Reading PD Interventions Study to examine the impact of two research-based PD interventions for reading instruction: (1) a content-focused teacher…

Descriptors: Early Reading, Reading Instruction, Professional Development, Intervention

Objective Standard Setting for Judge-Mediated Examinations

Peer reviewed

Direct link

Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008

Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…

Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies

The Validity and Reliability of a Performance Assessment Procedure in Ice Hockey

Peer reviewed

Direct link

Nadeau, Luc; Richard, Jean-Francois; Godbout, Paul – Physical Education and Sport Pedagogy, 2008

Background: Coaches and physical educators must obtain valid data relating to the contribution of each of their players in order to assess their level of performance in team sport competition. This information must also be collected and used in real game situations to be more valid. Developed initially for a physical education class context, the…

Descriptors: Physical Education, Team Sports, Observation, Performance Based Assessment

Misunderstandings, Agreements, and Disagreements: Toward a Cumulative Science of Reproducibly Superior Aspects of Giftedness

Peer reviewed

Direct link

Ericsson, K. Anders; Roring, Roy W.; Nandagopal, Kiruthiga – High Ability Studies, 2007

The authors are pleased with commentators' willingness to respond to their target article's challenge to identify observable reproducible phenomena that could be widely accepted as strong scientific evidence for innate talent. In this reply, the authors have organized the ideas in the commentaries into three general categories, namely the…

Descriptors: Interrater Reliability, Reader Response, Rote Learning, Creative Thinking

« Previous Page | Next Page »

Pages: 1 | ... | 144 | 145 | 146 | 147 | 148 | 149 | 150 | 151 | 152 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼