ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	10

Descriptor

Evaluation Methods	45
Research Problems	45
Reliability	27
Research Methodology	23
Validity	20
Test Reliability	13
Educational Research	11
Measurement Techniques	11
Evaluation Criteria	9
Statistical Analysis	9
Evaluation Problems	8
Research Design	8
Program Evaluation	7
Psychometrics	7
Research Needs	7
Academic Achievement	6
Data Analysis	6
Elementary Secondary Education	6
Interrater Reliability	6
Models	6
Test Validity	6
Educational Policy	5
Intervention	5
Predictor Variables	5
Sampling	5
More ▼

Publication Type

Journal Articles	25
Reports - Research	12
Opinion Papers	10
Reports - Evaluative	8
Information Analyses	7
Speeches/Meeting Papers	7
Reports - Descriptive	4
Books	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Non-Classroom	1
Non-Print Media	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	4
Higher Education	1
Secondary Education	1

Audience

Researchers	5
Parents	1
Policymakers	1
Practitioners	1

Location

Canada	2
Australia	1
China	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Comprehensive Tests of Basic…	1
Personal Orientation Inventory	1
Program for International…	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Towards the Automatic Risk of Bias Assessment on Randomized Controlled Trials: A Comparison of RobotReviewer and Humans

Peer reviewed

Direct link

Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024

RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…

Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics

Building the Plane in Flight: Establishing Post Hoc Inter-Rater Reliability Coefficients in an Educational Context. Sage Research Methods Cases Part 2

Direct link

Albert M. Jimenez; Sally J. Zepeda – Sage Research Methods Cases, 2017

The work presented in this case study results from a study conducted in 2012-2014 examining a newly created teacher evaluation system to determine the inter-rater reliability of the classroom observation instrument. The teacher evaluation system was the result of a partnership between the school district and the university in the same city…

Descriptors: Case Studies, Interrater Reliability, Teacher Evaluation, Observation

The Retrospective Pretest-Posttest Design Redux: On Its Validity as an Alternative to Traditional Pretest-Posttest Measurement

Peer reviewed

Direct link

Little, Todd D.; Chang, Rong; Gorrall, Britt K.; Waggenspack, Luke; Fukuda, Eriko; Allen, Patricia J.; Noam, Gil G. – International Journal of Behavioral Development, 2020

We revisit the merits of the retrospective pretest-posttest (RPP) design for repeated-measures research. The underutilized RPP method asks respondents to rate survey items twice during the same posttest measurement occasion from two specific frames of reference: "now" and "then." Individuals first report their current attitudes…

Descriptors: Pretesting, Alternative Assessment, Program Evaluation, Evaluation Methods

Motivation and Engagement in the United States, Canada, United Kingdom, Australia, and China: Testing a Multi-Dimensional Framework

Peer reviewed

Direct link

Martin, Andrew J.; Yu, Kai; Papworth, Brad; Ginns, Paul; Collie, Rebecca J. – Journal of Psychoeducational Assessment, 2015

This study explored motivation and engagement among North American (the United States and Canada; n = 1,540), U.K. (n = 1,558), Australian (n = 2,283), and Chinese (n = 3,753) secondary school students. Motivation and engagement were assessed via students' responses to the Motivation and Engagement Scale-High School (MES-HS). Confirmatory factor…

Descriptors: Foreign Countries, Motivation, Learner Engagement, Secondary School Students

A Comparison of Affect Ratings Obtained with Ecological Momentary Assessment and the Day Reconstruction Method

Peer reviewed

Direct link

Dockray, Samantha; Grant, Nina; Stone, Arthur A.; Kahneman, Daniel; Wardle, Jane; Steptoe, Andrew – Social Indicators Research, 2010

Measurement of affective states in everyday life is of fundamental importance in many types of quality of life, health, and psychological research. Ecological momentary assessment (EMA) is the recognized method of choice, but the respondent burden can be high. The day reconstruction method (DRM) was developed by Kahneman and colleagues ("Science,"…

Descriptors: Employed Women, Quality of Life, Evaluation Methods, Psychological Patterns

Modern Robust Statistical Methods: An Easy Way to Maximize the Accuracy and Power of Your Research

Peer reviewed

Direct link

Erceg-Hurn, David M.; Mirosevich, Vikki M. – American Psychologist, 2008

Classic parametric statistical significance tests, such as analysis of variance and least squares regression, are widely used by researchers in many disciplines, including psychology. For classic parametric tests to produce accurate results, the assumptions underlying them (e.g., normality and homoscedasticity) must be satisfied. These assumptions…

Descriptors: Statistical Significance, Least Squares Statistics, Effect Size, Statistical Studies

Toward Developing a Science of Treatment Integrity: Introduction to the Special Series

Peer reviewed

Direct link

Hagermoser Sanetti, Lisa M.; Kratochwill, Thomas R. – School Psychology Review, 2009

Treatment integrity (also referred to as "treatment fidelity," "intervention integrity," and "procedural reliability") is an important methodological concerning both research and practice because treatment integrity data are essential to making valid conclusions regarding treatment outcomes. Despite its relationship to validity, treatment…

Descriptors: Intervention, Research Methodology, Models, Validity

Individual Differences in Voice Quality Perception.

Peer reviewed

Kreiman, Jody; And Others – Journal of Speech and Hearing Research, 1992

Sixteen listeners (10 expert, 6 naive) judged the dissimilarity of pairs of voices drawn from pathological and normal populations. Only parameters that showed substantial variability were perceptually salient across listeners. Results suggest that traditional means of assessing listener reliability in voice perception tasks may not be appropriate.…

Descriptors: Evaluation Methods, Individual Differences, Interrater Reliability, Perception

Conceptual and Methodological Issues in Treatment Integrity Measurement

Peer reviewed

Direct link

McLeod, Bryce D.; Southam-Gerow, Michael A.; Weisz, John R. – School Psychology Review, 2009

This special series focused on treatment integrity in the child mental health and education field is timely. The articles do a laudable job of reviewing (a) the current status of treatment integrity research and measurement, (b) existing conceptual models of treatment integrity, and (c) the limitations of prior research. Overall, this thoughtful…

Descriptors: Evaluation Research, Children, Intervention, Research Methodology

The Missing Middle in Validation Research

Peer reviewed

Taylor, Erwin K.; Griess, Thomas – Personnel Psychology, 1976

In most selection validation research, only the upper and lower tails of the criterion distribution are used, often yielding misleading or incorrect results. Provides formulas and tables which enable the researcher to account more accurately for the distribution of criterion within the middle range of population. (Author/RW)

Descriptors: Evaluation Methods, Measurement Techniques, Predictive Validity, Reliability

Sample Size Determinations for the Two Rater Kappa Statistic.

Peer reviewed

Flack, Virginia F.; And Others – Psychometrika, 1988

A method is presented for determining sample size that will achieve a pre-specified bound on confidence interval width for the interrater agreement measure "kappa." The same results can be used when a pre-specified power is desired for testing hypotheses about the value of kappa. (Author/SLD)

Descriptors: Evaluation Methods, Interrater Reliability, Research Methodology, Research Problems

Can Appraisers Rate Work Performance Accurately?

Hedge, Jerry W.; Laue, Frances J. – 1988

The ability of individuals to make accurate judgments about others is examined and literature on this subject is reviewed. A wide variety of situational factors affects the appraisal of performance. It is generally accepted that the purpose of the appraisal influences the accuracy of the appraiser. The instrumentation, or tools, available to the…

Descriptors: Evaluation Criteria, Evaluation Methods, Evaluation Problems, Performance Factors

Visual Analysis of Single-Subject Studies by School Psychologists.

Peer reviewed

Furlong, Michael J.; Wampold, Bruce E. – Psychology in the Schools, 1981

To guide the unbiased process of visual inference, a four-step model is presented for the assessment of reliability, intervention effect, meaningfulness, and generalizability. A Visual Inference Checklist (VIC) systematizes this assessment process. (Author)

Descriptors: Bias, Data Analysis, Evaluation Methods, Identification

Evaluating Family Therapy: Divergent Methods, Divergent Findings.

Peer reviewed

Kolevzon, Michael S.; And Others – Journal of Marital and Family Therapy, 1988

Employed triangulation strategy for assessing family interaction, involving family members, therapist, and coders independently viewing videotapes. Found weak agreement between paired assessments within family triad, and within therapist-coder dyad. Findings suggest that methodological and/or scaling strategies designed to maximize agreement may…

Descriptors: Counselor Attitudes, Evaluation Criteria, Evaluation Methods, Evaluation Problems

Methodological Issues and Problems in the Assessment of Substance Use.

Peer reviewed

Carroll, Kathleen M. – Psychological Assessment, 1995

Three types of methodological issues particularly salient in research involving the assessment of substance use or abuse are discussed with strategies for avoiding problems: (1) the reliability and validity of methods; (2) the variability and episodic course of substance use; and (3) the heterogeneity of individuals with substance use disorders.…

Descriptors: Clinical Diagnosis, Evaluation Methods, Psychological Studies, Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3

School Psychology Review	4
American Journal on Mental…	2
American Journal of Distance…	1
American Psychologist	1
Behavioral Disorders	1
Early Childhood Research…	1
International Journal of…	1
Journal of Applied Behavior…	1
Journal of Classroom…	1
Journal of Counseling…	1
Journal of Marital and Family…	1
Journal of Psychoeducational…	1
Journal of Social Work…	1
Journal of Speech and Hearing…	1
Journal of Visual Impairment…	1
Library Quarterly	1
Moral Education Forum	1
Personnel Psychology	1
Psychological Assessment	1
Psychology in the Schools	1
Psychometrika	1
Research Synthesis Methods	1
Sage Research Methods Cases	1
Social Indicators Research	1
More ▼

Gresham, Frank M.	2
Albert M. Jimenez	1
Allen, Patricia J.	1
Anderson, Lorin W.	1
Arthur, Michael	1
Barnes, Robert E.	1
Campbell, Heather E.	1
Carroll, Kathleen M.	1
Chang Xu	1
Chang, Rong	1
Cizek, Gregory J.	1
Cline, Hugh F.	1
Collie, Rebecca J.	1
Czaja, Carol F.	1
Delucci, Kevin L.	1
Dockray, Samantha	1
Easton, Julia E.	1
Erceg-Hurn, David M.	1
Feldmesser, Robert A.	1
Flack, Virginia F.	1
Fukuda, Eriko	1
Furlong, Michael J.	1
Gage, N. L.	1
Gerdes, Karen	1
More ▼