ERIC - Search Results

Publication Date

In 2025	1
Since 2024	6
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	37

Descriptor

Evaluators	80
Measurement Techniques	80
Evaluation Methods	43
Program Evaluation	20
Elementary Secondary Education	13
Interrater Reliability	12
Scores	11
Educational Assessment	10
Scoring	10
Academic Achievement	9
Evaluation Criteria	9
Foreign Countries	9
Higher Education	9
Models	9
Rating Scales	9
Reliability	9
Comparative Analysis	8
Psychometrics	8
Program Implementation	7
Test Construction	7
Correlation	6
Data Collection	6
Research Methodology	6
School Districts	6
Standards	6
More ▼

Publication Type

Journal Articles	41
Reports - Research	34
Reports - Evaluative	18
Speeches/Meeting Papers	12
Reports - Descriptive	11
Opinion Papers	6
Guides - General	4
Information Analyses	4
Tests/Questionnaires	4
Collected Works - Proceedings	3
Dissertations/Theses -…	2
Guides - Non-Classroom	2
ERIC Publications	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	8
Elementary Education	5
Postsecondary Education	5
Secondary Education	4
High Schools	3
Middle Schools	3
Elementary Secondary Education	2
Junior High Schools	2
Adult Education	1
Early Childhood Education	1
Grade 2	1
Grade 5	1
Grade 7	1
Intermediate Grades	1
Primary Education	1
More ▼

Audience

Researchers	3
Practitioners	2
Policymakers	1

Location

California	2
New Hampshire	2
Africa	1
Argentina (Buenos Aires)	1
Australia	1
Canada	1
Ethiopia	1
Florida	1
Germany	1
Lesotho	1
Michigan	1
New Zealand	1
Somalia	1
South Korea	1
Sudan	1
Swaziland	1
Switzerland	1
Tanzania	1
Tennessee	1
Thailand	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Stewart B McKinney Homeless…

Assessments and Surveys

National Assessment of…	3
Flesch Kincaid Grade Level…	1
Fry Readability Formula	1
Motivated Strategies for…	1
National Adult Literacy…	1
National Survey of Student…	1
Rorschach Test	1
Test of English as a Foreign…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 80 results Save | Export

Planning Missing Data Designs for Human Ratings in Creativity Research: A Practical Guide

Peer reviewed

Direct link

Boris Forthmann; Benjamin Goecke; Roger E. Beaty – Creativity Research Journal, 2025

Human ratings are ubiquitous in creativity research. Yet, the process of rating responses to creativity tasks -- typically several hundred or thousands of responses, per rater -- is often time-consuming and expensive. Planned missing data designs, where raters only rate a subset of the total number of responses, have been recently proposed as one…

Descriptors: Creativity, Research, Researchers, Research Methodology

Thinking Outside the Self-Report: Using Evaluation Plans to Assess Evaluation Capacity Building

Peer reviewed

Direct link

Wingate, Lori A.; Robertson, Kelly; FitzGerald, Michael; Rucks, Lana; Tsuzaki, Takara; Clasen, Carla; Schwob, Jeremy – American Journal of Evaluation, 2022

In this study, we investigated the impact of the evaluation capacity building (ECB) efforts of an organization by examining the evaluation plans included in funding proposals over a 14-year period. Specifically, we sought to determine the degree to which and how evaluation plans in proposals to one National Science Foundation (NSF) program changed…

Descriptors: Measurement Techniques, Evaluation Methods, Capacity Building, Program Evaluation

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Evaluation Is Creation: Self and Social Judgments of Creativity across the Four-C Model

Peer reviewed

Direct link

Denis Dumas; James C. Kaufman – Educational Psychology Review, 2024

Who should evaluate the originality and task-appropriateness of a given idea has been a perennial debate among psychologists of creativity. Here, we argue that the most relevant evaluator of a given idea depends crucially on the level of expertise of the person who generated it. To build this argument, we draw on two complimentary theoretical…

Descriptors: Decision Making, Creativity, Task Analysis, Psychologists

Exploring NSF-Funded Evaluators' and Principal Investigators' Definitions and Measurement of Diversity, Equity, and Inclusion

Peer reviewed

Direct link

Boyce, Ayesha S.; Tovey, Tiffany L.S.; Onwuka, Onyinyechukwu; Moller, J.R.; Clark, Tyler; Smith, Aundrea – American Journal of Evaluation, 2023

More evaluators have anchored their work in equity-focused, culturally responsive, and social justice ideals. Although we have a sense of approaches that guide evaluators as to how they should attend to culture, diversity, equity, and inclusion (DEI), we have not yet established an empirical understanding of how evaluators measure DEI. In this…

Descriptors: Definitions, Inclusion, Equal Education, Social Justice

Method-of-Moment Corrected Maximum Likelihood (Ml) Structural-after-Measurement (SAM) Estimator for n-Level Structural Equation Models

Peer reviewed

Direct link

Fangxing Bai; Ben Kelcey – Society for Research on Educational Effectiveness, 2024

Purpose and Background: Despite the flexibility of multilevel structural equation modeling (MLSEM), a practical limitation many researchers encounter is how to effectively estimate model parameters with typical sample sizes when there are many levels of (potentially disparate) nesting. We develop a method-of-moment corrected maximum likelihood…

Descriptors: Maximum Likelihood Statistics, Structural Equation Models, Sample Size, Faculty Development

Visualizing Agreement: Bland-Altman Plots as a Supplement to Inter-Rater Reliability Indices

Peer reviewed

Direct link

Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024

Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…

Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques

Comparison of Inter-Rater Reliability Techniques in Performance-Based Assessment

Peer reviewed
PDF on ERIC

Download full text

Arslan Mancar, Sinem; Gulleroglu, H. Deniz – International Journal of Assessment Tools in Education, 2022

The aim of this study is to analyse the importance of the number of raters and compare the results obtained by techniques based on Classical Test Theory (CTT) and Generalizability (G) Theory. The Kappa and Krippendorff alpha techniques based on CTT were used to determine the inter-rater reliability. In this descriptive research data consists of…

Descriptors: Comparative Analysis, Interrater Reliability, Advanced Placement, Scoring Rubrics

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Presenting the Meta-Performance Test, a Metacognitive Battery Based on Performance

Peer reviewed
PDF on ERIC

Download full text

Castillo Diaz, Marcio Alexander; Gomes, Cristiano Mauro Assis – International Journal of Educational Methodology, 2021

The self-report and think-aloud approaches are the two dominant methodologies to measure metacognition. This is problematic, since they generate respondent and confirmation biases, respectively. The Meta-Performance Test is an innovative battery, which evaluates metacognition based on the respondent's performance, mitigating the aforementioned…

Descriptors: Metacognition, Measurement Techniques, Reading Comprehension, Arithmetic

Discourse of STEM Education Evaluation: Current and Future Perspectives

Direct link

Adetogun, Adeyemo Adekanmi – ProQuest LLC, 2023

Science, Technology, Engineering, and Mathematics (STEM) education has become increasingly important in the US due to its influence on the nation's educational needs, the creation of a skilled labor force, and opportunities for more tech-savvy workers. However, the evaluation approaches and methodologies used in STEM education programs have come…

Descriptors: STEM Education, Evaluation Methods, Evaluators, Educational Philosophy

Investigating Human Essay Rating Quality in a Large-Scale Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Zhang, Xiuyuan – AERA Online Paper Repository, 2019

The main purpose of the study is to evaluate the qualities of human essay ratings for a large-scale assessment using Rasch measurement theory. Specifically, Many-Facet Rasch Measurement (MFRM) was utilized to examine the rating scale category structure and provide important information about interpretations of ratings in the large-scale…

Descriptors: Essays, Evaluators, Writing Evaluation, Reliability

Effects of Second Language Pronunciation Teaching Revisited: A Proposed Measurement Framework and Meta-Analysis

Peer reviewed

Direct link

Saito, Kazuya; Plonsky, Luke – Language Learning, 2019

We propose a new framework for conceptualizing measures of instructed second language (L2) pronunciation performance according to three sets of parameters: (a) the constructs (focused on global vs. specific aspects of pronunciation), (b) the scoring method (human raters vs. acoustic analyses), and (c) the type of knowledge elicited (controlled vs.…

Descriptors: Second Language Learning, Second Language Instruction, Scoring, Pronunciation Instruction

Reliability and Construct Validity of the TBI-QOL Communication Short Form as a Parent-Proxy Report Instrument for Children with Traumatic Brain Injury

Peer reviewed

Direct link

Cohen, Matthew L.; Tulsky, David S.; Boulton, Aaron J.; Kisala, Pamela A.; Bertisch, Hilary; Yeates, Keith Owen; Zonfrillo, Mark R.; Durbin, Dennis R.; Jaffe, Kenneth M.; Temkin, Nancy; Wang, Jin; Rivara, Frederick P. – Journal of Speech, Language, and Hearing Research, 2019

Purpose: The purpose of this study was to evaluate the internal consistency and construct validity of the Traumatic Brain Injury Quality of Life Communication Item Bank (TBI-QOL COM) short form as a parent-proxy report measure. The TBI-QOL COM is a patient-reported outcome measure of functional communication originally developed as a self-report…

Descriptors: Brain, Head Injuries, Quality of Life, Pediatrics

Building an Initial Validity Argument for Binary and Analytic Rating Scales for an EFL Classroom Writing Assessment: Evidence from Many-Facets Rasch Measurement

Peer reviewed
PDF on ERIC

Download full text

Khamboonruang, Apichat – rEFLections, 2022

Although much research has compared the functioning between analytic and holistic rating scales, little research has compared the functioning of binary rating scales with other types of rating scales. This quantitative study set out to preliminarily and comparatively validate binary and analytic rating scales intended for use in formative…

Descriptors: Writing Evaluation, Evaluation Methods, Second Language Learning, Second Language Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational Measurement:…	6
American Journal of Evaluation	5
Applied Measurement in…	3
New Directions for Evaluation	3
New Directions for Program…	2
ProQuest LLC	2
Regional Educational…	2
AERA Online Paper Repository	1
Cambridge Assessment	1
College Student Journal	1
Contemporary Education Review	1
Creativity Research Journal	1
ETS Research Report Series	1
Education Trust	1
Educational Psychology Review	1
Evaluation Practice	1
Grantee Submission	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Psychology	1
Journal of College Student…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of MultiDisciplinary…	1
More ▼

Bocala, Candice	2
Bronson, William H.	2
Chang, Quincy	2
Denis Dumas	2
Lacireno-Paquet, Natalie	2
Law, Alexander I.	2
Riordan, Julie	2
Shakman, Karen	2
Adetogun, Adeyemo Adekanmi	1
Allan S. Cohen	1
Almy, Sarah	1
Arnold, Mary E.	1
Arslan Mancar, Sinem	1
Backlund, Phil	1
Baer, Donald M.	1
Ball, Samuel	1
Batchelder, William H.	1
Ben Kelcey	1
Benjamin Goecke	1
Bertisch, Hilary	1
Black, Don	1
Bombel, George	1
Boris Forthmann	1
Borman, Walter C.	1
More ▼