ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	14
Since 2017 (last 10 years)	51
Since 2007 (last 20 years)	158

Descriptor

Correlation	220
Evaluation Methods	220
Reliability	94
Test Reliability	81
Interrater Reliability	59
Test Validity	52
Foreign Countries	46
Validity	46
Scores	39
Statistical Analysis	37
Measurement Techniques	34
Measures (Individuals)	34
Student Evaluation	33
Psychometrics	32
Factor Analysis	31
Rating Scales	29
Comparative Analysis	27
College Students	21
Questionnaires	19
Student Attitudes	19
Models	16
Test Construction	16
Construct Validity	14
Evaluators	14
Item Analysis	14
More ▼

Publication Type

Journal Articles	172
Reports - Research	144
Reports - Evaluative	44
Tests/Questionnaires	14
Speeches/Meeting Papers	12
Dissertations/Theses -…	10
Information Analyses	9
Reports - Descriptive	8
Numerical/Quantitative Data	3
Guides - Non-Classroom	2
Opinion Papers	2
Books	1
Collected Works - Proceedings	1
Collected Works - Serials	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	46
Postsecondary Education	40
Elementary Secondary Education	22
Elementary Education	21
Secondary Education	14
High Schools	10
Middle Schools	8
Early Childhood Education	5
Adult Education	4
Grade 8	4
Junior High Schools	4
Primary Education	4
Grade 1	3
Grade 2	3
Grade 3	3
Grade 6	3
Grade 7	3
Grade 4	2
Grade 5	2
Preschool Education	2
Grade 10	1
Grade 11	1
Grade 12	1
Kindergarten	1
More ▼

Audience

Researchers	6
Practitioners	3
Counselors	1
Teachers	1

Location

China	7
Florida	6
United Kingdom	6
Netherlands	5
Spain	5
Turkey	5
Australia	4
California	4
Arizona	3
Illinois	3
Japan	3
Pennsylvania	3
Portugal	3
Singapore	3
South Korea	3
United States	3
Asia	2
Canada	2
Greece	2
Hong Kong	2
Italy	2
New Jersey	2
New York (New York)	2
Ohio	2
Sweden	2
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 220 results Save | Export

A Unified Approach to Estimating the Intraclass Correlation Coefficient and Its Bias: An Exploratory Study

Direct link

Kelvin Terrell Pompey – ProQuest LLC, 2021

Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…

Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation

PBL Student Assessment: Consistency of Different Evaluation Methods in a Computing Faculty

Peer reviewed

Direct link

Henrique Mohallem Paiva; Flávia Maria Santoro; Victor Takashi Hayashi; Bianca Cassemiro Lima – IEEE Transactions on Education, 2025

Contribution: This article analyzes student assessment within a computing faculty employing a full project-based learning (PBL) approach. Examining 2078 final grades across 60 classes and periods, the study reveals a significant correlation between graded self-studies, exams, and projects. This result contributes to understanding the reliability…

Descriptors: Student Evaluation, Computer Science Education, College Faculty, Correlation

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Assessing Clinical Competence: A Multitrait-Multimethod Matrix Construct Validity Study

Peer reviewed

Direct link

Andrea Vallevand; David E. Manthey; Kim Askew; Nicholas D. Hartman; Cynthia Burns; Lindsay C. Strowd; Claudio Violato – Advances in Health Sciences Education, 2024

Education in Doctor of Medicine programs has moved towards an emphasis on clinical competency, with entrustable professional activities providing a framework of learning objectives and outcomes to be assessed within the clinical environment. While the identification and structured definition of objectives and outcomes have evolved, many methods…

Descriptors: Clinical Experience, Graduate Medical Education, Validity, Evaluation Methods

The Whole Is More than the Sum of Its Parts -- Assessing Writing Using the Consensual Assessment Technique

Peer reviewed

Direct link

Zahn, Daniela; Canton, Ursula; Boyd, Victoria; Hamilton, Laura; Mamo, Josianne; McKay, Jane; Proudfoot, Linda; Telfer, Dickson; Williams, Kim; Wilson, Colin – Studies in Higher Education, 2021

Evaluating the impact of Academic Literacies teaching (Lea and Street [1998. "Student Writing in Higher Education: An Academic Literacies Approach." "Studies in Higher Education" 23 (2): 157-72. doi:10.1080/03075079812331380364]) is difficult, as it involves gauging whether writers: (1) gain better understanding of what…

Descriptors: Writing Evaluation, Evaluation Methods, Undergraduate Students, Foreign Countries

Brief Report: Specificity of Interpersonal Synchrony Deficits to Autism Spectrum Disorder and Its Potential for Digitally Assisted Diagnostics

Peer reviewed

Direct link

Koehler, Jana Christina; Georgescu, Alexandra Livia; Weiske, Johanna; Spangemacher, Moritz; Burghof, Lana; Falkai, Peter; Koutsouleris, Nikolaos; Tschacher, Wolfgang; Vogeley, Kai; Falter-Wagner, Christine M. – Journal of Autism and Developmental Disorders, 2022

Reliably diagnosing autism spectrum disorders (ASD) in adulthood poses a challenge to clinicians due to the absence of specific diagnostic markers. This study investigated the potential of interpersonal synchrony (IPS), which has been found to be reduced in ASD, to augment the diagnostic process. IPS was objectively assessed in videos…

Descriptors: Autism, Pervasive Developmental Disorders, Clinical Diagnosis, Reliability

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Uniform Sampling of One Factor Correlation Matrices with Applications to Psychometric Research

Peer reviewed

Direct link

Olvera Astivia, Oscar L.; Zumbo, Bruno D. – Measurement: Interdisciplinary Research and Perspectives, 2019

Methods to generate random correlation matrices have been proposed in the literature, but very few instances exist where these correlation matrices are structured or where the statistical properties of the algorithms are known. By relying on the tetrad relation discovered by Spearman and the properties of the beta distribution, an algorithm is…

Descriptors: Correlation, Psychometrics, Benchmarking, Evaluation Methods

The Counseling Competencies Scale: Validation and Refinement

Peer reviewed

Direct link

Lambie, Glenn W.; Mullen, Patrick R.; Swank, Jacqueline M.; Blount, Ashley – Measurement and Evaluation in Counseling and Development, 2018

Supervisors evaluated counselors-in-training at multiple points during their practicum experience using the Counseling Competencies Scale (CCS; N = 1,070). The CCS evaluations were randomly split to conduct exploratory factor analysis and confirmatory factor analysis, resulting in a 2-factor model (61.5% of the variance explained).

Descriptors: Counselor Training, Counseling, Measures (Individuals), Competence

Structural Variable Validation of an Online Learning Response Behavior (OLRB) Instrument: A Comparison Analysis of Three Extraction Methods of Exploratory Factor Analysis

Peer reviewed

Direct link

Azman Ong, Mohd Hanafi; Mohd Yasin, Norazlina; Ibrahim, Nur Syafikah – Asian Association of Open Universities Journal, 2022

Purpose: Measuring internal response of online learning is seen as fundamental to absorptive capacity which stimulates knowledge assimilation. However, the evaluation of practice and research of validated instruments that could effectively measure online learning response behavior is limited. Thus, in this study, a new instrument was designed…

Descriptors: Online Courses, Student Surveys, Student Attitudes, Factor Analysis

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

Assessment of Interrater and Intermethod Agreement in the Kinesiology Literature

Peer reviewed

Direct link

Looney, Marilyn A. – Measurement in Physical Education and Exercise Science, 2018

The purpose of this article was two-fold (1) provide an overview of the commonly reported and under-reported absolute agreement indices in the kinesiology literature for continuous data; and (2) present examples of these indices for hypothetical data along with recommendations for future use. It is recommended that three types of information be…

Descriptors: Interrater Reliability, Evaluation Methods, Kinetics, Indexes

Properties of a Combined Measure of Reading and Writing: The Assessment of Writing, Self-Monitoring, and Reading (AWSM Reader)

Peer reviewed

Direct link

Gioia, Anthony R.; Ahmed, Yusra; Woods, Steven P.; Cirino, Paul T. – Reading and Writing: An Interdisciplinary Journal, 2023

There is significant overlap between reading and writing, but no known standardized measure assesses these jointly. The goal of the present study is to evaluate the properties of a novel measure, the Assessment of Writing, Self-Monitoring, and Reading (AWSM Reader), that simultaneously evaluates both reading comprehension and writing. In doing so,…

Descriptors: Reading Writing Relationship, Writing Evaluation, Self Evaluation (Individuals), Executive Function

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

ProQuest LLC	10
Educational and Psychological…	5
Journal of Autism and…	5
Journal of Psychoeducational…	5
Regional Educational…	5
Advances in Health Sciences…	4
Assessment for Effective…	4
Measurement and Evaluation in…	4
Online Submission	4
Research on Social Work…	4
Social Indicators Research	4
Journal of Applied Research…	3
Research in Developmental…	3
Assessment & Evaluation in…	2
Autism: The International…	2
Behavioral Disorders	2
Educational Assessment	2
Electronic Journal of…	2
Gerontologist	2
Grantee Submission	2
Journal of Consulting and…	2
Journal of Nutrition…	2
Journal of Speech, Language,…	2
Measurement:…	2
Research in Autism Spectrum…	2
More ▼

Gill, Brian	4
Booker, Kevin	2
Bruch, Julie	2
Dudley-Marling, Curt	2
Elliott, Stephen N.	2
Gresham, Frank M.	2
Lambie, Glenn W.	2
Matson, Johnny L.	2
Onwuegbuzie, Anthony J.	2
Owston, Ronald D.	2
Smith, Erica	2
Swank, Jacqueline M.	2
Thompson, Bruce	2
A. C., John	1
Adunyarittigun, Dumrong	1
Ahmed, Yusra	1
Akin, Ahmet	1
Aklin, Will	1
Alfonsin, Nicole	1
Allen, Adelaide M.	1
Ambegaonkar, Jatin P.	1
Andersson, Marie	1
Andrea Vallevand	1
Angus, Megan Hague	1
More ▼

ACT Assessment	3
Dynamic Indicators of Basic…	3
Social Skills Rating System	3
Stanford Achievement Tests	3
Wechsler Intelligence Scale…	3
Aberrant Behavior Checklist	2
Behavior Assessment System…	2
Child Behavior Checklist	2
Graduate Record Examinations	2
Iowa Tests of Basic Skills	2
Preliminary Scholastic…	2
Program for International…	2
Academic Motivation Scale	1
Autism Diagnostic Observation…	1
Center for Epidemiologic…	1
Clinical Evaluation of…	1
Conners Rating Scales	1
Developmental Behavior…	1
Diagnostic Assessment for the…	1
Kaufman Brief Intelligence…	1
MacArthur Communicative…	1
Motivated Strategies for…	1
Mullen Scales of Early…	1
Peabody Individual…	1
Praxis Series	1
More ▼