ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	53

Descriptor

Correlation	58
Evaluation Methods	58
Scoring	36
Scoring Rubrics	20
Foreign Countries	14
Scores	14
Student Evaluation	14
Comparative Analysis	13
Computer Assisted Testing	10
Computer Software	10
Writing Evaluation	10
Interrater Reliability	9
Statistical Analysis	9
Computation	8
Models	8
College Students	7
Essays	7
Item Response Theory	7
Teacher Effectiveness	7
Test Validity	7
Data Analysis	6
Evaluation Criteria	6
Evaluation Research	6
Evaluators	6
Teacher Evaluation	6
More ▼

Publication Type

Journal Articles	41
Reports - Research	39
Reports - Evaluative	9
Dissertations/Theses -…	4
Reports - Descriptive	4
Numerical/Quantitative Data	2
Tests/Questionnaires	2
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	17
Postsecondary Education	14
Elementary Secondary Education	10
Secondary Education	10
High Schools	5
Middle Schools	5
Elementary Education	4
Junior High Schools	4
Adult Education	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 7	1
Grade 8	1
Grade 9	1
More ▼

Audience

Researchers	2
Teachers	1

Location

Arizona	3
Australia	3
China	3
Hong Kong	2
Pennsylvania	2
South Korea	2
Turkey	2
California	1
Canada	1
Colorado (Denver)	1
Denmark	1
Florida	1
Illinois	1
India	1
Japan	1
Nevada (Reno)	1
New York (New York)	1
North Carolina (Charlotte)	1
Russia	1
South Africa	1
Spain	1
Sweden	1
Taiwan	1
Tennessee (Memphis)	1
Texas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	3
Test of English as a Foreign…	2
ACT Assessment	1
Autism Diagnostic Observation…	1
College Board Achievement…	1
National Longitudinal Survey…	1
SAT (College Admission Test)	1
Vineland Adaptive Behavior…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 58 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Assessing Creativity across Multi-Step Intervention Using Generative AI Models

Peer reviewed
PDF on ERIC

Download full text

Eran Hadas; Arnon Hershkovitz – Journal of Learning Analytics, 2025

Creativity is an imperative skill for today's learners, one that has important contributions to issues of inclusion and equity in education. Therefore, assessing creativity is of major importance in educational contexts. However, scoring creativity based on traditional tools suffers from subjectivity and is heavily time- and labour-consuming. This…

Descriptors: Creativity, Evaluation Methods, Computer Assisted Testing, Artificial Intelligence

Language Models in Automated Essay Scoring: Insights for the Turkish Language

Peer reviewed
PDF on ERIC

Download full text

Tahereh Firoozi; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2023

The proliferation of large language models represents a paradigm shift in the landscape of automated essay scoring (AES) systems, fundamentally elevating their accuracy and efficacy. This study presents an extensive examination of large language models, with a particular emphasis on the transformative influence of transformer-based models, such as…

Descriptors: Turkish, Writing Evaluation, Essays, Accuracy

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

A Comparison of Student and Research-Based Evaluations of Explanation Quality in an Introductory Physics Course for Engineers

Direct link

Joe Olsen – ProQuest LLC, 2023

Instructional explanations are an ubiquitous component of classroom instruction, but are relatively neglected in science education when compared to other facets of teaching and learning. The ubiquity of instructional explanations and their potential to stimulate learning in students suggests that they should garner more attention from science…

Descriptors: Physics, Comparative Analysis, Student Attitudes, Educational Quality

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

A Proof-of-Concept Study on Scoring Oral Presentation Videos in Higher Education. Research Report. ETS RR-19-22

Peer reviewed
PDF on ERIC

Download full text

Feng, Gary; Joe, Jilliam; Kitchen, Christopher; Mao, Liyang; Roohr, Katrina Crotts; Chen, Lei – ETS Research Report Series, 2019

This proof-of-concept study examined the feasibility of a new scoring procedure designed to reduce the time of scoring a video-based public speaking assessment task. Instead of scoring the video in its entirety, the performance was evaluated based on content-related (e.g., speech organization, word choice) and delivery-related (e.g., vocal…

Descriptors: Scoring, Public Speaking, Video Technology, Evaluation Methods

Wise Crowd Content Assessment and Educational Rubrics

Peer reviewed

Direct link

Passonneau, Rebecca J.; Poddar, Ananya; Gite, Gaurav; Krivokapic, Alisa; Yang, Qian; Perin, Dolores – International Journal of Artificial Intelligence in Education, 2018

Development of reliable rubrics for educational intervention studies that address reading and writing skills is labor-intensive, and could benefit from an automated approach. We compare a main ideas rubric used in a successful writing intervention study to a highly reliable wise-crowd content assessment method developed to evaluate…

Descriptors: Computer Assisted Testing, Writing Evaluation, Content Analysis, Scoring Rubrics

Can Automated Machine Translation Evaluation Metrics Be Used to Assess Students' Interpretation in the Language Learning Classroom?

Peer reviewed

Direct link

Han, Chao; Lu, Xiaolei – Computer Assisted Language Learning, 2023

The use of translation and interpreting (T&I) in the language learning classroom is commonplace, serving various pedagogical and assessment purposes. Previous utilization of T&I exercises is driven largely by their potential to enhance language learning, whereas the latest trend has begun to underscore T&I as a crucial skill to be…

Descriptors: Translation, Computational Linguistics, Correlation, Language Processing

Applying Kane's Validity Framework to a Simulation Based Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018

Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…

Descriptors: Competence, Simulation, Allied Health Personnel, Certification

Implementing a Contributory Scoring Approach for the "GRE"® Analytical Writing Section: A Comprehensive Empirical Investigation. Research Report. ETS RR-17-14

Peer reviewed
PDF on ERIC

Download full text

Breyer, F. Jay; Rupp, André A.; Bridgeman, Brent – ETS Research Report Series, 2017

In this research report, we present an empirical argument for the use of a contributory scoring approach for the 2-essay writing assessment of the analytical writing section of the "GRE"® test in which human and machine scores are combined for score creation at the task and section levels. The approach was designed to replace a currently…

Descriptors: College Entrance Examinations, Scoring, Essay Tests, Writing Evaluation

How Should Colleges Treat Multiple Admissions Test Scores? ACT Working Paper 2017-4

Download full text

Mattern, Krista; Radunzel, Justine; Bertling, Maria; Ho, Andrew – ACT, Inc., 2017

The percentage of students retaking college admissions tests is rising (Harmston & Crouse, 2016). Researchers and college admissions offices currently use a variety of methods for summarizing these multiple scores. Testing companies, interested in validity evidence like correlations with college first-year grade-point averages (FYGPA), often…

Descriptors: College Entrance Examinations, Grade Point Average, College Freshmen, Correlation

Peer and Self-Assessment Applied to Oral Presentations from a Multidisciplinary Perspective

Peer reviewed

Direct link

Suñol, Joan Josep; Arbat, Gerard; Pujol, Joan; Feliu, Lidia; Fraguell, Rosa Maria; Planas-Lladó, Anna – Assessment & Evaluation in Higher Education, 2016

This article analyses the use of peer and self-assessment in oral presentations as complementary tools to assessment by the professor. The analysis is based on a study conducted at the University of Girona (Spain) in seven different degree subjects and fields of knowledge. We designed and implemented two instruments to measure students' peer and…

Descriptors: Public Speaking, Peer Evaluation, Self Evaluation (Individuals), Evaluation Methods

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

Equating a Large-Scale Writing Assessment Using Pairwise Comparisons of Performances

Peer reviewed

Direct link

Humphry, Stephen M.; McGrane, Joshua A. – Australian Educational Researcher, 2015

This paper presents a method for equating writing assessments using pairwise comparisons which does not depend upon conventional common-person or common-item equating designs. Pairwise comparisons have been successfully applied in the assessment of open-ended tasks in English and other areas such as visual art and philosophy. In this paper,…

Descriptors: Writing Evaluation, Evaluation Methods, Comparative Analysis, Writing Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

ETS Research Report Series	4
ProQuest LLC	4
Applied Psychological…	3
Mathematica Policy Research,…	3
Assessment & Evaluation in…	2
Educational Assessment	2
Educational and Psychological…	2
Journal of Speech, Language,…	2
Regional Educational…	2
ACT, Inc.	1
Advances in Health Sciences…	1
Art Therapy: Journal of the…	1
Assessing Writing	1
Australian Educational…	1
College Board	1
Computer Assisted Language…	1
Computers & Education	1
Contemporary Issues in…	1
Creativity Research Journal	1
Developmental Psychology	1
Discourse Processes: A…	1
Education and Information…	1
Educational Psychology in…	1
Educational Research and…	1
Educational Testing Service	1
More ▼

Bridgeman, Brent	3
Gill, Brian	3
Chiang, Hanley	2
Davey, Tim	2
Lipscomb, Stephen	2
Ramineni, Chaitanya	2
Trapani, Catherine S.	2
Williamson, David M.	2
Alci, Bülent	1
Alexander, Patricia A.	1
Andersson, Marie	1
Apple, Kristen	1
Arbat, Gerard	1
Arnon Hershkovitz	1
Bertling, Maria	1
Berzin, Stephanie Cosner	1
Boedigheimer, Ralph	1
Breyer, F. Jay	1
Brown, Michelle Stallone	1
Brydges, Ryan	1
Carlson, Sybil B.	1
Chaudhary, Banshi D.	1
Chen, Lei	1
Chiou, Chuang-Kai	1
More ▼