ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	13

Descriptor

Evaluators	13
Models	13
Scoring	13
Computer Software	5
Essays	5
Comparative Analysis	4
Foreign Countries	4
Test Items	4
Accuracy	3
Automation	3
Computer Assisted Testing	3
Item Analysis	3
Scores	3
Writing Evaluation	3
Artificial Intelligence	2
Case Studies	2
Computational Linguistics	2
Educational Change	2
English (Second Language)	2
Essay Tests	2
Interrater Reliability	2
Item Response Theory	2
Language Tests	2
Mathematics Tests	2
Networks	2
More ▼

Source

ETS Research Report Series	2
Educational and Psychological…	2
Applied Measurement in…	1
Assessment & Evaluation in…	1
International Educational…	1
Journal of Experimental…	1
Journal of the Scholarship of…	1
Language Testing	1
Malaysian Online Journal of…	1
New Teacher Project	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	10
Dissertations/Theses -…	1
Information Analyses	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Germany	1
Kentucky	1
Netherlands	1
Switzerland	1

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Employing a Hierarchical Rater Models for Automated Scoring: Scope Review on the Application in Educational Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Akif Avcu – Malaysian Online Journal of Educational Technology, 2025

This scope-review presents the milestones of how Hierarchical Rater Models (HRMs) become operable to used in automated essay scoring (AES) to improve instructional evaluation. Although essay evaluations--a useful instrument for evaluating higher-order cognitive abilities--have always depended on human raters, concerns regarding rater bias,…

Descriptors: Automation, Scoring, Models, Educational Assessment

Exploring Difficult-to-Score Essays with a Hyperbolic Cosine Accuracy Model and Coh-Metrix Indices

Peer reviewed

Direct link

Wang, Jue; Engelhard, George; Combs, Trenton – Journal of Experimental Education, 2023

Unfolding models are frequently used to develop scales for measuring attitudes. Recently, unfolding models have been applied to examine rater severity and accuracy within the context of rater-mediated assessments. One of the problems in applying unfolding models to rater-mediated assessments is that the substantive interpretations of the latent…

Descriptors: Writing Evaluation, Scoring, Accuracy, Computational Linguistics

Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments with Unfolding Models

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational and Psychological Measurement, 2019

The purpose of this study is to explore the use of unfolding models for evaluating the quality of ratings obtained in rater-mediated assessments. Two different judgmental processes can be used to conceptualize ratings: impersonal judgments and personal preferences. Impersonal judgments are typically expected in rater-mediated assessments, and…

Descriptors: Evaluative Thinking, Preferences, Evaluators, Models

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

More Efficient Processes for Creating Automated Essay Scoring Frameworks: A Demonstration of Two Algorithms

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Language Testing, 2021

Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Predictive Modeling of Rater Behavior: Implications for Quality Assurance in Essay Scoring

Peer reviewed

Direct link

Bejar, Isaac I.; Li, Chen; McCaffrey, Daniel – Applied Measurement in Education, 2020

We evaluate the feasibility of developing predictive models of rater behavior, that is, "rater-specific" models for predicting the scores produced by a rater under operational conditions. In the present study, the dependent variable is the score assigned to essays by a rater, and the predictors are linguistic attributes of the essays…

Descriptors: Scoring, Essays, Behavior, Predictive Measurement

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

Modeling Rater Effects and Complex Learning Progressions Using Item Response Models

Direct link

Shin, Hyo Jeong – ProQuest LLC, 2015

This dissertation is comprised of three papers that propose and apply psychometric models to deal with complexities and challenges in large-scale assessments, focusing on modeling rater effects and complex learning progressions. In particular, three papers investigate extensions and applications of multilevel and multidimensional item response…

Descriptors: Item Response Theory, Psychometrics, Models, Measurement

Moving beyond Assessment to Improving Students' Critical Thinking Skills: A Model for Implementing Change

Peer reviewed
PDF on ERIC

Download full text

Haynes, Ada; Lisic, Elizabeth; Goltz, Michele; Stein, Barry; Harris, Kevin – Journal of the Scholarship of Teaching and Learning, 2016

This research examines how the use of the CAT (Critical thinking Assessment Test) and involvement in CAT-Apps (CAT Applications within the discipline) training can serve as an important part of a faculty development model that assists faculty in the assessment of students' critical thinking skills and in the development of these skills within…

Descriptors: Educational Change, Critical Thinking, Thinking Skills, Skill Development

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Resetting Race to the Top: Why the Future of the Competition Depends on Improving the Scoring Process. Policy Brief

Download full text

New Teacher Project, 2010

Race to the Top represented a new paradigm in federal education. Instead of spreading relatively modest dollars evenly across all jurisdictions through funding formulas--as virtually all federal education funding has been and continues to be spent--a small number of successful states received all of the available funding, and in turn made it…

Descriptors: Federal Programs, Competition, Federal Aid, Educational Improvement

Developing and Validating a Design for Teacher Portfolio Assessment

Peer reviewed

Direct link

van der Schaaf, M. F.; Stokking, K. M. – Assessment & Evaluation in Higher Education, 2008

Developing and using a design for teacher portfolio assessment is a complex process including several components: the domain to be assessed (the teacher competences), the content standards or criteria, the portfolio format, the completion of the format (by teachers) with content, and the scoring of the portfolios (by raters). For a portfolio…

Descriptors: Portfolios (Background Materials), Portfolio Assessment, Scoring, Standards

Wang, Jue	2
Akif Avcu	1
Bejar, Isaac I.	1
Breyer, F. Jay	1
Casabianca, Jodi M.	1
Combs, Trenton	1
Engelhard, George	1
Engelhard, George, Jr.	1
Gierl, Mark J.	1
Goltz, Michele	1
Harris, Kevin	1
Haynes, Ada	1
Heffernan, Neil	1
Keller, Stefan	1
Khorramdel, Lale	1
Krüger, Maleika	1
Köller, Olaf	1
Lan, Andrew	1
Li, Chen	1
Lisic, Elizabeth	1
Lorenz, Florian	1
McCaffrey, Daniel	1
Rupp, André A.	1
Shin, Hyo Jeong	1
Shin, Jinnie	1
More ▼