ERIC - Search Results

Publication Date

In 2025	0
Since 2024	6
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	26
Since 2006 (last 20 years)	36

Descriptor

Comparative Analysis	43
Evaluators	43
Scoring	43
Second Language Learning	18
Essays	16
Computer Assisted Testing	14
English (Second Language)	14
Computer Software	13
Correlation	13
Writing Evaluation	13
Foreign Countries	12
Interrater Reliability	11
Language Tests	11
Accuracy	10
Artificial Intelligence	7
Reliability	7
Scores	7
College Students	6
Computational Linguistics	6
Evaluation Methods	6
Native Language	6
Second Language Instruction	6
Decision Making	5
Evaluation Criteria	5
Rating Scales	5
More ▼

Publication Type

Reports - Research	36
Journal Articles	34
Tests/Questionnaires	6
Speeches/Meeting Papers	4
Dissertations/Theses -…	3
Reports - Evaluative	3
Information Analyses	2
Numerical/Quantitative Data	1

Education Level

Higher Education	11
Postsecondary Education	11
Early Childhood Education	3
Elementary Education	3
Primary Education	3
Elementary Secondary Education	2
Grade 2	2
Secondary Education	2
Adult Education	1
Grade 1	1
Grade 3	1
Grade 4	1
High Schools	1
Intermediate Grades	1
Kindergarten	1
More ▼

Audience

Location

China	3
India	2
Europe	1
Iran	1
Japan	1
Singapore	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	5
Graduate Record Examinations	1
Test of English for…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 43 results Save | Export

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Examining the Effect of Assessment Construct Characteristics on Machine Learning Scoring of Scientific Argumentation

Peer reviewed

Direct link

Kevin C. Haudek; Xiaoming Zhai – International Journal of Artificial Intelligence in Education, 2024

Argumentation, a key scientific practice presented in the "Framework for K-12 Science Education," requires students to construct and critique arguments, but timely evaluation of arguments in large-scale classrooms is challenging. Recent work has shown the potential of automated scoring systems for open response assessments, leveraging…

Descriptors: Accuracy, Persuasive Discourse, Artificial Intelligence, Learning Management Systems

Rater Connections and the Detection of Bias in Performance Assessment

Peer reviewed

Direct link

Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022

In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…

Descriptors: Evaluators, Bias, Identification, Performance Based Assessment

Automated Essay Scoring and Revising Based on Open-Source Large Language Models

Peer reviewed

Direct link

Yishen Song; Qianta Zhu; Huaibo Wang; Qinhua Zheng – IEEE Transactions on Learning Technologies, 2024

Manually scoring and revising student essays has long been a time-consuming task for educators. With the rise of natural language processing techniques, automated essay scoring (AES) and automated essay revising (AER) have emerged to alleviate this burden. However, current AES and AER models require large amounts of training data and lack…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial

Peer reviewed

Direct link

Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024

In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…

Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis

More Efficient Processes for Creating Automated Essay Scoring Frameworks: A Demonstration of Two Algorithms

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Language Testing, 2021

Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Exploring the Impacts of Different Score Resolution Procedures on Person Fit and Estimated Achievement in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Language Assessment Quarterly, 2020

Scoring procedures for many rater-mediated performance assessments include score resolution procedures in which a third rater adjudicates discrepancies between two raters' ratings of the same performance. There are numerous approaches for calculating resolved scores that involve different combinations of the original and third ratings. Using data…

Descriptors: Scoring, Evaluators, Goodness of Fit, Content Area Writing

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Home-Grown Automated Essay Scoring in the Literature Classroom: A Solution for Managing the Crowd?

Peer reviewed
PDF on ERIC

Download full text

Uzun, Kutay – Contemporary Educational Technology, 2018

Managing crowded classes in terms of classroom assessment is a difficult task due to the amount of time which needs to be devoted to providing feedback to student products. In this respect, the present study aimed to develop an automated essay scoring environment as a potential means to overcome this problem. Secondarily, the study aimed to test…

Descriptors: Computer Assisted Testing, Essays, Scoring, English Literature

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Testing	4
Applied Measurement in…	3
Language Assessment Quarterly	3
ProQuest LLC	3
ETS Research Report Series	2
Educational and Psychological…	2
Grantee Submission	2
Journal of Educational and…	2
Advances in Physiology…	1
Assessment in Education:…	1
CALICO Journal	1
Contemporary Educational…	1
English Language Teaching	1
English Teaching	1
IEEE Transactions on Learning…	1
Innovation in Language…	1
International Educational…	1
International Journal of…	1
JALT CALL Journal	1
Journal of Experimental Child…	1
Journal of Speech, Language,…	1
Language Learning	1
Measurement:…	1
Online Submission	1
TESL-EJ	1
More ▼

Attali, Yigal	2
Linn, Robert L.	2
Wind, Stefanie A.	2
Abdul Gafoor, K.	1
Allan S. Cohen	1
Allen, Laura K.	1
Amanda Huee-Ping Wong	1
Apple, Kristen	1
Barkaoui, Khaled	1
Bohn-Gettler, Catherine M.	1
Breyer, F. Jay	1
Brooks, Rachel Lunde	1
Brown, Anne	1
Buzick, Heather	1
Crews, William E., Jr.	1
Crossley, Scott A.	1
Dalton, Sarah Grace	1
Dupuis, Danielle	1
Eckstein, Grant	1
Ferrara, Steve	1
Flor, Michael	1
Fromm, Davida	1
Galaczi, Evelina	1
Gierl, Mark J.	1
More ▼