Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 3 |
Descriptor
Author
Chen, Dandan | 1 |
Hebert, Michael | 1 |
Kim, James S. | 1 |
Miratrixy, Luke | 1 |
Mozer, Reagan | 1 |
Relyea, Jackie Eunjung | 1 |
Wilson, Joshua | 1 |
Yi Gui | 1 |
Publication Type
Reports - Research | 2 |
Dissertations/Theses -… | 1 |
Journal Articles | 1 |
Education Level
Early Childhood Education | 3 |
Elementary Education | 3 |
Primary Education | 3 |
Middle Schools | 2 |
Elementary Secondary Education | 1 |
Grade 1 | 1 |
Grade 10 | 1 |
Grade 11 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
More ▼ |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yi Gui – ProQuest LLC, 2024
This study explores using transfer learning in machine learning for natural language processing (NLP) to create generic automated essay scoring (AES) models, providing instant online scoring for statewide writing assessments in K-12 education. The goal is to develop an instant online scorer that is generalizable to any prompt, addressing the…
Descriptors: Writing Tests, Natural Language Processing, Writing Evaluation, Scoring
Chen, Dandan; Hebert, Michael; Wilson, Joshua – American Educational Research Journal, 2022
We used multivariate generalizability theory to examine the reliability of hand-scoring and automated essay scoring (AES) and to identify how these scoring methods could be used in conjunction to optimize writing assessment. Students (n = 113) included subsamples of struggling writers and non-struggling writers in Grades 3-5 drawn from a larger…
Descriptors: Reliability, Scoring, Essays, Automation
Mozer, Reagan; Miratrixy, Luke; Relyea, Jackie Eunjung; Kim, James S. – Annenberg Institute for School Reform at Brown University, 2021
In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…
Descriptors: Scoring, Automation, Data Analysis, Natural Language Processing