Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 13 |
Descriptor
Source
International Educational… | 7 |
Grantee Submission | 3 |
American Psychologist | 1 |
Applied Measurement in… | 1 |
Applied Psychological… | 1 |
College Board | 1 |
Educational Testing Service | 1 |
Journal of Educational… | 1 |
Online Submission | 1 |
Author
Publication Type
Speeches/Meeting Papers | 66 |
Reports - Research | 31 |
Reports - Evaluative | 30 |
Journal Articles | 4 |
Opinion Papers | 4 |
Information Analyses | 2 |
Tests/Questionnaires | 2 |
Reference Materials -… | 1 |
Reports - Descriptive | 1 |
Education Level
Secondary Education | 3 |
Junior High Schools | 2 |
Middle Schools | 2 |
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 13 |
Counselors | 1 |
Practitioners | 1 |
Location
Canada | 1 |
Iowa | 1 |
New Zealand | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
National Assessment of… | 4 |
ACT Assessment | 2 |
Flesch Kincaid Grade Level… | 1 |
Medical College Admission Test | 1 |
SAT (College Admission Test) | 1 |
Test of Standard Written… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Sami Baral; Eamon Worden; Wen-Chiang Lim; Zhuang Luo; Christopher Santorelli; Ashish Gurung; Neil Heffernan – Grantee Submission, 2024
The effectiveness of feedback in enhancing learning outcomes is well documented within Educational Data Mining (EDM). Various prior research have explored methodologies to enhance the effectiveness of feedback to students in various ways. Recent developments in Large Language Models (LLMs) have extended their utility in enhancing automated…
Descriptors: Automation, Scoring, Computer Assisted Testing, Natural Language Processing
Zhang, Mengxue; Baral, Sami; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2022
Automatic short answer grading is an important research direction in the exploration of how to use artificial intelligence (AI)-based tools to improve education. Current state-of-the-art approaches use neural language models to create vectorized representations of students responses, followed by classifiers to predict the score. However, these…
Descriptors: Grading, Mathematics Instruction, Artificial Intelligence, Form Classes (Languages)
Botarleanu, Robert-Mihai; Dascalu, Mihai; Allen, Laura K.; Crossley, Scott Andrew; McNamara, Danielle S. – Grantee Submission, 2022
Automated scoring of student language is a complex task that requires systems to emulate complex and multi-faceted human evaluation criteria. Summary scoring brings an additional layer of complexity to automated scoring because it involves two texts of differing lengths that must be compared. In this study, we present our approach to automate…
Descriptors: Automation, Scoring, Documentation, Likert Scales
Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023
Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…
Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests
Baral, Sami; Botelho, Anthony F.; Erickson, John A.; Benachamardi, Priyanka; Heffernan, Neil T. – International Educational Data Mining Society, 2021
Open-ended questions in mathematics are commonly used by teachers to monitor and assess students' deeper conceptual understanding of content. Student answers to these types of questions often exhibit a combination of language, drawn diagrams and tables, and mathematical formulas and expressions that supply teachers with insight into the processes…
Descriptors: Scoring, Automation, Mathematics Tests, Student Evaluation
Botarleanu, Robert-Mihai; Dascalu, Mihai; Allen, Laura K.; Crossley, Scott Andrew; McNamara, Danielle S. – Grantee Submission, 2021
Text summarization is an effective reading comprehension strategy. However, summary evaluation is complex and must account for various factors including the summary and the reference text. This study examines a corpus of approximately 3,000 summaries based on 87 reference texts, with each summary being manually scored on a 4-point Likert scale.…
Descriptors: Computer Assisted Testing, Scoring, Natural Language Processing, Computer Software
Doewes, Afrizal; Saxena, Akrati; Pei, Yulong; Pechenizkiy, Mykola – International Educational Data Mining Society, 2022
In Automated Essay Scoring (AES) systems, many previous works have studied group fairness using the demographic features of essay writers. However, individual fairness also plays an important role in fair evaluation and has not been yet explored. Initialized by Dwork et al., the fundamental concept of individual fairness is "similar people…
Descriptors: Scoring, Essays, Writing Evaluation, Comparative Analysis
Mostow, Jack; Gates, Donna; Ellison, Ross; Goutam, Rahul – International Educational Data Mining Society, 2015
Vocabulary knowledge is crucial to literacy development and academic success. Previous research has shown learning the meaning of a word requires encountering it in diverse informative contexts. In this work, we try to identify "nutritious" contexts for a word--contexts that help students build a rich mental representation of the word's…
Descriptors: Nutrition, Vocabulary Development, Accuracy, Scoring
Gautam, Dipesh; Swiecki, Zachari; Shaffer, David W.; Graesser, Arthur C.; Rus, Vasile – International Educational Data Mining Society, 2017
Virtual internships are online simulations of professional practice where students play the role of interns at a fictional company. During virtual internships, participants complete activities and then submit write-ups in the form of short answers, digital notebook entries. Prior work used classifiers trained on participant data to automatically…
Descriptors: Computer Simulation, Internship Programs, Semantics, College Students
Ostrow, Korinn; Donnelly, Chistopher; Heffernan, Neil – International Educational Data Mining Society, 2015
As adaptive tutoring systems grow increasingly popular for the completion of classwork and homework, it is crucial to assess the manner in which students are scored within these platforms. The majority of systems, including ASSISTments, return the binary correctness of a student's first attempt at solving each problem. Yet for many teachers,…
Descriptors: Intelligent Tutoring Systems, Scoring, Testing, Credits
Haertel, Edward H. – Educational Testing Service, 2013
Policymakers and school administrators have embraced value-added models of teacher effectiveness as tools for educational improvement. Teacher value-added estimates may be viewed as complicated scores of a certain kind. This suggests using a test validation model to examine their reliability and validity. Validation begins with an interpretive…
Descriptors: Reliability, Validity, Inferences, Teacher Effectiveness
Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013
The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…
Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores
Lang, W. Steve; Wilkerson, Judy R. – Online Submission, 2008
The construct of dispositions is defined in national standards, and colleges of education are required to assess candidate dispositions to meet accreditation requirements. Similarly, there is a need to review teacher dispositions in making hiring decisions about teachers, although this need may not yet be realized. Measurement is virtually…
Descriptors: Personality, Measurement Techniques, Measures (Individuals), Scoring
Some Empirical Results of Using Non-Linear Scoring Procedures for Yudofsky's Overt Aggression Scale.
Carifio, James; Lanza, Marilyn – 1990
The Yudofsky scale is considered to be one of the best scales for measuring aggressive behavior developed to date. One of the chief shortcomings of the scale, however, is appropriate methods for scoring it in ways that make the resulting scores well-suited for data analyses. The basic scoring problem with the Yudofsky scale is that the scale is…
Descriptors: Aggression, Models, Predictive Validity, Psychological Testing
Harris, Richard J. – 1992
Interpretation of emergent variables on the basis of structure coefficients (zero order correlations between original and emergent variables) is potentially very misleading and should be avoided in favor of interpretation on the basis of scoring coefficients. This is most apparent in multiple regression analysis and its special case, two-group…
Descriptors: Correlation, Discriminant Analysis, Mathematical Models, Multiple Regression Analysis