NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 129 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Andersen, Øistein E.; Yuan, Zheng; Watson, Rebecca; Cheung, Kevin Yet Fong – International Educational Data Mining Society, 2021
Automated essay scoring (AES), where natural language processing is applied to score written text, can underpin educational resources in blended and distance learning. AES performance has typically been reported in terms of correlation coefficients or agreement statistics calculated between a system and an expert human examiner. We describe the…
Descriptors: Evaluation Methods, Scoring, Essays, Computer Assisted Testing
Winter, Phoebe C.; Hansen, Mark; McCoy, Michelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2019
In order to accurately assess the English language proficiency of special populations of English learners, student assessment programs must maintain the comparability of standard and modified assessment formats, allowing for equivalent inferences to be made across student classifications. However, given the typically small size of special…
Descriptors: English Language Learners, Language Proficiency, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Baral, Sami; Botelho, Anthony F.; Erickson, John A.; Benachamardi, Priyanka; Heffernan, Neil T. – International Educational Data Mining Society, 2021
Open-ended questions in mathematics are commonly used by teachers to monitor and assess students' deeper conceptual understanding of content. Student answers to these types of questions often exhibit a combination of language, drawn diagrams and tables, and mathematical formulas and expressions that supply teachers with insight into the processes…
Descriptors: Scoring, Automation, Mathematics Tests, Student Evaluation
Kim, Dong-In; Julian, Marc; Hermann, Pam – Online Submission, 2022
In test equating, one critical equating property is the group invariance property which indicates that the equating function used to convert performance on each alternate form to the reporting scale should be the same for various subgroups. To mitigate the impact of disrupted learning on the item parameters during the COVID-19 pandemic, a…
Descriptors: COVID-19, Pandemics, Test Format, Equated Scores
Tomkowicz, Joanna; Kim, Dong-In; Wan, Ping – Online Submission, 2022
In this study we evaluated the stability of item parameters and student scores, using the pre-equated (pre-pandemic) parameters from Spring 2019 and post-equated (post-pandemic) parameters from Spring 2021 in two calibration and equating designs related to item parameter treatment: re-estimating all anchor parameters (Design 1) and holding the…
Descriptors: Equated Scores, Test Items, Evaluation Methods, Pandemics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhongdi Wu; Eric Larson; Makoto Sano; Doris Baker; Nathan Gage; Akihito Kamata – Grantee Submission, 2023
In this investigation we propose new machine learning methods for automated scoring models that predict the vocabulary acquisition in science and social studies of second grade English language learners, based upon free-form spoken responses. We evaluate performance on an existing dataset and use transfer learning from a large pre-trained language…
Descriptors: Prediction, Vocabulary Development, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gary Weiser; Alison K. Billman; Christopher J. Harris; Lauren M. Brodsky; Damelin Daniel – Grantee Submission, 2022
The "Framework" and NGSS bring to the forefront the role of language in doing science and in learning from doing science. Yet, most existing science assessments for elementary learners do not integrate or attend to aspects of scientific language and literacy that are essential components of science proficiency. Accordingly, there is a…
Descriptors: Standards, Language Role, Science Instruction, Science Tests
Burfitt, Joan – Mathematics Education Research Group of Australasia, 2017
Multiple-choice items are used in large-scale assessments of mathematical achievement for secondary students in many countries. Research findings can be implemented to improve the quality of the items and hence increase the amount of information gathered about student learning from each item. One way to achieve this is to create items for which…
Descriptors: Multiple Choice Tests, Mathematics Tests, Credits, Knowledge Level
Peer reviewed Peer reviewed
Direct linkDirect link
Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017
Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…
Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education
Mostow, Jack; Gates, Donna; Ellison, Ross; Goutam, Rahul – International Educational Data Mining Society, 2015
Vocabulary knowledge is crucial to literacy development and academic success. Previous research has shown learning the meaning of a word requires encountering it in diverse informative contexts. In this work, we try to identify "nutritious" contexts for a word--contexts that help students build a rich mental representation of the word's…
Descriptors: Nutrition, Vocabulary Development, Accuracy, Scoring
Jacobs, George M.; Greliche, Nicholas – Online Submission, 2015
This article seeks to explain why, even in norm referenced assessment environments, students and other stakeholders should not be concerned that students who help peers learn are negatively impacting their own assessments. The article opens with a review of assessment options: norm referenced, criterion referenced and ipsative. Next, Social…
Descriptors: Norm Referenced Tests, Criterion Referenced Tests, Peer Influence, Social Theories
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Cotter, Matthew; Hinkelman, Don – Research-publishing.net, 2019
Assessing student presentations can be made more reliable with video-recording and post-performance rating. Further, self assessment and peer assessment can aid in the learning process by students when using specific, easy-to-understand rubrics. A ten-year action research study involved video-recorded performance assessment tasks using a free,…
Descriptors: Student Evaluation, Video Technology, Self Evaluation (Individuals), Peer Evaluation
Ostrow, Korinn; Donnelly, Chistopher; Heffernan, Neil – International Educational Data Mining Society, 2015
As adaptive tutoring systems grow increasingly popular for the completion of classwork and homework, it is crucial to assess the manner in which students are scored within these platforms. The majority of systems, including ASSISTments, return the binary correctness of a student's first attempt at solving each problem. Yet for many teachers,…
Descriptors: Intelligent Tutoring Systems, Scoring, Testing, Credits
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Xiong, Wenting; Litman, Diane – Grantee Submission, 2014
We propose a novel unsupervised extractive approach for summarizing online reviews by exploiting review helpfulness ratings. In addition to using the helpfulness ratings for review-level filtering, we suggest using them as the supervision of a topic model for sentence-level content scoring. The proposed method is metadata-driven, requiring no…
Descriptors: User Satisfaction (Information), Electronic Publishing, Documentation, Metadata
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Crossley, Scott; Allen, Laura K.; Snow, Erica L.; McNamara, Danielle S. – Grantee Submission, 2015
This study investigates a new approach to automatically assessing essay quality that combines traditional approaches based on assessing textual features with new approaches that measure student attributes such as demographic information, standardized test scores, and survey results. The results demonstrate that combining both text features and…
Descriptors: Automation, Scoring, Essays, Evaluation Methods
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9