NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 24 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tahereh Firoozi; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2023
The proliferation of large language models represents a paradigm shift in the landscape of automated essay scoring (AES) systems, fundamentally elevating their accuracy and efficacy. This study presents an extensive examination of large language models, with a particular emphasis on the transformative influence of transformer-based models, such as…
Descriptors: Turkish, Writing Evaluation, Essays, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Langlois, Jean; Bellemare, Christian; Toulouse, Josée; Wells, George A. – Anatomical Sciences Education, 2017
Anatomy knowledge has been found to include both spatial and non-spatial components. However, no systematic evaluation of studies relating spatial abilities and anatomy knowledge has been undertaken. The objective of this study was to conduct a systematic review of the relationship between spatial abilities test and anatomy knowledge assessment. A…
Descriptors: Anatomy, Spatial Ability, Knowledge Level, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Guo, Xiuyan; Lei, Pui-Wa – International Journal of Testing, 2020
Little research has been done on the effects of peer raters' quality characteristics on peer rating qualities. This study aims to address this gap and investigate the effects of key variables related to peer raters' qualities, including content knowledge, previous rating experience, training on rating tasks, and rating motivation. In an experiment…
Descriptors: Peer Evaluation, Error Patterns, Correlation, Knowledge Level
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Breyer, F. Jay; Rupp, André A.; Bridgeman, Brent – ETS Research Report Series, 2017
In this research report, we present an empirical argument for the use of a contributory scoring approach for the 2-essay writing assessment of the analytical writing section of the "GRE"® test in which human and machine scores are combined for score creation at the task and section levels. The approach was designed to replace a currently…
Descriptors: College Entrance Examinations, Scoring, Essay Tests, Writing Evaluation
Buchanan, Phil – ProQuest LLC, 2016
This study is designed to gather information concerning a possible relationship between how dental students prefer to take in and communicate new information and how they prefer to be assessed. Though there are numerous references in the literature regarding the learning styles of students there are also references to the inaccuracy of such…
Descriptors: Correlation, Cognitive Style, Statistical Analysis, Dental Schools
Peer reviewed Peer reviewed
Direct linkDirect link
MacArthur, Charles A.; Philippakos, Zoi A.; Graham, Steve – Learning Disability Quarterly, 2016
The purpose of the current study was to develop and validate a measure of motivation for use with basic college writers that would measure self-efficacy, achievement goals, beliefs, and affect. As part of a design research project on curriculum for community college developmental writing classes, 133 students in 11 classes completed the motivation…
Descriptors: College Students, Writing Instruction, Student Motivation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Shukla, Archana; Chaudhary, Banshi D. – Education and Information Technologies, 2014
The quality of evaluation of essay type answer books involving multiple evaluators for courses with large number of enrollments is likely to be affected due to heterogeneity in experience, expertise and maturity of evaluators. In this paper, we present a strategy to detect anomalies in evaluation of essay type answers by multiple evaluators based…
Descriptors: Essays, Grading, Educational Strategies, Educational Quality
Peer reviewed Peer reviewed
Direct linkDirect link
Smith, Dale L.; Cook, Patrick; Buskist, William – Teaching of Psychology, 2011
The perceived relation between assigned student grades and instructor evaluations of teaching has been the subject of much debate, though few laboratory studies have been conducted with adequate controls. Marsh and Roche suggested that experimental field studies may be a particularly promising avenue for further analyses of this relation. The…
Descriptors: Grades (Scholastic), Student Evaluation of Teacher Performance, Tests, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012
Scoring models for the "e-rater"® system were built and evaluated for the "TOEFL"® exam's independent and integrated writing prompts. Prompt-specific and generic scoring models were built, and evaluation statistics, such as weighted kappas, Pearson correlations, standardized differences in mean scores, and correlations with…
Descriptors: Scoring, Prompting, Evaluators, Computer Software
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012
Automated scoring models for the "e-rater"® scoring engine were built and evaluated for the "GRE"® argument and issue-writing tasks. Prompt-specific, generic, and generic with prompt-specific intercept scoring models were built and evaluation statistics such as weighted kappas, Pearson correlations, standardized difference in…
Descriptors: Scoring, Test Scoring Machines, Automation, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Beers, Scott F.; Nagy, William E. – Reading and Writing: An Interdisciplinary Journal, 2009
This study examined the relationship of different measures of syntactic complexity with rated quality for two genres of text produced by middle school students. It was hypothesized that different measures would be associated with distinct aspects of syntactic complexity; words per clause with greater use of structures more typical of expository…
Descriptors: Middle School Students, Syntax, Essays, Grade 7
Peer reviewed Peer reviewed
Direct linkDirect link
Isaksson, Sven – Assessment & Evaluation in Higher Education, 2008
A continuous classroom assessment technique, "Five-minute" essays, was applied during a short course called "Scientific Methods in Archaeology--Applications and Problems", given at the Archaeological Research Laboratory, Department of Archaeology and Classical Studies, Stockholm University, Sweden. There was a strong positive…
Descriptors: Minicourses, Student Evaluation, Archaeology, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Furnham, Adrian; Christopher, Andrew; Garwood, Jeanette; Martin, Neil G. – Educational Psychology, 2008
More than 400 students from four universities in America and Britain completed measures of learning style preference, general knowledge (as a proxy for intelligence), and preference for examination method. Learning style was consistently associated with preferences: surface learners preferred multiple choice and group work options, and viewed…
Descriptors: Personality Traits, Cognitive Style, Demography, Theses
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Seong-in; Hameed, Ibrahim A. – Art Therapy: Journal of the American Art Therapy Association, 2009
For mental health professionals, art assessment is a useful tool for patient evaluation and diagnosis. Consideration of various color-related elements is important in art assessment. This correlational study introduces the concept of variety of color as a new color-related element of an artwork. This term represents a comprehensive use of color,…
Descriptors: Mental Health Workers, Essays, Scoring, Visual Stimuli
Peer reviewed Peer reviewed
Direct linkDirect link
Coniam, David – ReCALL, 2009
This paper describes a study of the computer essay-scoring program BETSY. While the use of computers in rating written scripts has been criticised in some quarters for lacking transparency or lack of fit with how human raters rate written scripts, a number of essay rating programs are available commercially, many of which claim to offer comparable…
Descriptors: Writing Tests, Scoring, Foreign Countries, Interrater Reliability
Previous Page | Next Page »
Pages: 1  |  2