Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 21 |
Descriptor
Correlation | 24 |
Evaluation Methods | 24 |
Essays | 20 |
Foreign Countries | 10 |
Scoring | 10 |
Computer Assisted Testing | 8 |
College Students | 7 |
Multiple Choice Tests | 7 |
Student Evaluation | 7 |
Writing Evaluation | 7 |
Comparative Analysis | 5 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 19 |
Reports - Research | 13 |
Reports - Evaluative | 7 |
Information Analyses | 2 |
Collected Works - Proceedings | 1 |
Dissertations/Theses -… | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 13 |
Postsecondary Education | 6 |
Secondary Education | 4 |
Elementary Education | 2 |
High Schools | 2 |
Middle Schools | 2 |
Elementary Secondary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
More ▼ |
Audience
Location
United Kingdom | 3 |
California | 1 |
China | 1 |
Finland | 1 |
France | 1 |
Hong Kong | 1 |
South Korea | 1 |
Sweden | 1 |
Texas | 1 |
Turkey | 1 |
United Kingdom (Nottingham) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 2 |
Program for International… | 1 |
Test of English as a Foreign… | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Tahereh Firoozi; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2023
The proliferation of large language models represents a paradigm shift in the landscape of automated essay scoring (AES) systems, fundamentally elevating their accuracy and efficacy. This study presents an extensive examination of large language models, with a particular emphasis on the transformative influence of transformer-based models, such as…
Descriptors: Turkish, Writing Evaluation, Essays, Accuracy
Langlois, Jean; Bellemare, Christian; Toulouse, Josée; Wells, George A. – Anatomical Sciences Education, 2017
Anatomy knowledge has been found to include both spatial and non-spatial components. However, no systematic evaluation of studies relating spatial abilities and anatomy knowledge has been undertaken. The objective of this study was to conduct a systematic review of the relationship between spatial abilities test and anatomy knowledge assessment. A…
Descriptors: Anatomy, Spatial Ability, Knowledge Level, Correlation
Guo, Xiuyan; Lei, Pui-Wa – International Journal of Testing, 2020
Little research has been done on the effects of peer raters' quality characteristics on peer rating qualities. This study aims to address this gap and investigate the effects of key variables related to peer raters' qualities, including content knowledge, previous rating experience, training on rating tasks, and rating motivation. In an experiment…
Descriptors: Peer Evaluation, Error Patterns, Correlation, Knowledge Level
Breyer, F. Jay; Rupp, André A.; Bridgeman, Brent – ETS Research Report Series, 2017
In this research report, we present an empirical argument for the use of a contributory scoring approach for the 2-essay writing assessment of the analytical writing section of the "GRE"® test in which human and machine scores are combined for score creation at the task and section levels. The approach was designed to replace a currently…
Descriptors: College Entrance Examinations, Scoring, Essay Tests, Writing Evaluation
Buchanan, Phil – ProQuest LLC, 2016
This study is designed to gather information concerning a possible relationship between how dental students prefer to take in and communicate new information and how they prefer to be assessed. Though there are numerous references in the literature regarding the learning styles of students there are also references to the inaccuracy of such…
Descriptors: Correlation, Cognitive Style, Statistical Analysis, Dental Schools
MacArthur, Charles A.; Philippakos, Zoi A.; Graham, Steve – Learning Disability Quarterly, 2016
The purpose of the current study was to develop and validate a measure of motivation for use with basic college writers that would measure self-efficacy, achievement goals, beliefs, and affect. As part of a design research project on curriculum for community college developmental writing classes, 133 students in 11 classes completed the motivation…
Descriptors: College Students, Writing Instruction, Student Motivation, Evaluation Methods
Shukla, Archana; Chaudhary, Banshi D. – Education and Information Technologies, 2014
The quality of evaluation of essay type answer books involving multiple evaluators for courses with large number of enrollments is likely to be affected due to heterogeneity in experience, expertise and maturity of evaluators. In this paper, we present a strategy to detect anomalies in evaluation of essay type answers by multiple evaluators based…
Descriptors: Essays, Grading, Educational Strategies, Educational Quality
Smith, Dale L.; Cook, Patrick; Buskist, William – Teaching of Psychology, 2011
The perceived relation between assigned student grades and instructor evaluations of teaching has been the subject of much debate, though few laboratory studies have been conducted with adequate controls. Marsh and Roche suggested that experimental field studies may be a particularly promising avenue for further analyses of this relation. The…
Descriptors: Grades (Scholastic), Student Evaluation of Teacher Performance, Tests, Correlation
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012
Scoring models for the "e-rater"® system were built and evaluated for the "TOEFL"® exam's independent and integrated writing prompts. Prompt-specific and generic scoring models were built, and evaluation statistics, such as weighted kappas, Pearson correlations, standardized differences in mean scores, and correlations with…
Descriptors: Scoring, Prompting, Evaluators, Computer Software
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012
Automated scoring models for the "e-rater"® scoring engine were built and evaluated for the "GRE"® argument and issue-writing tasks. Prompt-specific, generic, and generic with prompt-specific intercept scoring models were built and evaluation statistics such as weighted kappas, Pearson correlations, standardized difference in…
Descriptors: Scoring, Test Scoring Machines, Automation, Models
Beers, Scott F.; Nagy, William E. – Reading and Writing: An Interdisciplinary Journal, 2009
This study examined the relationship of different measures of syntactic complexity with rated quality for two genres of text produced by middle school students. It was hypothesized that different measures would be associated with distinct aspects of syntactic complexity; words per clause with greater use of structures more typical of expository…
Descriptors: Middle School Students, Syntax, Essays, Grade 7
Isaksson, Sven – Assessment & Evaluation in Higher Education, 2008
A continuous classroom assessment technique, "Five-minute" essays, was applied during a short course called "Scientific Methods in Archaeology--Applications and Problems", given at the Archaeological Research Laboratory, Department of Archaeology and Classical Studies, Stockholm University, Sweden. There was a strong positive…
Descriptors: Minicourses, Student Evaluation, Archaeology, Foreign Countries
Furnham, Adrian; Christopher, Andrew; Garwood, Jeanette; Martin, Neil G. – Educational Psychology, 2008
More than 400 students from four universities in America and Britain completed measures of learning style preference, general knowledge (as a proxy for intelligence), and preference for examination method. Learning style was consistently associated with preferences: surface learners preferred multiple choice and group work options, and viewed…
Descriptors: Personality Traits, Cognitive Style, Demography, Theses
Kim, Seong-in; Hameed, Ibrahim A. – Art Therapy: Journal of the American Art Therapy Association, 2009
For mental health professionals, art assessment is a useful tool for patient evaluation and diagnosis. Consideration of various color-related elements is important in art assessment. This correlational study introduces the concept of variety of color as a new color-related element of an artwork. This term represents a comprehensive use of color,…
Descriptors: Mental Health Workers, Essays, Scoring, Visual Stimuli
Coniam, David – ReCALL, 2009
This paper describes a study of the computer essay-scoring program BETSY. While the use of computers in rating written scripts has been criticised in some quarters for lacking transparency or lack of fit with how human raters rate written scripts, a number of essay rating programs are available commercially, many of which claim to offer comparable…
Descriptors: Writing Tests, Scoring, Foreign Countries, Interrater Reliability
Previous Page | Next Page »
Pages: 1 | 2