Publication Date
In 2025 | 3 |
Since 2024 | 14 |
Since 2021 (last 5 years) | 62 |
Since 2016 (last 10 years) | 171 |
Since 2006 (last 20 years) | 336 |
Descriptor
Essay Tests | 1063 |
Higher Education | 295 |
Writing Evaluation | 256 |
Scoring | 217 |
Writing Skills | 194 |
Multiple Choice Tests | 190 |
Foreign Countries | 185 |
Test Construction | 168 |
Student Evaluation | 156 |
Scores | 132 |
Test Reliability | 122 |
More ▼ |
Source
Author
Bridgeman, Brent | 15 |
Breland, Hunter M. | 12 |
White, Edward M. | 10 |
Attali, Yigal | 9 |
Brossell, Gordon | 8 |
Hoetker, James | 8 |
Wolfe, Edward W. | 8 |
Zhang, Mo | 8 |
Powers, Donald E. | 7 |
Deane, Paul | 5 |
Fowles, Mary E. | 5 |
More ▼ |
Publication Type
Education Level
Location
Indonesia | 37 |
Canada | 23 |
United Kingdom | 23 |
Florida | 16 |
California | 14 |
Australia | 8 |
Japan | 8 |
United Kingdom (England) | 8 |
Georgia | 7 |
Turkey | 7 |
China | 6 |
More ▼ |
Laws, Policies, & Programs
United States Constitution | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Akif Avcu – Malaysian Online Journal of Educational Technology, 2025
This scope-review presents the milestones of how Hierarchical Rater Models (HRMs) become operable to used in automated essay scoring (AES) to improve instructional evaluation. Although essay evaluations--a useful instrument for evaluating higher-order cognitive abilities--have always depended on human raters, concerns regarding rater bias,…
Descriptors: Automation, Scoring, Models, Educational Assessment
Wheeler, Jordan M.; Engelhard, George; Wang, Jue – Measurement: Interdisciplinary Research and Perspectives, 2022
Objectively scoring constructed-response items on educational assessments has long been a challenge due to the use of human raters. Even well-trained raters using a rubric can inaccurately assess essays. Unfolding models measure rater's scoring accuracy by capturing the discrepancy between criterion and operational ratings by placing essays on an…
Descriptors: Accuracy, Scoring, Statistical Analysis, Models
Hosnia M. M. Ahmed; Shaymaa E. Sorour – Education and Information Technologies, 2024
Evaluating the quality of university exam papers is crucial for universities seeking institutional and program accreditation. Currently, exam papers are assessed manually, a process that can be tedious, lengthy, and in some cases, inconsistent. This is often due to the focus on assessing only the formal specifications of exam papers. This study…
Descriptors: Higher Education, Artificial Intelligence, Writing Evaluation, Natural Language Processing
Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2024
The goal of this paper is to find better ways to estimate the internal consistency reliability of scores on tests with a specific type of design that are often encountered in practice: tests with constructed-response items clustered into sections that are not parallel or tau-equivalent, and one of the sections has only one item. To estimate the…
Descriptors: Test Reliability, Essay Tests, Construct Validity, Error of Measurement
Cleophas, Catherine; Hönnige, Christoph; Meisel, Frank; Meyer, Philipp – INFORMS Transactions on Education, 2023
As the COVID-19 pandemic motivated a shift to virtual teaching, exams have increasingly moved online too. Detecting cheating through collusion is not easy when tech-savvy students take online exams at home and on their own devices. Such online at-home exams may tempt students to collude and share materials and answers. However, online exams'…
Descriptors: Computer Assisted Testing, Cheating, Identification, Essay Tests
Jussi S. Jauhiainen; Agustin Bernardo Garagorry Guerra – Journal of Information Technology Education: Innovations in Practice, 2025
Aim/Purpose: This article investigates the process of identifying and correcting hallucinations in ChatGPT-4's recall of student-written responses as well as its evaluation of these responses, and provision of feedback. Effective prompting is examined to enhance the pre-evaluation, evaluation, and post-evaluation stages. Background: Advanced Large…
Descriptors: Artificial Intelligence, Student Evaluation, Writing Evaluation, Feedback (Response)
Angeline S. Lillard; Jessica Taggart – College Teaching, 2024
In large lecture courses, it can be challenging to imagine assessing student learning in ways other than multiple-choice exams and traditional point-based grading. Inspired by major pedagogical principles shared by Maria Montessori and Thomas Jefferson and supported by current understandings of effective teaching, assessment was reimagined in a…
Descriptors: College Faculty, Undergraduate Students, Teaching Assistants, Child Psychology
Naima Debbar – International Journal of Contemporary Educational Research, 2024
Intelligent systems of essay grading constitute important tools for educational technologies. They can significantly replace the manual scoring efforts and provide instructional feedback as well. These systems typically include two main parts: a feature extractor and an automatic grading model. The latter is generally based on computational and…
Descriptors: Test Scoring Machines, Computer Uses in Education, Artificial Intelligence, Essay Tests
Wendler, Cathy; Glazer, Nancy; Bridgeman, Brent – Applied Measurement in Education, 2020
Efficient constructed response (CR) scoring requires both accuracy and speed from human raters. This study was designed to determine if setting scoring rate expectations would encourage raters to score at a faster pace, and if so, if there would be differential effects on scoring accuracy for raters who score at different rates. Three rater groups…
Descriptors: Scoring, Expectation, Accuracy, Time
Aulia Dhita Nanda; Rusdi Hasan; Akhmad Sukri; Marheny Lukitasari; Alice Tonido Rivera – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2023
Higher-order thinking skills (HOTS) is one of the skills needed for 21st century challenges, especially for students. The aim of this study was to describe HOTS in students, especially the cognitive domain of analyzing and evaluating. This is a descriptive quantitative study employing a Pretest-Posttest One Group research design. The experiment…
Descriptors: Thinking Skills, Problem Based Learning, Teaching Methods, Skill Development
Lowe, Harriet – International Journal of Teaching and Learning in Higher Education, 2022
The educational value of portfolios as assessments has been widely acknowledged across the higher education sector and literature as providing a platform to promote student-centred and reflective learning (Brown, 1997; Snadden & Thomas, 1998; Karlowicz, 2000). While there is plentiful research investigating the benefits of providing portfolios…
Descriptors: Foreign Countries, Student Attitudes, College Students, Evaluation
Santi Lestari – Research Matters, 2024
Despite the increasing ubiquity of computer-based tests, many general qualifications examinations remain in a paper-based mode. Insufficient and unequal digital provision across schools is often identified as a major barrier to a full adoption of computer-based exams for general qualifications. One way to overcome this barrier is a gradual…
Descriptors: Keyboarding (Data Entry), Handwriting, Test Format, Comparative Analysis
Yelisey A. Shapovalov – ProQuest LLC, 2024
The Ethical Reasoning in Action program leveraged program assessment data to further promote student learning based on their effective educational framework for ethical reasoning: The Eight Key Question (8KQ) strategy. A cross-disciplinary team of five core researchers launched the first constructivist qualitative inquiry into students' ethical…
Descriptors: Ethics, Thinking Skills, Student Attitudes, Essay Tests
Ikmanisa Khairati; L. Lufri; Muhyiatul Fadilah – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Education for Sustainable Development (ESD) serves as a key accelerator for achieving the Sustainable Development Goals (SDGs), emphasizing systems thinking as an essential competency that must be cultivated in the learning process. This study investigates students' systems thinking skills within the ESD framework through assessments on…
Descriptors: Systems Approach, Thinking Skills, Sustainable Development, Biology
Sone, Enongene Mirabeau; Oluwasuji, Olutoba Gboyega – Practical Assessment, Research & Evaluation, 2021
The paper attempts to give an overview of evaluation in higher education institutions with particular emphasis on the faculties of humanities, education and social sciences disciplines at the University of Eswatini (Swaziland) in Southern Africa. It describes the general methodology of evaluation and identifies obstacles and relevant strategies…
Descriptors: Foreign Countries, Universities, Evaluation Methods, Student Evaluation