ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	11

Descriptor

Computer Assisted Testing	12
Ethics	12
Scoring	12
Artificial Intelligence	5
Computer Software	4
Test Validity	4
Best Practices	3
Educational Assessment	3
Accountability	2
Accuracy	2
Automation	2
Computational Linguistics	2
Decision Making	2
Equal Education	2
Evaluation Methods	2
Grading	2
Models	2
Privacy	2
Psychometrics	2
Standardized Tests	2
Standards	2
Student Evaluation	2
Technology Uses in Education	2
Test Bias	2
Test Format	2
More ▼

Source

ACT, Inc.	1
Assessment	1
British Educational Research…	1
Communique	1
Innovations in Education and…	1
International Educational…	1
International Journal of…	1
Journal of Educational…	1
Measurement and Evaluation in…	1
National Council on…	1
Online Submission	1
Praeger	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Descriptive	3
Speeches/Meeting Papers	2
Books	1
Collected Works - General	1
Information Analyses	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2

Audience

Location

United Kingdom

Laws, Policies, & Programs

Family Educational Rights and…	1
Health Insurance Portability…	1
Individuals with Disabilities…	1

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Reducing Workload in Short Answer Grading Using Machine Learning

Peer reviewed

Direct link

Rebecka Weegar; Peter Idestam-Almquist – International Journal of Artificial Intelligence in Education, 2024

Machine learning methods can be used to reduce the manual workload in exam grading, making it possible for teachers to spend more time on other tasks. However, when it comes to grading exams, fully eliminating manual work is not yet possible even with very accurate automated grading, as any grading mistakes could have significant consequences for…

Descriptors: Grading, Computer Assisted Testing, Introductory Courses, Computer Science Education

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment

Peer reviewed

Direct link

Dorsey, David W.; Michaels, Hillary R. – Journal of Educational Measurement, 2022

We have dramatically advanced our ability to create rich, complex, and effective assessments across a range of uses through technology advancement. Artificial Intelligence (AI) enabled assessments represent one such area of advancement--one that has captured our collective interest and imagination. Scientists and practitioners within the domains…

Descriptors: Validity, Ethics, Artificial Intelligence, Evaluation Methods

Responsibilities of Users of Standardized Tests (Rust-4E)

Peer reviewed

Direct link

Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022

In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…

Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics

Individual Fairness Evaluation for Automated Essay Scoring System

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Saxena, Akrati; Pei, Yulong; Pechenizkiy, Mykola – International Educational Data Mining Society, 2022

In Automated Essay Scoring (AES) systems, many previous works have studied group fairness using the demographic features of essay writers. However, individual fairness also plays an important role in fair evaluation and has not been yet explored. Initialized by Dwork et al., the fundamental concept of individual fairness is "similar people…

Descriptors: Scoring, Essays, Writing Evaluation, Comparative Analysis

Establishing Standards of Best Practice in Automated Scoring. ACT Research. Technical Brief

Download full text

Wood, Scott; Yao, Erin; Haisfield, Lisa; Lottridge, Susan – ACT, Inc., 2021

For assessment professionals who are also automated scoring (AS) professionals, there is no single set of standards of best practice. This paper reviews the assessment and AS literature to identify key standards of best practice and ethical behavior for AS professionals and codifies those standards in a single resource. Having a unified set of AS…

Descriptors: Standards, Best Practices, Computer Assisted Testing, Scoring

Results from NCME Survey on Revisions to the "Standards for Educational and Psychological Testing"

Download full text

Doris Zahner; Jeffrey T. Steedle; James Soland; Catherine Welch; Qi Qin; Kathryn Thompson; Richard Phelps – Online Submission, 2023

The "Standards for Educational and Psychological Testing" have served as a cornerstone for best practices in assessment. As the field evolves, so must these standards, with regular revisions ensuring they reflect current knowledge and practice. The National Council on Measurement in Education (NCME) conducted a survey to gather feedback…

Descriptors: Standards, Educational Assessment, Psychological Testing, Best Practices

Virtual Cognitive Assessment: Legal and Ethical Considerations

Direct link

Carlson, Tiffany; Crepeau-Hobson, Franci – Communique, 2021

When the coronavirus pandemic was declared a public health crisis in March 2020, school psychologists were forced into situations where face-to-face interaction with their students was discouraged and in some cases, prohibited. Consequently, the traditional practice of school psychology abruptly ended. Individualized Education Plans (IEP) and…

Descriptors: Cognitive Tests, Ethics, Decision Making, Models

Testing and Data Integrity in the Administration of Statewide Student Assessment Programs

Download full text

National Council on Measurement in Education, 2012

Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…

Descriptors: State Programs, Integrity, Testing, Test Preparation

Ethical Perspectives and Practice Behaviors Involving Computer-Based Test Interpretation.

Peer reviewed

McMinn, Mark R.; Ellens, Brent M.; Soref, Erez – Assessment, 1999

Surveyed 364 members of the Society for Personality Assessment to determine how they use computer-based test interpretation software (CBTI) in their work, and their perspectives on the ethics of using CBTI. Psychologists commonly use CBTI for test scoring, but not to formulate a case or as an alternative to a written report. (SLD)

Descriptors: Behavior Patterns, Computer Assisted Testing, Computer Software, Ethics

Educational Measurement. Fourth Edition. ACE/Praeger Series on Higher Education

Direct link

Brennan, Robert L., Ed. – Praeger, 2006

"Educational Measurement" has been the bible in its field since the first edition was published by ACE in 1951. The importance of this fourth edition of "Educational Measurement" is to extensively update and extend the topics treated in the previous three editions. As such, the fourth edition documents progress in the field and…

Descriptors: Educational Testing, Educational Assessment, Test Validity, Test Reliability

Ault, Haley	1
Balkin, Richard S.	1
Barrio Minton, Casey	1
Brennan, Robert L., Ed.	1
Carlson, Tiffany	1
Catherine Welch	1
Crepeau-Hobson, Franci	1
Doewes, Afrizal	1
Doris Zahner	1
Dorsey, David W.	1
Ellens, Brent M.	1
Erford, Bradley T.	1
Haisfield, Lisa	1
Hays, Danica G.	1
James Soland	1
Jeffrey T. Steedle	1
Jonas Flodén	1
Kathryn Thompson	1
Kim, Bryan S. K.	1
Kunal Sareen	1
Lenz, A. Stephen	1
Li, Chi	1
Lottridge, Susan	1
McMinn, Mark R.	1
Michaels, Hillary R.	1
More ▼