ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	16

Descriptor

Computer Assisted Testing	20
High Stakes Tests	20
Scoring	20
Automation	8
Educational Technology	8
Foreign Countries	8
Evaluation Methods	7
Student Evaluation	6
Artificial Intelligence	5
Essays	5
Adaptive Testing	4
Feedback (Response)	4
Language Tests	4
Portfolio Assessment	4
Psychometrics	4
Summative Evaluation	4
Test Items	4
Comparative Analysis	3
Computer Software	3
Correlation	3
Educational Assessment	3
English (Second Language)	3
Formative Evaluation	3
Grading	3
Scores	3
More ▼

Source

Canadian Journal of Learning…	2
Journal of Educational…	2
Applied Measurement in…	1
Berkeley Review of Education	1
Computers & Education	1
ETS Research Report Series	1
Educational Process:…	1
Educational Testing Service	1
IAP - Information Age…	1
Journal of Applied Testing…	1
Journal of Technology,…	1
Language Assessment Quarterly	1
London Review of Education	1
ProQuest LLC	1
Society for Research on…	1
Technology, Pedagogy and…	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	10
Reports - Evaluative	5
Reports - Descriptive	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Elementary Secondary Education	5
Higher Education	4
Postsecondary Education	3
Secondary Education	3
Grade 8	1
High Schools	1
Middle Schools	1

Audience

Location

Australia	3
Canada	1
China	1
Greece	1
Israel	1
Maryland	1
Massachusetts	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Graduate Record Examinations	1
Praxis Series	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment

Peer reviewed

Direct link

Dorsey, David W.; Michaels, Hillary R. – Journal of Educational Measurement, 2022

We have dramatically advanced our ability to create rich, complex, and effective assessments across a range of uses through technology advancement. Artificial Intelligence (AI) enabled assessments represent one such area of advancement--one that has captured our collective interest and imagination. Scientists and practitioners within the domains…

Descriptors: Validity, Ethics, Artificial Intelligence, Evaluation Methods

Can AI Grade Like a Human? Validity, Reliability, and Fairness in University Coursework Assessment

Peer reviewed
PDF on ERIC

Download full text

Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025

Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…

Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading

Rise of the Machines? The Evolving Role of AI Technologies in High-Stakes Assessment

Peer reviewed
PDF on ERIC

Download full text

Richardson, Mary; Clesham, Rose – London Review of Education, 2021

Our world has been transformed by technologies incorporating artificial intelligence (AI) within mass communication, employment, entertainment and many other aspects of our daily lives. However, within the domain of education, it seems that our ways of working and, particularly, assessing have hardly changed at all. We continue to prize…

Descriptors: Artificial Intelligence, High Stakes Tests, Computer Assisted Testing, Educational Change

Machine Learning, Natural Language Processing, and Psychometrics. The MARCES Book Series

Direct link

Hong Jiao, Editor; Robert W. Lissitz, Editor – IAP - Information Age Publishing, Inc., 2024

With the exponential increase of digital assessment, different types of data in addition to item responses become available in the measurement process. One of the salient features in digital assessment is that process data can be easily collected. This non-conventional structured or unstructured data source may bring new perspectives to better…

Descriptors: Artificial Intelligence, Natural Language Processing, Psychometrics, Computer Assisted Testing

Automated Scoring of Speaking and Writing: Starting to Hit Its Stride

Peer reviewed
PDF on ERIC

Download full text

Jones, Daniel Marc; Cheng, Liying; Tweedie, M. Gregory – Canadian Journal of Learning and Technology, 2022

This article reviews recent literature (2011-present) on the automated scoring (AS) of writing and speaking. Its purpose is to first survey the current research on automated scoring of language, then highlight how automated scoring impacts the present and future of assessment, teaching, and learning. The article begins by outlining the general…

Descriptors: Automation, Computer Assisted Testing, Scoring, Writing (Composition)

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

Prediction of Writing True Scores in Automated Scoring of Essays by Best Linear Predictors and Penalized Best Linear Predictors. Research Report. ETS RR-19-13

Peer reviewed
PDF on ERIC

Download full text

Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019

Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…

Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring

Righting Technologies: How Large-Scale Assessment Can Foster a More Equitable Education System

Peer reviewed
PDF on ERIC

Download full text

Behizadeh, Nadia; Lynch, Tom Liam – Berkeley Review of Education, 2017

For the last century, the quality of large-scale assessment in the United States has been undermined by narrow educational theory and hindered by limitations in technology. As a result, poor assessment practices have encouraged low-level instructional practices that disparately affect students from the most disadvantaged communities and schools.…

Descriptors: Equal Education, Measurement, Educational Theories, Evaluation Methods

Computer-Based English Language Testing in China: Present and Future

Peer reviewed

Direct link

Yu, Guoxing; Zhang, Jing – Language Assessment Quarterly, 2017

In this special issue on high-stakes English language testing in China, the two articles on computer-based testing (Jin & Yan; He & Min) highlight a number of consistent, ongoing challenges and concerns in the development and implementation of the nationwide IB-CET (Internet Based College English Test) and institutional computer-adaptive…

Descriptors: Foreign Countries, Computer Assisted Testing, English (Second Language), Language Tests

Using Digital Technologies to Improve the Authenticity of Performance Assessment for High-Stakes Purposes

Peer reviewed

Direct link

Newhouse, C. Paul – Technology, Pedagogy and Education, 2015

This paper reports on the outcomes of a three-year study investigating the use of digital technologies to increase the authenticity of high-stakes summative assessment in four Western Australian senior secondary courses. The study involved 82 teachers and 1015 students and a range of digital forms of assessment using computer-based exams, digital…

Descriptors: Educational Technology, High Stakes Tests, Summative Evaluation, Secondary School Students

Digitizing Practical Production Work for High-Stakes Assessments

Peer reviewed
PDF on ERIC

Download full text

Newhouse, C. Paul; Tarricone, Pina – Canadian Journal of Learning and Technology, 2014

High-stakes external assessment for practical courses is fraught with problems impacting on the manageability, validity and reliability of scoring. Alternative approaches to assessment using digital technologies have the potential to address these problems. This paper describes a study that investigated the use of these technologies to create and…

Descriptors: High Stakes Tests, Student Evaluation, Evaluation Methods, Scoring

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

Practice in Computer-Based Testing and Performance on the National Certification Examination for Nurse Anesthetists

Direct link

Dosch, Michael P. – ProQuest LLC, 2010

The general aim of the present retrospective study was to examine the test mode effect, that is, the difference in performance when tests are taken on computer (CBT), or by paper and pencil (PnP). The specific purpose was to examine the degree to which extensive practice in CBT in graduate students in nurse anesthesia would raise scores on a…

Descriptors: Feedback (Response), Graduate Students, Grade Point Average, Nurses

Using IT to Assess IT: Towards Greater Authenticity in Summative Performance Assessment

Peer reviewed

Direct link

Newhouse, C. Paul – Computers & Education, 2011

An applied Information Technology (IT) course that is assessed using pen and paper may sound incongruous but it is symptomatic of the state of high-stakes assessment in jurisdictions such as Western Australia. Whereas technology has permeated most aspects of modern life, including schooling, and more has been demanded of education systems in terms…

Descriptors: Foreign Countries, Portfolios (Background Materials), Student Evaluation, Performance Based Assessment

Previous Page | Next Page »

Pages: 1 | 2

Newhouse, C. Paul	3
Behizadeh, Nadia	1
Ben-Simon, Anat	1
Cheng, Liying	1
Clesham, Rose	1
Cohen, Yoav	1
Davey, Tim	1
Dikli, Semire	1
Dorsey, David W.	1
Dosch, Michael P.	1
Georgios Zacharis	1
Gobert, Janice D.	1
Haberman, Shelby J.	1
Herbert, Erin	1
Hong Jiao, Editor	1
Hou, Xiaodong	1
Jinnie Shin	1
Jones, Daniel Marc	1
Koedinger, Kenneth R.	1
Levi, Effi	1
Lissitz, Robert W.	1
Lynch, Tom Liam	1
Michaels, Hillary R.	1
Mills, Craig N.	1
O'Neil, Harold F., Jr.	1
More ▼