Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 10 |
| Since 2007 (last 20 years) | 16 |
Descriptor
| Computer Assisted Testing | 20 |
| High Stakes Tests | 20 |
| Scoring | 20 |
| Automation | 8 |
| Educational Technology | 8 |
| Foreign Countries | 8 |
| Evaluation Methods | 7 |
| Student Evaluation | 6 |
| Artificial Intelligence | 5 |
| Essays | 5 |
| Adaptive Testing | 4 |
| More ▼ | |
Source
Author
| Newhouse, C. Paul | 3 |
| Behizadeh, Nadia | 1 |
| Ben-Simon, Anat | 1 |
| Cheng, Liying | 1 |
| Clesham, Rose | 1 |
| Cohen, Yoav | 1 |
| Davey, Tim | 1 |
| Dikli, Semire | 1 |
| Dorsey, David W. | 1 |
| Dosch, Michael P. | 1 |
| Georgios Zacharis | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 14 |
| Reports - Research | 10 |
| Reports - Evaluative | 5 |
| Reports - Descriptive | 2 |
| Books | 1 |
| Collected Works - General | 1 |
| Dissertations/Theses -… | 1 |
| Information Analyses | 1 |
| Numerical/Quantitative Data | 1 |
Education Level
| Elementary Secondary Education | 5 |
| Higher Education | 4 |
| Postsecondary Education | 3 |
| Secondary Education | 3 |
| Grade 8 | 1 |
| High Schools | 1 |
| Middle Schools | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 2 |
| Graduate Record Examinations | 1 |
| Praxis Series | 1 |
What Works Clearinghouse Rating
Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025
In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…
Descriptors: Automation, Grading, Computer Assisted Testing, Scoring
Dorsey, David W.; Michaels, Hillary R. – Journal of Educational Measurement, 2022
We have dramatically advanced our ability to create rich, complex, and effective assessments across a range of uses through technology advancement. Artificial Intelligence (AI) enabled assessments represent one such area of advancement--one that has captured our collective interest and imagination. Scientists and practitioners within the domains…
Descriptors: Validity, Ethics, Artificial Intelligence, Evaluation Methods
Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025
Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…
Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading
Richardson, Mary; Clesham, Rose – London Review of Education, 2021
Our world has been transformed by technologies incorporating artificial intelligence (AI) within mass communication, employment, entertainment and many other aspects of our daily lives. However, within the domain of education, it seems that our ways of working and, particularly, assessing have hardly changed at all. We continue to prize…
Descriptors: Artificial Intelligence, High Stakes Tests, Computer Assisted Testing, Educational Change
Hong Jiao, Editor; Robert W. Lissitz, Editor – IAP - Information Age Publishing, Inc., 2024
With the exponential increase of digital assessment, different types of data in addition to item responses become available in the measurement process. One of the salient features in digital assessment is that process data can be easily collected. This non-conventional structured or unstructured data source may bring new perspectives to better…
Descriptors: Artificial Intelligence, Natural Language Processing, Psychometrics, Computer Assisted Testing
Jones, Daniel Marc; Cheng, Liying; Tweedie, M. Gregory – Canadian Journal of Learning and Technology, 2022
This article reviews recent literature (2011-present) on the automated scoring (AS) of writing and speaking. Its purpose is to first survey the current research on automated scoring of language, then highlight how automated scoring impacts the present and future of assessment, teaching, and learning. The article begins by outlining the general…
Descriptors: Automation, Computer Assisted Testing, Scoring, Writing (Composition)
Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018
In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…
Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing
Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019
Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…
Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring
Behizadeh, Nadia; Lynch, Tom Liam – Berkeley Review of Education, 2017
For the last century, the quality of large-scale assessment in the United States has been undermined by narrow educational theory and hindered by limitations in technology. As a result, poor assessment practices have encouraged low-level instructional practices that disparately affect students from the most disadvantaged communities and schools.…
Descriptors: Equal Education, Measurement, Educational Theories, Evaluation Methods
Yu, Guoxing; Zhang, Jing – Language Assessment Quarterly, 2017
In this special issue on high-stakes English language testing in China, the two articles on computer-based testing (Jin & Yan; He & Min) highlight a number of consistent, ongoing challenges and concerns in the development and implementation of the nationwide IB-CET (Internet Based College English Test) and institutional computer-adaptive…
Descriptors: Foreign Countries, Computer Assisted Testing, English (Second Language), Language Tests
Newhouse, C. Paul – Technology, Pedagogy and Education, 2015
This paper reports on the outcomes of a three-year study investigating the use of digital technologies to increase the authenticity of high-stakes summative assessment in four Western Australian senior secondary courses. The study involved 82 teachers and 1015 students and a range of digital forms of assessment using computer-based exams, digital…
Descriptors: Educational Technology, High Stakes Tests, Summative Evaluation, Secondary School Students
Newhouse, C. Paul; Tarricone, Pina – Canadian Journal of Learning and Technology, 2014
High-stakes external assessment for practical courses is fraught with problems impacting on the manageability, validity and reliability of scoring. Alternative approaches to assessment using digital technologies have the potential to address these problems. This paper describes a study that investigated the use of these technologies to create and…
Descriptors: High Stakes Tests, Student Evaluation, Evaluation Methods, Scoring
Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012
This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…
Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics
Dosch, Michael P. – ProQuest LLC, 2010
The general aim of the present retrospective study was to examine the test mode effect, that is, the difference in performance when tests are taken on computer (CBT), or by paper and pencil (PnP). The specific purpose was to examine the degree to which extensive practice in CBT in graduate students in nurse anesthesia would raise scores on a…
Descriptors: Feedback (Response), Graduate Students, Grade Point Average, Nurses
Newhouse, C. Paul – Computers & Education, 2011
An applied Information Technology (IT) course that is assessed using pen and paper may sound incongruous but it is symptomatic of the state of high-stakes assessment in jurisdictions such as Western Australia. Whereas technology has permeated most aspects of modern life, including schooling, and more has been demanded of education systems in terms…
Descriptors: Foreign Countries, Portfolios (Background Materials), Student Evaluation, Performance Based Assessment
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
