ERIC - Search Results

Publication Date

In 2025	8
Since 2024	28
Since 2021 (last 5 years)	121
Since 2016 (last 10 years)	229
Since 2006 (last 20 years)	356

Descriptor

Computer Assisted Testing	509
Scoring	509
Test Items	110
Test Construction	102
Automation	91
Essays	82
Foreign Countries	80
Scores	79
Adaptive Testing	78
Evaluation Methods	77
Computer Software	75
Writing Evaluation	75
Comparative Analysis	72
Language Tests	70
Test Validity	67
Second Language Learning	66
Student Evaluation	66
Correlation	65
English (Second Language)	62
Test Format	59
Test Reliability	54
Models	52
Item Response Theory	51
Educational Technology	48
Accuracy	45
More ▼

Education Level

Higher Education	84
Postsecondary Education	66
Secondary Education	50
Elementary Education	45
Elementary Secondary Education	33
Middle Schools	31
Junior High Schools	25
High Schools	15
Grade 8	12
Intermediate Grades	11
Grade 4	10
Early Childhood Education	9
Grade 5	9
Grade 6	8
Grade 7	8
Grade 3	5
Primary Education	5
Adult Education	3
Grade 9	3
Preschool Education	3
Grade 2	2
Kindergarten	2
Grade 10	1
Grade 11	1
Grade 12	1
More ▼

Audience

Administrators	8
Practitioners	8
Researchers	7
Teachers	4
Students	2
Counselors	1
Policymakers	1

Location

Australia	10
China	10
New York	9
Japan	7
Netherlands	6
Canada	5
Germany	5
Iran	4
Taiwan	4
United Kingdom	4
United Kingdom (England)	4
United States	4
Europe	3
Indonesia	3
Singapore	3
South Korea	3
Spain	3
California	2
Connecticut	2
Czech Republic	2
Denmark	2
France	2
Hong Kong	2
Israel	2
Malaysia	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Every Student Succeeds Act…	2
Elementary and Secondary…	1
Elementary and Secondary…	1
Family Educational Rights and…	1
Health Insurance Portability…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Scoring X

Showing 1 to 15 of 509 results Save | Export

Peer reviewed

Direct link

Ramnarain-Seetohul, Vidasha; Bassoo, Vandana; Rosunally, Yasmine – Education and Information Technologies, 2022

In automated essay scoring (AES) systems, similarity techniques are used to compute the score for student answers. Several methods to compute similarity have emerged over the years. However, only a few of them have been widely used in the AES domain. This work shows the findings of a ten-year review on similarity techniques applied in AES systems…

Descriptors: Computer Assisted Testing, Essays, Scoring, Automation

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Peer reviewed

Direct link

Uto, Masaki; Aomi, Itsuki; Tsutsumi, Emiko; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2023

In automated essay scoring (AES), essays are automatically graded without human raters. Many AES models based on various manually designed features or various architectures of deep neural networks (DNNs) have been proposed over the past few decades. Each AES model has unique advantages and characteristics. Therefore, rather than using a single-AES…

Descriptors: Prediction, Scores, Computer Assisted Testing, Scoring

The Language of Creativity: Evidence from Humans and Large Language Models

Peer reviewed

Direct link

William Orwig; Emma R. Edenbaum; Joshua D. Greene; Daniel L. Schacter – Journal of Creative Behavior, 2024

Recent developments in computerized scoring via semantic distance have provided automated assessments of verbal creativity. Here, we extend past work, applying computational linguistic approaches to characterize salient features of creative text. We hypothesize that, in addition to semantic diversity, the degree to which a story includes…

Descriptors: Computer Assisted Testing, Scoring, Creativity, Computational Linguistics

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Automated Pipeline for Multi-Lingual Automated Essay Scoring with ReaderBench

Peer reviewed

Direct link

Stefan Ruseti; Ionut Paraschiv; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Automated Essay Scoring (AES) is a well-studied problem in Natural Language Processing applied in education. Solutions vary from handcrafted linguistic features to large Transformer-based models, implying a significant effort in feature extraction and model implementation. We introduce a novel Automated Machine Learning (AutoML) pipeline…

Descriptors: Computer Assisted Testing, Scoring, Automation, Essays

Automated Pipeline for Multi-Lingual Automated Essay Scoring with ReaderBench

Peer reviewed

Direct link

Stefan Ruseti; Ionut Paraschiv; Mihai Dascalu; Danielle S. McNamara – International Journal of Artificial Intelligence in Education, 2024

Descriptors: Computer Assisted Testing, Scoring, Automation, Essays

The Machines Take Over: A Comparison of Various Supervised Learning Approaches for Automated Scoring of Divergent Thinking Tasks

Peer reviewed

Direct link

Buczak, Philip; Huang, He; Forthmann, Boris; Doebler, Philipp – Journal of Creative Behavior, 2023

Traditionally, researchers employ human raters for scoring responses to creative thinking tasks. Apart from the associated costs this approach entails two potential risks. First, human raters can be subjective in their scoring behavior (inter-rater-variance). Second, individual raters are prone to inconsistent scoring patterns…

Descriptors: Computer Assisted Testing, Scoring, Automation, Creative Thinking

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

Envisioning the Future of Creative Thinking Assessment

Peer reviewed

Direct link

Mathias Benedek; Roger E. Beaty – Journal of Creative Behavior, 2025

The PISA assessment 2022 of creative thinking was a moonshot effort that introduced significant advancements over existing creativity tests, including a broad range of domains (written, visual, social, and scientific), implementation in many languages, and sophisticated scoring methods. PISA 2022 demonstrated the general feasibility of assessing…

Descriptors: Creative Thinking, Creativity, Creativity Tests, Scoring

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Evaluating Coherence in Writing: Comparing the Capacity of Automated Essay Scoring Technologies

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Journal of Applied Testing Technology, 2022

Automated Essay Scoring (AES) technologies provide innovative solutions to score the written essays with a much shorter time span and at a fraction of the current cost. Traditionally, AES emphasized the importance of capturing the "coherence" of writing because abundant evidence indicated the connection between coherence and the overall…

Descriptors: Computer Assisted Testing, Scoring, Essays, Automation

A Comparison of Final Scoring Methods under the Multistage Adaptive Testing Framework

Direct link

Hacer Karamese – ProQuest LLC, 2022

Multistage adaptive testing (MST) has become popular in the testing industry because the research has shown that it combines the advantages of both linear tests and item-level computer adaptive testing (CAT). The previous research efforts primarily focused on MST design issues such as panel design, module length, test length, distribution of test…

Descriptors: Adaptive Testing, Scoring, Computer Assisted Testing, Design

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 34

ETS Research Report Series	31
Grantee Submission	25
ProQuest LLC	16
Journal of Educational…	14
Language Testing	11
Educational Measurement:…	10
Assessing Writing	9
International Educational…	9
Journal of Technology,…	9
Applied Measurement in…	8
Educational and Psychological…	8
Journal of Applied Testing…	8
Language Assessment Quarterly	8
International Journal of…	7
New York State Education…	7
Applied Psychological…	6
Computers & Education	6
Educational Technology &…	6
Education and Information…	5
Educational Testing Service	5
International Journal of…	5
Journal of Speech, Language,…	5
Educational Assessment	4
IEEE Transactions on Learning…	4
Journal of Creative Behavior	4
More ▼

Bennett, Randy Elliot	11
Attali, Yigal	9
Anderson, Paul S.	7
Williamson, David M.	6
Bejar, Isaac I.	5
Ramineni, Chaitanya	5
Stocking, Martha L.	5
Xi, Xiaoming	5
Zechner, Klaus	5
Bridgeman, Brent	4
Davey, Tim	4
Evanini, Keelan	4
Higgins, Derrick	4
Lee, Hee-Sun	4
Liu, Ou Lydia	4
McNamara, Danielle S.	4
Mulholland, Matthew	4
O'Neil, Harold F., Jr.	4
Pallant, Amy	4
Rupp, André A.	4
Weiss, David J.	4
Wilson, Joshua	4
Alonzo, Julie	3
Breyer, F. Jay	3
More ▼

Journal Articles	328
Reports - Research	271
Reports - Evaluative	95
Reports - Descriptive	75
Speeches/Meeting Papers	57
Tests/Questionnaires	22
Dissertations/Theses -…	16
Information Analyses	16
Numerical/Quantitative Data	15
Books	12
Guides - Non-Classroom	11
Collected Works - General	10
Opinion Papers	7
Collected Works - Proceedings	5
Book/Product Reviews	4
Guides - General	4
Guides - Classroom - Teacher	2
Non-Print Media	2
Reports - General	2
Reference Materials -…	1
More ▼

Test of English as a Foreign…	29
Graduate Record Examinations	16
National Assessment of…	9
Wechsler Intelligence Scale…	4
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Armed Services Vocational…	2
Dynamic Indicators of Basic…	2
International English…	2
New York State Regents…	2
Praxis Series	2
Program for International…	2
Progress in International…	2
SAT (College Admission Test)	2
Torrance Tests of Creative…	2
Trends in International…	2
Wechsler Individual…	2
ACT Assessment	1
Behavior Assessment System…	1
Center for Epidemiologic…	1
Computer Attitude Scale	1
Conners Rating Scales	1
Expressive One Word Picture…	1
Flesch Kincaid Grade Level…	1
Graduate Management Admission…	1
More ▼