ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	23
Since 2017 (last 10 years)	56
Since 2007 (last 20 years)	112

Descriptor

Computer Assisted Testing	130
Classification	122
Foreign Countries	33
Accuracy	32
Test Items	30
Adaptive Testing	26
Comparative Analysis	21
Second Language Learning	21
Models	20
Item Response Theory	19
Evaluation Methods	16
Computer Software	14
English (Second Language)	14
Language Tests	14
Simulation	14
Task Analysis	14
College Students	13
Prediction	13
Correlation	12
Scoring	12
Probability	11
Item Analysis	10
Language Proficiency	10
Scores	10
Statistical Analysis	10
More ▼

Publication Type

Journal Articles	130
Reports - Research	85
Reports - Evaluative	25
Reports - Descriptive	15
Information Analyses	4
Tests/Questionnaires	4
Opinion Papers	1
Reports - General	1

Audience

Researchers

Location

Canada	4
China	3
Taiwan	3
Texas	3
United Kingdom	3
Florida	2
Germany	2
Greece	2
Australia	1
California (Los Angeles)	1
Europe	1
Georgia	1
Georgia (Atlanta)	1
Indonesia	1
Iran	1
Ireland (Dublin)	1
Israel	1
Japan	1
Malaysia	1
Massachusetts	1
Netherlands	1
New York	1
North Carolina	1
North Carolina (Charlotte)	1
North Dakota	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	6
California Achievement Tests	1
Child Behavior Checklist	1
Dynamic Indicators of Basic…	1
Florida Comprehensive…	1
Gates MacGinitie Reading Tests	1
International English…	1
Myers Briggs Type Indicator	1
Raven Progressive Matrices	1
Woodcock Johnson Tests of…	1
Woodcock Johnson Tests of…	1
Woodcock Reading Mastery Test	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 130 results Save | Export

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Embeddings for Automatic Short Answer Grading: A Scoping Review

Peer reviewed

Direct link

Putnikovic, Marko; Jovanovic, Jelena – IEEE Transactions on Learning Technologies, 2023

Automatic grading of short answers is an important task in computer-assisted assessment (CAA). Recently, embeddings, as semantic-rich textual representations, have been increasingly used to represent short answers and predict the grade. Despite the recent trend of applying embeddings in automatic short answer grading (ASAG), there are no…

Descriptors: Automation, Computer Assisted Testing, Grading, Natural Language Processing

Teaching Algorithm Design: A Literature Review

Peer reviewed

Direct link

Jonathan Liu; Seth Poulsen; Erica Goodwin; Hongxuan Chen; Grace Williams; Yael Gertner; Diana Franklin – ACM Transactions on Computing Education, 2025

Algorithm design is a vital skill developed in most undergraduate Computer Science (CS) programs, but few research studies focus on pedagogy related to algorithms coursework. To understand the work that has been done in the area, we present a systematic survey and literature review of CS Education studies. We search for research that is both…

Descriptors: Teaching Methods, Algorithms, Design, Computer Science Education

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

The Effect of Item Pool and Selection Algorithms on Computerized Classification Testing (CCT) Performance

Peer reviewed
PDF on ERIC

Download full text

Demir, Seda – Journal of Educational Technology and Online Learning, 2022

The purpose of this research was to evaluate the effect of item pool and selection algorithms on computerized classification testing (CCT) performance in terms of some classification evaluation metrics. For this purpose, 1000 examinees' response patterns using the R package were generated and eight item pools with 150, 300, 450, and 600 items…

Descriptors: Test Items, Item Banks, Mathematics, Computer Assisted Testing

Automated Short Answer Scoring Using an Ensemble of Neural Networks and Latent Semantic Analysis Classifiers

Peer reviewed

Direct link

Ormerod, Christopher; Lottridge, Susan; Harris, Amy E.; Patel, Milan; van Wamelen, Paul; Kodeswaran, Balaji; Woolf, Sharon; Young, Mackenzie – International Journal of Artificial Intelligence in Education, 2023

We introduce a short answer scoring engine made up of an ensemble of deep neural networks and a Latent Semantic Analysis-based model to score short constructed responses for a large suite of questions from a national assessment program. We evaluate the performance of the engine and show that the engine achieves above-human-level performance on a…

Descriptors: Computer Assisted Testing, Scoring, Artificial Intelligence, Semantics

Impact of Categorization and Scaling on Classification Agreement and Prediction Accuracy Statistics. Research Report. ETS RR-21-26

Peer reviewed
PDF on ERIC

Download full text

Wang, Wei; Dorans, Neil J. – ETS Research Report Series, 2021

Agreement statistics and measures of prediction accuracy are often used to assess the quality of two measures of a construct. Agreement statistics are appropriate for measures that are supposed to be interchangeable, whereas prediction accuracy statistics are appropriate for situations where one variable is the target and the other variables are…

Descriptors: Classification, Scaling, Prediction, Accuracy

Identifying Enemy Item Pairs Using Natural Language Processing

Peer reviewed

Direct link

Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022

Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…

Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring

Empowering Higher Education Students to Monitor Their Learning Progress: Opportunities of Computerised Classification Testing

Peer reviewed

Direct link

Ifenthaler, Dirk; Sahin, Muhittin – Interactive Technology and Smart Education, 2023

Purpose: This study aims to focus on providing a computerized classification testing (CCT) system that can easily be embedded as a self-assessment feature into the existing legacy environment of a higher education institution, empowering students with self-assessments to monitor their learning progress and following strict data protection…

Descriptors: College Students, Classification, Self Evaluation (Individuals), Progress Monitoring

Tagging Reading Comprehension Materials with Document Extraction Attention Networks

Peer reviewed

Direct link

Sun, Bo; Zhu, Yunzong; Yao, Zeng; Xiao, Rong; Xiao, Yongkang; Wei, Yungang – IEEE Transactions on Learning Technologies, 2020

Reading comprehension tasks are commonly used for developing students' reading ability. In order to adaptively recommend reading comprehension materials to students engaged in computerized testing, the information in an item bank (a collection of test items stored in a dataset) must be effectively indexed. Familiarity with the topics present in…

Descriptors: Automation, Indexing, Item Banks, Classification

Computer-Supported Assessment of Geometric Exploration Using Variation Theory

Peer reviewed

Direct link

Luz, Yael; Yerushalmy, Michal – Journal for Research in Mathematics Education, 2023

We report on an innovative design of algorithmic analysis that supports automatic online assessment of students' exploration of geometry propositions in a dynamic geometry environment. We hypothesized that difficulties with and misuse of terms or logic in conjectures are rooted in the early exploration stages of inquiry. We developed a generic…

Descriptors: Algorithms, Computer Assisted Testing, Geometry, Mathematics Instruction

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

Some Sentences Prime Pragmatic Reasoning in the Verification and Evaluation of Comparisons

Peer reviewed

Direct link

Shukla, Vishakha; Long, Madeleine; Bhatia, Vrinda; Rubio-Fernandez, Paula – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2022

While most research on scalar implicature has focused on the lexical scale "some" vs "all," here we investigated an understudied scale formed by two syntactic constructions: categorizations (e.g., "Wilma is a nurse") and comparisons ("Wilma is like a nurse"). An experimental study by Rubio-Fernandez et al.…

Descriptors: Cues, Pragmatics, Comparative Analysis, Syntax

Classification Accuracy of i-Ready and Prior Year State Exams on Year-End Outcomes

Peer reviewed

Direct link

Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025

The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…

Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing

Effects of Computer-Based Feedback on Lower- and Higher-Order Learning Outcomes: A Network Meta-Analysis

Peer reviewed

Direct link

Mertens, Ute; Finn, Bridgid; Lindner, Marlit Annalena – Journal of Educational Psychology, 2022

Feedback is one of the most important factors for successful learning. Contemporary computer-based learning and testing environments allow the implementation of automated feedback in a simple and efficient manner. Previous meta-analyses suggest that different types of feedback are not equally effective. This heterogeneity might depend on learner…

Descriptors: Computer Assisted Testing, Feedback (Response), Electronic Learning, Network Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	14
Applied Psychological…	7
ETS Research Report Series	7
Journal of Experimental…	6
IEEE Transactions on Learning…	4
Journal of Applied Testing…	3
Journal of Educational…	3
Journal of Educational and…	3
Measurement:…	3
Second Language Research	3
Annals of Dyslexia	2
Assessment for Effective…	2
Cognition	2
Computer Assisted Language…	2
Educational Technology &…	2
International Journal of…	2
International Journal of…	2
Journal of Educational Data…	2
Language Testing	2
Language and Cognitive…	2
Online Submission	2
Practical Assessment,…	2
ZDM: The International…	2
ACM Transactions on Computing…	1
ALT-J: Research in Learning…	1
More ▼

Wang, Wen-Chung	4
Thompson, Nathan A.	3
Chung, Hyewon	2
Deane, Paul	2
Dodd, Barbara G.	2
Huebner, Alan	2
Kim, Jiseon	2
Liu, Chen-Wei	2
Park, Ryoungsun	2
Spray, Judith A.	2
Xi, Xiaoming	2
Zechner, Klaus	2
Abe, Mariko	1
Akar, Gozde B.	1
Aksu Dunya, Beyza	1
Alex J. Mechaber	1
Alexandron, Giora	1
Alexoudi, Kariofyllia	1
Almond, Russell	1
Antoniou, Faye	1
Aryadoust, Vahid	1
Asilkalkan, Abdullah	1
Auen, Amanda	1
Avraamidou, Lucy	1
Axelsson, Emma L.	1
More ▼

Higher Education	29
Postsecondary Education	19
Elementary Education	9
Secondary Education	8
Elementary Secondary Education	6
Early Childhood Education	5
Grade 3	5
High Schools	4
Middle Schools	4
Primary Education	4
Grade 4	3
Junior High Schools	3
Adult Education	2
Grade 1	2
Intermediate Grades	2
Grade 2	1
Grade 5	1
Grade 8	1
Grade 9	1
High School Equivalency…	1
Kindergarten	1
Preschool Education	1
More ▼