Publication Date
| In 2026 | 0 |
| Since 2025 | 14 |
Descriptor
| Accuracy | 14 |
| Computer Assisted Testing | 14 |
| Adaptive Testing | 5 |
| Artificial Intelligence | 5 |
| College Students | 4 |
| Computer Software | 4 |
| Test Items | 4 |
| Classification | 3 |
| Grading | 3 |
| Item Banks | 3 |
| Technology Uses in Education | 3 |
| More ▼ | |
Source
Author
| Agustín Garagorry Guerra | 1 |
| Aiping Yu | 1 |
| Akinbowale Natheniel Babatunde | 1 |
| Alex J. Mechaber | 1 |
| Anny Chan | 1 |
| Ayaka Sugawara | 1 |
| Benjamin G. Solomon | 1 |
| Beyza Aksu Dunya | 1 |
| Brian E. Clauser | 1 |
| Cheng-Chi Lee | 1 |
| Chunsong Jiang | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 14 |
| Reports - Research | 14 |
Education Level
| Higher Education | 4 |
| Postsecondary Education | 4 |
| Secondary Education | 2 |
| Early Childhood Education | 1 |
| Elementary Education | 1 |
| Grade 12 | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| High Schools | 1 |
| Intermediate Grades | 1 |
| More ▼ | |
Audience
Location
| Japan | 1 |
| New York | 1 |
| New York (Rochester) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 1 |
| Torrance Tests of Creative… | 1 |
What Works Clearinghouse Rating
Beyza Aksu Dunya; Stefanie Wind – International Journal of Testing, 2025
We explored the practicality of relatively small item pools in the context of low-stakes Computer-Adaptive Testing (CAT), such as CAT procedures that might be used for quick diagnostic or screening exams. We used a basic CAT algorithm without content balancing and exposure control restrictions to reflect low stakes testing scenarios. We examined…
Descriptors: Item Banks, Adaptive Testing, Computer Assisted Testing, Achievement
Ishaya Gambo; Faith-Jane Abegunde; Omobola Gambo; Roseline Oluwaseun Ogundokun; Akinbowale Natheniel Babatunde; Cheng-Chi Lee – Education and Information Technologies, 2025
The current educational system relies heavily on manual grading, posing challenges such as delayed feedback and grading inaccuracies. Automated grading tools (AGTs) offer solutions but come with limitations. To address this, "GRAD-AI" is introduced, an advanced AGT that combines automation with teacher involvement for precise grading,…
Descriptors: Automation, Grading, Artificial Intelligence, Computer Assisted Testing
Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025
In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…
Descriptors: Scoring, Computer Assisted Testing, Models, Correlation
Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025
Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…
Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy
Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025
In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Michael Bass; Scott Morris; Sheng Zhang – Measurement: Interdisciplinary Research and Perspectives, 2025
Administration of patient-reported outcome measures (PROs), using multidimensional computer adaptive tests (MCATs) has the potential to reduce patient burden, but the efficiency of MCAT depends on the degree to which an individual's responses fit the psychometric properties of the assessment. Assessing patients' symptom burden through the…
Descriptors: Adaptive Testing, Computer Assisted Testing, Patients, Outcome Measures
Xuefan Li; Marco Zappatore; Tingsong Li; Weiwei Zhang; Sining Tao; Xiaoqing Wei; Xiaoxu Zhou; Naiqing Guan; Anny Chan – IEEE Transactions on Learning Technologies, 2025
The integration of generative artificial intelligence (GAI) into educational settings offers unprecedented opportunities to enhance the efficiency of teaching and the effectiveness of learning, particularly within online platforms. This study evaluates the development and application of a customized GAI-powered teaching assistant, trained…
Descriptors: Artificial Intelligence, Technology Uses in Education, Student Evaluation, Academic Achievement
Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025
Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…
Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment
Jussi S. Jauhiainen; Agustín Garagorry Guerra – Innovations in Education and Teaching International, 2025
The study highlights ChatGPT-4's potential in educational settings for the evaluation of university students' open-ended written examination responses. ChatGPT-4 evaluated 54 written responses, ranging from 24 to 256 words in English. It assessed each response using five criteria and assigned a grade on a six-point scale from fail to excellent,…
Descriptors: Artificial Intelligence, Technology Uses in Education, Student Evaluation, Writing Evaluation
Chunsong Jiang; Xuan Chen; Aiping Yu; Guiqin Liang – Education and Information Technologies, 2025
Assignments and tests are the main forms of evaluation in the educational process, students usually lose interest in boring exercises during course learning. In spired of elements from human-computer battle game, a course test system is designed to encourage students to take tests more frequently and actively to achieve better learning effect,…
Descriptors: Computer Games, Educational Games, Game Based Learning, Competition
Xiuxiu Tang; Yi Zheng; Tong Wu; Kit-Tai Hau; Hua-Hua Chang – Journal of Educational Measurement, 2025
Multistage adaptive testing (MST) has been recently adopted for international large-scale assessments such as Programme for International Student Assessment (PISA). MST offers improved measurement efficiency over traditional nonadaptive tests and improved practical convenience over single-item-adaptive computerized adaptive testing (CAT). As a…
Descriptors: Reaction Time, Test Items, Achievement Tests, Foreign Countries
Roha M. Kaipa; Sarah Wendelbo – International Journal of Multilingualism, 2025
The research on language acquisition and retention has primarily focused on monolinguals and bilinguals, with comparatively few studies including trilinguals. To address this gap, the current study compares the acquisition and retention of a novel morphosyntactic rule in Spanish in twelve monolinguals, twelve bilinguals, and twelve trilinguals.…
Descriptors: Multilingualism, Second Language Instruction, Second Language Learning, Spanish
Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025
The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…
Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing
Qiao Wang; Ralph L. Rose; Ayaka Sugawara; Naho Orita – Vocabulary Learning and Instruction, 2025
VocQGen is an automated tool designed to generate multiple-choice cloze (MCC) questions for vocabulary assessment in second language learning contexts. It leverages several natural language processing (NLP) tools and OpenAI's GPT-4 model to produce MCC items quickly from user-specified word lists. To evaluate its effectiveness, we used the first…
Descriptors: Vocabulary Skills, Artificial Intelligence, Computer Software, Multiple Choice Tests

Peer reviewed
Direct link
