Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 8 |
| Since 2017 (last 10 years) | 21 |
| Since 2007 (last 20 years) | 27 |
Descriptor
| Accuracy | 27 |
| Computer Assisted Testing | 27 |
| Prediction | 27 |
| Correlation | 10 |
| Foreign Countries | 10 |
| Comparative Analysis | 8 |
| Reaction Time | 8 |
| Task Analysis | 8 |
| Classification | 7 |
| English (Second Language) | 6 |
| Language Processing | 5 |
| More ▼ | |
Source
Author
| Abayeva, Nella F. | 1 |
| Adjei, Seth A. | 1 |
| Anna Filighera | 1 |
| Attali, Yigal | 1 |
| Ballier, Nicolas | 1 |
| Barnes, Tiffany, Ed. | 1 |
| Bhatia, Vrinda | 1 |
| Bilki, Zeynep | 1 |
| Boss, Emily | 1 |
| Botelho, Anthony F. | 1 |
| Bouyé, Manon | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 22 |
| Reports - Research | 22 |
| Collected Works - Proceedings | 2 |
| Dissertations/Theses -… | 2 |
| Reports - Descriptive | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Higher Education | 6 |
| Postsecondary Education | 6 |
| Elementary Education | 5 |
| Adult Education | 2 |
| Elementary Secondary Education | 2 |
| Grade 4 | 2 |
| Middle Schools | 2 |
| Grade 10 | 1 |
| Grade 12 | 1 |
| Grade 5 | 1 |
| Grade 7 | 1 |
| More ▼ | |
Audience
Location
| Netherlands | 2 |
| Australia | 1 |
| Canada | 1 |
| Canada (Ottawa) | 1 |
| China | 1 |
| Czech Republic | 1 |
| Germany | 1 |
| Greece | 1 |
| Ireland | 1 |
| Ireland (Dublin) | 1 |
| Israel | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Graduate Record Examinations | 2 |
| Measures of Academic Progress | 2 |
| Test of English as a Foreign… | 2 |
| ACT Aspire | 1 |
| Massachusetts Comprehensive… | 1 |
| Minnesota Comprehensive… | 1 |
| Praxis Series | 1 |
| Program for the International… | 1 |
What Works Clearinghouse Rating
Anna Filighera; Sebastian Ochs; Tim Steuer; Thomas Tregel – International Journal of Artificial Intelligence in Education, 2024
Automatic grading models are valued for the time and effort saved during the instruction of large student bodies. Especially with the increasing digitization of education and interest in large-scale standardized testing, the popularity of automatic grading has risen to the point where commercial solutions are widely available and used. However,…
Descriptors: Cheating, Grading, Form Classes (Languages), Computer Software
Susan Barnes Porter – ProQuest LLC, 2022
The data from universal screeners must be valid and reliable in order to use it to make appropriate decisions about how best to allocate resources to support students who are at risk of not passing the state achievement test. The instruments used as part of universal screening must also have diagnostic accuracy. This study examined the diagnostic…
Descriptors: Screening Tests, Accuracy, Computer Assisted Testing, Achievement Tests
Wang, Wei; Dorans, Neil J. – ETS Research Report Series, 2021
Agreement statistics and measures of prediction accuracy are often used to assess the quality of two measures of a construct. Agreement statistics are appropriate for measures that are supposed to be interchangeable, whereas prediction accuracy statistics are appropriate for situations where one variable is the target and the other variables are…
Descriptors: Classification, Scaling, Prediction, Accuracy
Hsu, Hao-Hsuan; Huang, Nen-Fu – IEEE Transactions on Learning Technologies, 2022
This article introduces Xiao-Shih, the first intelligent question answering bot on Chinese-based massive open online courses (MOOCs). Question answering is critical for solving individual problems. However, instructors on MOOCs must respond to many questions, and learners must wait a long time for answers. To address this issue, Xiao-Shih…
Descriptors: Foreign Countries, Artificial Intelligence, Online Courses, Natural Language Processing
Shukla, Vishakha; Long, Madeleine; Bhatia, Vrinda; Rubio-Fernandez, Paula – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2022
While most research on scalar implicature has focused on the lexical scale "some" vs "all," here we investigated an understudied scale formed by two syntactic constructions: categorizations (e.g., "Wilma is a nurse") and comparisons ("Wilma is like a nurse"). An experimental study by Rubio-Fernandez et al.…
Descriptors: Cues, Pragmatics, Comparative Analysis, Syntax
McGuire, Katherine L. – Journal of Cognition and Development, 2022
Children have traditionally been viewed as less reliable witnesses than are adults. More recently, a concept known as developmental reversals, has brought this view into question. Developmental reversals have demonstrated that in certain contexts, children produce fewer false memories than adults. The primary paradigm used to demonstrate…
Descriptors: Memory, Cognitive Processes, Context Effect, Accuracy
Thompson, James J. – Measurement: Interdisciplinary Research and Perspectives, 2022
With the use of computerized testing, ordinary assessments can capture both answer accuracy and answer response time. For the Canadian Programme for the International Assessment of Adult Competencies (PIAAC) numeracy and literacy subtests, person ability, person speed, question difficulty, question time intensity, fluency (rate), person fluency…
Descriptors: Foreign Countries, Adults, Computer Assisted Testing, Network Analysis
Mingying Zheng – ProQuest LLC, 2024
The digital transformation in educational assessment has led to the proliferation of large-scale data, offering unprecedented opportunities to enhance language learning, and testing through machine learning (ML) techniques. Drawing on the extensive data generated by online English language assessments, this dissertation investigates the efficacy…
Descriptors: Artificial Intelligence, Computational Linguistics, Language Tests, English (Second Language)
Gaillat, Thomas; Simpkin, Andrew; Ballier, Nicolas; Stearns, Bernardo; Sousa, Annanda; Bouyé, Manon; Zarrouk, Manel – ReCALL, 2021
This paper focuses on automatically assessing language proficiency levels according to linguistic complexity in learner English. We implement a supervised learning approach as part of an automatic essay scoring system. The objective is to uncover Common European Framework of Reference for Languages (CEFR) criterial features in writings by learners…
Descriptors: Prediction, Rating Scales, English (Second Language), Second Language Learning
Harding, Bradley; Cousineau, Denis – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2022
The same-different task is a classic paradigm that requires participants to judge whether two successively presented stimuli are the same or different. While this task is simple, with results that have been replicated many times, response times (RTs) and accuracy for both same and different decisions remain difficult to model. The biggest obstacle…
Descriptors: Self Concept, Task Analysis, Priming, Reaction Time
Muhl-Richardson, Alex; Cornes, Katherine; Godwin, Hayward J.; Garner, Matthew; Hadwin, Julie A.; Liversedge, Simon P.; Donnelly, Nick – Applied Cognitive Psychology, 2018
Target onsets in dynamically changing displays can be predicted when contingencies exist between different stimulus states over time. In the present study, we examined predictive monitoring when participants searched dynamically changing displays of numbers and colored squares for a color target, a number target, or both. Stimuli were presented in…
Descriptors: Prediction, Visual Stimuli, Color, Spatial Ability
VanMeveren, Kalie; Hulac, David; Wollersheim-Shervey, Sarah – Assessment for Effective Intervention, 2020
Reading screening assessments help educators identify students who are at risk of reading and determine the need for intervention and supports. However, some schools screen and assess students more often than needed, and the additional information does not improve the accuracy of decisions. This may be especially true for students at the upper…
Descriptors: Reading Tests, Screening Tests, Elementary School Students, High Stakes Tests
Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017
The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…
Descriptors: Testing, Performance, Prediction, Error of Measurement
Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019
Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…
Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring
Evans, William S.; Cavanaugh, Robert; Quique, Yina; Boss, Emily; Starns, Jeffrey J.; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The purpose of this study was to develop and pilot a novel treatment framework called "BEARS" (Balancing Effort, Accuracy, and Response Speed). People with aphasia (PWA) have been shown to maladaptively balance speed and accuracy during language tasks. BEARS is designed to train PWA to balance speed-accuracy trade-offs and…
Descriptors: Accuracy, Semantics, Aphasia, Reaction Time
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
