ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	32
Since 2007 (last 20 years)	67

Descriptor

Computer Assisted Testing	79
Correlation	79
Scoring	65
Second Language Learning	25
English (Second Language)	24
Language Tests	24
Scores	24
Comparative Analysis	19
Essays	18
Evaluators	18
Accuracy	16
Computer Software	16
Foreign Countries	16
Test Validity	16
Scoring Rubrics	15
Writing Evaluation	15
Writing Tests	15
Automation	13
Interrater Reliability	13
Statistical Analysis	12
Test Reliability	12
Models	11
Regression (Statistics)	11
Evaluation Methods	10
Reliability	10
More ▼

Publication Type

Journal Articles	65
Reports - Research	64
Reports - Evaluative	8
Tests/Questionnaires	5
Dissertations/Theses -…	3
Reports - Descriptive	3
Speeches/Meeting Papers	3
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Higher Education	17
Postsecondary Education	12
Secondary Education	8
Elementary Secondary Education	4
High Schools	4
Elementary Education	3
Middle Schools	3
Grade 8	2
Junior High Schools	2
Early Childhood Education	1
Grade 4	1
Grade 6	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Location

California	2
China	2
Australia	1
Canada	1
Denmark	1
Germany	1
Hong Kong	1
Iran	1
Israel	1
Massachusetts	1
North Carolina (Greensboro)	1
Philippines	1
Singapore	1
Taiwan	1
Texas	1
United Kingdom (Coventry)	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	17
Graduate Record Examinations	4
Torrance Tests of Creative…	2
Foreign Language Classroom…	1
NEO Personality Inventory	1
SAT (College Admission Test)	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

A Tool for Automatic Scoring of Spelling Performance

Peer reviewed

Direct link

Themistocleous, Charalambos; Neophytou, Kyriaki; Rapp, Brenda; Tsapkini, Kyrana – Journal of Speech, Language, and Hearing Research, 2020

Purpose: The evaluation of spelling performance in aphasia reveals deficits in written language and can facilitate the design of targeted writing treatments. Nevertheless, manual scoring of spelling performance is time-consuming, laborious, and error prone. We propose a novel method based on the use of distance metrics to automatically score…

Descriptors: Computer Assisted Testing, Scoring, Spelling, Scores

Assessing Creativity across Multi-Step Intervention Using Generative AI Models

Peer reviewed
PDF on ERIC

Download full text

Eran Hadas; Arnon Hershkovitz – Journal of Learning Analytics, 2025

Creativity is an imperative skill for today's learners, one that has important contributions to issues of inclusion and equity in education. Therefore, assessing creativity is of major importance in educational contexts. However, scoring creativity based on traditional tools suffers from subjectivity and is heavily time- and labour-consuming. This…

Descriptors: Creativity, Evaluation Methods, Computer Assisted Testing, Artificial Intelligence

Impact of Categorization and Scaling on Classification Agreement and Prediction Accuracy Statistics. Research Report. ETS RR-21-26

Peer reviewed
PDF on ERIC

Download full text

Wang, Wei; Dorans, Neil J. – ETS Research Report Series, 2021

Agreement statistics and measures of prediction accuracy are often used to assess the quality of two measures of a construct. Agreement statistics are appropriate for measures that are supposed to be interchangeable, whereas prediction accuracy statistics are appropriate for situations where one variable is the target and the other variables are…

Descriptors: Classification, Scaling, Prediction, Accuracy

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

The Prevalence and Correlates of Accurate Singing

Peer reviewed

Direct link

Pfordresher, Peter Q.; Demorest, Steven M. – Journal of Research in Music Education, 2021

The purpose of this study was to analyze a large sample of volunteers from the general population who were tested with an identical online measure of singing accuracy. A sample of 632 participants completed the Seattle Singing Accuracy Protocol (SSAP), a standardized measure of singing accuracy, available online, that includes a test of pitch…

Descriptors: Correlation, Accuracy, Singing, Computer Assisted Testing

The Effectiveness of Machine Score-Ability Ratings in Predicting Automated Scoring Performance

Peer reviewed

Direct link

Lottridge, Susan; Wood, Scott; Shaw, Dan – Applied Measurement in Education, 2018

This study sought to provide a framework for evaluating machine score-ability of items using a new score-ability rating scale, and to determine the extent to which ratings were predictive of observed automated scoring performance. The study listed and described a set of factors that are thought to influence machine score-ability; these factors…

Descriptors: Program Effectiveness, Computer Assisted Testing, Test Scoring Machines, Scoring

Semantic Distance and the Alternate Uses Task: Recommendations for Reliable Automated Assessment of Originality

Peer reviewed

Direct link

Beaty, Roger E.; Johnson, Dan R.; Zeitlen, Daniel C.; Forthmann, Boris – Creativity Research Journal, 2022

Semantic distance is increasingly used for automated scoring of originality on divergent thinking tasks, such as the Alternate Uses Task (AUT). Despite some psychometric support for semantic distance -- including positive correlations with human creativity ratings -- additional work is needed to optimize its reliability and validity, including…

Descriptors: Semantics, Scoring, Creative Thinking, Creativity

Using Latent Semantic Analysis to Score Short Answer Constructed Responses: Automated Scoring of the Consequences Test

Peer reviewed

Direct link

LaVoie, Noelle; Parker, James; Legree, Peter J.; Ardison, Sharon; Kilcullen, Robert N. – Educational and Psychological Measurement, 2020

Automated scoring based on Latent Semantic Analysis (LSA) has been successfully used to score essays and constrained short answer responses. Scoring tests that capture open-ended, short answer responses poses some challenges for machine learning approaches. We used LSA techniques to score short answer responses to the Consequences Test, a measure…

Descriptors: Semantics, Evaluators, Essays, Scoring

Binding Costs in Processing Efficiency as Determinants of Cognitive Ability

Peer reviewed
PDF on ERIC

Download full text

Goecke, Benjamin; Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2021

Performance in elementary cognitive tasks is moderately correlated with fluid intelligence and working memory capacity. These correlations are higher for more complex tasks, presumably due to increased demands on working memory capacity. In accordance with the binding hypothesis, which states that working memory capacity reflects the limit of a…

Descriptors: Intelligence, Cognitive Processes, Short Term Memory, Reaction Time

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	13
Language Testing	7
Grantee Submission	3
Journal of Speech, Language,…	3
ProQuest LLC	3
Applied Measurement in…	2
Journal of Educational…	2
Turkish Online Journal of…	2
Advances in Physiology…	1
Applied Linguistics	1
Applied Psychological…	1
Assessing Writing	1
Assessment	1
CALICO Journal	1
College Student Journal	1
Computers & Education	1
Contemporary Issues in…	1
Council for Aid to Education	1
Creativity Research Journal	1
Educational Assessment	1
Educational Testing Service	1
Educational and Psychological…	1
English Teaching	1
IEEE Transactions on Learning…	1
International Journal of…	1
More ▼

Attali, Yigal	4
Bridgeman, Brent	4
Breyer, F. Jay	3
Anna-Maria Fall	2
Bennett, Randy Elliot	2
Beula M. Magimairaj	2
Crossley, Scott	2
Denis Dumas	2
Dikici, Ayhan	2
Gentile, Claudia	2
Greg Roberts	2
Kantor, Robert	2
Kunnan, Antony John	2
Lee, Yong-Won	2
McNamara, Danielle	2
Peter Organisciak	2
Philip Capin	2
Ramineni, Chaitanya	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Selcuk Acar	2
Sharon Vaughn	2
Sinharay, Sandip	2
Tezci, Erdogan	2
Williamson, David M.	2
More ▼