ERIC - Search Results

Publication Date

In 2025	3
Since 2024	6
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	26
Since 2006 (last 20 years)	74

Descriptor

Computer Assisted Testing	83
Reliability	83
Validity	34
Foreign Countries	21
Comparative Analysis	20
Scores	18
Scoring	18
Correlation	15
Student Evaluation	15
Evaluation Methods	14
Test Items	13
Item Response Theory	12
Psychometrics	12
Accuracy	10
Statistical Analysis	10
Test Construction	10
Measures (Individuals)	9
Writing Evaluation	9
Adaptive Testing	8
College Students	8
Computer Software	8
Error of Measurement	8
Essays	8
Feedback (Response)	8
Second Language Learning	8
More ▼

Publication Type

Journal Articles	83
Reports - Research	53
Reports - Evaluative	19
Reports - Descriptive	7
Information Analyses	3
Opinion Papers	3
Tests/Questionnaires	3
Speeches/Meeting Papers	1

Education Level

Higher Education	28
Postsecondary Education	26
Elementary Education	8
Elementary Secondary Education	8
Secondary Education	8
High Schools	5
Middle Schools	3
Early Childhood Education	2
Grade 1	2
Grade 11	2
Grade 12	2
Grade 8	2
Primary Education	2
Adult Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
High School Equivalency…	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
More ▼

Audience

Location

China	3
Australia	2
Singapore	2
Turkey	2
Arizona	1
Bangladesh	1
Brazil	1
California	1
Canada	1
Cyprus	1
Delaware	1
Florida	1
Hong Kong	1
Hungary	1
India	1
Indiana	1
Kentucky	1
Luxembourg	1
Mexico	1
Minnesota	1
Netherlands	1
North Carolina (Greensboro)	1
Oklahoma	1
Oman	1
Pennsylvania	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	3
Test of English as a Foreign…	2
ACT Assessment	1
Beck Anxiety Inventory	1
Beck Depression Inventory	1
Dynamic Indicators of Basic…	1
Minnesota Multiphasic…	1
Need for Cognition Scale	1
Peabody Individual…	1
Pediatric Evaluation of…	1
SAT (College Admission Test)	1
Social Skills Rating System	1
United States Medical…	1
Wechsler Intelligence Scale…	1
Wisconsin Card Sorting Test	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 83 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Investigating Students' Perception about LMS-Based Online Examination Practices

Peer reviewed

Direct link

Shard; Devesh Kumar; Sapna Koul – International Journal of Information and Learning Technology, 2024

Purpose: This study aims to gain insights into how students perceive online examination practices and evaluation, as well as identify the key factors that impact their intentions toward online exams. Design/methodology/approach: This empirical study conducted in India utilized an online survey method between May 24 and June 14, 2022. The data were…

Descriptors: Foreign Countries, Undergraduate Students, Graduate Students, Student Attitudes

Triangulating Learner Corpus and Online Experimental Data: Evidence from Gender Agreement and Relative Clauses in L2 Greek

Peer reviewed

Direct link

Despina Papadopoulou; Nikolaos Amvrazis; Gerakini Douka; Alexandros Tantos – Modern Language Journal, 2024

The article introduces triangulation to converge evidence from corpus and experimental data, by means of two case studies in second language (L2) learners of Greek. The first case study investigates the acquisition of gender agreement, while the second probes the development of relative clauses. In both studies, findings from the corpus are tested…

Descriptors: Greek, Phrase Structure, Second Language Learning, Second Language Instruction

Practical Randomly Selected Question Exam Design to Address Replicated and Sequential Questions in Online Examinations

Peer reviewed

Direct link

Elkhatat, Ahmed M. – International Journal for Educational Integrity, 2022

Examinations form part of the assessment processes that constitute the basis for benchmarking individual educational progress, and must consequently fulfill credibility, reliability, and transparency standards in order to promote learning outcomes and ensure academic integrity. A randomly selected question examination (RSQE) is considered to be an…

Descriptors: Integrity, Monte Carlo Methods, Credibility, Reliability

Development, Reliability, and Concurrent Validity of the American Sign Language Version of the Computerized Revised Token Test

Peer reviewed

Direct link

Emily B. Goldberg; Sheila R. Pratt; Malcolm R. McNeil; Neil Szuminsky; Kenneth DeHaan; Leslie Q. Zhen – Journal of Speech, Language, and Hearing Research, 2025

Purpose: The present study assessed the test-retest reliability of the American Sign Language (ASL) version of the Computerized Revised Token Test (CRTT-ASL) and compared the differences and similarities between ASL and English reading by Deaf and hearing users of ASL. Method: Creation of the CRTT-ASL involved filming, editing, and validating CRTT…

Descriptors: American Sign Language, Reliability, Validity, Test Construction

Examining Human and Automated Ratings of Elementary Students' Writing Quality: A Multivariate Generalizability Theory Application

Peer reviewed

Direct link

Chen, Dandan; Hebert, Michael; Wilson, Joshua – American Educational Research Journal, 2022

We used multivariate generalizability theory to examine the reliability of hand-scoring and automated essay scoring (AES) and to identify how these scoring methods could be used in conjunction to optimize writing assessment. Students (n = 113) included subsamples of struggling writers and non-struggling writers in Grades 3-5 drawn from a larger…

Descriptors: Reliability, Scoring, Essays, Automation

The Impact of Artificial Intelligence on Online Assessment: A Preliminary Review

Peer reviewed
PDF on ERIC

Download full text

Nejdet Karadag – Journal of Educational Technology and Online Learning, 2023

The purpose of this study is to examine the impact of artificial intelligence (AI) on online assessment in the context of opportunities and threats based on the literature. To this end, 19 articles related to the AI tool ChatGPT and online assessment were analysed through rapid literature review. In the content analysis, the themes of "AI's…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Natural Language Processing, Grading

Exploring the Nexus between Assessment, Quality and Social Justice: Reflections on Remote Assessment Practices

Peer reviewed

Direct link

Kershree Padayachee; M. Matimolane – Teaching in Higher Education, 2025

In the shift to Emergency Remote Teaching and Learning (ERT&L) during the COVID-19 pandemic, remote assessment and feedback became a major source of discontent and challenge for students and staff. This paper is a reflection and analysis of assessment practices during ERT&L, and our theorisation of the possibilities for shifts towards…

Descriptors: Educational Quality, Social Justice, Distance Education, Feedback (Response)

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Semantic Distance and the Alternate Uses Task: Recommendations for Reliable Automated Assessment of Originality

Peer reviewed

Direct link

Beaty, Roger E.; Johnson, Dan R.; Zeitlen, Daniel C.; Forthmann, Boris – Creativity Research Journal, 2022

Semantic distance is increasingly used for automated scoring of originality on divergent thinking tasks, such as the Alternate Uses Task (AUT). Despite some psychometric support for semantic distance -- including positive correlations with human creativity ratings -- additional work is needed to optimize its reliability and validity, including…

Descriptors: Semantics, Scoring, Creative Thinking, Creativity

Technical Characteristics of Curriculum-Based Measurement with Students Who Are Deaf

Peer reviewed

Direct link

Lam, Elizabeth A.; Rose, Susan; McMaster, Kristen L. – Journal of Deaf Studies and Deaf Education, 2020

This study compared the reliability and validity of student scores from paper--pencil and e-based assessments using the "maze" and "silent reading fluency" (SRF) tasks. Forty students who were deaf and hard of hearing and reading between the second and fifth grade reading levels and their teachers (n = 21) participated. For…

Descriptors: Deafness, Hearing Impairments, Curriculum Based Assessment, Evaluation Methods

E-Assessment in Higher Education: Students' Perspective

Peer reviewed
PDF on ERIC

Download full text

Huda, S. S. M.; Kabir, Md.; Siddiq, Tanvir – International Journal of Education and Development using Information and Communication Technology, 2020

This paper aims to examine the effectiveness of e-assessment in higher education from the perspective of students, and it also examines the student's reaction to this method. There are many developing countries that have begun to explore technology-based assessment systems. The new assessment system has benefits to institutions and to students.…

Descriptors: Computer Assisted Testing, Student Attitudes, Program Effectiveness, Technology Uses in Education

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

Development of Information Functions and Indices for the GGUM-RANK Multidimensional Forced Choice IRT Model

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok; Stark, Stephen – Journal of Educational Measurement, 2018

This research derived information functions and proposed new scalar information indices to examine the quality of multidimensional forced choice (MFC) items based on the RANK model. We also explored how GGUM-RANK information, latent trait recovery, and reliability varied across three MFC formats: pairs (two response alternatives), triplets (three…

Descriptors: Item Response Theory, Models, Item Analysis, Reliability

Attribute-Level Item Selection Method for DCM-CAT

Peer reviewed

Direct link

Bao, Yu; Bradshaw, Laine – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs) can provide multidimensional diagnostic feedback about students' mastery levels of knowledge components or attributes. One advantage of using DCMs is the ability to accurately and reliably classify students into mastery levels with a relatively small number of items per attribute. Combining DCMs with…

Descriptors: Test Items, Selection, Adaptive Testing, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	4
Educational and Psychological…	4
International Journal of…	4
Applied Psychological…	3
Journal of Technology,…	3
Advances in Physiology…	2
Assessment	2
Assessment & Evaluation in…	2
Journal of Psychoeducational…	2
Journal of Speech, Language,…	2
American Educational Research…	1
Applied Linguistics	1
Applied Measurement in…	1
Arab World English Journal	1
Assessment for Effective…	1
British Educational Research…	1
British Journal of…	1
CALICO Journal	1
CBE - Life Sciences Education	1
Chemistry Education Research…	1
Child & Youth Care Forum	1
Computers in Human Behavior	1
Creativity Research Journal	1
Curriculum Journal	1
EURASIA Journal of…	1
More ▼

Attali, Yigal	3
Delen, Erhan	2
Knight, Jennifer K.	2
Wolfe, Edward W.	2
Afkhamizadeh, Mozhgan	1
Aghili, Zahra	1
Al-Bahlani, Sara	1
Al-Maqbali, Asma Hilal	1
Alexandros Tantos	1
Allehaiby, Wid Hasen	1
Amanda Huee-Ping Wong	1
Apple, Kristen	1
Arce-Ferrer, Alvaro J.	1
Baldwin, Peter	1
Bao, Yu	1
Baron-Cohen, Simon	1
Barron, Ann E.	1
Beaty, Roger E.	1
Beddow, Peter A.	1
Bell, John F.	1
Blanes, Erika	1
Bradshaw, Laine	1
Brown, Gavin T. L.	1
Brownell, Sara	1
More ▼