ERIC - Search Results

Publication Date

In 2025	1
Since 2024	7

Descriptor

Computer Assisted Testing	7
Error of Measurement	7
Adaptive Testing	3
Test Reliability	3
Accuracy	2
Evaluation Methods	2
Student Evaluation	2
Test Validity	2
Thinking Skills	2
Adults	1
Aptitude Tests	1
Armed Forces	1
Artificial Intelligence	1
Autism Spectrum Disorders	1
Cognitive Measurement	1
Cognitive Tests	1
College Faculty	1
Comparative Analysis	1
Comparative Testing	1
Computation	1
Data	1
Data Analysis	1
Educational Diagnosis	1
Educational History	1
Elementary School Students	1
More ▼

Source

ProQuest LLC	2
British Educational Research…	1
ETS Research Institute	1
Education and Information…	1
Journal of Autism and…	1
Journal of Computer Assisted…	1

Publication Type

Journal Articles	4
Reports - Research	4
Dissertations/Theses -…	2
Reports - Evaluative	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Cognitive Diagnosis for Multiple-Choice Responses: Nonparametric Classification Method, Q-Matrix Theory, and Computerized Adaptive Testing

Direct link

Yu Wang – ProQuest LLC, 2024

The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…

Descriptors: Multiple Choice Tests, Cognitive Tests, Cognitive Measurement, Educational Diagnosis

A Two-Tier Computerized Adaptive Test to Measure Student Computational Thinking Skills

Peer reviewed

Direct link

Rizki Zakwandi; Edi Istiyono; Wipsar Sunu Brams Dwandaru – Education and Information Technologies, 2024

Computational Thinking (CT) skill was a part of the global framework of reference on Digital Literacy for Indicator 4.4.2, widely developed in mathematics and science learning. This study aimed to promote an assessment tool using a two-tier Computerized Adaptive Test (CAT). The study used the Design and Development Research (DDR) method with four…

Descriptors: Computer Assisted Testing, Adaptive Testing, Student Evaluation, Computation

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Duration versus Accuracy--What Matters for Computerised Adaptive Testing in Schools?

Peer reviewed

Direct link

Nikola Ebenbeck; Morten Bastian; Andreas Mühling; Markus Gebhardt – Journal of Computer Assisted Learning, 2024

Background: Computerised adaptive tests (CATs) are tests that provide personalised, efficient and accurate measurement while reducing testing time, depending on the desired level of precision. Schools have different types of assessments that can benefit from a significant reduction in testing time to varying degrees, depending on the area of…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Public Schools, Special Schools

Measuring and Modeling Human Capital: Confirmatory IRT, Poor-Proxy Bias, and Latent Convection

Direct link

Stefan Lorenz – ProQuest LLC, 2024

This dissertation develops and applies sophisticated Item Response Theory (IRT) methods to address fundamental measurement challenges in cognitive testing, focusing on the Armed Services Vocational Aptitude Battery (ASVAB) data from the National Longitudinal Survey of Youth (NLSY). The first chapter implements a confirmatory multidimensional IRT…

Descriptors: Human Capital, Item Response Theory, Vocational Aptitude, Armed Forces

Charting the Future of Assessments. Full Report

Download full text

Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…

Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias

Amit Sevak	1
Andreas Mühling	1
Christopher F. Chabris	1
Daniel Fishtein	1
Edi Istiyono	1
Ikkyu Choi	1
Jesse Sparks	1
Jonas Flodén	1
Markus Gebhardt	1
Matt I. Brown	1
Morten Bastian	1
Nikola Ebenbeck	1
Patrick C. Kyllonen	1
Patrick R. Heck	1
Rizki Zakwandi	1
Stefan Lorenz	1
Teresa Ober	1
Wipsar Sunu Brams Dwandaru	1
Yu Wang	1
More ▼