ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	15
Since 2017 (last 10 years)	33
Since 2007 (last 20 years)	55

Descriptor

Computer Assisted Testing	103
Error of Measurement	103
Adaptive Testing	60
Test Items	47
Item Response Theory	39
Simulation	25
Item Banks	21
Comparative Analysis	19
Test Construction	17
Test Reliability	16
Estimation (Mathematics)	14
Scores	14
Scoring	14
Test Length	14
Psychometrics	13
Accuracy	12
Foreign Countries	12
Goodness of Fit	12
Item Analysis	12
Test Bias	11
Reliability	10
Ability	9
Correlation	9
Difficulty Level	9
Higher Education	9
More ▼

Publication Type

Journal Articles	64
Reports - Research	61
Reports - Evaluative	26
Speeches/Meeting Papers	14
Dissertations/Theses -…	7
Reports - Descriptive	6
Tests/Questionnaires	3
Guides - Non-Classroom	2
Opinion Papers	2
Book/Product Reviews	1
Collected Works - Proceedings	1
Information Analyses	1
More ▼

Audience

Practitioners	2
Researchers	2

Location

Australia	2
Indonesia	2
Turkey	2
Canada	1
China	1
Israel	1
Japan	1
Norway	1
Portugal	1
Saudi Arabia	1
Sweden	1
Taiwan	1
United Kingdom	1
United Kingdom (England)	1
United States	1
Virginia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Armed Services Vocational…	2
Armed Forces Qualification…	1
Cognitive Abilities Test	1
College Level Academic Skills…	1
Embedded Figures Test	1
Group Embedded Figures Test	1
Measures of Academic Progress	1
National Household Education…	1
Rod and Frame Test	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 103 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Item Type and Survey Mode Comparability: An Analysis of Measurement Invariance between Item Response Types and Survey Modes

Direct link

Jackson, Kayla – ProQuest LLC, 2023

Prior research highlights the benefits of multimode surveys and best practices for item-by-item (IBI) and matrix-type survey items. Some researchers have explored whether mode differences for online and paper surveys persist for these survey item types. However, no studies discuss measurement invariance when both item types and online modes are…

Descriptors: Test Items, Surveys, Error of Measurement, Item Response Theory

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error

Peer reviewed

Direct link

Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…

Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation

Cognitive Diagnosis for Multiple-Choice Responses: Nonparametric Classification Method, Q-Matrix Theory, and Computerized Adaptive Testing

Direct link

Yu Wang – ProQuest LLC, 2024

The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…

Descriptors: Multiple Choice Tests, Cognitive Tests, Cognitive Measurement, Educational Diagnosis

The Study of the Effect of Item Parameter Drift on Ability Estimation Obtained from Adaptive Testing under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Sahin Kursad, Merve; Cokluk Bokeoglu, Omay; Cikrikci, Rahime Nukhet – International Journal of Assessment Tools in Education, 2022

Item parameter drift (IPD) is the systematic differentiation of parameter values of items over time due to various reasons. If it occurs in computer adaptive tests (CAT), it causes errors in the estimation of item and ability parameters. Identification of the underlying conditions of this situation in CAT is important for estimating item and…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Error of Measurement

A Two-Tier Computerized Adaptive Test to Measure Student Computational Thinking Skills

Peer reviewed

Direct link

Rizki Zakwandi; Edi Istiyono; Wipsar Sunu Brams Dwandaru – Education and Information Technologies, 2024

Computational Thinking (CT) skill was a part of the global framework of reference on Digital Literacy for Indicator 4.4.2, widely developed in mathematics and science learning. This study aimed to promote an assessment tool using a two-tier Computerized Adaptive Test (CAT). The study used the Design and Development Research (DDR) method with four…

Descriptors: Computer Assisted Testing, Adaptive Testing, Student Evaluation, Computation

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

Duration versus Accuracy--What Matters for Computerised Adaptive Testing in Schools?

Peer reviewed

Direct link

Nikola Ebenbeck; Morten Bastian; Andreas Mühling; Markus Gebhardt – Journal of Computer Assisted Learning, 2024

Background: Computerised adaptive tests (CATs) are tests that provide personalised, efficient and accurate measurement while reducing testing time, depending on the desired level of precision. Schools have different types of assessments that can benefit from a significant reduction in testing time to varying degrees, depending on the area of…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Public Schools, Special Schools

Modeling Item-Level Heterogeneous Treatment Effects with the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions

Peer reviewed

Direct link

Gilbert, Joshua B.; Kim, James S.; Miratrix, Luke W. – Journal of Educational and Behavioral Statistics, 2023

Analyses that reveal how treatment effects vary allow researchers, practitioners, and policymakers to better understand the efficacy of educational interventions. In practice, however, standard statistical methods for addressing heterogeneous treatment effects (HTE) fail to address the HTE that may exist "within" outcome measures. In…

Descriptors: Test Items, Item Response Theory, Computer Assisted Testing, Program Effectiveness

Automated Essay Scoring Effect on Test Equating Errors in Mixed-Format Test

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Scoring constructed-response items can be highly difficult, time-consuming, and costly in practice. Improvements in computer technology have enabled automated scoring of constructed-response items. However, the application of automated scoring without an investigation of test equating can lead to serious problems. The goal of this study was to…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Test Format

Quality of Item Pool (QIP) Index: A Novel Approach to Evaluating CAT Item Pool Adequacy

Peer reviewed

Direct link

Gönülates, Emre – Educational and Psychological Measurement, 2019

This article introduces the Quality of Item Pool (QIP) Index, a novel approach to quantifying the adequacy of an item pool of a computerized adaptive test for a given set of test specifications and examinee population. This index ranges from 0 to 1, with values close to 1 indicating the item pool presents optimum items to examinees throughout the…

Descriptors: Item Banks, Adaptive Testing, Computer Assisted Testing, Error of Measurement

Measuring and Modeling Human Capital: Confirmatory IRT, Poor-Proxy Bias, and Latent Convection

Direct link

Stefan Lorenz – ProQuest LLC, 2024

This dissertation develops and applies sophisticated Item Response Theory (IRT) methods to address fundamental measurement challenges in cognitive testing, focusing on the Armed Services Vocational Aptitude Battery (ASVAB) data from the National Longitudinal Survey of Youth (NLSY). The first chapter implements a confirmatory multidimensional IRT…

Descriptors: Human Capital, Item Response Theory, Vocational Aptitude, Armed Forces

Digital Module 18: Automated Scoring

Peer reviewed

Direct link

Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…

Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	7
ProQuest LLC	7
Applied Psychological…	6
ETS Research Report Series	6
International Journal of…	5
Journal of Educational…	4
Psychometrika	3
Applied Measurement in…	2
Education and Information…	2
Educational Measurement:…	2
International Journal of…	2
Journal of Educational and…	2
Journal of Psychoeducational…	2
Alberta Journal of…	1
Assessment	1
Assessment & Evaluation in…	1
British Educational Research…	1
CALICO Journal	1
Computers and Education	1
ETS Research Institute	1
EURASIA Journal of…	1
Educational Research and…	1
Educational Testing Service	1
Evaluation and the Health…	1
Grantee Submission	1
More ▼

Ban, Jae-Chun	3
De Ayala, R. J.	3
Yi, Qing	3
Zwick, Rebecca	3
van der Linden, Wim J.	3
Anna-Maria Fall	2
Bergstrom, Betty A.	2
Beula M. Magimairaj	2
Dodd, Barbara G.	2
Green, Donald Ross	2
Greg Roberts	2
Hanson, Bradley A.	2
Harris, Deborah J.	2
Lai, Hollis	2
Patience, Wayne M.	2
Philip Capin	2
Reckase, Mark D.	2
Rizavi, Saba	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Sharon Vaughn	2
Thissen, David	2
Weiss, David J.	2
Yao, Lihua	2
More ▼

Higher Education	8
Postsecondary Education	6
Secondary Education	6
Elementary Secondary Education	5
Elementary Education	4
High Schools	4
Junior High Schools	4
Middle Schools	4
Early Childhood Education	2
Grade 3	2
Grade 4	2
Primary Education	2
Adult Education	1
Grade 2	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High School Equivalency…	1
Intermediate Grades	1
More ▼