ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	48

Descriptor

Computer Assisted Testing	64
Error of Measurement	64
Adaptive Testing	35
Item Response Theory	28
Test Items	27
Simulation	16
Comparative Analysis	15
Accuracy	11
Item Banks	11
Scoring	11
Test Reliability	11
Psychometrics	10
Scores	10
Foreign Countries	9
Test Construction	9
Test Length	9
Reliability	8
Test Bias	8
Test Format	7
Correlation	6
Difficulty Level	6
Evaluation Methods	6
Interrater Reliability	6
Item Analysis	6
Test Validity	6
More ▼

Publication Type

Journal Articles	64
Reports - Research	45
Reports - Evaluative	14
Reports - Descriptive	4
Tests/Questionnaires	3
Book/Product Reviews	1
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Higher Education	8
Postsecondary Education	6
Secondary Education	5
High Schools	4
Elementary Education	3
Elementary Secondary Education	3
Junior High Schools	3
Middle Schools	3
Adult Education	1
Early Childhood Education	1
Grade 3	1
Grade 9	1
High School Equivalency…	1
Primary Education	1
More ▼

Audience

Practitioners

Location

Indonesia	2
Turkey	2
Canada	1
China	1
Japan	1
Portugal	1
Saudi Arabia	1
United Kingdom	1
Virginia	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Armed Forces Qualification…	1
Cognitive Abilities Test	1
Rod and Frame Test	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 64 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error

Peer reviewed

Direct link

Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…

Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation

The Study of the Effect of Item Parameter Drift on Ability Estimation Obtained from Adaptive Testing under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Sahin Kursad, Merve; Cokluk Bokeoglu, Omay; Cikrikci, Rahime Nukhet – International Journal of Assessment Tools in Education, 2022

Item parameter drift (IPD) is the systematic differentiation of parameter values of items over time due to various reasons. If it occurs in computer adaptive tests (CAT), it causes errors in the estimation of item and ability parameters. Identification of the underlying conditions of this situation in CAT is important for estimating item and…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Error of Measurement

A Two-Tier Computerized Adaptive Test to Measure Student Computational Thinking Skills

Peer reviewed

Direct link

Rizki Zakwandi; Edi Istiyono; Wipsar Sunu Brams Dwandaru – Education and Information Technologies, 2024

Computational Thinking (CT) skill was a part of the global framework of reference on Digital Literacy for Indicator 4.4.2, widely developed in mathematics and science learning. This study aimed to promote an assessment tool using a two-tier Computerized Adaptive Test (CAT). The study used the Design and Development Research (DDR) method with four…

Descriptors: Computer Assisted Testing, Adaptive Testing, Student Evaluation, Computation

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Duration versus Accuracy--What Matters for Computerised Adaptive Testing in Schools?

Peer reviewed

Direct link

Nikola Ebenbeck; Morten Bastian; Andreas Mühling; Markus Gebhardt – Journal of Computer Assisted Learning, 2024

Background: Computerised adaptive tests (CATs) are tests that provide personalised, efficient and accurate measurement while reducing testing time, depending on the desired level of precision. Schools have different types of assessments that can benefit from a significant reduction in testing time to varying degrees, depending on the area of…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Public Schools, Special Schools

Modeling Item-Level Heterogeneous Treatment Effects with the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions

Peer reviewed

Direct link

Gilbert, Joshua B.; Kim, James S.; Miratrix, Luke W. – Journal of Educational and Behavioral Statistics, 2023

Analyses that reveal how treatment effects vary allow researchers, practitioners, and policymakers to better understand the efficacy of educational interventions. In practice, however, standard statistical methods for addressing heterogeneous treatment effects (HTE) fail to address the HTE that may exist "within" outcome measures. In…

Descriptors: Test Items, Item Response Theory, Computer Assisted Testing, Program Effectiveness

Automated Essay Scoring Effect on Test Equating Errors in Mixed-Format Test

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Scoring constructed-response items can be highly difficult, time-consuming, and costly in practice. Improvements in computer technology have enabled automated scoring of constructed-response items. However, the application of automated scoring without an investigation of test equating can lead to serious problems. The goal of this study was to…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Test Format

Quality of Item Pool (QIP) Index: A Novel Approach to Evaluating CAT Item Pool Adequacy

Peer reviewed

Direct link

Gönülates, Emre – Educational and Psychological Measurement, 2019

This article introduces the Quality of Item Pool (QIP) Index, a novel approach to quantifying the adequacy of an item pool of a computerized adaptive test for a given set of test specifications and examinee population. This index ranges from 0 to 1, with values close to 1 indicating the item pool presents optimum items to examinees throughout the…

Descriptors: Item Banks, Adaptive Testing, Computer Assisted Testing, Error of Measurement

Digital Module 18: Automated Scoring

Peer reviewed

Direct link

Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…

Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Analyzing Different Module Characteristics in Computer Adaptive Multistage Testing

Peer reviewed
PDF on ERIC

Download full text

Sahin, Melek Gulsah – International Journal of Assessment Tools in Education, 2020

Computer Adaptive Multistage Testing (ca-MST), which take the advantage of computer technology and adaptive test form, are widely used, and are now a popular issue of assessment and evaluation. This study aims at analyzing the effect of different panel designs, module lengths, and different sequence of a parameter value across stages and change in…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Response Theory

Evaluating a Computerized Adaptive Testing Version of a Cognitive Ability Test Using a Simulation Study

Peer reviewed

Direct link

Tsaousis, Ioannis; Sideridis, Georgios D.; AlGhamdi, Hannan M. – Journal of Psychoeducational Assessment, 2021

This study evaluated the psychometric quality of a computerized adaptive testing (CAT) version of the general cognitive ability test (GCAT), using a simulation study protocol put forth by Han, K. T. (2018a). For the needs of the analysis, three different sets of items were generated, providing an item pool of 165 items. Before evaluating the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Cognitive Ability

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022

Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	7
ETS Research Report Series	6
Applied Psychological…	5
International Journal of…	5
Journal of Educational…	4
Psychometrika	3
Applied Measurement in…	2
Education and Information…	2
Educational Measurement:…	2
International Journal of…	2
Journal of Educational and…	2
Journal of Psychoeducational…	2
Alberta Journal of…	1
Assessment	1
Assessment & Evaluation in…	1
British Educational Research…	1
CALICO Journal	1
Computers and Education	1
EURASIA Journal of…	1
Educational Research and…	1
Evaluation and the Health…	1
Grantee Submission	1
International Journal for…	1
International Journal of…	1
International Journal of…	1
More ▼

Anna-Maria Fall	2
Bergstrom, Betty A.	2
Beula M. Magimairaj	2
De Ayala, R. J.	2
Dodd, Barbara G.	2
Greg Roberts	2
Lai, Hollis	2
Philip Capin	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Sharon Vaughn	2
Weiss, David J.	2
Yao, Lihua	2
Aksu Dunya, Beyza	1
AlGhamdi, Hannan M.	1
Andreas Mühling	1
Attali, Yigal	1
Ayan, Cansu	1
Ban, Jae-Chun	1
Barker, T.	1
Bossé, Michael J.	1
Boyer, Michelle	1
Britton, C.	1
Brown, Molly	1
More ▼