ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	12

Descriptor

Measurement	13
Test Items	13
Test Length	13
Accuracy	9
Computer Assisted Testing	8
Adaptive Testing	6
Correlation	4
Foreign Countries	4
Item Response Theory	4
Simulation	4
Test Bias	3
Ability	2
Comparative Analysis	2
Computation	2
Decision Making	2
Item Analysis	2
Models	2
Psychometrics	2
Reading Tests	2
Reliability	2
Scoring	2
Test Construction	2
Test Format	2
Test Validity	2
Armed Forces	1
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
Advanced Education	1
International Journal of…	1
Journal of Educational and…	1
Journal of Special Education…	1
Measurement:…	1
Pearson	1
Physical Review Physics…	1
ProQuest LLC	1
Universal Journal of…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	10
Dissertations/Theses -…	1
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Primary Education	1

Audience

Location

Germany	1
Japan	1
Ukraine	1

Laws, Policies, & Programs

Assessments and Surveys

Force Concept Inventory

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Differential Performance of Computerized Adaptive Testing in Students with and without Disabilities -- A Simulation Study

Peer reviewed

Direct link

Nikola Ebenbeck; Markus Gebhardt – Journal of Special Education Technology, 2024

Technologies that enable individualization for students have significant potential in special education. Computerized Adaptive Testing (CAT) refers to digital assessments that automatically adjust their difficulty level based on students' abilities, allowing for personalized, efficient, and accurate measurement. This article examines whether CAT…

Descriptors: Computer Assisted Testing, Students with Disabilities, Special Education, Grade 3

Improving Test Security and Efficiency of Computerized Adaptive Testing for the Force Concept Inventory

Peer reviewed

Direct link

Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2022

This paper presents improvements made to a computerized adaptive testing (CAT)-based version of the FCI (FCI-CAT) in regards to test security and test efficiency. First, we will discuss measures to enhance test security by controlling for item overexposure, decreasing the risk that respondents may (i) memorize the content of a pretest for use on…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Risk Management

How the Length and Characteristics of Routing Module Affect Ability Estimation in ca-MST?

Peer reviewed
PDF on ERIC

Download full text

Öztürk, Nagihan Boztunç – Universal Journal of Educational Research, 2019

In this study, how the length and characteristics of routing module in different panel designs affect measurement precision is examined. In the scope of the study, six different routing module length, nine different routing module characteristics, and two different panel design are handled. At the end of the study, the effects of conditions on…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Test Format

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

Computer Adaptive Language Testing According to NATO STANAG 6001 Requirements

Peer reviewed
PDF on ERIC

Download full text

Gawliczek, Piotr; Krykun, Viktoriia; Tarasenko, Nataliya; Tyshchenko, Maksym; Shapran, Oleksandr – Advanced Education, 2021

The article deals with the innovative, cutting age solution within the language testing realm, namely computer adaptive language testing (CALT) in accordance with the NATO Standardization Agreement 6001 (NATO STANAG 6001) requirements for further implementation in foreign language training of personnel of the Armed Forces of Ukraine (AF of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Language Tests, Second Language Instruction

The Matching Criterion Purification for Differential Item Functioning Analyses in a Large-Scale Assessment

Peer reviewed

Direct link

Lee, HyeSun; Geisinger, Kurt F. – Educational and Psychological Measurement, 2016

The current study investigated the impact of matching criterion purification on the accuracy of differential item functioning (DIF) detection in large-scale assessments. The three matching approaches for DIF analyses (block-level matching, pooled booklet matching, and equated pooled booklet matching) were employed with the Mantel-Haenszel…

Descriptors: Test Bias, Measurement, Accuracy, Statistical Analysis

Improving Measurement Precision of Hierarchical Latent Traits Using Adaptive Testing

Peer reviewed

Direct link

Wang, Chun – Journal of Educational and Behavioral Statistics, 2014

Many latent traits in social sciences display a hierarchical structure, such as intelligence, cognitive ability, or personality. Usually a second-order factor is linearly related to a group of first-order factors (also called domain abilities in cognitive ability measures), and the first-order factors directly govern the actual item responses.…

Descriptors: Measurement, Accuracy, Item Response Theory, Adaptive Testing

Test Length and Decision Quality in Personnel Selection: When Is Short Too Short?

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012

Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…

Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement

Balancing Flexible Constraints and Measurement Precision in Computerized Adaptive Testing

Peer reviewed

Direct link

Moyer, Eric L.; Galindo, Jennifer L.; Dodd, Barbara G. – Educational and Psychological Measurement, 2012

Managing test specifications--both multiple nonstatistical constraints and flexibly defined constraints--has become an important part of designing item selection procedures for computerized adaptive tests (CATs) in achievement testing. This study compared the effectiveness of three procedures: constrained CAT, flexible modified constrained CAT,…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Item Analysis

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Three Essays on Teacher Education Programs and Test-Takers' Response Times on Test Items

Direct link

Qian, Hong – ProQuest LLC, 2013

This dissertation includes three essays: one essay focuses on the effect of teacher preparation programs on teacher knowledge while the other two focus on test-takers' response times on test items. Essay One addresses the problem of how opportunities to learn in teacher preparation programs influence future elementary mathematics teachers'…

Descriptors: Teacher Education Programs, Pedagogical Content Knowledge, Preservice Teacher Education, Preservice Teachers

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

The Second Century of Ability Testing: Some Predictions and Speculations

Peer reviewed

Direct link

Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004

The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…

Descriptors: Ability, Testing, Futures (of Society), Psychometrics

Chien, Yuehmei	1
Cui, Ying	1
Dodd, Barbara G.	1
Embretson, Susan E.	1
Emons, Wilco H. M.	1
Galindo, Jennifer L.	1
Gawliczek, Piotr	1
Geisinger, Kurt F.	1
Hull, Michael M.	1
Kruyen, Peter M.	1
Krykun, Viktoriia	1
Lee, HyeSun	1
Leighton, Jacqueline P.	1
Liaw, Yuan-Ling	1
Mae, Naohiro	1
Markus Gebhardt	1
Moyer, Eric L.	1
Nikola Ebenbeck	1
Qian, Hong	1
Rutkowski, David	1
Rutkowski, Leslie	1
Shapran, Oleksandr	1
Shin, Chingwei David	1
Sijtsma, Klaas	1
Svetina, Dubravka	1
More ▼