ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	33

Descriptor

Adaptive Testing	35
Computation	35
Item Response Theory	35
Computer Assisted Testing	28
Test Items	17
Bayesian Statistics	10
Comparative Analysis	9
Simulation	9
Error of Measurement	8
Item Banks	7
Maximum Likelihood Statistics	7
Models	7
Accuracy	6
Classification	5
Foreign Countries	5
Statistical Analysis	5
Ability	4
Difficulty Level	4
Efficiency	4
Test Length	4
Goodness of Fit	3
Mathematics	3
Measurement Techniques	3
Robustness (Statistics)	3
Scoring	3
More ▼

Source

Applied Psychological…	9
Educational and Psychological…	6
Journal of Educational and…	5
ETS Research Report Series	2
Psychometrika	2
Applied Measurement in…	1
E-Learning and Digital Media	1
Educational Testing Service	1
Eurasian Journal of…	1
Grantee Submission	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Educational…	1
Online Submission	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	31
Reports - Research	21
Reports - Evaluative	10
Reports - Descriptive	3
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Secondary Education	3
High Schools	2
Elementary Education	1
Elementary Secondary Education	1
Grade 11	1
Grade 12	1
Grade 4	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Practitioners

Location

Netherlands	1
Taiwan	1
Turkey	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	1
Graduate Record Examinations	1
National Assessment of…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Bayesian Logistic Regression: A New Method to Calibrate Pretest Items in Multistage Adaptive Testing

Peer reviewed

Direct link

TsungHan Ho – Applied Measurement in Education, 2023

An operational multistage adaptive test (MST) requires the development of a large item bank and the effort to continuously replenish the item bank due to concerns about test security and validity over the long term. New items should be pretested and linked to the item bank before being used operationally. The linking item volume fluctuations in…

Descriptors: Bayesian Statistics, Regression (Statistics), Test Items, Pretesting

Adaptive Weight Estimation of Latent Ability: Application to Computerized Adaptive Testing with Response Revision

Peer reviewed

Direct link

Wang, Shiyu; Xiao, Houping; Cohen, Allan – Journal of Educational and Behavioral Statistics, 2021

An adaptive weight estimation approach is proposed to provide robust latent ability estimation in computerized adaptive testing (CAT) with response revision. This approach assigns different weights to each distinct response to the same item when response revision is allowed in CAT. Two types of weight estimation procedures, nonfunctional and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Robustness (Statistics)

A Fast and Simple Algorithm for Bayesian Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Ren, Hao – Journal of Educational and Behavioral Statistics, 2020

The Bayesian way of accounting for the effects of error in the ability and item parameters in adaptive testing is through the joint posterior distribution of all parameters. An optimized Markov chain Monte Carlo algorithm for adaptive testing is presented, which samples this distribution in real time to score the examinee's ability and optimally…

Descriptors: Bayesian Statistics, Adaptive Testing, Error of Measurement, Markov Processes

IRT and MIRT Models for Item Parameter Estimation with Multidimensional Multistage Tests

Peer reviewed

Direct link

Jewsbury, Paul A.; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020

In large-scale educational assessment data consistent with a simple-structure multidimensional item response theory (MIRT) model, where every item measures only one latent variable, separate unidimensional item response theory (UIRT) models for each latent variable are often calibrated for practical reasons. While this approach can be valid for…

Descriptors: Item Response Theory, Computation, Test Items, Adaptive Testing

Item Calibration Methods with Multiple Sub-Scale Multistage Testing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wang, Chun; Chen, Ping; Jiang, Shengyu – Grantee Submission, 2019

Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence questions remain as to how to…

Descriptors: Adaptive Testing, Test Items, Item Response Theory, Maximum Likelihood Statistics

Influence of Context on Item Parameters in Forced-Choice Personality Assessments

Peer reviewed

Direct link

Lin, Yin; Brown, Anna – Educational and Psychological Measurement, 2017

A fundamental assumption in computerized adaptive testing is that item parameters are invariant with respect to context--items surrounding the administered item. This assumption, however, may not hold in forced-choice (FC) assessments, where explicit comparisons are made between items included in the same block. We empirically examined the…

Descriptors: Personality Measures, Measurement Techniques, Context Effect, Test Items

Investigating Robustness of Item Response Theory Proficiency Estimators to Atypical Response Behaviors under Two-Stage Multistage Testing. ETS GRE® Board Research Report. ETS GRE®-16-03. ETS Research Report No. RR-16-22

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2016

The purpose of this study is to evaluate the extent to which item response theory (IRT) proficiency estimation methods are robust to the presence of aberrant responses under the "GRE"® General Test multistage adaptive testing (MST) design. To that end, a wide range of atypical response behaviors affecting as much as 10% of the test items…

Descriptors: Item Response Theory, Computation, Robustness (Statistics), Response Style (Tests)

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

Person Fit Analysis in Computerized Adaptive Testing Using Tests for a Change Point

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2016

Meijer and van Krimpen-Stoop noted that the number of person-fit statistics (PFSs) that have been designed for computerized adaptive tests (CATs) is relatively modest. This article partially addresses that concern by suggesting three new PFSs for CATs. The statistics are based on tests for a change point and can be used to detect an abrupt change…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Goodness of Fit

Economizing Education: Assessment Algorithms and Calculative Agencies

Peer reviewed

Direct link

O'Keeffe, Cormac – E-Learning and Digital Media, 2017

International Large Scale Assessments have been producing data about educational attainment for over 60 years. More recently however, these assessments as tests have become digitally and computationally complex and increasingly rely on the calculative work performed by algorithms. In this article I first consider the coordination of relations…

Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Bad Questions: An Essay Involving Item Response Theory

Peer reviewed

Direct link

Thissen, David – Journal of Educational and Behavioral Statistics, 2016

David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…

Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation

Impact of Violation of the Missing-at-Random Assumption on Full-Information Maximum Likelihood Method in Multidimensional Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T.; Guo, Fanmin – Practical Assessment, Research & Evaluation, 2014

The full-information maximum likelihood (FIML) method makes it possible to estimate and analyze structural equation models (SEM) even when data are partially missing, enabling incomplete data to contribute to model estimation. The cornerstone of FIML is the missing-at-random (MAR) assumption. In (unidimensional) computerized adaptive testing…

Descriptors: Maximum Likelihood Statistics, Structural Equation Models, Data, Computer Assisted Testing

Best Design for Multidimensional Computerized Adaptive Testing with the Bifactor Model

Peer reviewed

Direct link

Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2015

Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CATs. This study investigated the accuracy, fidelity, and efficiency of a fully multidimensional CAT algorithm…

Descriptors: Computer Assisted Testing, Adaptive Testing, Accuracy, Fidelity

Uncertainties in the Item Parameter Estimates and Robust Automated Test Assembly

Peer reviewed

Direct link

Veldkamp, Bernard P.; Matteucci, Mariagiulia; de Jong, Martijn G. – Applied Psychological Measurement, 2013

Item response theory parameters have to be estimated, and because of the estimation process, they do have uncertainty in them. In most large-scale testing programs, the parameters are stored in item banks, and automated test assembly algorithms are applied to assemble operational test forms. These algorithms treat item parameters as fixed values,…

Descriptors: Test Construction, Test Items, Item Banks, Automation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Kim, Sooyeon	3
Moses, Tim	3
Guo, Fanmin	2
He, Wei	2
Liu, Chen-Wei	2
Wang, Chun	2
Wang, Wen-Chung	2
Abad, Francisco J.	1
Boughton, Keith A.	1
Brown, Anna	1
Chang, Hua-Hua	1
Chang, Yuan-chin Ivan	1
Chen, Li-Ju	1
Chen, Ping	1
Cohen, Allan	1
Davey, Tim	1
Doebler, Anna	1
Finkelman, Matthew David	1
Glasnapp, Douglas R.	1
Han, Kyung T.	1
Herbert, Erin	1
Ho, Rong-Guey	1
Ho, Tsung-Han	1
Jewsbury, Paul A.	1
Jiang, Shengyu	1
More ▼