ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	19

Descriptor

Adaptive Testing	49
Bayesian Statistics	49
Computer Assisted Testing	32
Test Items	22
Item Response Theory	19
Maximum Likelihood Statistics	13
Test Construction	13
Comparative Analysis	12
Simulation	12
Item Banks	11
Latent Trait Theory	11
Computation	10
Item Analysis	9
Mathematical Models	9
Scores	8
Test Bias	8
Ability	7
Estimation (Mathematics)	7
Test Length	7
Test Reliability	6
Accuracy	5
Correlation	5
Psychometrics	5
Scoring	5
Testing Problems	5
More ▼

Source

Educational and Psychological…	4
Applied Measurement in…	3
Applied Psychological…	3
ETS Research Report Series	3
Journal of Educational…	2
Journal of Educational and…	2
Educational Technology &…	1
Journal of Speech, Language,…	1
Measurement:…	1
Psychological Methods	1
Psychometrika	1
More ▼

Publication Type

Reports - Research	49
Journal Articles	22
Speeches/Meeting Papers	6
Numerical/Quantitative Data	2
Reports - Evaluative	1

Education Level

Higher Education	3
Elementary Secondary Education	1
Postsecondary Education	1

Audience

Practitioners	1
Researchers	1

Location

Taiwan

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	2
Law School Admission Test	2
Early Childhood Longitudinal…	1
Graduate Management Admission…	1
Graduate Record Examinations	1
MacArthur Communicative…	1
School and College Ability…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 49 results Save | Export

Bayesian Logistic Regression: A New Method to Calibrate Pretest Items in Multistage Adaptive Testing

Peer reviewed

Direct link

TsungHan Ho – Applied Measurement in Education, 2023

An operational multistage adaptive test (MST) requires the development of a large item bank and the effort to continuously replenish the item bank due to concerns about test security and validity over the long term. New items should be pretested and linked to the item bank before being used operationally. The linking item volume fluctuations in…

Descriptors: Bayesian Statistics, Regression (Statistics), Test Items, Pretesting

On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments

Peer reviewed

Direct link

Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023

Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…

Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

A Bayesian-Inspired Item Response Theory-Based Framework to Produce Very Short Versions of MacArthur-Bates Communicative Development Inventories

Peer reviewed

Direct link

Chai, Jun Ho; Lo, Chang Huan; Mayor, Julien – Journal of Speech, Language, and Hearing Research, 2020

Purpose: This study introduces a framework to produce very short versions of the MacArthur-Bates Communicative Development Inventories (CDIs) by combining the Bayesian-inspired approach introduced by Mayor and Mani (2019) with an item response theory-based computerized adaptive testing that adapts to the ability of each child, in line with…

Descriptors: Bayesian Statistics, Item Response Theory, Measures (Individuals), Language Skills

Investigating Robustness of Item Response Theory Proficiency Estimators to Atypical Response Behaviors under Two-Stage Multistage Testing. ETS GRE® Board Research Report. ETS GRE®-16-03. ETS Research Report No. RR-16-22

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2016

The purpose of this study is to evaluate the extent to which item response theory (IRT) proficiency estimation methods are robust to the presence of aberrant responses under the "GRE"® General Test multistage adaptive testing (MST) design. To that end, a wide range of atypical response behaviors affecting as much as 10% of the test items…

Descriptors: Item Response Theory, Computation, Robustness (Statistics), Response Style (Tests)

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

A Comparative Study of Online Item Calibration Methods in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Ping – Journal of Educational and Behavioral Statistics, 2017

Calibration of new items online has been an important topic in item replenishment for multidimensional computerized adaptive testing (MCAT). Several online calibration methods have been proposed for MCAT, such as multidimensional "one expectation-maximization (EM) cycle" (M-OEM) and multidimensional "multiple EM cycles"…

Descriptors: Test Items, Item Response Theory, Test Construction, Adaptive Testing

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Information Utility: Quantifying the Total Psychometric Information Provided by a Measure

Peer reviewed

Direct link

Markon, Kristian E. – Psychological Methods, 2013

Although advances have improved our ability to describe the measurement precision of a test, it often remains challenging to summarize how well a test is performing overall. Reliability, for example, provides an overall summary of measurement precision, but it is sample-specific and might not reflect the potential usefulness of a test if the…

Descriptors: Measures (Individuals), Psychometrics, Statistical Analysis, Bayesian Statistics

The Random-Threshold Generalized Unfolding Model and Its Application of Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei; Wu, Shiu-Lien – Applied Psychological Measurement, 2013

The random-threshold generalized unfolding model (RTGUM) was developed by treating the thresholds in the generalized unfolding model as random effects rather than fixed effects to account for the subjective nature of the selection of categories in Likert items. The parameters of the new model can be estimated with the JAGS (Just Another Gibbs…

Descriptors: Computer Assisted Testing, Adaptive Testing, Models, Bayesian Statistics

Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test

Peer reviewed

Direct link

Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012

In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

The Problem of Bias in Person Parameter Estimation in Adaptive Testing

Peer reviewed

Direct link

Doebler, Anna – Applied Psychological Measurement, 2012

It is shown that deviations of estimated from true values of item difficulty parameters, caused for example by item calibration errors, the neglect of randomness of item difficulty parameters, testlet effects, or rule-based item generation, can lead to systematic bias in point estimation of person parameters in the context of adaptive testing.…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Item Response Theory

Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

Peer reviewed

Direct link

He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014

For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing

Evaluating Knowledge Structure-Based Adaptive Testing Algorithms and System Development

Peer reviewed

Direct link

Wu, Huey-Min; Kuo, Bor-Chen; Yang, Jinn-Min – Educational Technology & Society, 2012

In recent years, many computerized test systems have been developed for diagnosing students' learning profiles. Nevertheless, it remains a challenging issue to find an adaptive testing algorithm to both shorten testing time and precisely diagnose the knowledge status of students. In order to find a suitable algorithm, four adaptive testing…

Descriptors: Adaptive Testing, Test Items, Computer Assisted Testing, Mathematics

Bayesian Procedures for Identifying Aberrant Response-Time Patterns in Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Guo, Fanmin – Psychometrika, 2008

In order to identify aberrant response-time patterns on educational and psychological tests, it is important to be able to separate the speed at which the test taker operates from the time the items require. A lognormal model for response times with this feature was used to derive a Bayesian procedure for detecting aberrant response times.…

Descriptors: Adaptive Testing, Bayesian Statistics, Reaction Time, College Entrance Examinations

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Weiss, David J.	6
Reckase, Mark D.	4
Kim, Sooyeon	3
McBride, James R.	3
Moses, Tim	3
Vale, C. David	3
van der Linden, Wim J.	3
Glas, Cees A. W.	2
Thayer, Dorothy T.	2
Zwick, Rebecca	2
Abad, Francisco J.	1
Chai, Jun Ho	1
Chen, Ping	1
Chou, Chih-Ping	1
De Ayala, R. J.	1
DeAyala, R. J.	1
Dodd, Barbara G.	1
Doebler, Anna	1
Gialluca, Kathleen A.	1
Glasnapp, Douglas R.	1
Green, Bert F.	1
Guo, Fanmin	1
Hambleton, Ronald K.	1
Hankins, Janette A.	1
More ▼