ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	44

Descriptor

Item Response Theory	78
Computer Assisted Testing	47
Adaptive Testing	40
Test Items	31
Simulation	26
Models	17
Test Construction	16
Item Banks	15
Computation	14
Evaluation Methods	11
Comparative Analysis	9
Item Analysis	9
Testing	9
Test Bias	8
Testing Programs	8
Bayesian Statistics	7
Equations (Mathematics)	7
Error of Measurement	7
Monte Carlo Methods	7
Psychometrics	7
Test Format	7
Computer Simulation	6
Data Analysis	6
Equated Scores	6
Goodness of Fit	6
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	78
Reports - Evaluative	33
Reports - Research	32
Reports - Descriptive	8
Book/Product Reviews	2
Information Analyses	2

Education Level

Higher Education	7
High Schools	2
Postsecondary Education	1
Secondary Education	1

Audience

Practitioners

Location

Netherlands	4
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	3
ACT Assessment	1
Armed Forces Qualification…	1
Armed Services Vocational…	1
California Achievement Tests	1
Center for Epidemiologic…	1
Eysenck Personality Inventory	1
Iowa Tests of Basic Skills	1
Multidimensional Personality…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 78 results Save | Export

Deriving Stopping Rules for Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Chun; Chang, Hua-Hua; Boughton, Keith A. – Applied Psychological Measurement, 2013

Multidimensional computerized adaptive testing (MCAT) is able to provide a vector of ability estimates for each examinee, which could be used to provide a more informative profile of an examinee's performance. The current literature on MCAT focuses on the fixed-length tests, which can generate less accurate results for those examinees whose…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Item Banks

Uncertainties in the Item Parameter Estimates and Robust Automated Test Assembly

Peer reviewed

Direct link

Veldkamp, Bernard P.; Matteucci, Mariagiulia; de Jong, Martijn G. – Applied Psychological Measurement, 2013

Item response theory parameters have to be estimated, and because of the estimation process, they do have uncertainty in them. In most large-scale testing programs, the parameters are stored in item banks, and automated test assembly algorithms are applied to assemble operational test forms. These algorithms treat item parameters as fixed values,…

Descriptors: Test Construction, Test Items, Item Banks, Automation

The Random-Threshold Generalized Unfolding Model and Its Application of Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei; Wu, Shiu-Lien – Applied Psychological Measurement, 2013

The random-threshold generalized unfolding model (RTGUM) was developed by treating the thresholds in the generalized unfolding model as random effects rather than fixed effects to account for the subjective nature of the selection of categories in Likert items. The parameters of the new model can be estimated with the JAGS (Just Another Gibbs…

Descriptors: Computer Assisted Testing, Adaptive Testing, Models, Bayesian Statistics

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Firestar-"D": Computerized Adaptive Testing Simulation Program for Dichotomous Item Response Theory Models

Peer reviewed

Direct link

Choi, Seung W.; Podrabsky, Tracy; McKinney, Natalie – Applied Psychological Measurement, 2012

Computerized adaptive testing (CAT) enables efficient and flexible measurement of latent constructs. The majority of educational and cognitive measurement constructs are based on dichotomous item response theory (IRT) models. An integral part of developing various components of a CAT system is conducting simulations using both known and empirical…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computer Software, Item Response Theory

A CUSUM to Detect Person Misfit: A Discussion and Some Alternatives for Existing Procedures

Peer reviewed

Direct link

Tendeiro, Jorge N.; Meijer, Rob R. – Applied Psychological Measurement, 2012

This article extends the work by Armstrong and Shi on CUmulative SUM (CUSUM) person-fit methodology. The authors present new theoretical considerations concerning the use of CUSUM person-fit statistics based on likelihood ratios for the purpose of detecting cheating and random guessing by individual test takers. According to the Neyman-Pearson…

Descriptors: Cheating, Individual Testing, Adaptive Testing, Statistics

Computerized Adaptive Testing Using a Class of High-Order Item Response Theory Models

Peer reviewed

Direct link

Huang, Hung-Yu; Chen, Po-Hsi; Wang, Wen-Chung – Applied Psychological Measurement, 2012

In the human sciences, a common assumption is that latent traits have a hierarchical structure. Higher order item response theory models have been developed to account for this hierarchy. In this study, computerized adaptive testing (CAT) algorithms based on these kinds of models were implemented, and their performance under a variety of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Simulation

The Problem of Bias in Person Parameter Estimation in Adaptive Testing

Peer reviewed

Direct link

Doebler, Anna – Applied Psychological Measurement, 2012

It is shown that deviations of estimated from true values of item difficulty parameters, caused for example by item calibration errors, the neglect of randomness of item difficulty parameters, testlet effects, or rule-based item generation, can lead to systematic bias in point estimation of person parameters in the context of adaptive testing.…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Item Response Theory

Curtailment and Stochastic Curtailment to Shorten the CES-D

Peer reviewed

Direct link

Finkelman, Matthew D.; Smits, Niels; Kim, Wonsuk; Riley, Barth – Applied Psychological Measurement, 2012

The Center for Epidemiologic Studies-Depression (CES-D) scale is a well-known self-report instrument that is used to measure depressive symptomatology. Respondents who take the full-length version of the CES-D are administered a total of 20 items. This article investigates the use of curtailment and stochastic curtailment (SC), two sequential…

Descriptors: Measures (Individuals), Depression (Psychology), Test Length, Computer Assisted Testing

Exploring the Full-Information Bifactor Model in Vertical Scaling with Construct Shift

Peer reviewed

Direct link

Li, Ying; Lissitz, Robert W. – Applied Psychological Measurement, 2012

To address the lack of attention to construct shift in item response theory (IRT) vertical scaling, a multigroup, bifactor model was proposed to model the common dimension for all grades and the grade-specific dimensions. Bifactor model estimation accuracy was evaluated through a simulation study with manipulated factors of percentage of common…

Descriptors: Item Response Theory, Scaling, Models, Computation

Detecting Differential Item Functioning of Polytomous Items for an Ideal Point Response Process

Peer reviewed

Direct link

Wang, Wei; Tay, Louis; Drasgow, Fritz – Applied Psychological Measurement, 2013

There has been growing use of ideal point models to develop scales measuring important psychological constructs. For meaningful comparisons across groups, it is important to identify items on such scales that exhibit differential item functioning (DIF). In this study, the authors examined several methods for assessing DIF on polytomous items…

Descriptors: Test Bias, Effect Size, Item Response Theory, Statistical Analysis

An Empirical Evaluation of the Slip Correction in the Four Parameter Logistic Models with Computerized Adaptive Testing

Peer reviewed

Direct link

Yen, Yung-Chin; Ho, Rong-Guey; Laio, Wen-Wei; Chen, Li-Ju; Kuo, Ching-Chin – Applied Psychological Measurement, 2012

In a selected response test, aberrant responses such as careless errors and lucky guesses might cause error in ability estimation because these responses do not actually reflect the knowledge that examinees possess. In a computerized adaptive test (CAT), these aberrant responses could further cause serious estimation error due to dynamic item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Response Style (Tests)

A Comment on Early Student Blunders on Computer-Based Adaptive Tests

Peer reviewed

Direct link

Green, Bert F. – Applied Psychological Measurement, 2011

This article refutes a recent claim that computer-based tests produce biased scores for very proficient test takers who make mistakes on one or two initial items and that the "bias" can be reduced by using a four-parameter IRT model. Because the same effect occurs with pattern scores on nonadaptive tests, the effect results from IRT scoring, not…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Bias, Item Response Theory

A Review of DIMPACK Version 1.0: Conditional Covariance-Based Test Dimensionality Analysis Package

Peer reviewed

Direct link

Deng, Nina; Han, Kyung T.; Hambleton, Ronald K. – Applied Psychological Measurement, 2013

DIMPACK Version 1.0 for assessing test dimensionality based on a nonparametric conditional covariance approach is reviewed. This software was originally distributed by Assessment Systems Corporation and now can be freely accessed online. The software consists of Windows-based interfaces of three components: DIMTEST, DETECT, and CCPROX/HAC, which…

Descriptors: Item Response Theory, Nonparametric Statistics, Statistical Analysis, Computer Software

A Comparison of Item Selection Techniques for Testlets

Peer reviewed

Direct link

Murphy, Daniel L.; Dodd, Barbara G.; Vaughn, Brandon K. – Applied Psychological Measurement, 2010

This study examined the performance of the maximum Fisher's information, the maximum posterior weighted information, and the minimum expected posterior variance methods for selecting items in a computerized adaptive testing system when the items were grouped in testlets. A simulation study compared the efficiency of ability estimation among the…

Descriptors: Simulation, Adaptive Testing, Item Analysis, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

van der Linden, Wim J.	4
Chang, Hua-Hua	3
Lewis, Charles	3
Veldkamp, Bernard P.	3
Woods, Carol M.	3
Ackerman, Terry	2
Armstrong, Ronald D.	2
Belov, Dmitry I.	2
Brennan, Robert L.	2
Choi, Seung W.	2
Dodd, Barbara G.	2
Drasgow, Fritz	2
Finch, Holmes	2
Finkelman, Matthew D.	2
Green, Bert F.	2
Habing, Brian	2
Hambleton, Ronald K.	2
Hol, A. Michiel	2
Meijer, Rob R.	2
Mellenbergh, Gideon J.	2
Reise, Steven P.	2
Sheehan, Kathleen	2
Vorst, Harrie C. M.	2
Wang, Wen-Chung	2
More ▼