ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	11

Source

Applied Psychological…

Publication Type

Journal Articles	21
Reports - Research	8
Reports - Evaluative	7
Collected Works - Serials	3
Reports - Descriptive	2
Book/Product Reviews	1
Information Analyses	1

Education Level

Audience

Practitioners

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

"MSTGen": Simulated Data Generator for Multistage Testing

Peer reviewed

Direct link

Han, Kyung T. – Applied Psychological Measurement, 2013

Multistage testing, or MST, was developed as an alternative to computerized adaptive testing (CAT) for applications in which it is preferable to administer a test at the level of item sets (i.e., modules). As with CAT, the simulation technique in MST plays a critical role in the development and maintenance of tests. "MSTGen," a new MST…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computer Software, Simulation

Item Pocket Method to Allow Response Review and Change in Computerized Adaptive Testing

Peer reviewed

Direct link

Han, Kyung T. – Applied Psychological Measurement, 2013

Most computerized adaptive testing (CAT) programs do not allow test takers to review and change their responses because it could seriously deteriorate the efficiency of measurement and make tests vulnerable to manipulative test-taking strategies. Several modified testing methods have been developed that provide restricted review options while…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Testing

Variable-Length Computerized Adaptive Testing Based on Cognitive Diagnosis Models

Peer reviewed

Direct link

Hsu, Chia-Ling; Wang, Wen-Chung; Chen, Shu-Ying – Applied Psychological Measurement, 2013

Interest in developing computerized adaptive testing (CAT) under cognitive diagnosis models (CDMs) has increased recently. CAT algorithms that use a fixed-length termination rule frequently lead to different degrees of measurement precision for different examinees. Fixed precision, in which the examinees receive the same degree of measurement…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Diagnostic Tests

Exploring the Full-Information Bifactor Model in Vertical Scaling with Construct Shift

Peer reviewed

Direct link

Li, Ying; Lissitz, Robert W. – Applied Psychological Measurement, 2012

To address the lack of attention to construct shift in item response theory (IRT) vertical scaling, a multigroup, bifactor model was proposed to model the common dimension for all grades and the grade-specific dimensions. Bifactor model estimation accuracy was evaluated through a simulation study with manipulated factors of percentage of common…

Descriptors: Item Response Theory, Scaling, Models, Computation

Setting Time Limits on Tests

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2011

It is shown how the time limit on a test can be set to control the probability of a test taker running out of time before completing it. The probability is derived from the item parameters in the lognormal model for response times. Examples of curves representing the probability of running out of time on a test with given parameters as a function…

Descriptors: Testing, Timed Tests, Models, Probability

Testing for Nonuniform Differential Item Functioning with Multiple Indicator Multiple Cause Models

Peer reviewed

Direct link

Woods, Carol M.; Grimm, Kevin J. – Applied Psychological Measurement, 2011

In extant literature, multiple indicator multiple cause (MIMIC) models have been presented for identifying items that display uniform differential item functioning (DIF) only, not nonuniform DIF. This article addresses, for apparently the first time, the use of MIMIC models for testing both uniform and nonuniform DIF with categorical indicators. A…

Descriptors: Test Bias, Testing, Interaction, Item Response Theory

Ramsay-Curve Differential Item Functioning

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2011

Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…

Descriptors: Simulation, Item Response Theory, Testing, Questionnaires

Performance of DIMTEST-and NOHARM-Based Statistics for Testing Unidimensionality

Peer reviewed

Direct link

Finch, Holmes; Habing, Brian – Applied Psychological Measurement, 2007

This Monte Carlo study compares the ability of the parametric bootstrap version of DIMTEST with three goodness-of-fit tests calculated from a fitted NOHARM model to detect violations of the assumption of unidimensionality in testing data. The effectiveness of the procedures was evaluated for different numbers of items, numbers of examinees,…

Descriptors: Guessing (Tests), Testing, Statistics, Monte Carlo Methods

Empirical Selection of Anchors for Tests of Differential Item Functioning

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2009

Differential item functioning (DIF) occurs when items on a test or questionnaire have different measurement properties for one group of people versus another, irrespective of group-mean differences on the construct. Methods for testing DIF require matching members of different groups on an estimate of the construct. Preferably, the estimate is…

Descriptors: Test Results, Testing, Item Response Theory, Test Bias

Implementation and Measurement Efficiency of Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Po-Hsi – Applied Psychological Measurement, 2004

Multidimensional adaptive testing (MAT) procedures are proposed for the measurement of several latent traits by a single examination. Bayesian latent trait estimation and adaptive item selection are derived. Simulations were conducted to compare the measurement efficiency of MAT with those of unidimensional adaptive testing and random…

Descriptors: Item Analysis, Adaptive Testing, Computer Assisted Testing, Computer Simulation

Test Item Banking.

Peer reviewed

van der Linden, Wim J., Ed. – Applied Psychological Measurement, 1986

New theory and practice in testing is replacing the standard test by the test item bank and classical test theory by item response theory. Eight papers and a commentary are presented in this special issue concerning test item banking. (SLD)

Descriptors: Adaptive Testing, Algorithms, Bayesian Statistics, Computer Assisted Testing

Advances in Item Response Theory and Applications.

Peer reviewed

Hambleton, Ronald K., Ed.; van der Linden, Wim J., Ed. – Applied Psychological Measurement, 1982

Item response theory (IRT) is having a major impact on the field of testing. This special issue presents an introduction and seven papers concerning developments in IRT applications. Some important IRT research being conducted outside the United States is highlighted. (SLD)

Descriptors: Adaptive Testing, Equated Scores, Item Analysis, Latent Trait Theory

A Monte Carlo Approach to Unidimensionality Testing in Polytomous Rasch Models

Peer reviewed

Direct link

Christensen, Karl Bang; Kreiner, Svend – Applied Psychological Measurement, 2007

Many statistical tests are designed to test the different assumptions of the Rasch model, but only few are directed at detecting multidimensionality. The Martin-Lof test is an attractive approach, the disadvantage being that its null distribution deviates strongly from the asymptotic chi-square distribution for most realistic sample sizes. A Monte…

Descriptors: Item Response Theory, Monte Carlo Methods, Testing, Models

Variations on a Theme by Thurstone.

Peer reviewed

Lumsden, James – Applied Psychological Measurement, 1980

A test theory model based on the Thurstone judgmental model is described. By restricting various parameters of the model, 3 Rasch models, 2 pseudo-Rasch models, 3 two-parameter models, and a Weber's Law model are derived. (Author/CTM)

Descriptors: Latent Trait Theory, Mathematical Models, Scaling, Test Items

Estimating Consistency and Accuracy Indices for Multiple Classifications

Peer reviewed

Direct link

Lee, Won-Chan; Hanson, Bradley A.; Brennan, Robert L. – Applied Psychological Measurement, 2002

This article describes procedures for estimating various indices of classification consistency and accuracy for multiple category classifications using data from a single test administration. The estimates of the classification consistency and accuracy indices are compared under three different psychometric models: the two-parameter beta binomial,…

Descriptors: Classification, True Scores, Psychometrics, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Testing	21
Item Response Theory	9
Models	8
Adaptive Testing	6
Computer Assisted Testing	5
Simulation	5
Test Construction	5
Test Items	5
Evaluation Methods	4
Item Analysis	4
Latent Trait Theory	4
Computation	3
Equated Scores	3
Goodness of Fit	3
Maximum Likelihood Statistics	3
Measurement Techniques	3
Monte Carlo Methods	3
Psychometrics	3
Test Bias	3
Accuracy	2
Bayesian Statistics	2
Classification	2
Comparative Analysis	2
Computer Software	2
Correlation	2
More ▼

Woods, Carol M.	3
Hambleton, Ronald K., Ed.	2
Han, Kyung T.	2
Wang, Wen-Chung	2
van der Linden, Wim J., Ed.	2
Brennan, Robert L.	1
Chen, Po-Hsi	1
Chen, Shu-Ying	1
Christensen, Karl Bang	1
Finch, Holmes	1
Frary, Robert B.	1
Grimm, Kevin J.	1
Habing, Brian	1
Hanson, Bradley A.	1
Hsu, Chia-Ling	1
Kolen, Michael J.	1
Kreiner, Svend	1
Lee, Won-Chan	1
Li, Ying	1
Linn, Robert L.	1
Lissitz, Robert W.	1
Lumsden, James	1
May, Kim	1
Schwarz, Richard D.	1
Slinde, Jeffrey A.	1
More ▼