Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 11 |
Descriptor
Testing | 21 |
Item Response Theory | 9 |
Models | 8 |
Adaptive Testing | 6 |
Computer Assisted Testing | 5 |
Simulation | 5 |
Test Construction | 5 |
Test Items | 5 |
Evaluation Methods | 4 |
Item Analysis | 4 |
Latent Trait Theory | 4 |
More ▼ |
Source
Applied Psychological… | 21 |
Author
Publication Type
Journal Articles | 21 |
Reports - Research | 8 |
Reports - Evaluative | 7 |
Collected Works - Serials | 3 |
Reports - Descriptive | 2 |
Book/Product Reviews | 1 |
Information Analyses | 1 |
Education Level
Audience
Practitioners | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Han, Kyung T. – Applied Psychological Measurement, 2013
Multistage testing, or MST, was developed as an alternative to computerized adaptive testing (CAT) for applications in which it is preferable to administer a test at the level of item sets (i.e., modules). As with CAT, the simulation technique in MST plays a critical role in the development and maintenance of tests. "MSTGen," a new MST…
Descriptors: Computer Assisted Testing, Adaptive Testing, Computer Software, Simulation
Han, Kyung T. – Applied Psychological Measurement, 2013
Most computerized adaptive testing (CAT) programs do not allow test takers to review and change their responses because it could seriously deteriorate the efficiency of measurement and make tests vulnerable to manipulative test-taking strategies. Several modified testing methods have been developed that provide restricted review options while…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Testing
Hsu, Chia-Ling; Wang, Wen-Chung; Chen, Shu-Ying – Applied Psychological Measurement, 2013
Interest in developing computerized adaptive testing (CAT) under cognitive diagnosis models (CDMs) has increased recently. CAT algorithms that use a fixed-length termination rule frequently lead to different degrees of measurement precision for different examinees. Fixed precision, in which the examinees receive the same degree of measurement…
Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Diagnostic Tests
Li, Ying; Lissitz, Robert W. – Applied Psychological Measurement, 2012
To address the lack of attention to construct shift in item response theory (IRT) vertical scaling, a multigroup, bifactor model was proposed to model the common dimension for all grades and the grade-specific dimensions. Bifactor model estimation accuracy was evaluated through a simulation study with manipulated factors of percentage of common…
Descriptors: Item Response Theory, Scaling, Models, Computation
van der Linden, Wim J. – Applied Psychological Measurement, 2011
It is shown how the time limit on a test can be set to control the probability of a test taker running out of time before completing it. The probability is derived from the item parameters in the lognormal model for response times. Examples of curves representing the probability of running out of time on a test with given parameters as a function…
Descriptors: Testing, Timed Tests, Models, Probability
Woods, Carol M.; Grimm, Kevin J. – Applied Psychological Measurement, 2011
In extant literature, multiple indicator multiple cause (MIMIC) models have been presented for identifying items that display uniform differential item functioning (DIF) only, not nonuniform DIF. This article addresses, for apparently the first time, the use of MIMIC models for testing both uniform and nonuniform DIF with categorical indicators. A…
Descriptors: Test Bias, Testing, Interaction, Item Response Theory
Woods, Carol M. – Applied Psychological Measurement, 2011
Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…
Descriptors: Simulation, Item Response Theory, Testing, Questionnaires
Finch, Holmes; Habing, Brian – Applied Psychological Measurement, 2007
This Monte Carlo study compares the ability of the parametric bootstrap version of DIMTEST with three goodness-of-fit tests calculated from a fitted NOHARM model to detect violations of the assumption of unidimensionality in testing data. The effectiveness of the procedures was evaluated for different numbers of items, numbers of examinees,…
Descriptors: Guessing (Tests), Testing, Statistics, Monte Carlo Methods
Woods, Carol M. – Applied Psychological Measurement, 2009
Differential item functioning (DIF) occurs when items on a test or questionnaire have different measurement properties for one group of people versus another, irrespective of group-mean differences on the construct. Methods for testing DIF require matching members of different groups on an estimate of the construct. Preferably, the estimate is…
Descriptors: Test Results, Testing, Item Response Theory, Test Bias
Wang, Wen-Chung; Chen, Po-Hsi – Applied Psychological Measurement, 2004
Multidimensional adaptive testing (MAT) procedures are proposed for the measurement of several latent traits by a single examination. Bayesian latent trait estimation and adaptive item selection are derived. Simulations were conducted to compare the measurement efficiency of MAT with those of unidimensional adaptive testing and random…
Descriptors: Item Analysis, Adaptive Testing, Computer Assisted Testing, Computer Simulation

van der Linden, Wim J., Ed. – Applied Psychological Measurement, 1986
New theory and practice in testing is replacing the standard test by the test item bank and classical test theory by item response theory. Eight papers and a commentary are presented in this special issue concerning test item banking. (SLD)
Descriptors: Adaptive Testing, Algorithms, Bayesian Statistics, Computer Assisted Testing

Hambleton, Ronald K., Ed.; van der Linden, Wim J., Ed. – Applied Psychological Measurement, 1982
Item response theory (IRT) is having a major impact on the field of testing. This special issue presents an introduction and seven papers concerning developments in IRT applications. Some important IRT research being conducted outside the United States is highlighted. (SLD)
Descriptors: Adaptive Testing, Equated Scores, Item Analysis, Latent Trait Theory
Christensen, Karl Bang; Kreiner, Svend – Applied Psychological Measurement, 2007
Many statistical tests are designed to test the different assumptions of the Rasch model, but only few are directed at detecting multidimensionality. The Martin-Lof test is an attractive approach, the disadvantage being that its null distribution deviates strongly from the asymptotic chi-square distribution for most realistic sample sizes. A Monte…
Descriptors: Item Response Theory, Monte Carlo Methods, Testing, Models

Lumsden, James – Applied Psychological Measurement, 1980
A test theory model based on the Thurstone judgmental model is described. By restricting various parameters of the model, 3 Rasch models, 2 pseudo-Rasch models, 3 two-parameter models, and a Weber's Law model are derived. (Author/CTM)
Descriptors: Latent Trait Theory, Mathematical Models, Scaling, Test Items
Lee, Won-Chan; Hanson, Bradley A.; Brennan, Robert L. – Applied Psychological Measurement, 2002
This article describes procedures for estimating various indices of classification consistency and accuracy for multiple category classifications using data from a single test administration. The estimates of the classification consistency and accuracy indices are compared under three different psychometric models: the two-parameter beta binomial,…
Descriptors: Classification, True Scores, Psychometrics, Item Response Theory
Previous Page | Next Page ยป
Pages: 1 | 2