Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 10 |
Descriptor
Comparative Analysis | 18 |
Selection | 18 |
Simulation | 14 |
Computer Assisted Testing | 8 |
Test Items | 8 |
Adaptive Testing | 7 |
Models | 5 |
Computer Simulation | 4 |
Item Response Theory | 4 |
Statistical Studies | 4 |
Criteria | 3 |
More ▼ |
Source
Author
Chang, Hua-Hua | 2 |
Abad, Francisco Jose | 1 |
Algina, James | 1 |
Barnette, J. Jackson | 1 |
Barrada, Juan Ramon | 1 |
Beasley, T. Mark | 1 |
Chen, Pei-Hua | 1 |
Cheng, Ying | 1 |
Cohen, Allan S. | 1 |
Dodd, Barbara G. | 1 |
Douglas, Jeffrey | 1 |
More ▼ |
Publication Type
Journal Articles | 13 |
Reports - Research | 9 |
Reports - Evaluative | 8 |
Speeches/Meeting Papers | 3 |
Collected Works - Proceedings | 1 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 2 |
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Teachers | 1 |
Location
Asia | 1 |
Australia | 1 |
Brazil | 1 |
Connecticut | 1 |
Denmark | 1 |
Egypt | 1 |
Estonia | 1 |
Florida | 1 |
Germany | 1 |
Greece | 1 |
Hawaii | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Law School Admission Test | 1 |
What Works Clearinghouse Rating
Kopf, Julia; Zeileis, Achim; Strobl, Carolin – Educational and Psychological Measurement, 2015
Differential item functioning (DIF) indicates the violation of the invariance assumption, for instance, in models based on item response theory (IRT). For item-wise DIF analysis using IRT, a common metric for the item parameters of the groups that are to be compared (e.g., for the reference and the focal group) is necessary. In the Rasch model,…
Descriptors: Test Items, Equated Scores, Test Bias, Item Response Theory
Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2015
Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CATs. This study investigated the accuracy, fidelity, and efficiency of a fully multidimensional CAT algorithm…
Descriptors: Computer Assisted Testing, Adaptive Testing, Accuracy, Fidelity
Chen, Pei-Hua; Chang, Hua-Hua; Wu, Haiyan – Educational and Psychological Measurement, 2012
Two sampling-and-classification-based procedures were developed for automated test assembly: the Cell Only and the Cell and Cube methods. A simulation study based on a 540-item bank was conducted to compare the performance of the procedures with the performance of a mixed-integer programming (MIP) method for assembling multiple parallel test…
Descriptors: Test Items, Selection, Test Construction, Item Response Theory
Veldkamp, Bernard P. – Psicologica: International Journal of Methodology and Experimental Psychology, 2010
Application of Bayesian item selection criteria in computerized adaptive testing might result in improvement of bias and MSE of the ability estimates. The question remains how to apply Bayesian item selection criteria in the context of constrained adaptive testing, where large numbers of specifications have to be taken into account in the item…
Descriptors: Selection, Criteria, Bayesian Statistics, Computer Assisted Testing
Kang, Taehoon; Cohen, Allan S.; Sung, Hyun-Jung – Applied Psychological Measurement, 2009
This study examines the utility of four indices for use in model selection with nested and nonnested polytomous item response theory (IRT) models: a cross-validation index and three information-based indices. Four commonly used polytomous IRT models are considered: the graded response model, the generalized partial credit model, the partial credit…
Descriptors: Item Response Theory, Models, Selection, Simulation
Barrada, Juan Ramon; Olea, Julio; Ponsoda, Vicente; Abad, Francisco Jose – Applied Psychological Measurement, 2010
In a typical study comparing the relative efficiency of two item selection rules in computerized adaptive testing, the common result is that they simultaneously differ in accuracy and security, making it difficult to reach a conclusion on which is the more appropriate rule. This study proposes a strategy to conduct a global comparison of two or…
Descriptors: Test Items, Simulation, Adaptive Testing, Item Analysis
Cheng, Ying; Chang, Hua-Hua; Douglas, Jeffrey; Guo, Fanmin – Educational and Psychological Measurement, 2009
a-stratification is a method that utilizes items with small discrimination (a) parameters early in an exam and those with higher a values when more is learned about the ability parameter. It can achieve much better item usage than the maximum information criterion (MIC). To make a-stratification more practical and more widely applicable, a method…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection
Wedig, Timothy – PS: Political Science and Politics, 2010
Classroom simulations can make a significant contribution to learning outcomes in political science courses, provided that they are firmly linked to course content and learning objectives. This article offers a step-by-step decision framework for instructors seeking to use simulations as a core component of their courses, including selection of an…
Descriptors: Simulation, Political Science, Selection, Teacher Role
Haake, Magnus; Gulz, Agneta – International Journal of Artificial Intelligence in Education, 2009
The paper presents a theoretical framework addressing three aspects of embodied pedagogical agents: visual static appearance, pedagogical role, and communicative style. The framework is then applied to a user study where 90 school children (aged 12-15) in a dummy multimedia program were presented with either an instructor or a learning companion…
Descriptors: Foreign Countries, Computer Assisted Instruction, Multimedia Materials, Computer Graphics
Barnette, J. Jackson; McLean, James E. – 1999
Four of the most commonly used multiple comparison procedures were compared for pairwise comparisons and relative to control of per-experiment and experimentwise Type I errors when conducted as protected or unprotected tests. The methods are: (1) Dunn-Bonferroni; (2) Dunn-Sidak; (3) Holm's sequentially rejective; and (4) Tukey's honestly…
Descriptors: Comparative Analysis, Monte Carlo Methods, Research Methodology, Selection

Kingsbury, G. Gage; Zara, Anthony R. – Applied Measurement in Education, 1991
This simulation investigated two procedures that reduce differences between paper-and-pencil testing and computerized adaptive testing (CAT) by making CAT content sensitive. Results indicate that the price in terms of additional test items of using constrained CAT for content balancing is much smaller than that of using testlets. (SLD)
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Computer Simulation

Marcoulides, George A. – Educational and Psychological Measurement, 1994
Effects of different weighting schemes on selecting the optimal number of observations in multivariate-multifacet generalizability designs are studied when cost constraints are imposed. Comparison of four schemes through simulation indicates that all four produce similar optimal values and that reliability should be similar. (SLD)
Descriptors: Budgeting, Comparative Analysis, Costs, Factor Analysis
Beasley, T. Mark; Sheehan, Janet K. – 1994
C. L. Olson (1976, 1979) suggests the Pillai-Bartlett trace (V) as an omnibus multivariate analysis of variance (MANOVA) test statistic for its superior robustness to heterogeneous variances. J. Stevens (1979, 1980) contends that the robustness of V, Wilk's lambda (W) and the Hotelling-Lawley trace (T) are similar, and that their power functions…
Descriptors: Analysis of Covariance, Comparative Analysis, Matrices, Monte Carlo Methods

Schumacker, Randall E. – 1994
A population data set was randomly generated from which a random sample was drawn. This sample was randomly divided into two data sets, one of which was used to generate parameter estimates, which were then used in the second data set for cross-validation purposes. The best variable subset models were compared between the two data sets on the…
Descriptors: Comparative Analysis, Criteria, Estimation (Mathematics), Factor Analysis

Dodd, Barbara G. – Applied Psychological Measurement, 1990
Using one simulated and two real data sets, the effects of the systematic variation of the item-selection procedure and the stepsize method on the operating characteristics of computerized adaptive testing (CAT) for instruments with polychotomously scored rating scale items were studied. The six rating scale CAT procedures used performed well.…
Descriptors: Adaptive Testing, Attitude Measures, Comparative Analysis, Computer Assisted Testing
Previous Page | Next Page ยป
Pages: 1 | 2