NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Belov, Dmitry I.; Armstrong, Ronald D.; Weissman, Alexander – Applied Psychological Measurement, 2008
This article presents a new algorithm for computerized adaptive testing (CAT) when content constraints are present. The algorithm is based on shadow CAT methodology to meet content constraints but applies Monte Carlo methods and provides the following advantages over shadow CAT: (a) lower maximum item exposure rates, (b) higher utilization of the…
Descriptors: Test Items, Monte Carlo Methods, Law Schools, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Briggs, Derek C.; Wilson, Mark – Journal of Educational Measurement, 2007
An approach called generalizability in item response modeling (GIRM) is introduced in this article. The GIRM approach essentially incorporates the sampling model of generalizability theory (GT) into the scaling model of item response theory (IRT) by making distributional assumptions about the relevant measurement facets. By specifying a random…
Descriptors: Markov Processes, Generalizability Theory, Item Response Theory, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
De Corte, Wilfried – Educational and Psychological Measurement, 2004
The article describes a Windows program to estimate the expected value and sampling distribution function of the adverse impact ratio for general multistage selections. The results of the program can also be used to predict the risk that a future selection decision will result in an outcome that reflects the presence of adverse impact. The method…
Descriptors: Sampling, Measurement Techniques, Evaluation Methods, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Laufer, Batia – Applied Linguistics, 2005
This paper is a response to Paul Meara's (2005) critique of the Lexical Frequency Profile (LFP). Using simulated data, he challenges the claim that LFP is a sensitive and reliable tool for assessing vocabulary use in L2 speakers. In my response to his paper, I discuss the nature of lexical competence, in light of which LFP results should be…
Descriptors: Word Frequency, Lexicology, Profiles, Criticism
Peer reviewed Peer reviewed
Chen, Ru San; Dunlap, William P. – Journal of Educational Statistics, 1994
The present simulation study confirms that the corrected epsilon approximate test of B. Lecoutre yields a less biased estimation of population epsilon and reduces Type I error rates when compared to the epsilon approximate test of H. Huynh and L. S. Feldt. (SLD)
Descriptors: Computer Simulation, Estimation (Mathematics), Evaluation Methods, Monte Carlo Methods
Peer reviewed Peer reviewed
Roznowski, Mary; And Others – Applied Psychological Measurement, 1991
Three heuristic methods of assessing the dimensionality of binary item pools were evaluated in a Monte Carlo investigation. The indices were based on (1) the local independence of unidimensional tests; (2) patterns of second-factor loadings derived from simplex theory; and (3) the shape of the curve of successive eigenvalues. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Correlation, Evaluation Methods
Levy, Roy; Mislevy, Robert J. – US Department of Education, 2004
The challenges of modeling students' performance in simulation-based assessments include accounting for multiple aspects of knowledge and skill that arise in different situations and the conditional dependencies among multiple aspects of performance in a complex assessment. This paper describes a Bayesian approach to modeling and estimating…
Descriptors: Probability, Markov Processes, Monte Carlo Methods, Bayesian Statistics
Ziomek, Robert L.; Szymczuk, Mike – 1983
In order to evaluate standard setting procedures, apart from the more commonly applied approach of simply comparing the derived standards or failure rates across various techniques, this study investigated the errors of classification associated with the contrasting groups procedures. Monte Carlo simulations were employed to produce…
Descriptors: Classification, Computer Simulation, Error of Measurement, Evaluation Methods
Rose, Andrew M.; And Others – 1985
This third of three volumes reports on analytic procedures conducted to address various aspects of the scalar properties of the Device Effectiveness Forecasting Technique (DEFT). DEFT, a series of microcomputer programs applied to data gathered from rating scales, is used to evaluate simulator devices used in U.S. Army weapons training. The…
Descriptors: Adults, Computer Oriented Programs, Computer Simulation, Data Interpretation