NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 5,101 to 5,115 of 9,552 results Save | Export
Peer reviewed Peer reviewed
Zwick, Rebecca; Thayer, Dorothy T. – Applied Psychological Measurement, 2002
Used a simulation to investigate the applicability to computerized adaptive test data of a differential item functioning (DIF) analysis method. Results show the performance of this empirical Bayes enhancement of the Mantel Haenszel DIF analysis method to be quite promising. (SLD)
Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Item Bias
Peer reviewed Peer reviewed
de Gruijter, Dato N. M. – Applied Psychological Measurement, 1990
Following a brief discussion of test construction by linear programing, the results of a study by F. B. Baker and others (1988) with respect to a uniform target is replicated. It is demonstrated that the result depends on characteristics of the item pool. (SLD)
Descriptors: Item Response Theory, Linear Programing, Mathematical Models, Test Construction
Peer reviewed Peer reviewed
Nicewander, W. Alan – Psychometrika, 1990
An estimate and upper-bound estimate for the reliability of a test composed of binary items is derived from the multidimensional latent trait theory of R. D. Bock and M. Aitken (1981). The practical uses of such estimates are discussed. (SLD)
Descriptors: Estimation (Mathematics), Factor Analysis, Item Response Theory, Test Items
Peer reviewed Peer reviewed
Baker, Frank B.; And Others – Applied Psychological Measurement, 1988
Linear programing was used to select items from item pools (N=1,500) based on one-, two-, and three-parameter models so that a target test information function (TTIF) was reached. Focus was on the distributional characteristics of selected items. The linear-programing approach focuses on the worst feature of the TTIF. (TJH)
Descriptors: Item Banks, Latent Trait Theory, Linear Programing, Test Construction
Peer reviewed Peer reviewed
D'Amato, Rik Carl; And Others – Journal of School Psychology, 1988
Investigated the overlap between the Wechsler Intelligence Scale for Children - Revised (WISC-R) and the Halstead-Reitan Neuropsychological Battery (HRNB) in light of their use in diagnosing children's learning problems using scores for children (N=1,181) on the WISC-R and the HRNB. Results showed primary overlap between measures was attributed to…
Descriptors: Adolescents, Children, Intelligence Tests, Test Items
Peer reviewed Peer reviewed
Parshall, Cynthia G.; Miller, Timothy R. – Journal of Educational Measurement, 1995
Exact testing was evaluated as a method for conducting Mantel-Haenszel differential item functioning (DIF) analyses with relatively small samples. A series of computer simulations found that the asymptotic Mantel-Haenszel and the exact method yielded very similar results across sample size, levels of DIF, and data sets. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Identification, Item Bias
Peer reviewed Peer reviewed
Bacon, Donald R.; And Others – Educational and Psychological Measurement, 1995
The potential for bias in reliability estimation and for errors in item selection when alpha or unit-weighted omega coefficients are used is explored under simulated conditions. Results suggest that composite reliability may be an assessment tool but should not be an item selection tool in structural equations. (SLD)
Descriptors: Bias, Estimation (Mathematics), Reliability, Selection
Peer reviewed Peer reviewed
Ackerman, Terry A.; Evans, John A. – Applied Psychological Measurement, 1994
The effect of the conditioning score on the results of differential item functioning (DIF) analysis was examined with simulated data. The study demonstrates that results of DIF that rely on a conditioning score can be quite different depending on the conditioning variable that is selected. (SLD)
Descriptors: Construct Validity, Identification, Item Bias, Selection
Peer reviewed Peer reviewed
Engelhard, George, Jr. – Educational and Psychological Measurement, 1992
A historical perspective is provided of the concept of invariance in measurement theory, describing sample-invariant item calibration and item-invariant measurement of individuals. Invariance as a key measurement concept is illustrated through the measurement theories of E. L. Thorndike, L. L. Thurstone, and G. Rasch. (SLD)
Descriptors: Behavioral Sciences, Educational History, Measurement Techniques, Psychometrics
Peer reviewed Peer reviewed
Matthews, Margaret – Reading in a Foreign Language, 1990
Presents critical analysis of a paper "Testing Reading Comprehension Skills, Part One," in which the consideration concerns the inadequacy of taxonomies of skills to describe individual readers' processes and, hence, their usefulness in test construction. (15 references) (GLR)
Descriptors: Classification, Evaluation, Reading Comprehension, Second Language Learning
Peer reviewed Peer reviewed
Oshima, T. C.; Miller, M. David – Applied Psychological Measurement, 1992
How item bias indexes based on item response theory (IRT) identify bias that results from multidimensionality is demonstrated. Simulation results suggest that IRT-based bias indexes detect multidimensional items with bias but do not detect multidimensional items without bias. They also do not confound between-group differences on the primary test.…
Descriptors: Computer Simulation, Item Bias, Item Response Theory, Mathematical Models
Peer reviewed Peer reviewed
Muraki, Eiji – Applied Psychological Measurement, 1993
The concept of information functions developed for dichotomous item response models is adapted for the partial credit model, and the information function is used to investigate collapsing and recoding categories of polytomously scored items from the National Assessment of Educational Progress. (SLD)
Descriptors: Equations (Mathematics), Item Response Theory, National Surveys, Psychometrics
Peer reviewed Peer reviewed
Kuder, Frederic; Diamond, Esther E.; Zytowski, Donald G. – Educational and Psychological Measurement, 1998
Predictive validity, generally taken to be the prime validity that occupationally normed interest inventories should demonstrate, is dependent on the capacity of an instrument to differentiate between occupations. A comparison of two methods of differentiation shows that a method using proportions of each occupational group to assign item-scoring…
Descriptors: Interest Inventories, Occupational Tests, Predictive Measurement, Predictive Validity
Peer reviewed Peer reviewed
Bradlow, Eric T.; Thomas, Neal – Journal of Educational and Behavioral Statistics, 1998
A set of conditions is presented for the validity of inference for Item Response Theory (IRT) models applied to data collected from examinations that allow students to choose a subset of items. Common low-dimensional IRT models estimated by standard methods do not resolve the difficult problems posed by choice-based data. (SLD)
Descriptors: Inferences, Item Response Theory, Models, Selection
Peer reviewed Peer reviewed
Katz, Irvin R.; Martinez, Michael E.; Sheehan, Kathleen M.; Tatsuoka, Kikumi K. – Journal of Educational and Behavioral Statistics, 1998
A technique is presented for applying the Rule Space methodology of cognitive diagnosis to assessment in a semantically rich domain. The approach bases diagnosis on item characteristics that are more abstract than individual problem-solving steps. The method is illustrated through a test of architectural knowledge completed by 122 architects. (SLD)
Descriptors: Architects, Architecture, Cognitive Tests, Diagnostic Tests
Pages: 1  |  ...  |  337  |  338  |  339  |  340  |  341  |  342  |  343  |  344  |  345  |  ...  |  637