Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedParshall, Cynthia G.; Miller, Timothy R. – Journal of Educational Measurement, 1995
Exact testing was evaluated as a method for conducting Mantel-Haenszel differential item functioning (DIF) analyses with relatively small samples. A series of computer simulations found that the asymptotic Mantel-Haenszel and the exact method yielded very similar results across sample size, levels of DIF, and data sets. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Identification, Item Bias
Peer reviewedBacon, Donald R.; And Others – Educational and Psychological Measurement, 1995
The potential for bias in reliability estimation and for errors in item selection when alpha or unit-weighted omega coefficients are used is explored under simulated conditions. Results suggest that composite reliability may be an assessment tool but should not be an item selection tool in structural equations. (SLD)
Descriptors: Bias, Estimation (Mathematics), Reliability, Selection
Peer reviewedAckerman, Terry A.; Evans, John A. – Applied Psychological Measurement, 1994
The effect of the conditioning score on the results of differential item functioning (DIF) analysis was examined with simulated data. The study demonstrates that results of DIF that rely on a conditioning score can be quite different depending on the conditioning variable that is selected. (SLD)
Descriptors: Construct Validity, Identification, Item Bias, Selection
Peer reviewedEngelhard, George, Jr. – Educational and Psychological Measurement, 1992
A historical perspective is provided of the concept of invariance in measurement theory, describing sample-invariant item calibration and item-invariant measurement of individuals. Invariance as a key measurement concept is illustrated through the measurement theories of E. L. Thorndike, L. L. Thurstone, and G. Rasch. (SLD)
Descriptors: Behavioral Sciences, Educational History, Measurement Techniques, Psychometrics
Peer reviewedMatthews, Margaret – Reading in a Foreign Language, 1990
Presents critical analysis of a paper "Testing Reading Comprehension Skills, Part One," in which the consideration concerns the inadequacy of taxonomies of skills to describe individual readers' processes and, hence, their usefulness in test construction. (15 references) (GLR)
Descriptors: Classification, Evaluation, Reading Comprehension, Second Language Learning
Peer reviewedOshima, T. C.; Miller, M. David – Applied Psychological Measurement, 1992
How item bias indexes based on item response theory (IRT) identify bias that results from multidimensionality is demonstrated. Simulation results suggest that IRT-based bias indexes detect multidimensional items with bias but do not detect multidimensional items without bias. They also do not confound between-group differences on the primary test.…
Descriptors: Computer Simulation, Item Bias, Item Response Theory, Mathematical Models
Peer reviewedMuraki, Eiji – Applied Psychological Measurement, 1993
The concept of information functions developed for dichotomous item response models is adapted for the partial credit model, and the information function is used to investigate collapsing and recoding categories of polytomously scored items from the National Assessment of Educational Progress. (SLD)
Descriptors: Equations (Mathematics), Item Response Theory, National Surveys, Psychometrics
Peer reviewedKuder, Frederic; Diamond, Esther E.; Zytowski, Donald G. – Educational and Psychological Measurement, 1998
Predictive validity, generally taken to be the prime validity that occupationally normed interest inventories should demonstrate, is dependent on the capacity of an instrument to differentiate between occupations. A comparison of two methods of differentiation shows that a method using proportions of each occupational group to assign item-scoring…
Descriptors: Interest Inventories, Occupational Tests, Predictive Measurement, Predictive Validity
Peer reviewedBradlow, Eric T.; Thomas, Neal – Journal of Educational and Behavioral Statistics, 1998
A set of conditions is presented for the validity of inference for Item Response Theory (IRT) models applied to data collected from examinations that allow students to choose a subset of items. Common low-dimensional IRT models estimated by standard methods do not resolve the difficult problems posed by choice-based data. (SLD)
Descriptors: Inferences, Item Response Theory, Models, Selection
Peer reviewedKatz, Irvin R.; Martinez, Michael E.; Sheehan, Kathleen M.; Tatsuoka, Kikumi K. – Journal of Educational and Behavioral Statistics, 1998
A technique is presented for applying the Rule Space methodology of cognitive diagnosis to assessment in a semantically rich domain. The approach bases diagnosis on item characteristics that are more abstract than individual problem-solving steps. The method is illustrated through a test of architectural knowledge completed by 122 architects. (SLD)
Descriptors: Architects, Architecture, Cognitive Tests, Diagnostic Tests
Peer reviewedPowers, Donald E.; Bennett, Randy Elliot – Applied Measurement in Education, 1999
Explored how allowing examinees to select test questions affected examinee performance and test characteristics for a measure of ability to generate hypotheses about a situation. Results with 2,429 examinees who elected the choice condition on the Graduate Record Examination suggest that items are differentially attractive to examinees. (SLD)
Descriptors: Ability, College Students, Higher Education, Responses
Peer reviewedHiggins, N. C.; Zumbo, Bruno D.; Hay, Jana L. – Educational and Psychological Measurement, 1999
Confirmatory factor analysis of data from 1,346 respondents to the Attributional Style Questionnaire (ASQ) (C. Peterson and others, 1982) reveals that adequate fit is provided by a three-factor attributional style model that includes context-dependent item sets. Results suggest that there is no such thing as a nonsituational attributional style.…
Descriptors: Adults, Attribution Theory, Construct Validity, Context Effect
Peer reviewedPerlow, Richard; Moore, D. De Wayne; Kyle, Rebecca; Killen, Thomas – Educational and Psychological Measurement, 1999
Examined a set of working memory scales containing two versions of test items that are reading and mathematics based. Data from 201 undergraduates support the hypothesis that an oblique two-factor model in which the factors are based on item content would fit the data well. (SLD)
Descriptors: Factor Structure, Higher Education, Mathematics, Models
Peer reviewedNandakumar, Ratna; Yu, Feng; Li, Hsin-Hung; Stout, William – Applied Psychological Measurement, 1998
Investigated the performance of the Poly-DIMTEST (PD) procedure (and associated computer program) in assessing the unidimensionality of test data produced by polytomous items through Monte Carlo simulation. Results show that PD can confirm unidimensionality for unidimensional simulated data and can detect lack of unidimensionality. (SLD)
Descriptors: Evaluation Methods, Item Response Theory, Monte Carlo Methods, Simulation
Peer reviewedRaykov, Tenko – Applied Psychological Measurement, 1998
Examines the relationship between Cronbach's coefficient alpha and the reliability of a composite of a prespecified set of interrelated nonhomogeneous components through simulation. Shows that alpha can over- or underestimate scale reliability at the population level. Illustrates the bias in terms of structural parameters. (SLD)
Descriptors: Reliability, Simulation, Statistical Bias, Structural Equation Models


