NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)3
Since 2006 (last 20 years)8
Audience
Researchers4
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 25 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017
The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…
Descriptors: Test Items, Test Bias, Item Response Theory, Surveys
Peer reviewed Peer reviewed
Direct linkDirect link
Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017
This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…
Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012
The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…
Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Slonim-Nevo, Vered; Nevo, Isaac – Journal of Mixed Methods Research, 2009
Combining diverse methods in a single study raises a problem: What should be done when the findings of one method of investigation conflict with those of another? The authors illustrate this problem using an example in which three study phases--quantitative, qualitative, and intervention--are applied. The findings from the quantitative phase did…
Descriptors: Methods Research, Immigration, Statistical Analysis, Qualitative Research
Peer reviewed Peer reviewed
Vegelius, Jan – Educational and Psychological Measurement, 1977
Generalizations of the G index as a measure of similarity between persons beyond the dichotomous situation are discussed. An attempt is made to present a generalization that does not require dichotomization of the items for cases where the number of response alternatives may differ. (Author/JKS)
Descriptors: Correlation, Item Analysis, Measurement Techniques, Multidimensional Scaling
Peer reviewed Peer reviewed
Vegelius, Jan – Educational and Psychological Measurement, 1979
The computer program WEIGAN makes the weighted G analysis available for computer users. The input and output of the program are described. (Author/JKS)
Descriptors: Computer Programs, Correlation, Factor Analysis, Item Analysis
Peer reviewed Peer reviewed
Yen, Wendy M. – Psychometrika, 1985
An approximate relationship is devised between the unidimensional model used in data analysis and a multidimensional model hypothesized to be generating the item responses. Scale shrinkage is successfully predicted for several sets of simulated data. (Author/LMO)
Descriptors: Difficulty Level, Hypothesis Testing, Item Analysis, Latent Trait Theory
Peer reviewed Peer reviewed
Lautenschlager, Gary J.; Park, Dong-Gun – Applied Psychological Measurement, 1988
The consequences of using item response theory (IRT) item bias detecting procedures with multidimensional IRT item data are examined. Limitations in procedures for detecting item bias are discussed. (SLD)
Descriptors: Item Analysis, Latent Trait Theory, Mathematical Models, Multidimensional Scaling
Peer reviewed Peer reviewed
Reynolds, Thomas J. – Educational and Psychological Measurement, 1981
Cliff's Index "c" derived from an item dominance matrix is utilized in a clustering approach, termed extracting Reliable Guttman Orders (ERGO), to isolate Guttman-type item hierarchies. A comparison of factor analysis to the ERGO is made on social distance data involving multiple ethnic groups. (Author/BW)
Descriptors: Cluster Analysis, Difficulty Level, Factor Analysis, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Cindy M.; Azen, Razia; Schmitt, Thomas – Educational and Psychological Measurement, 2006
It is believed by some that most tests are multidimensional, meaning that they measure more than one underlying construct. The primary objective of this study is to illustrate how variations in the secondary ability distribution affect the statistical detection of dimensionality and to demonstrate the difference between substantive and statistical…
Descriptors: Multidimensional Scaling, Item Response Theory, Comparative Testing, Statistical Analysis
Korpi, Meg; Haertel, Edward – 1984
The purpose of this paper is to further the cause of clarifying construct interpretations of tests, by proposing that non-metric multidimensional scaling may be more useful than factor analysis or other latent structure models for investigating the internal structure of tests. It also suggests that typical problems associated with scaling…
Descriptors: Correlation, Factor Structure, Intermediate Grades, Item Analysis
Baker, Frank B.; Hoyt, Cyril J. – 1972
A scaling technique known as the Method of Reciprocal Averages has been in use since the early 1930's. This technique yields a set of item response weights for a psychological inventory which maximizes the internal consistency of the inventory for a group of subjects. Although the technique has been used for many years, its mathematical…
Descriptors: Analysis of Variance, Correlation, Evaluation Methods, Item Analysis
Smith, Donald M. – 1974
The concept of scaled achievement tests is discussed and a method of selecting those items of a test that form the most scalable (i.e., having the highest coefficient of reproducibility) subset is presented. Sometimes called a monotonic-deterministic model, this type of test assumes that the test items may be sequentially ordered. To determine the…
Descriptors: Achievement Tests, Arithmetic, Difficulty Level, Item Analysis
Reynolds, Thomas J. – 1976
A method of factor extraction specific to a binary matrix, illustrated here as a person-by-item response matrix, is presented. The extraction procedure, termed ERGO, differs from the more commonly implemented dimensionalizing techniques, factor analysis and multidimensional scaling, by taking into consideration item difficulty. Utilized in the…
Descriptors: Discriminant Analysis, Factor Analysis, Item Analysis, Matrices
Previous Page | Next Page »
Pages: 1  |  2