NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung – Applied Psychological Measurement, 2008
Raju and Oshima (2005) proposed two prophecy formulas based on item response theory in order to predict the reliability of ability estimates for a test after change in its length. The first prophecy formula is equivalent to the classical Spearman-Brown prophecy formula. The second prophecy formula is misleading because of an underlying false…
Descriptors: Test Reliability, Item Response Theory, Computation, Evaluation Methods
Peer reviewed Peer reviewed
Millsap, Roger E. – Applied Psychological Measurement, 1988
Two new methods for constructing a credibility interval (CI)--an interval containing a specified proportion of true validity description--are discussed, from a frequentist perspective. Tolerance intervals, unlike the current method of constructing the CI, have performance characteristics across repeated applications and may be useful in validity…
Descriptors: Bayesian Statistics, Meta Analysis, Statistical Analysis, Test Reliability
Peer reviewed Peer reviewed
Dawes, Robyn M. – Applied Psychological Measurement, 1977
Staff members of the Psychology department at the University of Oregon rated each other's height on five rating scales representative of those found in social psychology. Average ratings proved to be very good estimates of height. (Author/JKS)
Descriptors: College Faculty, Height, Males, Measurement Techniques
Peer reviewed Peer reviewed
Waters, Brian K. – Applied Psychological Measurement, 1977
The validity and utility of the stratified adaptive computerized testing model (stradaptive) developed by Weiss are empirically investigated. The model presents a tailored testing strategy based upon Binet IQ measurement theory and Lord's modern test theory. (Author/RC)
Descriptors: Ability, Adaptive Testing, Computer Oriented Programs, Item Banks
Peer reviewed Peer reviewed
Raben, Charles S.; And Others – Applied Psychological Measurement, 1978
Two studies are reported which investigated the construct validity and reliability of the Ghiselli Self-Description Inventory as a measure of self-esteem. The first study, using a multitrait-multimethod matrix, found little evidence for the construct validity of the instrument. The second study found a significant, although low, reliability. (…
Descriptors: Achievement Need, Higher Education, Locus of Control, Self Concept Measures
Peer reviewed Peer reviewed
McGarvey, Bill; And Others – Applied Psychological Measurement, 1977
The most consistently used scoring system for the rod-and-frame task has been the total number of degrees in error from the true vertical. Since a logical case can be made for at least four alternative scoring systems, a thorough comparison of all five systems was performed. (Author/CTM)
Descriptors: Analysis of Variance, Cognitive Style, Cognitive Tests, Elementary Education
Peer reviewed Peer reviewed
Patterson, Henry O,; Milakofsky, Louis – Applied Psychological Measurement, 1980
Adapting curricula to the cognitive developmental level of students has been hindered by the difficulty of assessing those levels in students. The reliability and validity of a paper-and-pencil Piagetian assessment are discussed. (Author/ JKS)
Descriptors: Cognitive Development, Cognitive Measurement, Elementary Secondary Education, Grade 3
Peer reviewed Peer reviewed
Laosa, Luis M. – Applied Psychological Measurement, 1980
A technique to measure maternal teaching strategies was developed for possible use in research and evaluation studies. Scores derived from the technique describe quality and quanitity of behaviors used by mothers to teach cognitive-perceptual tasks to their own young children. Reliability and validity data are presented. (Author/JKS)
Descriptors: Cultural Differences, Measurement Techniques, Mothers, Observation
Peer reviewed Peer reviewed
Burisch, Matthias – Applied Psychological Measurement, 1978
Sets of inventory scales were constructed from a common item pool, using variants of what are here called the Inductive, Deductive, and External strategies. Peer ratings for 21 traits served as criteria. Very little variation in validity was attributable to construction strategies. (Author/CTM)
Descriptors: Deduction, Foreign Countries, Higher Education, Induction
Peer reviewed Peer reviewed
Luecht, Richard M. – Applied Psychological Measurement, 1996
The example of a medical licensure test is used to demonstrate situations in which complex, integrated content must be balanced at the total test level for validity reasons, but items assigned to reportable subscore categories may be used under a multidimensional item response theory adaptive paradigm to improve subscore reliability. (SLD)
Descriptors: Adaptive Testing, Certification, Computer Assisted Testing, Licensing Examinations (Professions)
Peer reviewed Peer reviewed
Davison, Mark L.; Robbins, Stephen – Applied Psychological Measurement, 1978
Empirically weighted scores for Rest's Defining Issues Test were found to be more reliable than the simple sum of scores theoretically weighted sum, or Rest's p scores. They also had slightly higher correlations with Kohlberg's interview scores. Empirically weighted scores also showed more significant change in two longitudinal studies. (CTM)
Descriptors: Higher Education, Longitudinal Studies, Moral Development, Moral Values
Peer reviewed Peer reviewed
Schmeck, Ronald Ray; And Others – Applied Psychological Measurement, 1977
Five studies are presented describing the development of a self-report inventory for measuring individual differences in learning processes. Factor analysis of items yielded four scales: Synthesis-Analysis, Study Methods, Fact Retention, and Elaborative Processing. There were no sex differences, and the scales demonstrated acceptable reliabilities…
Descriptors: Factor Analysis, Higher Education, Learning Processes, Retention (Psychology)
Peer reviewed Peer reviewed
Menasco, Michael B.; Curry, David J. – Applied Psychological Measurement, 1978
Scores on the Role Construct Repertory Test exhibited significant correlations with other forms of cognitive functioning, including American College Test scores in science and mathematics for a group of 79 college students. The Grid Form of the test was used. Test-retest reliability was low. (Author/CTM)
Descriptors: Achievement Tests, Cognitive Processes, Cognitive Style, Cognitive Tests
Peer reviewed Peer reviewed
Goh, David S. – Applied Psychological Measurement, 1979
The advantages of using psychometric thoery to design short forms of intelligence tests are demonstrated by comparing such usage to a systematic random procedure that has previously been used. The Wechsler Intelligence Scale for Children Revised (WISC-R) Short Form is presented as an example. (JKS)
Descriptors: Elementary Secondary Education, Intelligence Tests, Item Analysis, Psychometrics
Peer reviewed Peer reviewed
And Others; Mann, Irene T. – Applied Psychological Measurement, 1979
Several methodological problems (particularly the assumed bipolarity of scales, instructions regarding use of the midpoint, and concept-scale interaction) which may contribute to a lack of precision in the semantic differential technique were investigated. Results generally supported the use of the semantic differential. (Author/JKS)
Descriptors: Analysis of Variance, Computer Assisted Testing, Higher Education, Rating Scales
Previous Page | Next Page ยป
Pages: 1  |  2