NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 22 results Save | Export
Peer reviewed Peer reviewed
Burnett, J. Dale – Educational and Psychological Measurement, 1974
The general use of the Spearman-Brown formula for calculating the reliability of parallel tests with different lengths is reviewed. The importance of the assumption that the component tests be parallel is noted and the property that parallel tests must be non-negatively correlated is derived. (Author)
Descriptors: Statistical Analysis, Test Reliability, Testing Problems
Peer reviewed Peer reviewed
Green, Samuel B. – Educational and Psychological Measurement, 1981
The proportion of agreement, G, and kappa indexes are shown to differ in how they correct for chance agreements between two observers. On the basis of the findings, it is suggested that no single agreement index is appropriate for all sets of data. (Author/BW)
Descriptors: Comparative Analysis, Measurement Techniques, Test Reliability, Testing Problems
Peer reviewed Peer reviewed
MacCann, Robert G. – Educational and Psychological Measurement, 1989
Levine's equations for random groups and unequally reliable tests can be used to equate two tests through performance on an anchor test. Levine's assumption of a parallelism requirement is not necessary; it is sufficient to assume only that the tests are congeneric, an assumption implicit in linear test equating. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Latent Trait Theory, Test Reliability
Peer reviewed Peer reviewed
Wagner, Edwin E.; And Others – Educational and Psychological Measurement, 1990
Maximized correlation as an internal reliability estimate for tests with few items was investigated. An actual sampling distribution of maximum correlation--"r" max--was empirically derived from 100 samples of 50 cases each from Rorschach test data and compared with those of alpha and an odd/even split, using 2,020 Rorschach protocols.…
Descriptors: Comparative Analysis, Correlation, Estimation (Mathematics), Sample Size
Peer reviewed Peer reviewed
Piotrowski, Chris; Dunham, Frances Y. – Educational and Psychological Measurement, 1984
Research on the semantic differential technique has provided evidence for variance in Osgood's formulation of dimensions of connotative meaning. Retest data based on Piotrowski's original sample is reported. Results indicate support for stability and consistency of the Evaluation dimension. Moderate consistency was found in scales comprising the…
Descriptors: Elementary Education, Factor Analysis, Factor Structure, Semantic Differential
Peer reviewed Peer reviewed
Yarnold, Paul R. – Educational and Psychological Measurement, 1984
Unreliable profiles impose the difficulty that ordinal and interval relations among the individual's scores become uncertain or unstable. A profile reliability coefficient is derived to estimate the relative expected extent of this ordinal and interval "inversion" for any profile of K measures. (Author/DWH)
Descriptors: Error of Measurement, Mathematical Models, Profiles, Test Reliability
Peer reviewed Peer reviewed
Conger, Anthony J. – Educational and Psychological Measurement, 1983
A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)
Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length
Peer reviewed Peer reviewed
Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984
Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)
Descriptors: Correlation, Intelligence Tests, Profiles, Scores
Peer reviewed Peer reviewed
Cureton, Edward E. – Educational and Psychological Measurement, 1971
A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)
Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability
Peer reviewed Peer reviewed
Raju, Nambury S. – Educational and Psychological Measurement, 1982
Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)
Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas
Peer reviewed Peer reviewed
Kuncel, Ruth Boutin; Fiske, Donald W. – Educational and Psychological Measurement, 1974
Four hypotheses regarding stability of response process and response in personality testing are tested and supported. (RC)
Descriptors: College Students, Item Analysis, Personality Measures, Response Style (Tests)
Peer reviewed Peer reviewed
Parish, Thomas S.; Rankin, Charles I. – Educational and Psychological Measurement, 1982
The Nonsexist Personal Attribute Inventory for Children (NPAIC) was administered along with the Piers-Harris scale to children in fifth through eighth grade. A correlation of .49 was found between the two scales. The NPAIC was found to be a reliable, valid self-concept scale for females and males. (Author/GK)
Descriptors: Elementary Secondary Education, Self Concept Measures, Sex Bias, Test Reliability
Peer reviewed Peer reviewed
Andrulis, Richard S.; And Others – Educational and Psychological Measurement, 1978
The effects of repeaters (testees included in both administrations of two forms of a test) on the test equating process are examined. It is shown that repeaters do effect test equating and tend to lower the cutoff point for passing the test. (JKS)
Descriptors: Cutting Scores, Equated Scores, Item Analysis, Scoring
Peer reviewed Peer reviewed
Gordon, Michael E.; Gross, Ronald H. – Educational and Psychological Measurement, 1978
Past practice of operationalizing the concept of fakeability of psychological tests is reviewed. The strengths and weaknesses of these indices are discussed in the light of a proposed new definition of fakeability based upon Naylor's model of measurement accuracy. (Author/JKS)
Descriptors: Psychological Testing, Rating Scales, Response Style (Tests), Test Reliability
Peer reviewed Peer reviewed
Sims, Ronald R. – Educational and Psychological Measurement, 1986
The Learning Style Inventory (LSI) and the newly revised Learning Style Inventory (LSI II) were examined for internal consistency, test-retest reliability, and stability of the four classifications resulting from their scores. Internal consistency was improved in LSI II, but problems with low test-retest indices and classifications stability…
Descriptors: Cognitive Measurement, Cognitive Style, College Students, Higher Education
Previous Page | Next Page ยป
Pages: 1  |  2