ERIC - Search Results

Descriptor

Test Reliability	22
Testing Problems	22
Test Validity	7
College Students	3
Correlation	3
Error of Measurement	3
Higher Education	3
Item Analysis	3
Response Style (Tests)	3
Statistical Analysis	3
Test Construction	3
Behavior Rating Scales	2
Comparative Analysis	2
Cutting Scores	2
Equated Scores	2
Generalizability Theory	2
High Schools	2
Intelligence Tests	2
Measurement Techniques	2
Multiple Choice Tests	2
Profiles	2
Rating Scales	2
Responses	2
Test Interpretation	2
Test Items	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	14
Reports - Research	12
Reports - Evaluative	4

Education Level

Audience

Location

India

Laws, Policies, & Programs

Assessments and Surveys

Conners Teacher Rating Scale	1
Cornell Critical Thinking Test	1
Differential Aptitude Test	1
Learning Style Inventory	1
Piers Harris Childrens Self…	1
Rorschach Test	1
Watson Glaser Critical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Parallel Measurements and the Spearman-Brown Formula

Peer reviewed

Burnett, J. Dale – Educational and Psychological Measurement, 1974

The general use of the Spearman-Brown formula for calculating the reliability of parallel tests with different lengths is reviewed. The importance of the assumption that the component tests be parallel is noted and the property that parallel tests must be non-negatively correlated is derived. (Author)

Descriptors: Statistical Analysis, Test Reliability, Testing Problems

A Comparison of Three Indexes of Agreement between Observers: Proportion of Agreement, G-Index, and Kappa.

Peer reviewed

Green, Samuel B. – Educational and Psychological Measurement, 1981

The proportion of agreement, G, and kappa indexes are shown to differ in how they correct for chance agreements between two observers. On the basis of the findings, it is suggested that no single agreement index is appropriate for all sets of data. (Author/BW)

Descriptors: Comparative Analysis, Measurement Techniques, Test Reliability, Testing Problems

A Derivation of Levine's Formulae (for Equating Unequally Reliable Tests Using Random Groups) without the Assumption of Parallelism.

Peer reviewed

MacCann, Robert G. – Educational and Psychological Measurement, 1989

Levine's equations for random groups and unequally reliable tests can be used to equate two tests through performance on an anchor test. Levine's assumption of a parallelism requirement is not necessary; it is sufficient to assume only that the tests are congeneric, an assumption implicit in linear test equating. (SLD)

Descriptors: Equated Scores, Equations (Mathematics), Latent Trait Theory, Test Reliability

An Empirical Demonstration of the Stability of the Maximized Correlation as an Internal-Consistency Reliability Estimate for Tests of Small Item Size.

Peer reviewed

Wagner, Edwin E.; And Others – Educational and Psychological Measurement, 1990

Maximized correlation as an internal reliability estimate for tests with few items was investigated. An actual sampling distribution of maximum correlation--"r" max--was empirically derived from 100 samples of 50 cases each from Rorschach test data and compared with those of alpha and an odd/even split, using 2,020 Rorschach protocols.…

Descriptors: Comparative Analysis, Correlation, Estimation (Mathematics), Sample Size

Stability of Factor Structure on the Semantic Differential: Retest Data.

Peer reviewed

Piotrowski, Chris; Dunham, Frances Y. – Educational and Psychological Measurement, 1984

Research on the semantic differential technique has provided evidence for variance in Osgood's formulation of dimensions of connotative meaning. Retest data based on Piotrowski's original sample is reported. Results indicate support for stability and consistency of the Evaluation dimension. Moderate consistency was found in scales comprising the…

Descriptors: Elementary Education, Factor Analysis, Factor Structure, Semantic Differential

The Reliability of a Profile.

Peer reviewed

Yarnold, Paul R. – Educational and Psychological Measurement, 1984

Unreliable profiles impose the difficulty that ordinal and interval relations among the individual's scores become uncertain or unstable. A profile reliability coefficient is derived to estimate the relative expected extent of this ordinal and interval "inversion" for any profile of K measures. (Author/DWH)

Descriptors: Error of Measurement, Mathematical Models, Profiles, Test Reliability

One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1983

A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)

Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length

Regression Effects on Part Scores Based on Whole-Score Selected Samples.

Peer reviewed

Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984

Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)

Descriptors: Correlation, Intelligence Tests, Profiles, Scores

The Stability Coefficient

Peer reviewed

Cureton, Edward E. – Educational and Psychological Measurement, 1971

A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)

Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

The Reliability of a Criterion-Referenced Composite with the Parts of the Composite Having Different Cutting Scores.

Peer reviewed

Raju, Nambury S. – Educational and Psychological Measurement, 1982

Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)

Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas

Stability of Response Process and Response

Peer reviewed

Kuncel, Ruth Boutin; Fiske, Donald W. – Educational and Psychological Measurement, 1974

Four hypotheses regarding stability of response process and response in personality testing are tested and supported. (RC)

Descriptors: College Students, Item Analysis, Personality Measures, Response Style (Tests)

The Nonsexist Personal Attribute Inventory for Children: A Report on Its Validity and Reliability as a Self-Concept Scale.

Peer reviewed

Parish, Thomas S.; Rankin, Charles I. – Educational and Psychological Measurement, 1982

The Nonsexist Personal Attribute Inventory for Children (NPAIC) was administered along with the Piers-Harris scale to children in fifth through eighth grade. A correlation of .49 was found between the two scales. The NPAIC was found to be a reliable, valid self-concept scale for females and males. (Author/GK)

Descriptors: Elementary Secondary Education, Self Concept Measures, Sex Bias, Test Reliability

The Effects of Repeaters on Test Equating.

Peer reviewed

Andrulis, Richard S.; And Others – Educational and Psychological Measurement, 1978

The effects of repeaters (testees included in both administrations of two forms of a test) on the test equating process are examined. It is shown that repeaters do effect test equating and tend to lower the cutoff point for passing the test. (JKS)

Descriptors: Cutting Scores, Equated Scores, Item Analysis, Scoring

A Critique of Methods for Operationalizing the Concept of Fakeability.

Peer reviewed

Gordon, Michael E.; Gross, Ronald H. – Educational and Psychological Measurement, 1978

Past practice of operationalizing the concept of fakeability of psychological tests is reviewed. The strengths and weaknesses of these indices are discussed in the light of a proposed new definition of fakeability based upon Naylor's model of measurement accuracy. (Author/JKS)

Descriptors: Psychological Testing, Rating Scales, Response Style (Tests), Test Reliability

The Reliability and Classification Stability of the Learning Style Inventory.

Peer reviewed

Sims, Ronald R. – Educational and Psychological Measurement, 1986

The Learning Style Inventory (LSI) and the newly revised Learning Style Inventory (LSI II) were examined for internal consistency, test-retest reliability, and stability of the four classifications resulting from their scores. Internal consistency was improved in LSI II, but problems with low test-retest indices and classifications stability…

Descriptors: Cognitive Measurement, Cognitive Style, College Students, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Conger, Anthony J.	2
Andrulis, Richard S.	1
Burnett, J. Dale	1
Ceurvorst, Robert W.	1
Chatterji, S.	1
Cureton, Edward E.	1
Dunham, Frances Y.	1
Ebel, Robert L.	1
Fiske, Donald W.	1
Gordon, Michael E.	1
Green, Samuel B.	1
Gross, Ronald H.	1
Krus, David J.	1
Kuncel, Ruth Boutin	1
MacCann, Robert G.	1
Maurer, Todd J.	1
Michael, William B.	1
Modjeski, Richard B.	1
Mukerjee, Manjula	1
Parish, Thomas S.	1
Piotrowski, Chris	1
Raju, Nambury S.	1
Rankin, Charles I.	1
Reynolds, Cecil R.	1
More ▼