ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Measurement Techniques	12
Reliability	8
Test Theory	5
Correlation	4
Scores	4
Test Reliability	4
Achievement Gains	3
Change	3
Sampling	3
Bias	2
Comparative Analysis	2
Computation	2
Error of Measurement	2
Test Items	2
Test Validity	2
Anxiety	1
Classification	1
Cognitive Tests	1
College Faculty	1
Criterion Referenced Tests	1
Cultural Differences	1
Elementary School Students	1
Equations (Mathematics)	1
Foreign Countries	1
Generalization	1
More ▼

Source

Applied Psychological…

Author

Alsawalmeh, Yousef M.	2
Feldt, Leonard S.	2
Almehrizi, Rashid S.	1
Biswas, Ajoy Kumar	1
Brennan, Robert L.	1
Collins, Linda M.	1
Dawes, Robyn M.	1
Geuens, Maggie	1
Hambleton, Ronald K., Ed.	1
Humphreys, Lloyd G.	1
Laosa, Luis M.	1
Lee, Won-Chan	1
Schillewaert, Niels	1
Wan, Lei	1
Weijters, Bert	1
Williams, Richard H.	1
Zimmerman, Donald W.	1
More ▼

Publication Type

Journal Articles	11
Reports - Evaluative	6
Book/Product Reviews	3
Reports - Research	3
Collected Works - Serials	1
Reports - Descriptive	1

Education Level

Early Childhood Education	1
Elementary Education	1
Grade 2	1
Higher Education	1
Postsecondary Education	1
Primary Education	1

Audience

Location

Belgium

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The Individual Consistency of Acquiescence and Extreme Response Style in Self-Report Questionnaires

Peer reviewed

Direct link

Weijters, Bert; Geuens, Maggie; Schillewaert, Niels – Applied Psychological Measurement, 2010

The severity of bias in respondents' self-reports due to acquiescence response style (ARS) and extreme response style (ERS) depends strongly on how consistent these response styles are over the course of a questionnaire. In the literature, different alternative hypotheses on response style (in)consistency circulate. Therefore, nine alternative…

Descriptors: Models, Response Style (Tests), Questionnaires, Measurement Techniques

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009

For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…

Descriptors: Classification, Reliability, Test Items, Scoring

Testing the Equality of Two Related Intraclass Reliability Coefficients.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1994

An approximate statistical test of the equality of two intraclass reliability coefficients based on the same sample of people is derived. Such a test is needed when a researcher wishes to compare the reliability of two measurement procedures, and both procedures can be applied to results from the same group. (SLD)

Descriptors: Comparative Analysis, Measurement Techniques, Reliability, Sampling

Reliability of Total Test Scores When Considered as Ordinal Measurements

Peer reviewed

Direct link

Biswas, Ajoy Kumar – Applied Psychological Measurement, 2006

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

Descriptors: True Scores, Test Theory, Test Reliability, Scores

Is Reliability Obsolete? A Commentary on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Collins, Linda M. – Applied Psychological Measurement, 1996

The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)

Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability

Suppose We Measured Height With Rating Scales Instead of Rulers

Peer reviewed

Dawes, Robyn M. – Applied Psychological Measurement, 1977

Staff members of the Psychology department at the University of Oregon rated each other's height on five rating scales representative of those found in social psychology. Average ratings proved to be very good estimates of height. (Author/JKS)

Descriptors: College Faculty, Height, Males, Measurement Techniques

Test of the Hypothesis that the Intraclass Reliability Coefficient Is the Same for Two Measurement Procedures.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1992

An approximate statistical test is derived for the hypothesis that the intraclass reliability coefficients associated with two measurement procedures are equal. Control of Type 1 error is investigated by comparing empirical sampling distributions of the test statistic with its derived theoretical distribution. A numerical illustration is…

Descriptors: Equations (Mathematics), Hypothesis Testing, Mathematical Models, Measurement Techniques

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Measures for the Study of Maternal Teaching Strategies.

Peer reviewed

Laosa, Luis M. – Applied Psychological Measurement, 1980

A technique to measure maternal teaching strategies was developed for possible use in research and evaluation studies. Scores derived from the technique describe quality and quanitity of behaviors used by mothers to teach cognitive-perceptual tasks to their own young children. Reliability and validity data are presented. (Author/JKS)

Descriptors: Cultural Differences, Measurement Techniques, Mothers, Observation

Contributions to Criterion-Referenced Testing Technology.

Peer reviewed

Hambleton, Ronald K., Ed. – Applied Psychological Measurement, 1980

This special issue covers recent technical developments in the field of criterion-referenced testing. An introduction, six papers, and two commentaries dealing with test development, test score uses, and evaluation of scores review relevant literature, offer new models and/or results, and suggest directions for additional research. (SLD)

Descriptors: Criterion Referenced Tests, Mastery Tests, Measurement Techniques, Standard Setting (Scoring)