NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 391 to 405 of 1,161 results Save | Export
Peer reviewed Peer reviewed
Collins, Linda M.; Cliff, Norman – Psychometrika, 1985
The axioms of a three-set Guttman simplex model are presented and the effects of relaxing the axioms for one of the three sets are examined. This model can be used to define longitudinal developmental scales. (NSF)
Descriptors: Mathematical Models, Measurement Techniques, Scaling, Test Construction
Peer reviewed Peer reviewed
Zimmerman, Donald W. – Educational and Psychological Measurement, 1983
A definition of test validity as the ratio of a covariance term to a variance term, analogous to the classical definition of test reliability, is proposed. When error scores on distinct tests are uncorrelated, the proposed definition coincides with the usual one, but it remains meaningful when error scores are correlated. (Author/BW)
Descriptors: Definitions, Mathematical Formulas, Mathematical Models, Test Theory
Marzano, Robert J. – 2000
There has been little discussion of two conventions common within classroom assessment: the convention of representing student's performance on an assessment using a single score; and the convention of using the average score to summarize a student's performance over a set of assessments. This paper attempts to demonstrate that the assumptions…
Descriptors: Elementary Secondary Education, Scoring, Teacher Made Tests, Test Theory
Peer reviewed Peer reviewed
Conger, Anthony J. – Educational and Psychological Measurement, 1980
Reliability maximizing weights are related to theoretically specified true score scaling weights to show a constant relationship that is invariant under separate linear tranformations on each variable in the system. Test theoretic relations should be derived for the most general model available and not for unnecessarily constrained models.…
Descriptors: Mathematical Formulas, Scaling, Test Reliability, Test Theory
Peer reviewed Peer reviewed
Divgi, D. R. – Applied Psychological Measurement, 1980
The dependence of reliability indices for mastery tests on mean and cutoff scores was examined in the case of three decision-theoretic indices. Dependence of kappa on mean and cutoff scores was opposite to that of the proportion of correct decisions, which was linearly related to average threshold loss. (Author/BW)
Descriptors: Classification, Cutting Scores, Mastery Tests, Test Reliability
Peer reviewed Peer reviewed
Vegelius, Jan – Educational and Psychological Measurement, 1979
A new measure of similarity between persons applicable in Q-analysis is proposed. It allows assumptions of non-orthogonality between the items, across which the similarity is computed. The similarity measure may also be applied in an R-analysis. (Author/JKS)
Descriptors: Correlation, Item Analysis, Q Methodology, Test Construction
Peer reviewed Peer reviewed
Collins, Linda M. – Applied Psychological Measurement, 1996
The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)
Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability
Peer reviewed Peer reviewed
Andrich, David – Studies in Educational Evaluation, 2002
Uses a framework previously developed to relate outcomes based education and B. Bloom's "Taxonomy of Educational Objectives" to consider ways in which modern test theory can be used to connect aspects of assessment to the curriculum framework and to consider insights this connection might provide. (SLD)
Descriptors: Curriculum, Models, Outcome Based Education, Test Construction
Peer reviewed Peer reviewed
Sanders, Piet F.; Verschoor, Alfred J. – Applied Psychological Measurement, 1998
Presents minimization and maximization models for parallel test construction under constraints. The minimization model constructs weakly and strongly parallel tests of minimum length, while the maximization model constructs weakly and strongly parallel tests with maximum test reliability. (Author/SLD)
Descriptors: Algorithms, Models, Reliability, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Molenaar, Peter C. M. – Measurement: Interdisciplinary Research and Perspectives, 2004
Psychology is focused on variation between cases (interindividual variation). Results thus obtained are considered to be generalizable to the understanding and explanation of variation within single cases (intraindividual variation). It is indicated, however, that the direct consequences of the classical ergodic theorems for psychology and…
Descriptors: Psychology, Psychometrics, Developmental Psychology, Personality Theories
Peer reviewed Peer reviewed
Direct linkDirect link
Krijnen, Wim P. – Psychometrika, 2004
In many instances it is reasonable to assume that the population covariance matrix has positive elements. This assumption implies for the single factor analysis model that the loadings and regression weights for best linear factor prediction are positive. For the multiple factor analysis model where each variable loads on a single factor and a…
Descriptors: Test Theory, Structural Equation Models, Factor Analysis, Prediction
Peer reviewed Peer reviewed
Direct linkDirect link
Bush, Martin E. – Quality Assurance in Education: An International Perspective, 2006
Purpose: To provide educationalists with an understanding of the key quality issues relating to multiple-choice tests, and a set of guidelines for the quality assurance of such tests. Design/methodology/approach: The discussion of quality issues is structured to reflect the order in which those issues naturally arise. It covers the design of…
Descriptors: Multiple Choice Tests, Test Reliability, Educational Quality, Quality Control
Peer reviewed Peer reviewed
Direct linkDirect link
Gest, Scott D.; Davidson, Alice J.; Rulison, Kelly L.; Moody, James; Welsh, Janet A. – New Directions for Child and Adolescent Development, 2007
The near universality of gender segregation in middle childhood and early adolescence has stimulated extensive research on sex differences in peer relationship processes. Recent reviews of the literature suggest that although some claims of two-cultures theory have clear empirical support, such as strong preference for same-sex peers over…
Descriptors: Early Adolescents, Peer Relationship, Friendship, Peer Groups
Helms, LuAnn Sherbeck – 1999
This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…
Descriptors: Effect Size, Meta Analysis, Reliability, Scores
Peer reviewed Peer reviewed
Schulman, Robert S.; Haden, Richard L. – Psychometrika, 1975
A model is proposed for the description of ordinal test scores based on the definition of true score as expected rank; its deviations are compared with results from classical test theory. An unbiased estimator of population true score from sample data is calculated. Score variance and population reliability are examined. (Author/BJG)
Descriptors: Career Development, Mathematical Models, Test Reliability, Test Theory
Pages: 1  |  ...  |  23  |  24  |  25  |  26  |  27  |  28  |  29  |  30  |  31  |  ...  |  78