Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedPapanastasiou, Elena C. – Structural Equation Modeling, 2003
This volume, based on papers presented at a 1998 conference, collects thinking and research on item generation for test development. It includes materials on psychometric and cognitive theory, construct-oriented approaches to item generation, the item generation process, and some applications of item generative principles. (SLD)
Descriptors: Item Banks, Test Construction, Test Items, Test Theory
Peer reviewedZimmerman, Donald W.; Zumbo, Bruno D. – International Journal of Testing, 2001
Presents a model of tests and measurement that identifies test scores with Hilbert space vectors and true and error components of scores with linear operators. This geometric point of view brings to light relations among elementary concepts in test theory, including reliability, validity, and parallel tests. (Author/SLD)
Descriptors: Models, Probability, Reliability, Scores
Peer reviewedKeith, Timothy Z.; And Others – Journal of School Psychology, 1988
Studied whether Stanford-Binet Intelligence Scale: Fourth Edition corresponds to theory that guided its construction, using first-order confirmatory factor analysis with entire standardization sample and three age groups. Results generally support the four factors as reflecting the underlying structure of the new Binet, but were less supportive of…
Descriptors: Factor Analysis, Intelligence Tests, Test Theory, Test Validity
Kolawole, E. B. – Educational Research and Reviews, 2008
This study investigated the effects of the cooperative and competitive learning on academic performance of students in mathematics in order to find out which one of them is the more effective learning strategy. The sample of the study was 400 Senior Secondary Schools III, Mathematics students made up of 240 boys and 160 girls randomly selected…
Descriptors: Females, Males, Mathematics Achievement, Learning Strategies
Talbot, Robert M.; Briggs, Derek C. – Measurement: Interdisciplinary Research and Perspectives, 2007
At the core of the argument-based approach to test validation as it has been presented by Kane (1992, 2004, 2006) is a relatively simple premise: test validity is demonstrated by linking the score that is observed from a test instrument to the use of that score for some subsequent inference. Details, however, are not so simple: How does one craft…
Descriptors: Test Validity, Inferences, Knowledge Base for Teaching, Mathematics Education
Kieffer, Kevin M. – 1998
This paper discusses the benefits of using generalizabilty theory in lieu of classical test theory. Generalizability theory subsumes and extends the precepts of classical test theory by estimating the magnitude of multiple sources of measurement error and their interactions simultaneously in a single analysis. Since classical test theory examines…
Descriptors: Error of Measurement, Generalizability Theory, Heuristics, Interaction
Peer reviewedFrank, Austin C.; Kirk, Barbara A. – Journal of Vocational Behavior, 1974
The Basic Interest Scales (BIS) and the Occupational Scales (O-S) of the revised Strong Vocational Interest Blank for Women (TW 398) were assigned Holland codes, and component scores for the BIS and O-S were separately developed, intercorrelated, and evaluated along with standardized composite scores representing each of the 11 O-S groups. (Author)
Descriptors: Comparative Analysis, Females, Occupational Aspiration, Test Theory
Peer reviewedNash, Roy – Interchange, 1987
An argument that Binet must be regarded as a major theoretician of functional intelligence and should be considered for what is regarded as classical intelligence theory is advanced. A discourse on Binet's theory, its intellectual context and the developments it fostered is given. (JL)
Descriptors: Cognitive Development, Intelligence Quotient, Intelligence Tests, Psychometrics
Peer reviewedCollins, Linda M.; Cliff, Norman – Psychometrika, 1985
The axioms of a three-set Guttman simplex model are presented and the effects of relaxing the axioms for one of the three sets are examined. This model can be used to define longitudinal developmental scales. (NSF)
Descriptors: Mathematical Models, Measurement Techniques, Scaling, Test Construction
Peer reviewedZimmerman, Donald W. – Educational and Psychological Measurement, 1983
A definition of test validity as the ratio of a covariance term to a variance term, analogous to the classical definition of test reliability, is proposed. When error scores on distinct tests are uncorrelated, the proposed definition coincides with the usual one, but it remains meaningful when error scores are correlated. (Author/BW)
Descriptors: Definitions, Mathematical Formulas, Mathematical Models, Test Theory
Marzano, Robert J. – 2000
There has been little discussion of two conventions common within classroom assessment: the convention of representing student's performance on an assessment using a single score; and the convention of using the average score to summarize a student's performance over a set of assessments. This paper attempts to demonstrate that the assumptions…
Descriptors: Elementary Secondary Education, Scoring, Teacher Made Tests, Test Theory
Peer reviewedConger, Anthony J. – Educational and Psychological Measurement, 1980
Reliability maximizing weights are related to theoretically specified true score scaling weights to show a constant relationship that is invariant under separate linear tranformations on each variable in the system. Test theoretic relations should be derived for the most general model available and not for unnecessarily constrained models.…
Descriptors: Mathematical Formulas, Scaling, Test Reliability, Test Theory
Peer reviewedDivgi, D. R. – Applied Psychological Measurement, 1980
The dependence of reliability indices for mastery tests on mean and cutoff scores was examined in the case of three decision-theoretic indices. Dependence of kappa on mean and cutoff scores was opposite to that of the proportion of correct decisions, which was linearly related to average threshold loss. (Author/BW)
Descriptors: Classification, Cutting Scores, Mastery Tests, Test Reliability
Peer reviewedVegelius, Jan – Educational and Psychological Measurement, 1979
A new measure of similarity between persons applicable in Q-analysis is proposed. It allows assumptions of non-orthogonality between the items, across which the similarity is computed. The similarity measure may also be applied in an R-analysis. (Author/JKS)
Descriptors: Correlation, Item Analysis, Q Methodology, Test Construction
Peer reviewedCollins, Linda M. – Applied Psychological Measurement, 1996
The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)
Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability

Direct link
