Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedFigueredo, Aurelio Jose; And Others – Multivariate Behavioral Research, 1995
Two longitudinal studies involving 29 raters concerning the construct validity, temporal stability, and interrater reliability of the latent common factors underlying subjective assessments by human raters of personality traits in the stumptail macaque and the zebra finch illustrate the use of generalizability analysis to test prespecified…
Descriptors: Animal Behavior, Construct Validity, Evaluation Methods, Generalizability Theory
Peer reviewedWakefield, John F. – Journal of Creative Behavior, 1991
This article reviews the history of divergent thinking tests and provides a projection of current research suggesting a bright outlook for creativity tests. A model relating problem finding and problem solving is described, as are approaches to increasing test reliability. (DB)
Descriptors: Creativity, Creativity Research, Creativity Tests, Divergent Thinking
Peer reviewedGreenberg, Karen L. – WPA: Writing Program Administration, 1992
Elaborates on and responds to challenges of direct writing assessment. Speculates on future directions in writing assessment. Suggests that, if writing instructors accept that writing is a multidimensional, situational construct that fluctuates across a wide variety of contexts, then they must also respect the complexity of teaching and testing…
Descriptors: Essay Tests, Higher Education, Multiple Choice Tests, Test Format
Peer reviewedEichelberger, R. Tony – Mid-Western Educational Researcher, 1992
Concerns about norm-referenced achievement tests involve (1) the purposes of the tests (comparability of results); (2) the content of the tests; (3) psychometric screening of test items; (4) the form of tests items; (5) the interpretation of summary scores; and (6) the construct validity of the tests. Offers guidelines for use of norm-referenced…
Descriptors: Achievement Tests, Elementary Secondary Education, Norm Referenced Tests, Reliability
Peer reviewedFisicaro, Sebastiano A.; Lautenschlager, Gary J. – Educational and Psychological Measurement, 1992
An equation derived by W. A. Nicewander and J. M. Price relating statistical power to reliability of dependent variable measures when true score regression is homogeneous across treatment conditions is enhanced through overcoming the problem of directly estimating the squared linear correlation between true scores for X and Y. (SLD)
Descriptors: Analysis of Variance, Correlation, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedCaldwell, Linda L.; And Others – Journal of Leisure Research, 1992
Reports development of a battery of scales measuring four dimensions of adolescent leisure experience: boredom, awareness, anxiety, and challenge. Analysis confirmed the internal consistency reliability of a shortened version. Test-retest reliability was established over 12 months. Correlations with theoretically related measures suggested initial…
Descriptors: Adolescents, Construct Validity, Foreign Countries, Leisure Time
Peer reviewedvan Buuren, Stef; van Rijckevorsel, Jan L. A. – Psychometrika, 1992
A technique is presented to transform incomplete categorical data into complete data by imputing appropriate scores into missing cells. A solution of the optimization problem is suggested, and relevant psychometric theory is discussed. The average correlation should be at least 0.50 before the method becomes practical. (SLD)
Descriptors: Classification, Computer Simulation, Correlation, Equations (Mathematics)
Peer reviewedMcCroskey, Jacquelyn; And Others – Child Welfare, 1990
Discusses the development of the Family Assessment Form for use in in-home family support services at the Children's Bureau of Los Angeles, California. Focuses on details of the form, its reliability and validity, and refinements in its use. (BB)
Descriptors: Child Welfare, Family Programs, Program Evaluation, Research and Development
Peer reviewedZegers, Frits E. – Applied Psychological Measurement, 1991
The degree of agreement between two raters rating several objects for a single characteristic can be expressed through an association coefficient, such as the Pearson product-moment correlation. How to select an appropriate association coefficient, and the desirable properties and uses of a class of such coefficients--the Euclidean…
Descriptors: Classification, Correlation, Data Interpretation, Equations (Mathematics)
Peer reviewedHenriksen, Melvin, Ed.; Wagon, Stan, Ed. – American Mathematical Monthly, 1991
The discrete mathematics topics of trees and computational complexity are implemented in a simple reliability program which illustrates the process advantages of the PASCAL programing language. The discussion focuses on the impact that reliability research can provide in assessment of the risks found in complex technological ventures. (Author/JJK)
Descriptors: Algorithms, College Mathematics, Higher Education, Instructional Materials
Peer reviewedPage, Ellis Batten – Journal of Experimental Education, 1994
National Assessment of Educational Progress writing sample essays from 1988 and 1990 (495 and 599 essays) were subjected to computerized grading and human ratings. Cross-validation suggests that computer scoring is superior to a two-judge panel, a finding encouraging for large programs of essay evaluation. (SLD)
Descriptors: Computer Assisted Testing, Computer Software, Essays, Evaluation Methods
Peer reviewedCrone, Linda J.; And Others – Applied Measurement in Education, 1994
Scores from 324 Louisiana schools on the Louisiana Graduation Exit Examination and a within-school split sample of 255 schools indicate that a single subject or grade provides a less consistent and more narrow perspective on school effectiveness than a subcomposite made up of 2 subject areas. (SLD)
Descriptors: Classification, Effective Schools Research, Elementary Secondary Education, Exit Examinations
Peer reviewedSlate, John R. – Psychology in the Schools, 1994
Investigated correlations between two intelligence measures for exceptional children. Corrected correlations between the tests indicated differences with correlations reported in one manual. Relationships were generally higher than those reported elsewhere. Implications are discussed, especially those involving the use of correlations between…
Descriptors: Adolescents, Children, Correlation, Elementary Secondary Education
Peer reviewedDoll, Beth; Elliott, Stephen N. – Journal of Early Intervention, 1994
Nine comprehensive observations were conducted of 24 preschoolers (8 with disabilities) in free play settings, with social behavior categories based on the work of Strain. Comparison of partial and complete observational records demonstrated that at least five observations were required to represent the children's social behavior adequately.…
Descriptors: Behavior Rating Scales, Classroom Observation Techniques, Disabilities, Preschool Children
Peer reviewedCooil, Bruce; Rust, Roland T. – Psychometrika, 1994
It is proposed that proportional reduction in loss (PRL) be used as a theoretical basis to derive, justify, and interpret reliability measures to gauge reliability on a zero-to-one scale. This PRL approach simplifies the interpretation of existing measures (e.g., generalizability-theory measures). (SLD)
Descriptors: Data Analysis, Equations (Mathematics), Estimation (Mathematics), Generalizability Theory


