Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Capraro, Robert M.; Graham, James M. – 2002
This paper illustrates first how estimated Structural Equation Modeling (SEM) measurement error variances are actually estimates of score reliabilities. The major advantage of SEM over other analytic methods is that it accounts for measurement error. Score reliabilities are estimated as part of structural modeling, so that structural models test…
Descriptors: Error of Measurement, Estimation (Mathematics), Reliability, Scores
van der Linden, Wim J.; Vos, Hans J.; Chang, Lei – 2000
In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of performance. Methods to check standard setting…
Descriptors: Interrater Reliability, Judges, Probability, Standard Setting
De Champlain, Andre F.; Gessaroli, Marc E.; Floreck, Lisa M. – 2000
The purpose of this study was to estimate the extent to which recording variability among standardized patients (SPs) has an impact on classification consistency with data sets simulated to reflect performances on a large-scale clinical skills examination. SPs are laypersons trained to portray patients in clinical encounters (cases) and to record…
Descriptors: Classification, Interrater Reliability, Licensing Examinations (Professions), Medical Education
Henson, Robin K.; Thompson, Bruce – 2001
Given the potential value of reliability generalization (RG) studies in the development of cumulative psychometric knowledge, the purpose of this paper is to provide a tutorial on how to conduct such studies and to serve as a guide for researchers wishing to use this methodology. After some brief comments on classical test theory, the paper…
Descriptors: Coding, Error of Measurement, Psychometrics, Reliability
Brualdi, Amy – 1999
Test validity refers to the degree to which the inferences based on test scores are meaningful, useful, and appropriate. Thus, test validity is a characteristic of a test when it is administered to a particular population. This article introduces the modern concepts of validity advanced by S. Messick (1989, 1996, 1996). Traditionally, the means of…
Descriptors: Criteria, Data Interpretation, Elementary Secondary Education, Reliability
Peer reviewedLavoie, Allan L.; Bentler, Peter M. – Journal of Educational Measurement, 1974
Descriptors: Measurement Techniques, Rating Scales, Semantic Differential, Test Construction
Peer reviewedRiviere, Michael S. – Educational and Psychological Measurement, 1973
Descriptors: Comparative Testing, Intelligence Tests, Mental Retardation, Test Reliability
Peer reviewedPercell, Lawrence P.; Delk, John L. – Journal of Consulting and Clinical Psychology, 1973
The results of this study showed that the relative usefulness of the three forms of the Mini Mult, a 71-item form of the MMPI, were not encouraging. (JC)
Descriptors: Personality Assessment, Personality Measures, Psychological Testing, Psychology
Peer reviewedMcCarthy, Karen A.; Steckler, Jane F. – Educational and Psychological Measurement, 1973
While the PPVT is widely used with both typical (i.e., regular classroom students) and atypical (i.e., special class students such as retardates or cerebral palsied) children, there is a dearth of reliability data available for the typical group. (Authors)
Descriptors: Grade 1, Intelligence Quotient, Test Reliability, Test Validity
Peer reviewedMartois, John S. – Educational and Psychological Measurement, 1973
Copies of this program may be obtained from the author at the University of Southern California, School of Pharmacy, University Park, Los Angeles 90007. (CB)
Descriptors: Comparative Analysis, Computer Programs, Input Output, Statistical Analysis
Hess, Lee R. – Training and Development Journal, 1973
A simple approach for achieving fair employment guidelines. (Editor)
Descriptors: Employment Qualifications, Predictive Validity, Rating Scales, Test Reliability
Peer reviewedStafford, Richard E. – Journal of Educational Measurement, 1971
Descriptors: Correlation, Statistical Analysis, Test Interpretation, Test Reliability
Peer reviewedGilman, David Alan; Ferry, Paula – Journal of Educational Measurement, 1972
Results indicate that scoring tests by the self-scoring method can result in a higher split half reliability than tests scored by the traditional right-wrong method. (Authors)
Descriptors: Data Analysis, Multiple Choice Tests, Scoring, Test Construction
McLaughlin, G. Harry – J Reading, 1969
Replies to Pauk's investigation, raising some questions about it and defending his own readability measure. (MD)
Descriptors: Correlation, Criteria, Predictive Validity, Readability
Brodsky, Stanley L.; Owens, Shirlee – Psychol Rep, 1969
Descriptors: College Students, Group Dynamics, Psychological Testing, Self Concept


