Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedFeldt, Leonard S. – Educational and Psychological Measurement, 2003
Develops formulas to cope with the situation in which the reliability of test scores must be approximated even though no examinee has taken the complete instrument. Develops different estimators for part tests that are judged to be classically parallel, tau-equivalent, or congeneric. Proposes standards for differentiating among these three models.…
Descriptors: Estimation (Mathematics), Reliability, Scores, Test Results
Peer reviewedLi, Heng – Psychometrika, 1997
A formally simple expression for the maximal reliability of a linear composite is provided. Its theoretical implications and its relation to existing results for reliability are discussed. (Author/SLD)
Descriptors: Reliability, Test Items, Theory Practice Relationship
Peer reviewedMyford, Carol M. – Applied Measurement in Education, 2002
Studied the use of descriptive graphic rating scales by 11 raters to evaluate students' work, exploring different design features. Used a Rasch-model based rating scale analysis to determine that all the continuous scales could be considered to have at least five points, and that defined midpoints did not result in higher student separation…
Descriptors: Evaluators, Rating Scales, Reliability, Test Construction
Peer reviewedOsbourne, Jason W.; Waters, Elaine – Practical Assessment, Research & Evaluation, 2002
Discusses assumptions of multiple regression that are not robust to violation: linearity, reliability of measurement, homoscedasticity, and normality. Stresses the importance of checking assumptions. (SLD)
Descriptors: Error of Measurement, Regression (Statistics), Reliability
Peer reviewedFeldt, Leonard S.; Charter, Richard A. – Measurement and Evaluation in Counseling and Development, 2003
Evaluating a test's reliability often requires dividing it into 3 or more unequal parts, which causes violation of the tau equivalence assumption of Cronbach's alpha. This article presents a criterion for abandoning alpha and an approach for computing a more appropriate estimate of reliability, the Gilmer-Feldt coefficient. (Author)
Descriptors: Counseling, Evaluation Methods, Psychometrics, Test Reliability
Peer reviewedKane, Michael – Journal of Educational Measurement, 2002
Reviews the criticisms of sampling assumptions in generalizability theory (and in reliability theory) and examines the feasibility of using representative sampling, stratification, homogeneity assumptions, and replications to address these criticisms. Suggests some general outlines for the conduct of generalizability theory studies. (SLD)
Descriptors: Generalizability Theory, Reliability, Research Methodology, Sampling
Peer reviewedRaykov, Tenko; Shrout, Patrick E. – Structural Equation Modeling, 2002
Discusses a method for obtaining point and interval estimates of reliability for composites of measures with a general structure. The approach is based on fitting a correspondingly constrained structural equation model and generalizes earlier covariance structure analysis methods for scale reliability estimation with congeneric tests. (SLD)
Descriptors: Estimation (Mathematics), Reliability, Structural Equation Models
Peer reviewedBaker, Herbert George; Spier, Morris S. – Public Personnel Management, 1990
Much criticism is leveled at the nature and usefulness of the employment interview. Despite its shortcomings and the availability of more objective means of selection, classification, and placement, the personal interview is used pervasively. A structured interview can increase the reliability and validity of the technique. (JOW)
Descriptors: Employment Interviews, Personnel Selection, Reliability, Validity
Peer reviewedBlau, Gary J. – Journal of Vocational Behavior, 1988
Examined the reliability and validity of a career commitment measure using employees (N=266) of newspaper and insurance companies. Results showed career commitment could be reliably measured and was operationally distinct from job involvement and organizational commitment. Discusses findings in terms of meaning of career commitment. (Author/ABL)
Descriptors: Careers, Employees, Test Reliability, Test Validity
Peer reviewedBerry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1988
Cohen's kappa statistic is frequently used to measure agreement between two observers using categorical polytomies. Cohen's statistic is: shown to be inherently multivariate in nature; expanded to analyze ordinal and interval data; and extended to over two observers. A non-asymptotic test of significance is provided for the generalized statistic.…
Descriptors: Equations (Mathematics), Interrater Reliability, Multivariate Analysis
Peer reviewedvan der Linden, Wim J.; Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1988
Gulliksen's matched random subtests method is a graphical method to split a test into parallel test halves, allowing maximization of coefficient alpha as a lower bound to the classical test reliability coefficient. This problem is formulated as a zero-one programing problem solvable by algorithms that already exist. (TJH)
Descriptors: Algorithms, Equations (Mathematics), Programing, Test Reliability
Peer reviewedMerriam, Sharan B. – PAACE Journal of Lifelong Learning, 1995
Deals with issues of validity and reliability in qualitative research in education. Discusses philosophical assumptions underlying the concepts of internal validity, reliability, and external validity or generalizability. Presents strategies congruent with a qualitative research perspective for ensuring the rigor and trustworthiness of findings.…
Descriptors: Educational Research, Qualitative Research, Reliability, Validity
Peer reviewedRosenthal, James A. – Social Work Research, 1994
Notes that conventional practice in social work research is to recommend level of reliability close to 0.80 as minimum standard. Contends that needed reliability varies by situation: that in situations in which important decisions about individuals are being made, 0.90 provides better standard; whereas in descriptive survey research with large…
Descriptors: Reliability, Research, Social Work, Statistical Analysis
Peer reviewedWoo-Kyoung, Ahn; And Others – Cognition, 1995
Presents a series of four studies testing the hypothesis that people seek out and prefer information about causal mechanisms rather than information about covariation. Concludes that people attempt to seek out causal mechanisms in developing a causal explanation for a specific event. (DR)
Descriptors: College Students, Information Seeking, Motivation, Reliability
Peer reviewedZimmerman, Donald W. – Journal of Educational Measurement, 1994
An alternative formula is presented for the reliability of a difference score that contains the correlation between true scores instead of the correlation between observed scores. This approach provides more useful information and yields values that are not as anomalous as those usually obtained. (SLD)
Descriptors: Correlation, Equations (Mathematics), Reliability, Research Methodology


