Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedGutkin, Terry B.; And Others – Journal of School Psychology, 1984
Performed orthogonal and oblique factor analysis using the Wechsler Adult Intelligence Scale-Revised (WAIS-R) standardization sample (N=1,880). Analysis of the variance components for each subtest at every age level revealed a substantial proportion of subtests at a wide range of age levels evidenced high or intermediate levels of specific…
Descriptors: Adults, Factor Analysis, Intelligence Tests, Performance Factors
Peer reviewedKnight, Robert G. – Journal of Consulting and Clinical Psychology, 1983
Discusses the significance of confidence intervals around IQ scores based on a misleading interpretation of the standard error of measurement terms provided in the Wechsler Adult Intelligence Scale-Revised (WAIS-R) manual. Presents standard error values and a table for determining the abnormality of verbal and performance IQ discrepancies.…
Descriptors: Error of Measurement, Foreign Countries, Intelligence Tests, Test Interpretation
Peer reviewedShohamy, Elana – Language Learning, 1983
Reports on study in which students of Hebrew as a second language took four versions of oral proficiency test. Results indicate that different speech styles and topics significantly affected students' scores, and correlational analyses between pairs pointed to low reliability and lack of stability of the test. Urges caution in making decisions…
Descriptors: Hebrew, Interviews, Language Proficiency, Language Tests
Peer reviewedGough, Harrison G. – Journal of Creative Behavior, 1975
Descriptors: Creative Thinking, Creativity, Measurement Instruments, Sampling
Follman, John – Southern Journal of Educational Research, 1976
Three versions of a rating scale were compared for reliability and level of ratings. Results indicated that the wording of items influence these factors; that items should be written in order to maximize their meaning; and that response categories should be unique to each item. (Author/RW)
Descriptors: Evaluation Methods, Language Usage, Measurement Techniques, Rating Scales
Peer reviewedPrice-Bonham, Sharon – Journal of Marriage and the Family, 1976
This study compares differential relations between unweighted and weighted decision-making scores and selected resource variables. The findings suggest that there are few significant relationships between individual decisions and resource variables. (Author)
Descriptors: Decision Making, Marriage, Research Projects, Sex Role
Peer reviewedKane, Michael T.; And Others – Journal of Educational Measurement, 1976
This discussion illustrates the application of generalizability theory to a design commonly employed in the collection of evaluation data and provides a detailed analysis of the dependability of student evaluations of college teaching. (RC)
Descriptors: Course Evaluation, Student Evaluation of Teacher Performance, Test Reliability, True Scores
Peer reviewedPiotrowski, Richard J. – Psychology in the Schools, 1976
Changes in the full scale reliability of the WISC-R were computed at three age levels when each subtest was omitted by itself. The same procedure was followed with those subtests which independently had the smallest effect in lowering full scale reliability. Cautions were noted concerning the exclusion of subtests. (Author)
Descriptors: Intelligence Tests, Statistical Studies, Test Construction, Test Interpretation
Peer reviewedPielstick, N. L.; Thorndike, Robert M. – Psychology in the Schools, 1976
Reanalysis of Wakefield and Carlson's data confirmed canonical correlations of .84 and .69, but analysis of redundancies revealed that only 34 percent of the total WISC subtest variance is redundant with the ITPA and 39 percent of the ITPA subtest variance is redundant with the WISC. (Author)
Descriptors: Comparative Testing, Intelligence Tests, Statistical Analysis, Test Reliability
McCollum, Janet; Thompson, Bruce – Online Submission, 1980
Response error refers to the tendency to respond to items based on the perceived social desirability or undesirability of given responses. Response error can be particularly problematic when all or most of the items on a measure are extremely attractive or unattractive. The present paper proposes a method of (a) distinguishing among preferences…
Descriptors: Methods, Response Style (Tests), Social Desirability, Reliability
Dimitrov, Dimiter M. – 2003
This paper provides analytic evaluations of expected (marginal) true-score measures for binary items given their item response theory (IRT) calibration. Under the assumption of normal trait distributions, marginalized true scores, error variance, true score variance, and reliability for norm-referenced and criterion-references interpretations are…
Descriptors: Item Response Theory, Reliability, Test Construction, Test Items
Harrison, Judy A.; McAfee, Harold – 2002
To improve the admission process for teacher education at Henderson State University, Arkansas, and to ensure that teacher education candidates have appropriate dispositions for teaching, this study investigated the interview was an admissions criterion. Criteria for assessment and levels of proficiency were determined. A scoring rubric was…
Descriptors: College Admission, Criteria, Interviews, Preservice Teachers
Harrison, Judy A.; McAfee, Harold; Caldwell, Ana – 2002
To improve the admission process for teacher education at Henderson State University, Arkansas, and to ensure that teacher education candidates have appropriate dispositions for teaching, this study investigated the interview was an admissions criterion. Criteria for assessment and levels of proficiency were determined. A scoring rubric was…
Descriptors: College Admission, Interviews, Preservice Teachers, Reliability
Li, Jun Corser; Woodruff, David J. – 2002
Coefficient alpha is a simple and very useful index of test reliability that is widely used in educational and psychological measurement. Classical statistical inference for coefficient alpha is well developed. This paper presents two methods for Bayesian statistical inference for a single sample alpha coefficient. An approximate analytic method…
Descriptors: Bayesian Statistics, Markov Processes, Monte Carlo Methods, Reliability
Rudner, Lawrence M.; Schafer, William D. – 2001
This digest discusses sources of error in testing, several approaches to estimating reliability, and several ways to increase test reliability. Reliability has been defined in different ways by different authors, but the best way to look at reliability may be the extent to which measurements resulting from a test are characteristics of those being…
Descriptors: Educational Testing, Error of Measurement, Reliability, Scores


