Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedWasserman, Theodore H. – Child and Family Behavior Therapy, 1983
Forty-four emotionally disturbed and 112 elementary school children were administered the Children's Dysfunctional Cognition Scale (CDC) for purposes of standardizing the instrument. Results indicate that the CDC is reliable and valid. The scale differentiates between normal and disturbed populations and is highly correlated with teacher's ratings…
Descriptors: Cognitive Processes, Elementary Education, Emotional Disturbances, Test Construction
Mardell-Czudnowski, Carol D.; Lessen, Elliott I. – Diagnostique, 1982
Special education tests evaluated by M. Thurlow and J. Ysseldyke with regard to technical adequacy were reevaluated using the same criteria. There was disagreement of 11 of the 30 tests used in both evaluations, demonstrating that current test standards are vague enough that experts disagree. (Author/CL)
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Test Norms
Peer reviewedConger, Anthony J. – Educational and Psychological Measurement, 1983
A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)
Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length
Peer reviewedNarrett, Carla M; And Others – Reading Teacher, 1984
Reviews the Kaufman Assessment Battery for Children, an individually administered test of intelligence and achievement. Finds it to be of high overall quality. (FL)
Descriptors: Achievement Tests, Intelligence Tests, Test Reliability, Test Reviews
Peer reviewedFulmer, Susanne; Fulmer, Robert – Journal of Learning Disabilities, 1983
The Pre-Reading-Screening Procedures and the Slingerland Screening Tests for Identifying Children with Specific Language Disability were administered to 1021 grade one to six students. Results indicated that reliability and validity coefficients were acceptable and that educational programing decisions could confidently be based on the test…
Descriptors: Elementary Education, Language Handicaps, Learning Disabilities, Screening Tests
Peer reviewedDroege, Robert C.; Hawk, John – Journal of Employment Counseling, 1976
This study was performed to resolve the question of comparability of manual dexterity scores obtained on wooden and plastic versions of the USES pegboard. Currently authorized equipment, either plastic or wooden, may be used with the confidence that scores are not affected by the type of equipment used. (Author)
Descriptors: Aptitude Tests, Individual Differences, Research Projects, Skill Analysis
Peer reviewedTaylor, Erwin K.; Griess, Thomas – Personnel Psychology, 1976
In most selection validation research, only the upper and lower tails of the criterion distribution are used, often yielding misleading or incorrect results. Provides formulas and tables which enable the researcher to account more accurately for the distribution of criterion within the middle range of population. (Author/RW)
Descriptors: Evaluation Methods, Measurement Techniques, Predictive Validity, Reliability
Peer reviewedAllison, Paul A. – Psychometrika, 1976
A direct proof is given for the generalized Spearman-Brown formula for any real multiple of test length. (Author)
Descriptors: Correlation, Error of Measurement, Raw Scores, Test Length
Peer reviewedHatcher, Roger P. – Mental Retardation, 1976
Descriptors: Infants, Intelligence Tests, Mental Retardation, Predictive Measurement
Peer reviewedRedelheim, Paul S. – Reading Teacher, 1976
Descriptors: Elementary Education, Multidimensional Scaling, Reading Research, Reading Tests
Jensen, Robert K. – Research Quarterly, 1976
Testing showed the dynamometer to be a sufficiently accurate instrument for recording force-time and force-displacement curves. (GW)
Descriptors: Measurement Instruments, Measurement Techniques, Motion, Physical Activities
Mertler, Craig A.; Earley, Mark A. – 2002
This paper discusses the results of a study comparing the psychometric qualities of two forms of a survey, one administered in paper-and-pencil format and the other administered in Web format. The survey addressed the topic of college course anxiety and was used to survey a sample of undergraduate students (n=36). The psychometric qualities…
Descriptors: Higher Education, Online Systems, Psychometrics, Reliability
Lasee, Michael J.; Smith, Douglas K. – 1991
This study compared the effectiveness of the recently-developed Early Screening Profiles (ESP) with the Kaufman Assessment Battery for Children (K-ABC), two screening tests designed to measure the cognitive, language, motor, and social development of preschool children. The tests were administered in counterbalanced order to a sample of 29…
Descriptors: Preschool Children, Preschool Education, Screening Tests, Student Evaluation
Identifying Undifferentiating Response Sets and Assessing Their Effects on the Measurement of Items.
Schulz, E. Matthew; Sun, Anji – 2001
Undifferentiating response sets, defined as "overuse" of any category of a Likert scale, were identified using a combination of simple criteria, such as whether a single-category response set involved more than four items, and statistical criteria based on D. Andrich's (1978) measurement model for Likert scales (the Rating Scale model). Data were…
Descriptors: College Students, High Schools, Likert Scales, Measurement Techniques
Hendrickson, Amy B. – 2001
The purpose of the study was to compare reliability estimates for a test composed of stimulus-dependent testlets as derived from item scores, testlet scores, and under the univariate generalizability theory and multivariate generalizability theory designs, as well as to determine the influence of the number of testlets and the number of items per…
Descriptors: Comparative Analysis, Reliability, Scores, Standardized Tests


