Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Erford, Bradley T.; Klein, Lauren – Educational and Psychological Measurement, 2007
The Slosson-Diagnostic Math Screener (S-DMS) was designed to help identify students in Grades 1 to 8 at risk for mathematics failure. Internal consistency, test-retest reliability, item analysis, decision efficiency, convergent validity, and factorial validity of all five levels of the S-DMS were studied using 20 independent samples of students…
Descriptors: Grade 1, Test Validity, Item Analysis, Test Reliability
Erford, Bradley T.; Balcom, Lindsey C.; Moore-Thomas, Cheryl – Measurement and Evaluation in Counseling and Development, 2007
This study provides preliminary analysis of reliability and validity of scores on the Screening Test for Emotional Problems, which was designed to identify students ages 5 to 18 years who are referred for wide-ranging emotional disturbances categorized under the Individuals With Disabilities Education Improvement Act (U.S. Department of Education,…
Descriptors: Emotional Problems, Disabilities, Test Validity, Screening Tests
Scherer, Marcia J.; McKee, Barbara G. – 1992
Validity and reliability data are presented for two instruments for assessing the predispositions that people have toward the use of assistive and educational technologies. The two instruments, the Assistive Technology Device Predisposition Assessment (ATDPA) and the Educational Technology Predisposition Assessment (ETPA), are self-report…
Descriptors: Assistive Devices (for Disabled), Attitude Measures, Check Lists, College Students
Kaplan, Bruce A.; Johnson, Eugene G. – 1992
Across the field of educational assessment the case has been made for alternatives to the multiple-choice item type. Most of the alternative types of items require a subjective evaluation by a rater. The reliability of this subjective rating is a key component of these types of alternative items. In this paper, measures of reliability are…
Descriptors: Educational Assessment, Elementary Secondary Education, Estimation (Mathematics), Evaluators
Aghbar, Ali-Asghar – 1986
The effectiveness of the "read-comp" technique in assessing writing ability and the usefulness of a rubric and procedure devised for scoring read-comp samples and essays were evaluated. Subjects were 100 freshman students enrolled in general and remedial English classes in a 6-week summer session at Indiana University of Pennsylvania.…
Descriptors: College Freshmen, Essay Tests, Evaluation Methods, Grading
Goldstein, Harvey; Wolf, Alison – 1986
Locally developed occupational tests were administered to 16- and 17-year-olds in a government-sponsored vocational education program in the United Kingdom over a six-month period in 1984. Job skills were tested in two occupational areas: use of a micrometer and invoice completion. Some performance tests were designed by researchers and some by…
Descriptors: Comparative Testing, Criterion Referenced Tests, Evaluation Criteria, Foreign Countries
Cronin, Linda; Capie, William – 1986
The influence of day-to-day variation in teacher performance on the reliability and validity of teacher assessment was examined. An attempt was made to identify and quantify sources of score variation attributable to differences in teacher performance, day of observation, observers, and test subscales; and to determine their effects on reliability…
Descriptors: Behavior Change, Behavior Rating Scales, Classroom Observation Techniques, Evaluation Methods
SCHWAGER, SIDNEY – 1967
IN THIS REPORT THE UNITED FEDERATION OF TEACHERS (UFT) ANALYZES SPECIFIC DATA FROM THE CENTER FOR URBAN EDUCATION'S (CUE) NEGATIVE EVALUATION OF NEW YORK CITY'S MORE EFFECTIVE SCHOOLS (MES) PROGRAM AND CHARGES THAT CUE'S CONCLUSIONS ARE INVALID. THE UFT MAINTAINS THAT SINCE 18 OF THE 21 MES WERE FORMER SPECIAL SERVICE (SS) SCHOOLS, CUE SHOULD HAVE…
Descriptors: Achievement Gains, Arithmetic, Comparative Analysis, Control Groups
Gillmore, Gerald M. – 1979
It is argued in this paper that generalizability theory provides a uniquely useful framework for defining and quantifying the dependability of data for decision making. It does so by requiring careful specification of the conditions of measurement and the anticipated sources of variation in the results of the measurement procedure. A distinction…
Descriptors: Analysis of Variance, Criterion Referenced Tests, Decision Making, Educational Assessment
Peer reviewedBradley, Robert H.; Corwyn, Robert F.; Caldwell, Betty M.; Whiteside-Mansell, Leanne; Mink, Iris T. – Journal of Research on Adolescence, 2000
Describes the development of the Early Adolescent version of the Home Observation for Measurement of the Environment (EA-HOME) Inventory. Presents information on its usefulness with African Americans, Chinese Americans, European Americans, Mexican Americans, and Dominican Americans. Notes findings indicating high interobserver agreement, with…
Descriptors: Black Youth, Child Development, Chinese Americans, Cultural Differences
Wang, Tianyou – 1996
In this paper, formulas for computing the weights that maximize the reliability of a test with multiple parts are derived using a congeneric model. A direct derivation for the three-part test and case and a two-step derivation for the n-part case are presented, and results for these two approaches are shown to be consistent for the three-part…
Descriptors: Computation, Equations (Mathematics), Matrices, Performance Based Assessment
Guthrie, John T.; And Others – 1994
Noting that the amount of reading students do is related to their reading achievement, this booklet presents an instrument designed to measure the amount and breadth of students' reading in and out of school. The first part of the booklet discusses the Reading Activity Inventory (RAI) and how it differs from other reading activity measures, uses…
Descriptors: Elementary Education, Evaluation Methods, Reading Ability, Reading Achievement
Peer reviewedFulton, Robert T.; And Others – Journal of Speech and Hearing Disorders, 1975
Evaluated with 12 children (9- to 25-months-old) were the efficacy and reliability of auditory stimulus-response control training and assessment procedures. (Author/LS)
Descriptors: Auditory Tests, Exceptional Child Research, Hearing Impairments, Infants
Peer reviewedHay, Nancy M.; Stewart, Norman R. – Journal of Counseling Psychology, 1974
This study determined internal consistency and test-retest reliability coefficients for the Willoughby Personality Schedule, currently used as an outcome measure in research and in clinical practice. The Hoyt analysis of variance yielded an internal consistency reliability coefficient of .90 on the first testing. The test-retest reliability…
Descriptors: Anxiety, College Students, Evaluation Methods, Personality Measures
Peer reviewedBalyeat, Ralph; Norman, Douglas – Reading Teacher, 1975
Research indicates that a special version of the cloze procedure is a reliable test of reading comprehension. (RB)
Descriptors: Cloze Procedure, Elementary Education, Reading Comprehension, Reading Research

Direct link
