Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Akin, Ahmet; Cetin, Bayram – Educational Sciences: Theory and Practice, 2007
This study investigated the validity and reliability of the Turkish version of the Depression Anxiety Stress Scale (DASS). The sample of the study consisted of 590 university students, 121 English teachers and 136 emotionally disturbed individuals who sought treatment in various clinics and counseling centers. Factor loadings of the scale ranged…
Descriptors: Emotional Disturbances, Test Reliability, Measures (Individuals), English Teachers
Bean, Tammy; Mooijaart, A.; Eurelings-Bontekoe, Elisabeth; Spinhoven, Philip – Journal of Psychoeducational Assessment, 2007
The psychometric properties of the Dutch Teacher's Report Form (TRF) for teachers of Unaccompanied Refugee Minors (URM) were evaluated in this study. The teachers (n = 486) that participated received a Dutch TRF to report on the mental health of the unaccompanied minor. Hierarchical confirmative factor analysis and individual confirmatory factor…
Descriptors: Measures (Individuals), Teacher Surveys, Teachers, Observation
Erford, Bradley T.; Klein, Lauren – Educational and Psychological Measurement, 2007
The Slosson-Diagnostic Math Screener (S-DMS) was designed to help identify students in Grades 1 to 8 at risk for mathematics failure. Internal consistency, test-retest reliability, item analysis, decision efficiency, convergent validity, and factorial validity of all five levels of the S-DMS were studied using 20 independent samples of students…
Descriptors: Grade 1, Test Validity, Item Analysis, Test Reliability
Erford, Bradley T.; Balcom, Lindsey C.; Moore-Thomas, Cheryl – Measurement and Evaluation in Counseling and Development, 2007
This study provides preliminary analysis of reliability and validity of scores on the Screening Test for Emotional Problems, which was designed to identify students ages 5 to 18 years who are referred for wide-ranging emotional disturbances categorized under the Individuals With Disabilities Education Improvement Act (U.S. Department of Education,…
Descriptors: Emotional Problems, Disabilities, Test Validity, Screening Tests
Wang, Tianyou – 1996
In this paper, formulas for computing the weights that maximize the reliability of a test with multiple parts are derived using a congeneric model. A direct derivation for the three-part test and case and a two-step derivation for the n-part case are presented, and results for these two approaches are shown to be consistent for the three-part…
Descriptors: Computation, Equations (Mathematics), Matrices, Performance Based Assessment
Guthrie, John T.; And Others – 1994
Noting that the amount of reading students do is related to their reading achievement, this booklet presents an instrument designed to measure the amount and breadth of students' reading in and out of school. The first part of the booklet discusses the Reading Activity Inventory (RAI) and how it differs from other reading activity measures, uses…
Descriptors: Elementary Education, Evaluation Methods, Reading Ability, Reading Achievement
Peer reviewedFulton, Robert T.; And Others – Journal of Speech and Hearing Disorders, 1975
Evaluated with 12 children (9- to 25-months-old) were the efficacy and reliability of auditory stimulus-response control training and assessment procedures. (Author/LS)
Descriptors: Auditory Tests, Exceptional Child Research, Hearing Impairments, Infants
Peer reviewedHay, Nancy M.; Stewart, Norman R. – Journal of Counseling Psychology, 1974
This study determined internal consistency and test-retest reliability coefficients for the Willoughby Personality Schedule, currently used as an outcome measure in research and in clinical practice. The Hoyt analysis of variance yielded an internal consistency reliability coefficient of .90 on the first testing. The test-retest reliability…
Descriptors: Anxiety, College Students, Evaluation Methods, Personality Measures
Peer reviewedBalyeat, Ralph; Norman, Douglas – Reading Teacher, 1975
Research indicates that a special version of the cloze procedure is a reliable test of reading comprehension. (RB)
Descriptors: Cloze Procedure, Elementary Education, Reading Comprehension, Reading Research
Attali, Yigal – ETS Research Report Series, 2004
Contrary to common belief, reliability estimates of number-right multiple-choice tests are not inflated by speededness. Because examinees guess on questions when they run out of time, the responses to these questions show less consistency with the responses of other questions, and the reliability of the test will be decreased. The surprising…
Descriptors: Multiple Choice Tests, Timed Tests, Test Reliability, Guessing (Tests)
Peer reviewedWinett, Richard A.; And Others – Developmental Psychology, 1975
Descriptors: Basic Skills, Child Development, Day Care, Preschool Children
Horowitz, Frances Degen – 1987
Discussed are methodological aspects of three symposium papers on process approaches to individual differences in infancy. Fagan's (1987) research is viewed as an important contribution to the growing literature that demonstrates that process measures, that is, information processing behaviors, may provide a useful reflection of early to later…
Descriptors: Attention, Conference Papers, Individual Differences, Infant Behavior
Patience, Wayne; Auchter, Joan – 1988
A central aim in any assessment program is to ensure fair and stable scoring from administration to administration. When administrations are decentralized, not only in location, but in frequency and in logistical configuration, it is imperative to construct training, certifying, and monitoring systems that provide continuity between the original…
Descriptors: Equivalency Tests, Essay Tests, Scoring, Secondary Education
Littlefield, John H.; Troendle, G. Roger – 1987
The effect of different types of rating task instructions on rater behavior was examined using experts, as opposed to novices, as raters. The experts were instructed to (1) form a global categorical judgment (early hypothesis generation); (2) assess 19 detailed elements; or (3) both. Subjects were 8 dental faculty members who ranged in age from 28…
Descriptors: Dentistry, Evaluation Methods, Higher Education, Holistic Evaluation
St. Louis, Kenneth O.; Ruscello, Dennis M. – 1981
Although speech-language pathologists are expected to be able to administer and interpret oral examinations, there are currently no screening tests available that provide careful administration instructions and data for intra-examiner and inter-examiner reliability. The Oral Speech Mechanism Screening Examination (OSMSE) is designed primarily for…
Descriptors: Physiology, Screening Tests, Speech Evaluation, Speech Pathology

Direct link
