Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedDeSanti, Roger J.; Sullivan, Vicki Gallo – Reading Psychology, 1984
Concludes that the Cloze Reading Inventory and its coding form can be reliably employed by a variety of teachers for a variety of grade levels and passages. (FL)
Descriptors: Cloze Procedure, Elementary Secondary Education, Interrater Reliability, Reading Comprehension
Peer reviewedLee, Dong Yul; And Others – Journal of Clinical Psychology, 1985
Developed a 33-item, situation-specific instrument that measures assertiveness of adolescents. Based on data from 682 elementary and secondary school students, adequate reliability and validity of the Assertiveness Scale for Adolescents (ASA) were obtained when tested against several variables about which predictions could be made. (BH)
Descriptors: Adolescents, Assertiveness, Elementary Secondary Education, Foreign Countries
Peer reviewedQuereshi, M. Y.; And Others – Journal of Clinical Psychology, 1984
Administered the Wechsler Intelligence Scale for Children, Wechsler Intelligence Scale for Children-Revised, and Wechsler Preschool Primary Scale of Intelligence in a counterbalanced design to randomly selected elementary school children (N=72). Results indicated that the verbal Intelligence Quotients (IQs) were comparable, but the performance and…
Descriptors: Comparative Testing, Elementary Education, Elementary School Students, Intelligence Tests
Peer reviewedWillson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984
Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)
Descriptors: Correlation, Intelligence Tests, Profiles, Scores
Peer reviewedBrown, Linda; Bryant, Brian R. – Remedial and Special Education (RASE), 1984
The article reviews Consumer's Guide to Tests in Print, noting its purposes (to provide objective information about technical characteristics of standardized tests); criteria for evaluating standardizaton, reliability, and validity; and its rating system based on evaluations of selected review panel members. (CL)
Descriptors: Elementary Secondary Education, Standardized Tests, Test Construction, Test Reliability
Peer reviewedBryson, Susan E.; Pilon, David J. – Journal of Clinical Psychology, 1984
Carried out four experiments in which male and female undergraduates (N=384) completed the Beck Depression Inventory under conditions ranging from absolute anonymity to a face-to-face interview. Results showed no evidence that depression is more severe or common in females. Responses appeared essentially unaffected by method of administration.…
Descriptors: College Students, Depression (Psychology), Foreign Countries, Higher Education
Peer reviewedHale, Gordon, A.; And Others – Language Learning, 1983
Addresses the issues of whether test scores are affected by the prior availability of the items on a test. Concludes that, while disclosing items significantly affects test scores, the magnitude of the disclosure effect drecreases with an increase in the size of the disclosed pool. (EKN)
Descriptors: English (Second Language), Language Tests, Scores, Second Language Learning
Peer reviewedO'Donnell, Michael P.; Wood, Margo – Journal of Reading, 1984
Concludes that The London Procedure does not reflect contemporary research in the fields of literacy acquisition and learning disabilities. (AEA)
Descriptors: Adult Basic Education, Adult Literacy, Reading Diagnosis, Test Reliability
Peer reviewedDowaliby, Fred J.; And Others – American Annals of the Deaf, 1983
The Locus of Control Inventory for the Deaf (LCID), consisting of a 23-item Likert-like scale, and two commonly used scales to assess locus of control in hearing persons were administered to 174 deaf freshman students. Intercorrelation findings demonstrated greater soundness of the LCID as compared with the other scales. (Author)
Descriptors: Correlation, Deafness, Locus of Control, Measures (Individuals)
Peer reviewedHopkins, Kenneth D. – Journal of Special Education, 1983
This article illustrates the use of generalizability theory in special education to estimate the reliability of a measure when there is more than one source of error in the universe of inference and how the effects from changing the number of items and/or raters can be evaluated. (Author)
Descriptors: Generalization, Item Analysis, Mathematics, Research Methodology
Peer reviewedWood, R.; Quinn, B. – Educational Review, 1976
Impression marking of English Language essay and summary questions by pairs of examiners is shown, as expected, to be more reliable than single marking. Given the limited statistical information available, it is concluded that pairing of examiners can as well be done by random or quasi-random means as by attempts at calculated matching. (Editor/RK)
Descriptors: Bias, Educational Research, Essay Tests, Examiners
Peer reviewedSchwab, Donald P.; And Others – Personnel Psychology, 1975
Recently, an evaluation procedure, behaviorally anchored rating scales (BARS), has been developed that attempts to capture performance in multidimensional, behavior-specific terms. Article reviews and evaluates the research on BARS and suggests new directions for future research. (Author/RK)
Descriptors: Behavior Rating Scales, Performance Criteria, Psychological Studies, Tables (Data)
Zhang, Yanwei; Breithaupt, Krista; Tessema, Aster; Chuah, David – Online Submission, 2006
Two IRT-based procedures to estimate test reliability for a certification exam that used both adaptive (via a MST model) and non-adaptive design were considered in this study. Both procedures rely on calibrated item parameters to estimate error variance. In terms of score variance, one procedure (Method 1) uses the empirical ability distribution…
Descriptors: Individual Testing, Test Reliability, Programming, Error of Measurement
Colton, Dean A.; Gao, Xiaohong; Harris, Deborah J.; Kolen, Michael J.; Martinovich-Barhite, Dara; Wang, Tianyou; Welch, Catherine J. – 1997
This collection consists of six papers, each dealing with some aspects of reliability and performance testing. Each paper has an abstract, and each contains its own references. Papers include: (1) "Using Reliabilities To Make Decisions" (Deborah J. Harris); (2) "Conditional Standard Errors, Reliability, and Decision Consistency…
Descriptors: Decision Making, Error of Measurement, Item Response Theory, Performance Based Assessment
Thompson, Bruce – 1998
After presenting a general linear model as a framework for discussion, this paper reviews five methodology errors that occur in educational research: (1) the use of stepwise methods; (2) the failure to consider in result interpretation the context specificity of analytic weights (e.g., regression beta weights, factor pattern coefficients,…
Descriptors: Educational Research, Effect Size, Research Methodology, Scores


