Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedHanton, Samuel D.; Ryan, Julie B. – Journal of Optometric Education, 1986
A study of the reliability of a computer-assisted test of optometric clinical diagnostic skills that uses actual patient cases revealed that the test is most useful when used in conjunction with subjective clinical grading for evaluating problem-solving skills. (MSE)
Descriptors: Clinical Diagnosis, Computer Assisted Testing, Higher Education, Medical Case Histories
Peer reviewedWhitman, Barbara; Zachary, Robert A. – Educational and Psychological Measurement, 1986
Roth's Mother Child Relationship Evaluation was administered to 54 mothers and 20 fathers of children aged 3 to 11. The underlying dimensions--acceptance, overprotection, overindulgence, and rejection--were also assessed. Results suggested the need for both revision and renorming of the instrument. (Author/GDC)
Descriptors: Developmental Disabilities, Elementary Education, Factor Structure, Fathers
Peer reviewedConger, Anthony J.; And Others – Educational and Psychological Measurement, 1983
An investigation of the Conners' Teacher Rating Scale-Revised hyperactivity scale found that the referents for teacher ratings should be determined, teachers' ratings should be made more objective, standardization across teachers should be demonstrated before norms are preferred, and the rating scale should be validated via observations or other…
Descriptors: Behavior Rating Scales, Classroom Observation Techniques, Generalizability Theory, Hyperactivity
Peer reviewedAngoff, William H.; Schrader, William B. – Journal of Educational Measurement, 1984
The reported data provide a basis for evaluating the formula-scoring versus rights-scoring issue and for assessing the effects of directions on the reliability and parallelism of scores for sophisticated examinees taking professionally developed tests. Results support the invariance hypothesis rather than the differential effects hypothesis.…
Descriptors: College Entrance Examinations, Guessing (Tests), Higher Education, Hypothesis Testing
Peer reviewedAllison, Donald E. – Alberta Journal of Educational Research, 1984
Reports that no significant difference in reliability appeared between a heterogeneous and a homogeneous form of the same general science matching-item test administered to 316 sixth-grade students but that scores on the heterogeneous form of the test were higher, independent of the examinee's sex or intelligence. (SB)
Descriptors: Comparative Analysis, Comparative Testing, Elementary Education, Grade 6
Peer reviewedRogosa, David; And Others – Journal of Educational Psychology, 1984
Using observational data on classroom teachers, statistical procedures are presented for studying two questions on the stability of teacher behavior over time: (1) Is the individual teacher's behavior consistent? and (2) Are individual differences among teachers consistent? Approaches and methods of previous temporal stability studies are…
Descriptors: Classroom Observation Techniques, Elementary Secondary Education, Individual Differences, Mathematical Models
Peer reviewedHudson, Thom; Lynch, Brian – Language Testing, 1984
Presents approaches to test development analysis, reliability, and validity based on criterion-referenced measurement principles and compares them with norm-referenced approaches in terms of the types of decisions that result from either approach. This is done by using data from an English-as-a-second-language achievement testing project at the…
Descriptors: Achievement Tests, Criterion Referenced Tests, English (Second Language), Norm Referenced Tests
Peer reviewedSheret, Michael – Comparative Education Review, 1984
Addresses applications of the coefficient of variation as a measure of educational inequality or as a means of measuring changes of inequality status. Suggests the Gini coefficient has many advantages over the coefficient of variation since it can be used with the Lorenz curve (Lorenz provides detail Gini omits). (BRR)
Descriptors: Analysis of Variance, Comparative Analysis, Comparative Education, Data Analysis
American School Board Journal, 1985
A school administrator in 1928 expressed a favorable opinion of standardized intelligence and achievement tests, based on research findings correlating test scores with student achievement. Quotations from his article reveal how little the controversy over tests has changed in the intervening years. (TE)
Descriptors: Achievement Tests, Aptitude Tests, Educational History, Educational Trends
Peer reviewedMarsh, Herbert W. – Journal of Educational Psychology, 1984
Findings and research designs used to study university students' evaluations of teaching effectiveness are reviewed. A construct validation approach which recognizes the multidimensionality of both effective teaching and students' evaluations is recommended for further research. (BS)
Descriptors: College Faculty, Evaluation Criteria, Evaluation Methods, Evaluation Utilization
Peer reviewedWisxon, Stanton E. – Reading Teacher, 1985
Notes that, while the test is not recommended for children at the lower and extreme upper levels, it does provide a valid and reliable measure of some of the abilities that may be involved in the reading behaviors of five- and six-year-old children. (FL)
Descriptors: Early Reading, Preschool Children, Primary Education, Reading Ability
Shrock, Sharon A.; Foshay, Wellesley R. – Performance and Instruction, 1984
Discusses methods of sampling the best information from instruction/training developers/candidates for professional certification and examines the problems of interpreting that information and making classification decisions. Assessment strategies including criterion-referenced, multiple-choice, short answer, and essay questions, and portfolio…
Descriptors: Certification, Competence, Criterion Referenced Tests, Instructional Development
Peer reviewedHelfeldt, John P. – Reading Teacher, 1984
Concludes that the Brigance Screen is a well-organized criterion-referenced test designed to assist in the early identification of individuals who need further testing. (FL)
Descriptors: Criterion Referenced Tests, Early Identification, Grade 1, Kindergarten
Peer reviewedBristow, Page Simpson; And Others – Reading Teacher, 1983
Reports that reading instructional level scores of a teacher constructed informal reading inventory, the commercially prepared Basic Reading Inventory, the Metropolitan Achievement Test, and students' actual level of placement in books are roughly comparable but that the Wide Range Achievement Test reading subtest places children much higher. (FL)
Descriptors: Comparative Analysis, Elementary Education, Informal Reading Inventories, Readability
Peer reviewedVon Bargen, Donna M. – Merrill-Palmer Quarterly, 1983
Reviews literature on children up to age one to determine whether heart rate (HR) is a reliable, stable, and valid measure. Considers factors influencing resting HR and the Law of Initial Values, discusses information about response to stimulation and infant perception and affect obtained by using HR measures, and describes HR use in risk…
Descriptors: Affective Behavior, Conditioning, Heart Rate, High Risk Persons


