Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Canfield, Robert – 1979
Prompted by the need to develop improved tools for describing and assessing teachers' skills in teaching reading, a study was undertaken to examine the effectiveness of a system for classifying and tallying teacher-pupil verbal interaction during classroom instruction. Specifically, the study investigated the extent of agreement among college…
Descriptors: Educational Research, Elementary Education, Evaluation Methods, Interaction
Saunders, Joseph C.; Huynh, Huynh – 1980
In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…
Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests
Mitchell, Thomas E. – 1980
Kelley's cube model of attributions (1967) can be applied to moral judgments to predict how individuals arrive at attributions concerning dispositional or environmental causes. The relative contributions of the three dimensions of Kelley's cube to attributions of morality and trustworthiness were tested by presenting 37 male and 77 female subjects…
Descriptors: Adults, Attribution Theory, Behavior Standards, Behavioral Science Research
Hunter, Lyman R.; And Others – 1977
The purpose of this guide is to provide a short, ready source of information about the process and problems of the diagnostic assessment of preschool children. Formal test instruments frequently used in the assessment process are listed by category. Detailed discussions of the tests and useful information the classroom teacher can gain from the…
Descriptors: Diagnostic Teaching, Diagnostic Tests, Early Childhood Education, Educational Diagnosis
McDaniel, Ernest D.; Leddick, George R. – 1978
The Young Children's Self-Concept Scale, a 40-item revision of the Piers-Harris Children's Self Concept Scale, was designed specifically for young children. Its validity was investigated by comparing self concept scores to teachers' ratings of self concept. The sample included twenty teachers and 459 students in grades 1-4. Factor analyses were…
Descriptors: Age Differences, Elementary Education, Factor Structure, Research Reports
National Education Association, Washington, DC. Project on Utilization of Inservice Education R & D Outcomes. – 1977
The learning module described is for elementary or secondary level teachers who wish to improve their understanding of the meaning of test reliability and validity. The contents of the module are outlined, and activities and resources involved in its use are described. Ordering information on materials is provided, and a critique of the module is…
Descriptors: Elementary Secondary Education, Inservice Teacher Education, Instructional Materials, Learning Modules
Garfunkel, Frank – 1967
There are reasons why teaching behavior should be assessed, including (1) upgrading teacher education, (2) gaining insights into the learning of both teachers and children, and (3) studying social interactions. Two means of assessing teacher ability are quantification of teacher behavior by the use of rating scales, behavioral categories, etc.,…
Descriptors: Behavior Rating Scales, Classroom Research, Evaluation, Measurement Techniques
O'Connor, William J. – 1968
The relationship between the Bender-Gesalt Test was studied using the Koppitz Developmental Scoring System and the Marianne Frostig Developmental Test of Visual Perception in terms of age, sex, IQ, and socioeconomic status. A relationship to the Harrison Reading Readiness Test was also explored. Subjects were 89 first- and second-grade children…
Descriptors: Age, Grade 1, Grade 2, Intelligence Differences
Petrosko, Joseph M. – 1977
Three hundred-fifty-two standardized tests of reading comprehension and 373 standardized vocabulary measures were analyzed in terms of a number of criteria related to psychometric quality and educational ability. The criteria were based primarily on the Standards for Educational and Psychological Tests developed by the American Psychological…
Descriptors: Evaluation, Evaluation Criteria, Norms, Predictive Validity
Hisama, Kay Keiko Washiya – 1976
This study presents a new version of the cloze procedure used as a placement test for foreign students enrolled in an English language program for non-native speakers. Called the New Cloze Test (NCT), the test was administered to 136 foreign students who were beginning college students and who had not been in the United States longer than one…
Descriptors: Cloze Procedure, Doctoral Dissertations, English (Second Language), Foreign Students
Heun, Richard E.; And Others
The differences between edumetric and psychometric uses of tests were described and the relevance of the edumetric dimension for measuring student learning gains, especially in the context of individualized instruction involving multiple learning mode options, was clarified. Also, the procedures for edumetric reliability and validation assessment…
Descriptors: Achievement Gains, Cognitive Measurement, Comparative Analysis, Higher Education
Howell, John F.
After a survey of existing behavioral measures was made, a behavior rating scale was developed to measure the observable disruptive behavior of emotionally disturbed children in the classroom. Estimates of various types of reliability were calculated, and scale validity was examined. The scale was used to evaluate the effect of counseling on…
Descriptors: Behavior Change, Behavior Rating Scales, Classrooms, Counseling Effectiveness
Cliff, Norman – 1975
Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…
Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences
Ramsey-Klee, Diane M.; Richman, Vivian – 1975
The purpose of this research is to develop content analytic techniques capable of extracting the differentiating information in narrative performance evaluations for enlisted personnel in order to aid in the process of selecting personnel for advancement, duty assignment, training, or quality retention. Four tasks were performed. The first task…
Descriptors: Classification, Comparative Analysis, Content Analysis, Discriminant Analysis
Kirk, Samuel A.; Elkins, John – 1974
Summarized are 68 research studies from 1970 to 1975 on the Revised Illinois Test of Psycholinguistic Abilities (ITPA), particularly as it relates to learning disabilities. The reviews have been organized by the following areas (the number of studies in each section and sample study topics are in parentheses): studies comparing the experimental…
Descriptors: Academic Achievement, Educational Diagnosis, Exceptional Child Education, Learning Disabilities


