Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Goodstein, H. A. – 1982
The proposed standard for judging proficiency test score reliability requires that the proportion of items passed for each objective assessed be a dependable estimate of the universe score for the domain strata established by the objective. Domain breadth is the focusing issue. Data from a field trial of the Tennessee Proficiency Test are analyzed…
Descriptors: Basic Skills, Criterion Referenced Tests, Educational Testing, Elementary Secondary Education
Thomas, Sandra P.; And Others – 1982
The most common approach to self-management research has been to apply it to a specific target behavior, without attending to the generalizability of changes to other facets of one's life. A procedure for measuring self-management effectiveness under real world conditions was developed which emphasized the successful application of self-change…
Descriptors: Age Differences, Behavior Modification, Behavior Patterns, College Students
Winters, Lynn; And Others – 1980
This text is designed to accompany workshop instruction in student assessment technique for the purpose of evaluating bilingual programs. It is addressed to those involved with bilingual programs generally, educators directly involved in planning and implementation, and research and evaluation specialists. The contextual variables that affect the…
Descriptors: Bilingual Education, Bilingual Students, Educational Planning, Elementary Secondary Education
Fuchs, Lynn S.; And Others – 1982
The effects of aggregation on the reliability of measures of academic performance were explored in two studies. In the first study, 30 elementary-age children were tested four times on the same forms of the Woodcock Reading Mastery Tests and the Ginn 720 Reading Passage measures. Group stability coefficients, within-subject reliability…
Descriptors: Academic Achievement, Elementary Education, Evaluation Methods, Measurement Objectives
Bliss, Leonard B. – 1982
A sample of slightly over 1500 students was drawn from even-numbered grades in public schools of the U.S. Virgin Islands, and was given the 1973 edition of the Stanford Achievement Test (in grades 2,4,6, & 8) and the Test of Academic Skills (grades 10 and 12) to assess student academic achievement in the basic skill areas of mathematics,…
Descriptors: Achievement Tests, Basic Skills, Data Analysis, Educational Assessment
Maring, Gerald H. – 1983
Noting that use of the reading-related components of the National Assessment of Educational Progress (NAEP) by state education agencies has ranged from extensive to moderate to limited, this paper presents case studies of the ways in which states have used the NAEP models. The first half of the paper describes extensive use by Minnesota and…
Descriptors: Case Studies, Educational Assessment, Elementary Secondary Education, Models
Cunningham, J. W.; And Others – 1975
A study was done to explore the feasibility of developing a set of activity preference (interest) scales corresponding to twenty-two second (higher) order work dimensions derived from the Occupation Analysis Inventory (OAI). (The OAI is an instrument containing 622 work elements which are descriptions of work activities and conditions on which…
Descriptors: Employment, Employment Qualifications, Feasibility Studies, Interest Inventories
Subkoviak, Michael J. – 1977
Four different procedures were used for estimating the proportion of persons who would be classified consistently as either passing both of two parallel tests or failing both. These four methods were applied at each of four different mastery level scores for each of three different length tests. Data were based on 50 replications of each procedure…
Descriptors: Criterion Referenced Tests, Cutting Scores, Data Analysis, Data Collection
PDF pending restorationHorowitz, Frances Degen – 1977
This paper discusses issues connected with the reliability of the Neonatal Behavioral Assessment Scale (NBAS) in terms of behavior prediction, neonatal behavioral organization and stability, and consequent implications for study of newborns. Discussion focuses on: (1) reliability, and (2) prediction and neonatal assessment. The NBAS is seen as a…
Descriptors: Behavior Development, Behavior Rating Scales, Child Development, Environmental Influences
Popham, W. James; Husek, T. R. – J Educ Meas, 1969
Research partially supported by the UCLA Center for the Study of Evaluation of Instructional Programs.
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Item Analysis, Measurement
Brazee, Edward N. – 1981
The Language Arts Test of Cognitive Functioning (LATCF) was investigated to determine if the measure could give specific information about the thinking required for language arts tasks. The LATCF was developed by the author and consists of six anecdotes or tasks that the student must solve or complete. The anecdotes are built on six functions that…
Descriptors: Adolescent Development, Cognitive Processes, Cognitive Style, Cognitive Tests
Mets, Jan – 1981
Oral proficiency tests were constructed for French, German, and English by the Dutch National Institute for Educational Measurement to determine how well students could cope linguistically with daily living situations, after 3 years of foreign language instruction. Originally the rating scale featured six categories. Because of difficulties in…
Descriptors: Achievement Rating, English (Second Language), Foreign Countries, French
Michopoulos, Aristotle – 1981
This paper addresses the lack of language dominance assessment instruments and curriculum materials for Greek-speaking children in the U.S. These children need appropriate language screening tests, based on research and data derived from their native language group, for diagnostic and placement purposes. The development of an instrument for…
Descriptors: Bilingual Education, Bilingual Students, Diagnostic Tests, Elementary Education
Miskel, Cecil; And Others – 1981
To assess structural coupling in schools, investigators must first have measures with established reliability and validity levels. Structural coupling refers to the mechanisms and norms in organizations that influence interactions among individuals. For three structural coupling measurement techniques--participant observation, interviews, and…
Descriptors: Elementary Secondary Education, Interprofessional Relationship, Interviews, Measurement Techniques
Robertson, David W.; And Others – 1977
A comparative study of item analysis was conducted on the basis of race to determine whether alternative test construction or processing might increase the proportion of black enlisted personnel among those passing various military technical knowledge examinations. The study used data from six specialists at four grade levels and investigated item…
Descriptors: Difficulty Level, Enlisted Personnel, Item Analysis, Occupational Tests


