Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Zdorkowski, Richard Todd – 1985
A project was conducted to develop and test three criterion-referenced achievement tests for use in Oklahoma's Dietetic Support Personnel training program. The training program is a competency-based course of instruction that is designed to prepare food service and production workers for employment in Oklahoma's health care industry. Initial…
Descriptors: Achievement Tests, Allied Health Personnel, Criterion Referenced Tests, Dietetics
Moss, Jeffrey W.; Briers, Gary E. – 1982
A study examined the relationship of attitudes of vocational student teachers to their plans to teach. Also investigated during the study was an alternative method of scoring the Purdue Student-Teacher Opinionnaire. (The Purdue Student-Teacher Opinionnaire is a 60-item questionnaire that yields factor scores pertaining to the following areas:…
Descriptors: Agricultural Education, Attitude Measures, Career Choice, Educational Facilities
Cason, Gerald J.; And Others – 1983
Prior research in a single clinical training setting has shown Cason and Cason's (1981) simplified model of their performance rating theory can improve rating reliability and validity through statistical control of rater stringency error. Here, the model was applied to clinical performance ratings of 14 cohorts (about 250 students and 200 raters)…
Descriptors: Clinical Experience, Error of Measurement, Evaluation Methods, Higher Education
Goodstein, H. A. – 1982
The proposed standard for judging proficiency test score reliability requires that the proportion of items passed for each objective assessed be a dependable estimate of the universe score for the domain strata established by the objective. Domain breadth is the focusing issue. Data from a field trial of the Tennessee Proficiency Test are analyzed…
Descriptors: Basic Skills, Criterion Referenced Tests, Educational Testing, Elementary Secondary Education
Thomas, Sandra P.; And Others – 1982
The most common approach to self-management research has been to apply it to a specific target behavior, without attending to the generalizability of changes to other facets of one's life. A procedure for measuring self-management effectiveness under real world conditions was developed which emphasized the successful application of self-change…
Descriptors: Age Differences, Behavior Modification, Behavior Patterns, College Students
Winters, Lynn; And Others – 1980
This text is designed to accompany workshop instruction in student assessment technique for the purpose of evaluating bilingual programs. It is addressed to those involved with bilingual programs generally, educators directly involved in planning and implementation, and research and evaluation specialists. The contextual variables that affect the…
Descriptors: Bilingual Education, Bilingual Students, Educational Planning, Elementary Secondary Education
Fuchs, Lynn S.; And Others – 1982
The effects of aggregation on the reliability of measures of academic performance were explored in two studies. In the first study, 30 elementary-age children were tested four times on the same forms of the Woodcock Reading Mastery Tests and the Ginn 720 Reading Passage measures. Group stability coefficients, within-subject reliability…
Descriptors: Academic Achievement, Elementary Education, Evaluation Methods, Measurement Objectives
Bliss, Leonard B. – 1982
A sample of slightly over 1500 students was drawn from even-numbered grades in public schools of the U.S. Virgin Islands, and was given the 1973 edition of the Stanford Achievement Test (in grades 2,4,6, & 8) and the Test of Academic Skills (grades 10 and 12) to assess student academic achievement in the basic skill areas of mathematics,…
Descriptors: Achievement Tests, Basic Skills, Data Analysis, Educational Assessment
Jellema, William W.; Olliver, James – 1975
Uses of discipline and program cost data by Indiana private colleges and universities are discussed, and cost data are presented. Attention is directed to the immediate and longer range uses of the data, the applicability of the methods to smaller institutions, reliability of the data, and the future of this analysis. The findings were used for…
Descriptors: College Credits, College Programs, Comparative Analysis, Expenditure per Student
Maring, Gerald H. – 1983
Noting that use of the reading-related components of the National Assessment of Educational Progress (NAEP) by state education agencies has ranged from extensive to moderate to limited, this paper presents case studies of the ways in which states have used the NAEP models. The first half of the paper describes extensive use by Minnesota and…
Descriptors: Case Studies, Educational Assessment, Elementary Secondary Education, Models
Chan, Ke-Sheng – Asia-Pacific Forum on Science Learning and Teaching, 2005
This study attempts to determine whether there exists a negative interconnection between the creative and testable nature-of-science (NOS) conceptions in college students' conceptual ecology by investigating, through a pair of IHV-assisted teaching experiments, the effect of raising the status of each NOS conception in students' conceptual ecology…
Descriptors: Foreign Countries, College Students, Student Attitudes, Scientific Principles
Scanlan, Craig L. – Online Journal of Distance Learning Administration, 2003
U.S. universities and colleges offering distance education courses have increased immensely since 1998, and by 2004 it was expected that distance learners will constitute about 14% of all those enrolled in degree programs. In its preliminary review of distance learning, the Institute for Higher Education Policy (1998) emphasized the need for…
Descriptors: Distance Education, Higher Education, Universities, Program Evaluation
Stallings, Jane A.; Giesen, Philip A. – 1977
Observer reliability and the confusability of codes, two sources of error in the collection of classroom observational data, are examined. Confusability is defined as the extent to which one code is mistakenly recorded as another code. Observational data were collected in each of 172 first grade and 171 third grade Follow Through and comparison…
Descriptors: Bias, Classroom Observation Techniques, Classroom Research, Codification
Cunningham, J. W.; And Others – 1975
A study was done to explore the feasibility of developing a set of activity preference (interest) scales corresponding to twenty-two second (higher) order work dimensions derived from the Occupation Analysis Inventory (OAI). (The OAI is an instrument containing 622 work elements which are descriptions of work activities and conditions on which…
Descriptors: Employment, Employment Qualifications, Feasibility Studies, Interest Inventories
Subkoviak, Michael J. – 1977
Four different procedures were used for estimating the proportion of persons who would be classified consistently as either passing both of two parallel tests or failing both. These four methods were applied at each of four different mastery level scores for each of three different length tests. Data were based on 50 replications of each procedure…
Descriptors: Criterion Referenced Tests, Cutting Scores, Data Analysis, Data Collection

Peer reviewed
Direct link
