Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Haladyna, Thomas M.; Downing, Steven M. – 1988
The proposition that the optimal number of options in a multiple choice test item is three was examined. The concept of functional distractor, a plausible wrong answer that is negatively discriminating when total test performance is the criterion, is discussed. Three distinct groups of achievers (high, middle, and low) on a national standardized…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Physicians
Dana, Richard H.; And Others – 1986
Three standard assessment instruments (Rorschach, Millon Clinical Multiaxial Inventory and 16PF) were administered to 12 participating Rosebud Sioux Indians--6 males, 6 females. Reports were generated for each instrument. Consensual and unique concepts contained in all the reports were analyzed in order to describe the contents. Six judges, all…
Descriptors: American Indians, Cultural Differences, Objective Tests, Psychological Studies
Tollefson, Nona; Chung, Jing-Mei – 1986
Procedures for correcting for guessing and for assessing partial knowledge (correction-for-guessing, three-decision scoring, elimination/inclusion scoring, and confidence or probabilistic scoring) are discussed. Mean scores and internal consistency reliability estimates were compared across three administration and scoring procedures for…
Descriptors: Achievement Tests, Comparative Analysis, Evaluation Methods, Graduate Students
Brown, Cheryl – 1984
Two methods of researching second language learning are described, analyzed, and compared: student diaries giving a first-person account of the second language learning experience; and participant observation, in which the observer is in the language learning situation recording in descriptive terms all possible data about the situation.…
Descriptors: Attitude Measures, Comparative Analysis, Diaries, English (Second Language)
Bliss, Leonard B. – 1984
A model for the validation of standardized tests of academic achievement upon populations not represented in the samples used to standardize the tests is presented, and the results of a field testing of the model are described. The 1973 editions of the Stanford Achievement Test and the Test of Academic Skills were administered to a sample of…
Descriptors: Achievement Tests, Basic Skills, Elementary Secondary Education, Item Analysis
Holmes, Susan E.; Doody-Bogan, Evelyn N. – 1983
The accuracy of trait estimates obtained from three vertical equating methods was examined. The procedures studied included two anchor test designs and a single-group design. Data from two content areas and two grade combinations were studied. A three-parameter logistic model was used to perform the equatings. The results obtained were used to…
Descriptors: Achievement Tests, Equated Scores, Estimation (Mathematics), Latent Trait Theory
Fuchs, Lynn S.; And Others – 1983
The purposes of this study were to assess the usefulness of a variety of readability formulas in predicting the relative difficulty of passages, and to explore the contribution of pupils' background to text difficulty. Subjects were 285 special education students in grades 1-9, 117 of whom were based in rural and suburban Minnesota (MN) and 168 of…
Descriptors: Difficulty Level, Elementary Secondary Education, Predictive Measurement, Readability Formulas
Francis, Alexandria S.; Holmes, Susan E. – 1983
Discrepancies among the standards produced by different criterion-referenced standard-setting techniques may be the result of a failure to adequately define the minimally competent candidate. Current research in this area is reviewed in terms of three categories: studies in which no formal assistance in conceptualization is given to judges,…
Descriptors: Certification, Criterion Referenced Tests, Cutting Scores, Interrater Reliability
Rapaport, Ross J. – 1982
This investigation of survey research methodology examined the satisfaction differences between anonymous and identified respondents to a client satisfaction survey from a university counseling center. A questionnaire based on the Counseling Services Assessment Blank was mailed to 410 students terminated from counseling during the 1980-81 academic…
Descriptors: Attitude Measures, Counseling Services, Evaluation Methods, Higher Education
Bethscheider, Janine K. – 1988
An experimental test battery designed to measure several perceptual abilities was administered to 1,368 (51.8% male) paying clients of the Johnson O'Connor Research Foundation (JOCRF) in an effort to identify and measure three perceptual abilities: (1) flexibility of closure; (2) speed of closure; and (3) spatial scanning. Subjects, who ranged in…
Descriptors: Adolescents, Adults, Cognitive Processes, Perception
PDF pending restorationDaiker, Donald A.; Grogan, Nedra – 1985
The role of sample papers (i.e., anchor papers, prototypes, range-finders) in holistic evaluation of writing is discussed. When, where, and how many sample papers are to be selected, and who should perform the selection are covered. The process of sample selection should proceed as follows: (1) a general reading of papers by committee members to…
Descriptors: Advanced Placement, Essay Tests, Evaluators, Higher Education
Roberts, Clare; Pratt, Chris – 1988
This paper investigated the reliability and construct validity of a 30-item scale in measuring the attitudes of teachers in Australia toward the integration of handicapped children into regular schools. The Attitude Toward Mainstreaming Scale, which was designed by Larrivee and Cook (1979), has been used in evaluation studies in the United Sates…
Descriptors: Attitude Measures, Construct Validity, Disabilities, Elementary Education
Breland, Hunter M.; Jones, Robert J. – 1988
The reliability, validity, and score discrepancies of 94 expository essays scored in conference versus remote settings were studied. Focus was on comparing holistic ratings obtained in both settings. Essays written by college freshmen on two different topics were scored by readers working in a conference setting and by different readers working in…
Descriptors: College Freshmen, Comparative Analysis, Conferences, Essay Tests
Karchmer, Michael A.; Allen, Thomas E. – 1984
The final report describes the accomplishments of an 18-month study designed to adapt and standardize the 7th Edition of the Stanford Achievement Test with a national, randomly drawn sample of hearing-impaired students. The following objectives were accomplished: (1) test material and special procedures were developed and disseminated; (2) the…
Descriptors: Achievement Tests, Elementary Secondary Education, Hearing Impairments, Test Construction
Halpin, Gerald; Simpson, Robert – 1986
Forty adolescent subjects with behavior problems at home or at school were administered the Woodcock Reading Mastery Tests. Subjects ranged in age from 12 to 18, and included 25 males and 15 females. The Passage Comprehension Test for each subject was rescored using three different ceiling criteria: (1) five errors in six consecutive responses,…
Descriptors: Adolescents, Analysis of Variance, Behavior Problems, Raw Scores


