Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Fagot, Beverly I.; Hagan, Richard – 1985
Covert checks of observational methodology reveal declines in reliability of observations. This appears to be particularly true when complex codes are used to track social interaction. The present study was undertaken to see whether reliability could be maintained through a combination of technological advancements and the development of improved…
Descriptors: Automation, Classroom Observation Techniques, Data Collection, Reliability
David, Jane L. – 1985
Three goals must be met in order for the National Center for Education Statistics (NCES) to improve the quality and utility of its data collection: (1) the choice of what to collect must be driven by the questions of interest to decisionmakers and the public; (2) procedures must insure validity and reliability of the data; and (3) the data must be…
Descriptors: Data Collection, Data Interpretation, Educational Research, Elementary Secondary Education
Woodruff, David J.; Sawyer, Richard L. – 1988
Two methods for estimating measures of pass-fail reliability are derived, by which both theta and kappa may be estimated from a single test administration. The methods require only a single test administration and are computationally simple. Both are based on the Spearman-Brown formula for estimating stepped-up reliability. The non-distributional…
Descriptors: Estimation (Mathematics), Licensing Examinations (Professions), Pass Fail Grading, Scores
Stelmachers, Zigfrids T.; Sherman, Robert E. – 1988
The clinical usefulness of various empirically derived suicide potential rating scales has been questioned by several suicidologists. This study used actual case histories in an attempt to anchor suicide risk ratings. Thirty-three brief case histories of suicidal patients were given to 19 experienced crisis workers for seven-point ratings of…
Descriptors: Clinical Diagnosis, Evaluation Criteria, Evaluation Methods, High Risk Persons
PDF pending restorationSpellman, Charles R.; And Others – 1982
The project was designed to develop an alternate testing method for the visual acuity assessment of preschool children with handicaps. Additional project objectives included evaluation and modification of existing experimental procedures for discrimination training and visual acuity testing of preschool handicapped children; establishment of a…
Descriptors: Infants, Preschool Education, Screening Tests, Test Construction
Lampe, Richard E. – 1984
This study examines the accuracy of the self-scoring efforts of 306 eighth-graders on the Kuder General Interest Survey (GIS), and suggests possible methods to improve self-scoring accuracy. The GIS is widely used to assist junior high school students with their educational and vocational planning. After the administration of the test by English…
Descriptors: Interest Inventories, Junior High Schools, Profiles, Scoring
Villanova, Robert M. – 1984
This paper reports on the development and refinement of the Connecticut School Effectiveness Questionnaire (CSEQ) and the Connecticut School Effectiveness Interview (CSEI), the primary data collection tools used in the Connecticut State Department of Education School Effectiveness Project. The primary purpose of both the CSEI and the CSEQ is to…
Descriptors: Elementary Secondary Education, Factor Structure, Institutional Characteristics, Interviews
PDF pending restorationRay, John J. – 1982
Projective measurement of achievement motivation can be achieved by using a large number of scales quite suitable for this purpose and related concepts. This paper is a guide to the literature on measuring achievement motivation rather than a detailed review of each scale. The author introduces his scale which he feels is unique because it…
Descriptors: Achievement Need, Forced Choice Technique, Measurement Techniques, Rating Scales
Jones, Paul L.; Young, Patricia – 1983
The initial form of the Coping with Death Scale consisted of 30 items designed to obtain responses arranged along a seven-point Likert-type scale. Each item on the scale was derived from personal responses of students who completed a death and dying seminar. The items appeared to fall into two categories: coping with self and coping with others.…
Descriptors: Death, Factor Analysis, Factor Structure, Higher Education
Sigmon, Gary L.; And Others – 1983
In recent years educators have been utilizing judgmental methods, such as the ones advocated by Ebel and Angoff, to set minimum competency standards on test items. This study was designed to investigate the reliability and validity of these two procedures in setting minimum levels of performance on 175 vocational evaluator competency statements.…
Descriptors: Comparative Analysis, Evaluation Methods, Evaluators, Minimum Competencies
Falbo, Toni; Belk, Sharyn S. – 1983
A seven item Likert-type scale was developed to measure self-righteousness, defined as the conviction that one's beliefs and actions are correct, especially in contrast to the beliefs and actions of others. The Self Righteousness Questionnaire (SRQ) measures three components of self-righteousness: belittlement, acceptance, and uncertainty. The…
Descriptors: Adults, Beliefs, Dogmatism, Opinions
Tsui, Anne S. – 1983
Quality of performance data yielded by subjective judgment is of major concern to researchers in performance appraisal. However, some confusion exists in the analysis of quality on ratings obtained from different rating scale formats and from different raters. To clarify this confusion, a study was conducted to assess the quality of judgmental…
Descriptors: Administrator Evaluation, Administrators, Error of Measurement, Evaluation Methods
Miller, Lucy Jane; Linder, Toni W. – 1982
The Miller Assessment for Preschoolers (MAP) is a new developmental screening tool for children aged 2 years 9 months to 5 years 8 months. The instrument, which has been standardized on 1,200 subjects representing nine geographic regions, identifies children who are functioning below the developmental level of their peers. The sampling method was…
Descriptors: Predictive Validity, Preschool Children, Preschool Education, Scoring
Doolittle, Allen E. – 1983
The stability of selected indices for detecting differential item performance (item bias), from one randomly equivalent sample to another, is addressed. Some recent research has criticized these indices as too unreliable for utility in measuring bias in achievement test items. Using data from a national testing of the ACT Assessment, however, this…
Descriptors: Black Students, Item Analysis, Racial Factors, Reliability
Edinger, Jack D.; Vosk, Barbara N. – 1983
Of the many short forms of the Minnesota Multiphasic Personality Inventory (MMPI) that have been developed, the MMPI-168 is among the most promising. To determine whether clinical judgments based on the MMPI-168 are comparable to judgments based on the standard MMPI, 30 clinical psychologists participated in a randomized block, repeated treatment…
Descriptors: Comparative Testing, Diagnostic Tests, Interrater Reliability, Personality Measures


