Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedCollins, Angelo – Journal of Personnel Evaluation in Education, 1991
The research of the Biology component of the Teacher Assessment Project (BioTAP) of Stanford (California) University is described. BioTAP uses portfolio development as an important aspect of teacher assessment. Advantages and drawbacks of teacher portfolios are discussed, including issues of validity and reliability. (SLD)
Descriptors: Assessment Centers (Personnel), Biology, Evaluation Methods, High Schools
Peer reviewedAlexander, Cheryl S.; And Others – Journal of Youth and Adolescence, 1990
The development and preliminary testing of a six-item scale to assess risk taking among young adolescents are described. Test construction was based on information provided by eighth graders. The measure, used in a longitudinal study of 758 eighth through tenth graders from 3 rural counties in Maryland, showed good reliability. (SLD)
Descriptors: Adolescents, Attitude Measures, Grade 8, Longitudinal Studies
Peer reviewedJaeger, Richard M. – Educational Measurement: Issues and Practice, 1991
Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)
Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners
Peer reviewedReid, Jerry B. – Educational Measurement: Issues and Practice, 1991
Training judges to generate item ratings in standard setting once the reference group has been defined is discussed. It is proposed that sensitivity to the factors that determine difficulty can be improved through training. Three criteria for determining when training is sufficient are offered. (SLD)
Descriptors: Computer Assisted Instruction, Difficulty Level, Evaluators, Interrater Reliability
Peer reviewedHarvill, Leo M. – Educational Measurement: Issues and Practice, 1991
This paper discusses standard error of measurement (SEM), the amount of variation or spread in the measurement errors for a test, and gives information needed to interpret test scores using SEMs. SEMs at various score levels should be used in calculating score bands rather than a single SEM value. (SLD)
Descriptors: Definitions, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)
Peer reviewedByrne, Brian; Fielding-Barnsley, Ruth – Journal of Educational Psychology, 1990
Results of 6 experiments with 109 Australian preschool children favor training in phoneme identity over segmentation as a component of initial reading instruction because it is easier to implement and its relation to alphabetic insight is stronger. Implications for the initial reading curriculum are discussed. (SLD)
Descriptors: Alphabets, Beginning Reading, Curriculum Development, Foreign Countries
Peer reviewedGreenan, James P.; Winters, Michael – Journal of Epsilon Pi Tau, 1991
A set of instruments designed to measure generalizable interpersonal relations skills was validated with students in Illinois area vocational centers. The instruments developed possess a relatively high degree of content and face validity and moderate to high internal consistency and test-retest reliability. (JOW)
Descriptors: Interpersonal Competence, Interpersonal Relationship, Measures (Individuals), Secondary Education
Peer reviewedRisucci, Donald A.; And Others – Evaluation and the Health Professions, 1992
The reliability and accuracy of evaluations of 126 surgical faculty made by 47 general surgery residents over 2 years were examined. The general accuracy and reliability over both years indicate that anonymous ratings of surgical faculty by groups of residents can be a valuable evaluation method. (SLD)
Descriptors: Correlation, Evaluation Methods, Graduate Medical Education, Graduate Medical Students
Peer reviewedAuster, Ethel; Choo, Chun Wei – Journal of the American Society for Information Science, 1993
A study investigating variables in Canadian Chief Executive Officers' use of information sources to scan external business environments is reported. Findings show that frequency of source use correlates to perceived uncertainty and that source quality is more important than source accessibility--a finding that contradicts past user studies. (28…
Descriptors: Access to Information, Administrators, Business, Correlation
Peer reviewedNeto, Felix – Journal of Youth and Adolescence, 1993
The applicability of the Satisfaction With Life Scale (SWLS), developed in the United States, to another culture was assessed by investigating reliability and validity of the SWLS with 99 boys and 118 girls from Portugal. The cross-national validity of the scale and its utility with different age groups are supported. (SLD)
Descriptors: Adolescents, Age Differences, Attitude Measures, Comparative Testing
Peer reviewedSchiel, Jeffrey L.; Shaw, Dale G. – Applied Measurement in Education, 1992
Changes in information retention resulting from changes in reliability and number of intervals in scale construction were studied to provide quantitative information to help in decisions about choosing intervals. Information retention reached a maximum when the number of intervals was about 8 or more and reliability was near 1.0. (SLD)
Descriptors: Decision Making, Knowledge Level, Mathematical Models, Monte Carlo Methods
Nicholson, Charles L. – Diagnostique, 1990
The Matrix Analogies Test measures nonverbal ability of handicapped and nonhandicapped children, ages 5-17, in a culture-fair fashion. It assesses pattern completion, reasoning by analogy, serial reasoning, and spatial visualization, with a short form available as a screening instrument. This paper describes the test's administration, format,…
Descriptors: Abstract Reasoning, Culture Fair Tests, Disabilities, Elementary Secondary Education
Buckhalt, Joseph A. – Diagnostique, 1990
The Wechsler Preschool and Primary Scale of Intelligence-Revised, intended for children ages 3-7, is used to diagnosis exceptional intellectual ability in school settings. Its 12 subtests measure both Performance Intelligence Quotient and Verbal Intelligence Quotient. This paper describes the test's administration, summation of data,…
Descriptors: Ability Identification, Diagnostic Tests, Gifted, Handicap Identification
Peer reviewedvan Aken, Marcel A. G.; van Lieshout, F. M. – International Journal of Behavioral Development, 1991
In longitudinal study, California Child Q-Set descriptions of children aged 7-12 were given by teachers, mothers, peers, and child. Consistency of descriptions was expected to have causal relationship with two aspects of competence. Small reciprocal relationship between consistency of self- and child-descriptions and school achievement was found;…
Descriptors: Academic Achievement, Children, Competence, Elementary Education
Peer reviewedElam, Carol L.; Andrykowski, Michael A. – Academic Medicine, 1991
Medical school admission interview ratings for four entering classes (n=356 students) were compared with preadmission academic variables (admission test scores, undergraduate grades), student characteristics (age, gender, residence), and interviewer characteristics (gender, professional background, admission committee membership). Recommendations…
Descriptors: Academic Achievement, Admission Criteria, College Admission, Higher Education


