Publication Date
| In 2026 | 1 |
| Since 2025 | 599 |
| Since 2022 (last 5 years) | 2536 |
| Since 2017 (last 10 years) | 5571 |
| Since 2007 (last 20 years) | 9167 |
Descriptor
| Test Validity | 21743 |
| Test Reliability | 9997 |
| Test Construction | 5880 |
| Foreign Countries | 4941 |
| Psychometrics | 2956 |
| Factor Analysis | 2938 |
| Measures (Individuals) | 2370 |
| Higher Education | 2248 |
| Evaluation Methods | 2084 |
| College Students | 1810 |
| Correlation | 1722 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 805 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 170 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedEinfeld, Stewart L.; Tonge, Bruce J. – Journal of Autism and Developmental Disorders, 1995
This article describes the development and validation of the Developmental Behavior Checklist for children with emotional and behavior problems along with mental retardation. The article discusses generating and refining the checklist items, results of a principal components analysis, establishing reliability and construct and criterion validity,…
Descriptors: Behavior Development, Behavior Disorders, Behavior Rating Scales, Check Lists
Peer reviewedChambers, Francine; Richards, Brian – Language Learning Journal, 1995
Discusses the use of "free conversation" in oral examinations. The use of "free conversation" to describe structured interviews where the teacher does most of the questioning and the candidate supplies most of the information is inaccurate unless the format of tasks can genuinely allow the exchange of previously unknown information. (CK)
Descriptors: Comparative Analysis, Course Descriptions, French, Interviews
Peer reviewedRusson, Craig; Koehly, Laura M. – Evaluation and Program Planning, 1995
A scale was developed for measuring the persuasive impact of qualitative and quantitative evaluation reports on decision makers. Using two exploratory (n=192 graduate and undergraduate students) and two confirmatory (n=200 administrators) samples, researchers developed a 28-item Likert-type scale that demonstrated high reliability and validity.…
Descriptors: Administrators, Attention, College Students, Comprehension
Peer reviewedWatkins, David; Thomas, Babu – Assessment and Evaluation in Higher Education, 1991
This study applied 2 U.S. instruments to assess 11 Indian graduate students' evaluations of teaching effectiveness. Results indicated high internal consistency, and the measures' items were seen as appropriate. Analysis suggested more overlap between teaching skill and enthusiasm than evident in Western studies. (Author/DB)
Descriptors: Comparative Education, Cultural Differences, Evaluation Methods, Foreign Countries
Peer reviewedWashington, Julie A.; Craig, Holly K. – Language, Speech, and Hearing Services in Schools, 1992
This study of 105 low-income, urban, African-American preschool and kindergarten children found that the performance of most of the children on the Peabody Picture Vocabulary Test-Revised was more than one standard deviation below the mean. Findings indicated that the test was not appropriate for use with this population. (Author/JDD)
Descriptors: Black Students, Diagnostic Tests, Kindergarten Children, Language Handicaps
Peer reviewedKane, Michael T. – Evaluation and the Health Professions, 1992
A proposed model for the validity of measures of professional competence treats validation as the evaluation of inferences drawn from test scores, focusing on evaluation, generalization, and extrapolation. The model is used to indicate strengths and weaknesses of assessments of professional competence: observations of performance, simulations, and…
Descriptors: Competence, Evaluation Methods, Generalization, Inferences
Peer reviewedBaird, William E.; Silvern, Steven B. – Journal of Research on Computing in Education, 1992
Describes a study of college students that investigated the interaction between instructional mode and testing mode. Computer learning and testing versus paper-and-pencil methods are compared, and treatments for the experimental and control groups are described. Areas for further research are suggested. (20 references) (LRW)
Descriptors: Analysis of Variance, Comparative Analysis, Computer Assisted Instruction, Computer Assisted Testing
Peer reviewedFrisbie, David A. – Educational Measurement: Issues and Practice, 1992
Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)
Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests
Melnyk, L.; Das, J. P. – American Journal on Mental Retardation, 1992
Twenty-six adolescents with educable mental retardation were identified as good or poor attenders based on teachers' ratings on an attention scale and were administered an auditory vigilance test and Posner's physical and name identity task. Although the vigilance task did not discriminate between groups, the more demanding Posner's task did.…
Descriptors: Attention Control, Attention Deficit Disorders, Attention Span, Behavior Rating Scales
Hoover, John H.; And Others – Education and Training in Mental Retardation, 1992
The development of a structured interview designed to assess leisure satisfaction in persons with mental retardation is described along with initial reliability, validity, and leisure satisfaction findings with 40 individuals with developmental disabilities. Also considered are the rationale for measuring leisure satisfaction based on quality of…
Descriptors: Adolescents, Adults, Interviews, Leisure Time
Peer reviewedMcGrew, Kevin S.; And Others – Exceptional Children, 1992
This study found significant relationships between measures of adaptive/maladaptive behavior and community adjustment in 239 adults with mild to severe mental retardation. Results provide evidence for the criterion-related validity of measures of adaptive/maladaptive behavior, and suggest the importance of such skills in community adaptation and…
Descriptors: Adaptive Behavior (of Disabled), Adjustment (to Environment), Adults, Behavior Problems
Peer reviewedCarver, Ronald P. – Educational and Psychological Measurement, 1992
Reliability and validity of a new measure of cognitive speed, the Speed of Thinking Test (SST), were investigated with 129 college students, who also completed a vocabulary test, a test of reading speed, and a test of reading comprehension. The SST appears to be a reliable and valid measure. (SLD)
Descriptors: Cognitive Ability, Cognitive Tests, College Students, Comparative Testing
Peer reviewedIrvin, Larry K.; Walker, Hill M. – Exceptional Children, 1994
This article reviews the content and procedural requirements of social competence assessment for children with disabilities and presents information on multiperspective prototype assessments using a videodisc and a microcomputer with a "touch screen." Preliminary psychometric data on sensitivity, reliability, and construct validity are…
Descriptors: Computer Assisted Testing, Disabilities, Educational Technology, Elementary Secondary Education
Peer reviewedAlexander, Cheryl S.; And Others – Journal of Youth and Adolescence, 1990
The development and preliminary testing of a six-item scale to assess risk taking among young adolescents are described. Test construction was based on information provided by eighth graders. The measure, used in a longitudinal study of 758 eighth through tenth graders from 3 rural counties in Maryland, showed good reliability. (SLD)
Descriptors: Adolescents, Attitude Measures, Grade 8, Longitudinal Studies
Peer reviewedDonleavy, G. D.; Lim, Amanda – Assessment and Evaluation in Higher Education, 1990
The study assessed the cross-cultural validity of the Thematic Apperception Test (TAT) as a gauge of motivation to succeed (as proposed in Atkinson's model of motivation) with 45 Hong Kong students. Although doubts about the TAT's validity were found to be unjustified, the question of whether the test captures the need to achieve remains.…
Descriptors: Achievement Need, Comparative Analysis, Cross Cultural Studies, Cultural Context


