Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedVan Bourgondien, Mary E.; Reichle, Nancy C.; Campbell, Duncan G.; Mesibov, Gary B. – Research in Developmental Disabilities, 1998
This study assessed the psychometric properties of the Environmental Rating Scale, a measure specifically designed to assess residential treatment programs for individuals with autism. The measure's reliability was demonstrated by assessments of the internal consistency, stability, and interrater reliability. Preliminary analysis of validity…
Descriptors: Adults, Autism, Evaluation Methods, Interrater Reliability
Peer reviewedWinn, Bradley A.; Cameron, Kim S. – Research in Higher Education, 1998
The Malcolm Baldridge National Quality Award (MBNQA) framework for defining organizational quality is widely accepted in for-profit organizations. A study examined the validity of the proposed relationships among MBNQA dimensions using data from higher education. The empirical results help identify a modified model that has implications for…
Descriptors: Awards, Educational Quality, Evaluation Criteria, Evaluation Methods
Peer reviewedSisto, Fermino Fernandes – Child Study Journal, 2000
Examined validity of use of human figure drawing to evaluate cognitive development status using Piagetian tasks with 7- to 11-year-olds. Found that scores for children's drawings of a man and a woman correlated significantly with mental imaging, conservation of mass, and conservation of length, suggesting the possibility of finding patterns to…
Descriptors: Children, Cognitive Development, Cognitive Measurement, Cognitive Tests
Peer reviewedWoo, Tae O.; Frank, Nancy – Journal of Social Psychology, 2000
Investigates the role of academic self-esteem and academic performance in 208 U. S. college students' perceptions of the validity of their grades. Data collected through a questionnaire. Finds that regardless of self-esteem, the students with higher GPAs saw grades as more valid. This finding is in accordance with weak forms of self-enhancement…
Descriptors: Academic Ability, Academic Achievement, College Students, Grades (Scholastic)
Peer reviewedWang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001
Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria
Peer reviewedSwain, Merrill – Language Testing, 2001
Examines one aspect of the many interfaces between second language (L2) learning and L2 testing. The aspect is the oral interaction--the dialogue--that occurs within small groups. Discusses from within a sociocultural theory of mind, that in a group, performance is jointly constructed and distributed across the participants. (Author/VWL)
Descriptors: Dialogs (Language), Inferences, Interaction, Language Tests
Peer reviewedRoyer, James M. – Journal of Adolescent & Adult Literacy, 2001
Describes a team-based approach for creating Sentence Verification Technique (SVT) tests, a development procedure that allows teachers and other school personnel to develop comprehension tests from curriculum materials in use in their schools. Finds that if tests are based on materials that are appropriate for the population to be tested, the…
Descriptors: Elementary Secondary Education, Evaluation Methods, Listening Comprehension Tests, Reading Tests
Peer reviewedTimmerman, Thomas A. – Teaching of Psychology, 2000
Provides an in-class activity that enables students in research methods to learn about survey design and multiple regression. Explains that the students developed questionnaires in order to generate five items, as set in a multiple regression equation, that would best predict a particular outcome. Discusses benefits and drawbacks. (CMK)
Descriptors: Comparative Analysis, Course Content, Educational Strategies, Higher Education
Peer reviewedPentony, Joseph F.; Swank, Paul; Pentony, Carole G. – Community College Journal of Research and Practice, 2001
Describes a study involving 1,343 community college students, which examined the relationships between the Cultural Literacy Test (CLT), developed by E.D. Hirsch, and several academic factors: grade point average and first-semester grades in remedial and regular English, history, and government courses. Emphasizes the clinical usefulness of the…
Descriptors: Community Colleges, Cultural Literacy, Evaluation Research, Factor Analysis
Rotberg, Iris C. – School Administrator, 1996
Because educators have unrealistic expectations about tests, they use them inappropriately and draw inaccurate conclusions from results. This article debunks five myths about test-score comparisons: valid measurement of school quality; declining international competitiveness; "fixing" schools with more tests; development of new, improved…
Descriptors: Comparative Education, Competition, Elementary Secondary Education, Expenditure per Student
Peer reviewedAyres, Debbie M.; And Others – Communication Reports, 1995
Investigates whether a videotape designed to reduce public speaking apprehension (PSA) could be used to help at-risk students cope with PSA. Finds that the videotape condition was associated with lower levels of trait communication apprehension, state communication apprehension, and negative thinking than the placebo and control conditions. (SR)
Descriptors: Communication Apprehension, Communication Research, Elementary Secondary Education, High Risk Students
Peer reviewedLloyd, D.; And Others – Assessment & Evaluation in Higher Education, 1996
In an engineering technology course at Coventry University (England), the utility of computer-assisted tests was compared with that of traditional paper-based tests. It was found that the computer-based technique was acceptable to students, produced valid results, and demonstrated potential for saving staff time. (Author/MSE)
Descriptors: Comparative Analysis, Computer Assisted Testing, Efficiency, Engineering Education
Peer reviewedLassiter, Kerry S. – Psychology in the Schools, 1995
To test the validity of brief measures of intelligence and explore how well these instruments relate to academic performance, the WPPSI-R, the Kaufman Brief Intelligence Scale, Draw-A-Person: Quantitative Scoring System, and the K-ABC Achievement Scale were administered to 50 kindergarten and first-grade children. Results indicated all measures…
Descriptors: Academic Achievement, Cognitive Ability, Correlation, Grade 1
Peer reviewedDodds, A. G.; And Others – Journal of Visual Impairment & Blindness, 1996
This study found that Nottingham Adjustment Scale items on acceptance of sight loss and attitudes toward blindness were free of response bias. Respondents (n=559) who were given only negative items disagreed significantly more with them than did those given mixed positive and negative statements. Respondents with poor emotional adjustment were…
Descriptors: Adaptive Behavior (of Disabled), Adults, Attitudes, Beliefs
Peer reviewedStoner, Gary; And Others – Journal of Applied Behavior Analysis, 1994
Two case studies examined the utility of curriculum-based measurement of math and reading for evaluating the effects of methylphenidate on the academic performance of two students diagnosed with attention deficit hyperactivity disorder. Results suggest that the curriculum-based measures were sensitive indicators of the students' responses to the…
Descriptors: Academic Achievement, Attention Deficit Disorders, Case Studies, Curriculum Based Assessment


