Publication Date
| In 2026 | 0 |
| Since 2025 | 575 |
| Since 2022 (last 5 years) | 2511 |
| Since 2017 (last 10 years) | 5546 |
| Since 2007 (last 20 years) | 9142 |
Descriptor
| Test Validity | 21718 |
| Test Reliability | 9977 |
| Test Construction | 5864 |
| Foreign Countries | 4924 |
| Psychometrics | 2948 |
| Factor Analysis | 2936 |
| Measures (Individuals) | 2366 |
| Higher Education | 2245 |
| Evaluation Methods | 2083 |
| College Students | 1808 |
| Correlation | 1718 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 799 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 168 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedRead, Robert R. – Educational and Psychological Measurement, 1979
Students' perception of overall teaching performance is strongly aligned with feelings about the course. A method is presented for separating the effects of the teacher, the course, and individual student groups. All three effects are highly significant; the effect of the course is by far the most prominent. (Author/JKS)
Descriptors: Analysis of Variance, Course Evaluation, Discriminant Analysis, Item Analysis
Peer reviewedBerry, Marianne; Cash, Scottye J.; Mathiesen, Sally G. – Child Welfare, 2003
Examined the validity and reliability of the Strengths and Stressors Tracking Device (SSTD), a rapid assessment measure of family well-being to help guide case planning and evaluate treatment effectiveness. Found high internal consistency in all domains measured: environmental conditions, social support, caregiver skills, and child well-being.…
Descriptors: Child Abuse, Child Neglect, Child Welfare, Children
Peer reviewedMasataka, Nobuo – Early Education and Development, 2002
Evaluated social competence and problem behaviors of 200 Japanese preschoolers. Found anxiety-withdrawal, anger- aggression, and social competence factors as well as age and gender differences in emotional and behavioral problems and social competence. Found consistency with previous findings from U.S. and Canadian samples. (DLH)
Descriptors: Age Differences, Behavior Problems, Cross Cultural Studies, Cultural Images
Determining Exemptions from Foreign Language Requirements: Use of the Modern Language Aptitude Test.
Peer reviewedGoodman, Joan F.; And Others – Contemporary Educational Psychology, 1990
The short form of the Modern Language Aptitude Test (MLAT) was administered to 587 university students, and scores were validated against final first and second semester grades in introductory foreign language classes for 529 and 365 students, respectively. The MLAT failed to distinguish good from poor students. (TJH)
Descriptors: Analysis of Variance, Aptitude Tests, College Entrance Examinations, Equivalency Tests
Peer reviewedStiggins, Richard J.; And Others – Journal of Educational Measurement, 1989
Classroom assessment procedures of 36 teachers in grades 2 to 12 were studied to determine the extent to which they measure students' higher order thinking skills in mathematics, science, social studies, and language arts. A striking finding was the absence of evaluation of comparative and evaluative thinking. (SLD)
Descriptors: Classroom Techniques, Cognitive Processes, Educational Assessment, Elementary Secondary Education
Peer reviewedCrocker, Linda; And Others – Journal of Educational Measurement, 1988
Using generalizability theory as a framework, the problem of assessing the content validity of standardized achievement tests is considered. Four designs to assess test-item fit to a curriculum are described, and procedures for determining the optimal number of raters and schools in a content-validation decision-making study are considered. (TJH)
Descriptors: Achievement Tests, Content Validity, Decision Making, Elementary Education
Peer reviewedKenkel, James M.; Tucker, Richard W. – World Englishes, 1989
Outlines an application of theoretical understandings of institutionalized or nativized varieties of English to the practical concern of English-as-a-Second-Language programs, including testing, placement, and pedagogy. (29 references) (Author/OD)
Descriptors: English (Second Language), Error Analysis (Language), Essays, Foreign Countries
Peer reviewedTenhaken, Ursula; Scheibner-Herzig, Gudrun – Journal of Experimental Education, 1988
Achievement in English was measured for 64 German eighth graders by a cloze test, which was evaluated by three methods, yielding similar results. Twenty-one native English speakers comprised a criterion group. The cloze test may be appropriate for evaluating oral communication when an oral interview is not feasible. (SLD)
Descriptors: Achievement Tests, Cloze Procedure, Communicative Competence (Languages), English (Second Language)
Peer reviewedSelby, Edwin C.; And Others – Journal of Creative Behavior, 1993
Innovative or adaptive behavior of 86 eighth-grade students was rated by themselves, parents, and teachers using the Kirton Adaption-Innovation Inventory. The inventory was found to be reliable, stable, and valid. No significant differences between male and female students' scores were exhibited. Correlation with scores on the Comprehensive Test…
Descriptors: Adjustment (to Environment), Adoption (Ideas), Basic Skills, Correlation
Peer reviewedAckerman, Terry A. – Journal of Educational Measurement, 1992
The difference between item bias and item impact and the way they relate to item validity are discussed from a multidimensional item response theory perspective. The Mantel-Haenszel procedure and the Simultaneous Item Bias strategy are used in a Monte Carlo study to illustrate detection of item bias. (SLD)
Descriptors: Causal Models, Computer Simulation, Construct Validity, Equations (Mathematics)
Widaman, Keith F.; And Others – American Journal on Mental Retardation, 1993
Measures of 4traits (cognitive competence, social competence, social maladaption, and personal maladaption) were obtained on 157 persons with mental retardation, using 3 measurements: standardized assessment instrument, day shift staff ratings, and evening shift staff ratings. The multitrait-multimethod matrix procedure demonstrated strong…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Cognitive Ability, Construct Validity
Peer reviewedMoore, Don; And Others – Educational and Psychological Measurement, 1991
Correlations of National Teacher Examination (NTE) Core Battery scores and college grade point average (GPA) with a measure of teaching effectiveness for 493 first-year teachers indicate that the correlation is higher for GPA than for the Core Battery. NTE core scores do not predict effectiveness better than GPA alone. (SLD)
Descriptors: Beginning Teachers, College Graduates, Correlation, Elementary School Teachers
Peer reviewedMcLeod, P. J. – Evaluation and the Health Professions, 1991
Faculty opinions of an evaluation program for medical school clinical tutors were obtained through a survey of 24 undergraduate clinical tutors. Although students had been using the evaluation instrument to rate teachers for five years, faculty expressed many reservations about its reliability and validity. (SLD)
Descriptors: Clinical Teaching (Health Professions), Evaluation Methods, Higher Education, Medical Education
Peer reviewedNisbet, Steven – Mathematics Education Research Journal, 1991
Questionnaire responses of 155 student teachers were analyzed to develop meaningful attitude scales and to refine the instrument. Attitude scales identified in the analysis and built into the final form of the questionnaire were (1) anxiety; (2) confidence and enjoyment, (3) desire for recognition; and (4) pressure to conform. Includes a copy of…
Descriptors: Affective Measures, Attitude Measures, Education Majors, Elementary School Teachers
Peer reviewedBers, Trudy H.; Smith, Kerry E. – Community College Review, 1990
Describes a study of the validity and reliability of a writing skills assessment test taken by 4,284 2-year college students in 1986-87. Assesses interrater reliability, influences of nonperformance factors (e.g., gender, native language, and form of test), predictive validity of test for future performance, and implications of findings. (DMM)
Descriptors: Basic Writing, Community Colleges, High Risk Students, Predictive Validity


