Publication Date
| In 2026 | 2 |
| Since 2025 | 613 |
| Since 2022 (last 5 years) | 2550 |
| Since 2017 (last 10 years) | 5585 |
| Since 2007 (last 20 years) | 9181 |
Descriptor
| Test Validity | 21757 |
| Test Reliability | 10004 |
| Test Construction | 5884 |
| Foreign Countries | 4949 |
| Psychometrics | 2962 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2373 |
| Higher Education | 2249 |
| Evaluation Methods | 2084 |
| College Students | 1812 |
| Correlation | 1722 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 806 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 171 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Abedi, Jamal – Center for Research on Evaluation Standards and Student Testing CRESST, 2004
Research reports major concerns over classification and measurement for students with limited English proficiency (LEP). A poor operational definition of the English language proficiency construct and validity concerns about existing language proficiency tests are among these issues. Decisions on including LEP students in large-scale assessments…
Descriptors: Federal Legislation, National Competency Tests, Language Proficiency, Limited English Speaking
Peer reviewedEbel, Robert L. – Educational Evaluation and Policy Analysis, 1980
Giving tests and assigning grades are stated to be important aspects of teachers' responsibility for facilitating student learning. Opposition to testing is discussed, objections are criticized, and beneficial consequences of evaluation are listed. It is maintained that tests do not cause cheating, or excessive discouragement, competition, or…
Descriptors: Academic Achievement, Achievement Tests, Affective Measures, Cheating
Peer reviewedRosenbach, John H.; Mowder, Barbara A. – Psychology in the Schools, 1981
Reviews some approaches to test bias and considers its fundamental causes. Suggests that because test validity is consistently high, the cultural bias of schooling is responsible. Proposes that because schooling reflects social values, resolution lies in social-political action, not psychological or psychometric advances. Discusses implications.…
Descriptors: Cultural Influences, Culture Fair Tests, Elementary Secondary Education, Intelligence Tests
Peer reviewedDeBlock, A.; And Others – Studies in Educational Evaluation, 1980
Dutch-speaking Belgian students scored higher than students from eight other non-English speaking countries on a reading comprehension test. There was a two-thirds overlap between the test and English curriculum. Teachers' expectations corresponded well with obtained scores. (CP)
Descriptors: Behavioral Objectives, Change Strategies, Cross Cultural Studies, Educational Policy
Peer reviewedBray, James H.; Howard, George S. – Journal of Educational Psychology, 1980
Training produced significant changes in the teaching behavior, self-ratings of teaching ability, and student ratings of instruction of graduate teaching assistants. Response-shift bias was noted in the self-reports and controlled through the collection of retrospective pretests. (Author/CP)
Descriptors: Higher Education, Inservice Teacher Education, Program Evaluation, Research Design
Peer reviewedRead, Robert R. – Educational and Psychological Measurement, 1979
Students' perception of overall teaching performance is strongly aligned with feelings about the course. A method is presented for separating the effects of the teacher, the course, and individual student groups. All three effects are highly significant; the effect of the course is by far the most prominent. (Author/JKS)
Descriptors: Analysis of Variance, Course Evaluation, Discriminant Analysis, Item Analysis
Peer reviewedBerry, Marianne; Cash, Scottye J.; Mathiesen, Sally G. – Child Welfare, 2003
Examined the validity and reliability of the Strengths and Stressors Tracking Device (SSTD), a rapid assessment measure of family well-being to help guide case planning and evaluate treatment effectiveness. Found high internal consistency in all domains measured: environmental conditions, social support, caregiver skills, and child well-being.…
Descriptors: Child Abuse, Child Neglect, Child Welfare, Children
Peer reviewedMasataka, Nobuo – Early Education and Development, 2002
Evaluated social competence and problem behaviors of 200 Japanese preschoolers. Found anxiety-withdrawal, anger- aggression, and social competence factors as well as age and gender differences in emotional and behavioral problems and social competence. Found consistency with previous findings from U.S. and Canadian samples. (DLH)
Descriptors: Age Differences, Behavior Problems, Cross Cultural Studies, Cultural Images
Determining Exemptions from Foreign Language Requirements: Use of the Modern Language Aptitude Test.
Peer reviewedGoodman, Joan F.; And Others – Contemporary Educational Psychology, 1990
The short form of the Modern Language Aptitude Test (MLAT) was administered to 587 university students, and scores were validated against final first and second semester grades in introductory foreign language classes for 529 and 365 students, respectively. The MLAT failed to distinguish good from poor students. (TJH)
Descriptors: Analysis of Variance, Aptitude Tests, College Entrance Examinations, Equivalency Tests
Peer reviewedStiggins, Richard J.; And Others – Journal of Educational Measurement, 1989
Classroom assessment procedures of 36 teachers in grades 2 to 12 were studied to determine the extent to which they measure students' higher order thinking skills in mathematics, science, social studies, and language arts. A striking finding was the absence of evaluation of comparative and evaluative thinking. (SLD)
Descriptors: Classroom Techniques, Cognitive Processes, Educational Assessment, Elementary Secondary Education
Peer reviewedCrocker, Linda; And Others – Journal of Educational Measurement, 1988
Using generalizability theory as a framework, the problem of assessing the content validity of standardized achievement tests is considered. Four designs to assess test-item fit to a curriculum are described, and procedures for determining the optimal number of raters and schools in a content-validation decision-making study are considered. (TJH)
Descriptors: Achievement Tests, Content Validity, Decision Making, Elementary Education
Peer reviewedKenkel, James M.; Tucker, Richard W. – World Englishes, 1989
Outlines an application of theoretical understandings of institutionalized or nativized varieties of English to the practical concern of English-as-a-Second-Language programs, including testing, placement, and pedagogy. (29 references) (Author/OD)
Descriptors: English (Second Language), Error Analysis (Language), Essays, Foreign Countries
Peer reviewedTenhaken, Ursula; Scheibner-Herzig, Gudrun – Journal of Experimental Education, 1988
Achievement in English was measured for 64 German eighth graders by a cloze test, which was evaluated by three methods, yielding similar results. Twenty-one native English speakers comprised a criterion group. The cloze test may be appropriate for evaluating oral communication when an oral interview is not feasible. (SLD)
Descriptors: Achievement Tests, Cloze Procedure, Communicative Competence (Languages), English (Second Language)
Peer reviewedSelby, Edwin C.; And Others – Journal of Creative Behavior, 1993
Innovative or adaptive behavior of 86 eighth-grade students was rated by themselves, parents, and teachers using the Kirton Adaption-Innovation Inventory. The inventory was found to be reliable, stable, and valid. No significant differences between male and female students' scores were exhibited. Correlation with scores on the Comprehensive Test…
Descriptors: Adjustment (to Environment), Adoption (Ideas), Basic Skills, Correlation
Peer reviewedAckerman, Terry A. – Journal of Educational Measurement, 1992
The difference between item bias and item impact and the way they relate to item validity are discussed from a multidimensional item response theory perspective. The Mantel-Haenszel procedure and the Simultaneous Item Bias strategy are used in a Monte Carlo study to illustrate detection of item bias. (SLD)
Descriptors: Causal Models, Computer Simulation, Construct Validity, Equations (Mathematics)


