Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedCizek, Gregory J. – Educational Measurement: Issues and Practice, 1988
Sources of current misuse of standardized tests in assessing the quality of home-based educational programs are identified. Development of new instruments and cooperation of concerned groups are suggested as a means of increasing educational alternatives, excellence, and accountability. (Author/TJH)
Descriptors: Educational Legislation, Educational Quality, Elementary Secondary Education, Home Schooling
Peer reviewedPhelps, LeAdelle; And Others – Psychology in the Schools, 1988
Compared Stanford-Binet (Fourth Edition) and the Wechsler Intelligence Scale for Children-Revised as instruments for assessing the intellectual strengths and weaknesses of students (N=35) classified as learning disabled in elementary and secondary grades. Results suggest the tests will yield similar intelligence quotients for the learning disabled…
Descriptors: Comparative Testing, Elementary School Students, Elementary Secondary Education, Intelligence Quotient
Bracey, Gerald W. – High School Magazine, 1993
Describes four criteria that can be used to evaluate methods of assessment: (1) "What are the consequences of using the test?" (2) "Is this assessment fair?" (3) "Do the skills and knowledge of this assessment transfer or generalize?" and (4) "Does this assessment cover cognitively complex task?" (KDP)
Descriptors: Alternative Assessment, Evaluation Methods, High Schools, Performance Based Assessment
Peer reviewedPeirce, Bonny Norton; Stein, Pippa – Harvard Educational Review, 1995
A pilot of a reading test to be used in college entrance examinations for black South Africans showed how students' interpretation of the test differed as the social context was altered. The multiple meanings produced by shifting power relations call into question the test's validity. (SK)
Descriptors: Black Students, College Entrance Examinations, Cultural Context, Equal Education
Peer reviewedBurrell, Brenda; And Others – Educational and Psychological Measurement, 1992
The reliability and validity of Child Abuse Potential Inventory (CAP) scores were investigated for 53 mothers of young children with disabilities and 60 mothers of young children without handicaps. Results generally support the integrity of CAP scores, although the random response subscale's reliability could be improved by omitting some items.…
Descriptors: Child Abuse, Comparative Testing, Disabilities, Measures (Individuals)
Peer reviewedSchlene, Vickie J. – Social Education, 1992
Presents citations in the ERIC database on testing and assessment in the social studies. Includes items concerning the nature of testing, tests as guides for curriculum, the impact of testing, and the validity of performance grades. Suggests works on context effect, history instruction, and what students need to know. (DK)
Descriptors: Context Effect, Curriculum Development, Educational Testing, Elementary Secondary Education
Peer reviewedPowers, Donald E.; Schedl, Mary A.; Leung, Susan Wilson; Butler, Frances A. – Language Testing, 1999
A communicative-competence orientation was undertaken to study the validity of test-score inferences derived from the revised Test of Spoken English (TSE). To implement the approach, a sample of undergraduate students, primarily native-English speakers, provided reactions to the test responses of a sample of TSE examinees. (Author/VWL)
Descriptors: College Students, Communicative Competence (Languages), English (Second Language), Inferences
Peer reviewedSwain, Merrill – Language Testing, 2001
Examines one aspect of the many interfaces between second language (L2) learning and L2 testing. The aspect is the oral interaction--the dialogue--that occurs within small groups. Discusses from within a sociocultural theory of mind, that in a group, performance is jointly constructed and distributed across the participants. (Author/VWL)
Descriptors: Dialogs (Language), Inferences, Interaction, Language Tests
Rotberg, Iris C. – School Administrator, 1996
Because educators have unrealistic expectations about tests, they use them inappropriately and draw inaccurate conclusions from results. This article debunks five myths about test-score comparisons: valid measurement of school quality; declining international competitiveness; "fixing" schools with more tests; development of new, improved…
Descriptors: Comparative Education, Competition, Elementary Secondary Education, Expenditure per Student
Peer reviewedLloyd, D.; And Others – Assessment & Evaluation in Higher Education, 1996
In an engineering technology course at Coventry University (England), the utility of computer-assisted tests was compared with that of traditional paper-based tests. It was found that the computer-based technique was acceptable to students, produced valid results, and demonstrated potential for saving staff time. (Author/MSE)
Descriptors: Comparative Analysis, Computer Assisted Testing, Efficiency, Engineering Education
Peer reviewedLassiter, Kerry S. – Psychology in the Schools, 1995
To test the validity of brief measures of intelligence and explore how well these instruments relate to academic performance, the WPPSI-R, the Kaufman Brief Intelligence Scale, Draw-A-Person: Quantitative Scoring System, and the K-ABC Achievement Scale were administered to 50 kindergarten and first-grade children. Results indicated all measures…
Descriptors: Academic Achievement, Cognitive Ability, Correlation, Grade 1
Domenech, Daniel A. – School Administrator, 2000
The question of validity, or how high-stakes tests are being used and interpreted, threatens to undermine the entire standards movement. Joint standards developed by three professional associations say decisions affecting students' life chances should not be based on test scores alone. Objectivity and teaching to tests are real concerns. (MLH)
Descriptors: Academic Standards, Data Interpretation, Elementary Secondary Education, High Stakes Tests
Peer reviewedBachman, Lyle F. – Language Testing, 2000
Reviews developments in language testing research and practice over the last 20 years, and suggests future directions in the areas of professionalizing the field and validation research. Argues that concerns for ethical conduct must be grounded in valid test use, so that professionalization and validation research are inseparable. (Author/VWL)
Descriptors: Ethics, Language Research, Language Tests, Second Language Instruction
Peer reviewedKoretz, Daniel; Stecher, Brian; Klein, Stephen; McCaffrey, Daniel – Educational Measurement: Issues and Practice, 1994
Reports on an ongoing evaluation of the Vermont portfolio assessment program. Indicates that the positive news about the instructional effects of the assessment program are in contrast with the empirical findings about the quality of the data the program has yielded. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment
Peer reviewedSmith, Tina T.; Lee, Evan; McDade, Hiram L. – Communication Disorders Quarterly, 2001
This study investigated the dialectal sensitivity of the T-unit as a nonbiased alternative for assessing the oral grammatical skills of school-age, nonstandard English speakers. Analysis of language samples from 28 9-year-old children (half African-American) revealed no significant differences between groups, suggesting that the T-unit may be a…
Descriptors: Black Dialects, Black Students, Culture Fair Tests, Elementary Education


