Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBlachman, Benita A. – Journal of Reading, 1983
Questions the credibility of the Diagnostic Spelling Potential Test. (FL)
Descriptors: Elementary Secondary Education, Norm Referenced Tests, Reading Instruction, Spelling Instruction
McLeod, John – Australian Journal of Reading, 1983
Illustrates ways to link the motivating power and efficiency of the computer to the effectiveness and necessity of traditional teaching practice and criterion referenced testing in order to assess and teach spelling. (FL)
Descriptors: Computer Assisted Testing, Criterion Referenced Tests, Microcomputers, Motivation Techniques
Peer reviewedChase, Clinton I. – Elementary School Journal, 1976
This article lists psychometric problems encountered when measuring change in achievement by improved test scores as is done in performance contracting. Examined are: regression toward the mean, reliability of gain scores, the nature of grade norms, and test content as representation of educational objectives. (SB)
Descriptors: Academic Achievement, Achievement Tests, Elementary Secondary Education, Evaluation
McKenna, Bernard H. – NJEA Review, 1976
Article presented a true story of how two cities ran testing programs and the lessons that can be learned from their failures. (Editor/RK)
Descriptors: Learning Processes, Scores, Standardized Tests, Student Attitudes
Johnstone, Christopher; Altman, Jason; Thurlow, Martha – National Center on Educational Outcomes, University of Minnesota, 2006
Universal design for assessments is an approach to educational assessment based on principles of accessibility for a wide variety of end users. Elements of universal design include inclusive test population; precisely defined constructs; accessible, non-biased items; tests that are amenable to accommodations; simple, clear and intuitive…
Descriptors: Educational Assessment, Test Construction, Testing Accommodations, Design Requirements
Simpson, Mary Ann; Gong, Brian; Marion, Scott – National Center on Educational Outcomes, University of Minnesota, 2006
This study addresses three questions: First, considering the full group of students and the special education subgroup, what is the likely effect of minimum cell size and confidence interval size on school-level Adequate Yearly Progress (AYP) determinations? Second, what effects do the changing minimum cell sizes have on inclusion of special…
Descriptors: Intervals, Educational Improvement, Reading Achievement, Achievement Tests
Borko, Hilda; Stecher, Brian M. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2006
This report presents findings from two investigations of the use of classroom artifacts to measure the presence of reform-oriented teaching practices in middle-school science classes. It complements previous research on the use of artifacts to describe reform-oriented teaching practices in mathematics. In both studies, ratings based on collections…
Descriptors: Teaching Methods, Science Instruction, Investigations, Mathematics
Lomawaima, K. Tsianina; McCarty, Teresa L. – 2002
The constructs used to evaluate research quality--valid, objective, reliable, generalizable, randomized, accurate, authentic--are not value-free. They all require human judgment, which is affected inevitably by cultural norms and values. In the case of research involving American Indians and Alaska Natives, assessments of research quality must be…
Descriptors: Action Research, American Indian Education, Educational Research, Indigenous Knowledge
Howley, Craig B. – Online Submission, 2006
This essay explains the relevance of critique in rural education to novels about rural places. The most important quoted passage in the essay is from the noted physicist Richard Feynman: "Science is the belief in the ignorance of experts." Novelist-physicist C. P. Snow, historian Henry Adams, and poet and student-of-mathematics Kelly Cherry also…
Descriptors: Natural Sciences, Novels, Rural Education, Social Sciences
Sun, Anji; Valiga, Michael J. – 1997
In this study, the reliability of the American College Testing (ACT) Program's "Survey of Academic Advising" (SAA) was examined using both univariate and multivariate generalizability theory approaches. The primary purpose of the study was to compare the results of three generalizability theory models (a random univariate model, a mixed…
Descriptors: Academic Advising, Colleges, Faculty Advisers, Generalizability Theory
Jones, Lex – 1996
In England and Wales, a National Curriculum initiated in 1988 was designed to ensure that all schools provided a curriculum which represented different areas of knowledge. The past 20 years has increasingly seen more emphasis on the link between the financial amounts spent on education and subsequent return on this money. The impact of the…
Descriptors: British National Curriculum, Foreign Countries, Literacy, Performance Based Assessment
Lee, Yong-Won – 2001
An essay test is now an integral part of the computer based Test of English as a Foreign Language (TOEFL-CBT). This paper provides a brief overview of the current TOEFL-CBT essay test, describes the operational procedures for essay scoring, including the Online Scoring Network (OSN) of the Educational Testing Service (ETS), and discusses major…
Descriptors: Computer Assisted Testing, English (Second Language), Essay Tests, Interrater Reliability
McCarthy, Christopher J.; Lambert, Richard G.; Curlette, William L.; Seraphine, Anne E.; Beard, Michelle – 2001
This paper provides evidence for the reliability and validity of the Preventive Coping Resources Inventory (PCRI) instrument designed to measure coping resources useful for prevention based on previous research. It specifically looks at the construct validity of the PCRI; the convergent and discriminate validity of the PCRI with related…
Descriptors: College Students, Coping, Emotional Response, Higher Education
Mushi, Selina L. P. – 2001
Analysis of secondary data was used as a way to inform the researcher about the trends in her assessment practices over a 4-year period. This was an important initial step in an effort to develop and integrate high-quality classroom assessment tasks and make sense of assessment information for decision making. Scores from 26 groups of graduate and…
Descriptors: Educational Assessment, Evaluation Methods, Feedback, Graduate Students
Ridge, Kirk – 2001
This study investigated whether raters in two different training groups would demonstrate halo error when each rater scored all five responses to five different mathematics performance-based items from each student. One group of 20 raters was trained by an experienced scoring director with item-specific scoring rubrics and the opportunity to…
Descriptors: Evaluators, Feedback, Interrater Reliability, Junior High School Students


