Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Dickey, Edwin; Roblyer, M. D. – Learning & Leading with Technology, 1997
Examines the impact of technology on educational effectiveness in the United States as measured by the National Assessment of Educational Progress (NAEP) and the Third International Mathematics and Science Study (TIMSS). Presents five items from NAEP and TIMSS tests that may favor students with technology experience. Concludes that further…
Descriptors: Computer Literacy, Computer Uses in Education, Educational Assessment, Elementary Secondary Education
Peer reviewedLunz, Mary E.; Schumacker, Randall E. – Journal of Outcome Measurement, 1997
Results and interpretations of the data from a performance examination were compared for four methods of analysis for 74 medical specialty certification candidates: (1) traditional summary statistics; (2) inter-judge correlations; (3) generalizability theory; and (4) the multifaceted Rasch model. Advantages of the Rasch model are outlined. (SLD)
Descriptors: Comparative Analysis, Data Analysis, Generalizability Theory, Interrater Reliability
Chenoweth, Karin – Black Issues in Higher Education, 1997
While Scholastic Assessment Test (SAT) and American College Testing Program (ACT) scores are viewed as reliable, and colleges and universities continue to use them, they are often misunderstood and misused. They are reliable for predicting freshman grades only when comparisons are made within one racial group. They also do not account for student…
Descriptors: Academic Achievement, College Entrance Examinations, College Freshmen, Higher Education
Peer reviewedO'Leary, Michael; Shiel, Gerry – Educational Assessment, 1997
Six dimensions of curriculum profiles are identified: (1) function-purpose; (2) curriculum coverage; (3) criterion referencing; (4) validity-reliability; (5) manageability; and (6) interpretability. Each dimension in Australian National Profiles, the Victoria (Australia) Profiles, and the National Curriculum Assessment in England and Wales is…
Descriptors: Curriculum, Curriculum Development, Educational Administration, Elementary Secondary Education
Peer reviewedSupovitz, Jonathan A.; MacGowan, Andrew, III; Slattery, Jean – Educational Assessment, 1997
Reports on the interrater reliability of a language arts portfolio assessment in the primary grades of the Rochester (New York) school system. Results from approximately 400 primary grade portfolios rated by 2 raters show that teachers can assess their own students' work reliably. (SLD)
Descriptors: Evaluation Methods, Evaluators, Interrater Reliability, Portfolio Assessment
Peer reviewedMaxon, Antonia Brancia; White, Karl R.; Culpepper, Brandt; Vohr, Betty R. – Journal of Communication Disorders, 1997
Describes factors that can affect the referral rate for otoacoustic emission-based newborn hearing screening and discusses the screening results of 1,328 newborns screened with transient evoked otoaoustic emissions prior to hospital discharge. The youngest infants were as likely to pass as infants who were 24-27 hours old. (Author/CR)
Descriptors: Age Differences, Auditory Tests, Evaluation Methods, Hearing Impairments
Peer reviewedPlucker, Jonathan A. – Journal of Secondary Gifted Education, 1997
This study used a sample (n=967) of academically gifted adolescent students attending summer enrichment programs and participating in urban school districts' gifted programs to evaluate the reliability and validity of the Adolescent Coping Scale. Results suggest the instrument is sufficiently reliable for group administration and research purposes…
Descriptors: Academically Gifted, Adolescents, Coping, Elementary Secondary Education
Hill, Roger B. – Journal of Technology Education, 1997
The Observation Procedure for Technology Education Mental Processes, a computerized assessment tool, was based on duration and frequency of mental processes needed for problem solving. Videotapes of students completing problem-solving activities were used to identify the processes. Interrater reliability tests validated the program. (SK)
Descriptors: Cognitive Processes, Computer Software Development, Interrater Reliability, Measures (Individuals)
Peer reviewedWinston, Roger B., Jr.; Bledsoe, Tyrone; Goldstein, Adam R.; Wisbey, Martha E.; Street, James L.; Brown, Steven R.; Goyen, Kenneth D.; Rounds, Linda E. – Journal of College Student Development, 1997
Using M. R. Weisbord's model of organizational diagnosis, researchers developed the Student Organization Environment Scales to measure students' perceptions of the psychosocial environment or climate of college student organizations. Development of the instrument is described and estimates of its reliability and validity are reported. Describes…
Descriptors: College Environment, College Students, Higher Education, Models
Peer reviewedGoffman, Lisa; And Others – Journal of Child Language, 1996
The influence of information level on the production of accuracy of 20 children was examined. Data were children's productions of nouns in sets of utterances referring to triplets of pictures representing noun-verb-noun utterances. (Author/JL)
Descriptors: Acoustic Phonetics, Child Language, Cognitive Processes, Grammar
Peer reviewedOakland, Thomas; And Others – Gifted Child Quarterly, 1996
Eleven leadership measures for children, youth, and adults are reviewed in the context of current leadership theories and psychometric standards for test use. Measures for assessing leadership among children are considered inadequately normed and lacking in reliability and validity data, but leadership measures for adults are seen as more…
Descriptors: Adolescents, Adults, Children, Gifted
Peer reviewedWehby, Joseph H.; Symons, Frank J. – Behavioral Disorders, 1996
Primary among the issues in research on school-age children with emotional and behavioral disorders (EBD) is the prediction and control of aggressive behavior. This paper examines issues in the direct measurement of classroom aggression including low base rates, interactional sequences, and reliability. (Author/DB)
Descriptors: Aggression, Behavior Change, Behavior Disorders, Elementary Secondary Education
Peer reviewedMilligan, Frank – Nurse Education Today, 1996
Grading profiles for formative and summative assessment in a British nursing school were designed with criterion referencing to improve validity and interrater and intercourse reliability. Assessment was conceptualized as an ethical activity that clarifies expectations through specification of criteria. (SK)
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Foreign Countries, Formative Evaluation
Peer reviewedCharman, Tony – Early Child Development and Care, 2003
Asserts that although no instrument has proved sufficiently robust to recommend universal screening, screening instruments for autism spectrum disorder (ASD) can play an important role. Discusses the clinical issues raised in screening for a developmental disorder, including risk status, management advice, and availability of services. Asserts…
Descriptors: Autism, Developmental Disabilities, Disability Identification, Early Identification
Peer reviewedKirby, Sheila Nataraj; McCaffrey, Daniel F.; Lockwood, J. R.; McCombs, Jennifer Sloan; Naftel, Scott; Barney, Heather – Peabody Journal of Education, 2002
Discusses the quality of school-level data collected as part of state accountability systems, including the reliability and validity of school-level test scores as a measure of the value added by schools to student learning, outlining ways that these data can be usefully analyzed; illustrating challenges inherent in doing so; and discussing…
Descriptors: Accountability, Data Collection, Educational Research, Elementary Secondary Education


