Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedSegall, Daniel O. – Psychometrika, 1996
Maximum likelihood and Bayesian procedures are presented for item selection and scoring of multidimensional adaptive tests. A demonstration with simulated response data illustrates that multidimensional adaptive testing can provide equal or higher reliabilities with fewer items than are required in one-dimensional adaptive testing. (SLD)
Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Equations (Mathematics)
Peer reviewedKinch, Carol; Lewis-Palmer, Teri; Hagan-Burke, Shanna; Sugai, George – Education and Treatment of Children, 2001
A study examined the usefulness of information secured from eight students displaying substantially more problem behaviors in one classroom (high-risk) than another, and 16 teachers. Students were able to provide reliable information in the functional assessment interview. Moderate to high agreement was obtained between students and teachers in…
Descriptors: Antisocial Behavior, Behavior Problems, Data Collection, Functional Behavioral Assessment
Peer reviewedVermeer, Anne – Language Testing, 2000
Discusses the reliability and validity of different measures of lexical richness in various language data research and computer simulations, and examines the behavior of these measures in spontaneous speech data of first language and second language children learning Dutch, aged 4 to 7, compared with their lexical abilities as measured by tests.…
Descriptors: Comparative Analysis, Dutch, Grade 1, Kindergarten
Peer reviewedTrusty, Jerry; Harris, Morag B. Colvin – Journal of Adolescent Research, 1999
Examined extent to which demographics, students' personal resources, and family resources predicted stable or lowered educational expectations from eighth grade to 2 years post-high school. Found that predictors of lost talent or lowered expectations over time included low SES and racial group membership. External locus of control predicted lost…
Descriptors: Adolescent Development, Adolescents, Age Differences, Expectation
Peer reviewedJulien, Heidi; Michels, David – Canadian Journal of Information and Library Science, 2000
Describes a study in New Zealand that examined the information behavior of people in daily life contexts. Results showed how users' expectations of the usefulness of information sources varied by gender and source characteristics, including accessibility, trustworthiness, and reliability. Suggests ways to encourage use of formal information…
Descriptors: Access to Information, Credibility, Foreign Countries, Gender Issues
Costello, E. Jane; Egger, Helen; Angold, Adrian – Journal of the American Academy of Child & Adolescent Psychiatry, 2005
Objective: To review recent progress in child and adolescent psychiatric epidemiology in the area of prevalence and burden. Method: The literature published in the past decade was reviewed under two headings: methods and findings. Results: Methods for assessing the prevalence and community burden of child and adolescent psychiatric disorders have…
Descriptors: Evidence, Early Intervention, Incidence, Mental Disorders
Hudson, Peter; Skamp, Keith; Brooks, Lyndon – Science Education, 2005
Perceptions of mentors' practices related to primary science teaching from nine Australian universities (N = 331 final-year preservice teachers) were gathered through a literature-based instrument. Five factors that characterize effective mentoring practices in primary science teaching were supported by confirmatory factory analysis. These…
Descriptors: Teaching Methods, Test Reliability, Preservice Teachers, Pedagogical Content Knowledge
Silvestrone, Judy M. – New Directions for Teaching and Learning, 2004
Whether in the science or language laboratory, carrying out health care procedures or demonstrating performance arts, faculty can improve skill evaluation through transparency and authenticity in exam construction, format, and grading.
Descriptors: Language Laboratories, Performance Based Assessment, Validity, Reliability
Spector, Janet E. – Psychology in the Schools, 2005
Informal Reading Inventories (IRI) are often recommended as instructionally relevant measures of reading. However, they have also been criticized for inattention to technical quality. Examination of reliability evidence in nine recently revised IRIs revealed that fewer than half report reliability. Several appear to have sufficient reliability for…
Descriptors: Informal Reading Inventories, Reading Instruction, Reading Difficulties, Reading Research
Sudweeks, Richard R.; Glissmeyer, Connie B.; Morrison, Timothy G.; Wilcox, Bradley R.; Tanner, Mark W. – Reading Research and Instruction, 2004
Oral retellings are strongly recommended as a way to measure reading comprehension for second language learners (Bernhardt, 1985, 1990, 1991). However, the reliability of such ratings is a matter of concern for a variety of reasons (Aiken, 1996; Cooper, 1981; Saal, Downey, & Lahey, 1980). The purpose of this study was to establish reliable rating…
Descriptors: Error of Measurement, Generalizability Theory, Reading Comprehension, Second Language Learning
Hong, Eunsook; Greene, Mary T.; Higgins, Kyle – Gifted Child Quarterly, 2006
An instrument to measure teachers' instructional practices, the Instructional Practice Questionnaire, was developed and validated in three phases. The questionnaires focused on three domains of instructional practices: cognitive, interpersonal, and interpersonal. First, an initial questionnaire was developed for a pilot study, and data were…
Descriptors: Teaching Methods, Questionnaires, Resource Room Programs, Regular and Special Education Relationship
Hartley, S. L.; MacLean, W. E., Jr. – Journal of Intellectual Disability Research, 2006
Background: Likert-type scales are increasingly being used among people with intellectual disability (ID). These scales offer an efficient method for capturing a wide range of variance in self-reported attitudes and behaviours. This review is an attempt to evaluate the reliability and validity of Likert-type scales in people with ID. Methods:…
Descriptors: Likert Scales, Test Reliability, Test Validity, Measurement Techniques
Kame'enui, Edward J.; Fuchs, Lynn; Francis, David J.; Good, Roland, III; O'Connor, Rollanda E.; Simmons, Deborah C.; Tindal, Gerald; Torgesen, Joseph K. – Educational Researcher, 2006
Assessment of student performance is critical for developing effective instructional policy and designing programs responsive to individual students' needs. To gauge the adequacy of available assessment tools for achieving these ends, the Reading First Assessment Committee (RFAC) developed criteria for evaluating the adequacy of reading measures…
Descriptors: Evaluation Methods, Primary Education, Student Evaluation, Reading Skills
Feldt, Leonard S.; Kim, Seonghoon – Educational and Psychological Measurement, 2006
Researchers sometimes need a statistical test of the hypothesis that two values of Cronbach's alpha reliability coefficient are equal. The situation may involve scores from two different measures administered to independent random samples or from the same measure administered to random samples from two different populations. Feldt derived a test…
Descriptors: Individual Testing, Test Items, Sample Size, Scores
Noens, I.; van Berckelaer-Onnes, I.; Verpoorten, R.; van Duijn, G. – Journal of Intellectual Disability Research, 2006
Background: The ComFor (Forerunners in Communication) is an instrument to explore underlying competence for augmentative communication. More specifically, it measures perception and sense-making of non-transient forms of communication at the levels of presentation and representation. The target group consists primarily of individuals with autism…
Descriptors: Foreign Countries, Comparative Analysis, Verbal Communication, Psychometrics

Direct link
