Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedSquires, Jane K.; Potter, LaWanda; Bricker, Diane D.; Lamorey, Suzanne – Early Childhood Research Quarterly, 1998
Examined the use of the Ages and Stages Questionnaires with 96 low- and middle-income parents on their child from 4 to 30 months. Found that percent agreement between a professionally-administered standardized assessment and questionnaires completed by low and middle-income parents was 80% to 91% and 85% to 93%, respectively. (Author/KB)
Descriptors: Child Development, Comparative Analysis, Longitudinal Studies, Low Income Groups
Peer reviewedAllie, Saalih; Buffler, Andy; Kaunda, Loveness; Campbell, Bob; Lubben, Fred – International Journal of Science Education, 1998
Investigates the procedural understanding of first-year university science students in South Africa. Explores ideas related to the reliability of experimental data and discusses the types of reasoning underlying the responses. (DDR)
Descriptors: Cognitive Processes, College Students, Concept Formation, Context Effect
Peer reviewedSameroff, Arnold J.; Fiese, Barbara H. – Monographs of the Society for Research in Child Development, 1999
Investigated reliability of the Family Narrative Consortium (FNC) scales for measuring the effect of social context on construction of family narratives. Analyzed data from four FNC studies to determine reliability of scale dimensions of narrative coherence, narrative interaction, and relationship beliefs. Found that the scale was a set of…
Descriptors: Depression (Psychology), Family Environment, Family History, Family Influence
Peer reviewedNewport, John F. – Assessment & Evaluation in Higher Education, 1996
In the United States, college students' ratings of instructors are routinely used to make personnel decisions. However, closer examination of the qualifications of amateur student raters and novice public school teachers who have received training that should enable them to be good raters suggests that neither group is qualified to give reliable…
Descriptors: College Students, Employment Practices, Faculty Evaluation, Higher Education
Peer reviewedHutchinson, Thomas A. – Language, Speech, and Hearing Services in Schools, 1996
This article uses a question-and-answer format to describe the major categories of technical information usually presented in technical manuals for tests used by speech/language clinicians, including logical evidence of validity, empirical evidence of validity, types of reliability estimates, and practical issues in applying standardization data…
Descriptors: Communication Disorders, Elementary Secondary Education, Language Tests, Psychometrics
Peer reviewedNovak, John R.; And Others – Journal of Educational Research, 1996
Techniques for establishing the reliability and validity of student writing assessment are presented. Raters scored collections of elementary students' narrative writing with holistic scores from two rubrics (one established and one new, performance-based rubric). The new rubric proved reliable and valid, though correlational patterns were not…
Descriptors: Elementary Education, Elementary School Students, Evaluation Methods, Performance Based Assessment
Peer reviewedSegall, Daniel O. – Psychometrika, 1996
Maximum likelihood and Bayesian procedures are presented for item selection and scoring of multidimensional adaptive tests. A demonstration with simulated response data illustrates that multidimensional adaptive testing can provide equal or higher reliabilities with fewer items than are required in one-dimensional adaptive testing. (SLD)
Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Equations (Mathematics)
Peer reviewedKinch, Carol; Lewis-Palmer, Teri; Hagan-Burke, Shanna; Sugai, George – Education and Treatment of Children, 2001
A study examined the usefulness of information secured from eight students displaying substantially more problem behaviors in one classroom (high-risk) than another, and 16 teachers. Students were able to provide reliable information in the functional assessment interview. Moderate to high agreement was obtained between students and teachers in…
Descriptors: Antisocial Behavior, Behavior Problems, Data Collection, Functional Behavioral Assessment
Peer reviewedVermeer, Anne – Language Testing, 2000
Discusses the reliability and validity of different measures of lexical richness in various language data research and computer simulations, and examines the behavior of these measures in spontaneous speech data of first language and second language children learning Dutch, aged 4 to 7, compared with their lexical abilities as measured by tests.…
Descriptors: Comparative Analysis, Dutch, Grade 1, Kindergarten
Peer reviewedTrusty, Jerry; Harris, Morag B. Colvin – Journal of Adolescent Research, 1999
Examined extent to which demographics, students' personal resources, and family resources predicted stable or lowered educational expectations from eighth grade to 2 years post-high school. Found that predictors of lost talent or lowered expectations over time included low SES and racial group membership. External locus of control predicted lost…
Descriptors: Adolescent Development, Adolescents, Age Differences, Expectation
Peer reviewedJulien, Heidi; Michels, David – Canadian Journal of Information and Library Science, 2000
Describes a study in New Zealand that examined the information behavior of people in daily life contexts. Results showed how users' expectations of the usefulness of information sources varied by gender and source characteristics, including accessibility, trustworthiness, and reliability. Suggests ways to encourage use of formal information…
Descriptors: Access to Information, Credibility, Foreign Countries, Gender Issues
Costello, E. Jane; Egger, Helen; Angold, Adrian – Journal of the American Academy of Child & Adolescent Psychiatry, 2005
Objective: To review recent progress in child and adolescent psychiatric epidemiology in the area of prevalence and burden. Method: The literature published in the past decade was reviewed under two headings: methods and findings. Results: Methods for assessing the prevalence and community burden of child and adolescent psychiatric disorders have…
Descriptors: Evidence, Early Intervention, Incidence, Mental Disorders
Hudson, Peter; Skamp, Keith; Brooks, Lyndon – Science Education, 2005
Perceptions of mentors' practices related to primary science teaching from nine Australian universities (N = 331 final-year preservice teachers) were gathered through a literature-based instrument. Five factors that characterize effective mentoring practices in primary science teaching were supported by confirmatory factory analysis. These…
Descriptors: Teaching Methods, Test Reliability, Preservice Teachers, Pedagogical Content Knowledge
Silvestrone, Judy M. – New Directions for Teaching and Learning, 2004
Whether in the science or language laboratory, carrying out health care procedures or demonstrating performance arts, faculty can improve skill evaluation through transparency and authenticity in exam construction, format, and grading.
Descriptors: Language Laboratories, Performance Based Assessment, Validity, Reliability
Spector, Janet E. – Psychology in the Schools, 2005
Informal Reading Inventories (IRI) are often recommended as instructionally relevant measures of reading. However, they have also been criticized for inattention to technical quality. Examination of reliability evidence in nine recently revised IRIs revealed that fewer than half report reliability. Several appear to have sufficient reliability for…
Descriptors: Informal Reading Inventories, Reading Instruction, Reading Difficulties, Reading Research

Direct link
