Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBanville, Dominique; Desrosiers, Pauline; Genet-Volet, Yvette – Journal of Teaching in Physical Education, 2000
Presents a methodology developed by Vallerand (1989) in the psychological field that translates and validates questionnaires and inventories developed for specific cultures. This cross- cultural technique ensures that the instrument will provide data that are valid and reliable in the target population. Seven necessary steps are defined, and…
Descriptors: Cultural Influences, Cultural Relevance, Culturally Relevant Education, Culture Fair Tests
Peer reviewedGoh, Swee Chiew; Fraser, Barry J. – Journal of Research in Childhood Education, 2000
Examined teachers' interpersonal behavior and its association with affective and cognitive outcomes among 1,512 elementary mathematics students in Singapore. Validated a widely applicable and convenient questionnaire to assess teacher interpersonal behavior. Found that different data analysis methods yielded consistent associations between teacher…
Descriptors: Academic Achievement, Elementary Education, Elementary School Students, Elementary School Teachers
Peer reviewedScott, Marcia S.; Deuel, Lois-Lynn Stoyko; Urbano, Richard C.; Fletcher, Kathryn L.; Torres, Carolyn – Education and Training in Mental Retardation and Developmental Disabilities, 1998
The performance on a cognitive screening test of 37 children (ages 4-6) with mild mental retardation or learning disabilities was compared to their peers. The tasks constituting the initial version of the battery were evaluated in terms of their classification accuracy and yielded a set of five different cognitive tasks. (CR)
Descriptors: Classification, Cognitive Ability, Disability Identification, Kindergarten Children
Peer reviewedBrennan, Robert L. – Educational Measurement: Issues and Practice, 1998
Explores the relationship between measurement theory and practice, considering five broad categories of: (1) models, assumptions, and terminology; (2) reliability; (3) validity; (4) scaling; and (5) setting performance standards. It must be recognized that measurement is not an end in itself. (SLD)
Descriptors: Educational Assessment, Educational Practices, Measurement Techniques, Models
Caudell, Lee Sherman – Northwest Education, 1996
Most states have expanded their statewide testing programs to include alternative educational assessments, and two (Kentucky and Maine) have completely abandoned the multiple-choice format. However, over half of states designing alternative assessments are encountering major difficulties related to the high cost of performance-based assessments,…
Descriptors: Accountability, Alternative Assessment, Costs, Educational Assessment
Peer reviewedSquires, Jane K.; Potter, LaWanda; Bricker, Diane D.; Lamorey, Suzanne – Early Childhood Research Quarterly, 1998
Examined the use of the Ages and Stages Questionnaires with 96 low- and middle-income parents on their child from 4 to 30 months. Found that percent agreement between a professionally-administered standardized assessment and questionnaires completed by low and middle-income parents was 80% to 91% and 85% to 93%, respectively. (Author/KB)
Descriptors: Child Development, Comparative Analysis, Longitudinal Studies, Low Income Groups
Peer reviewedAllie, Saalih; Buffler, Andy; Kaunda, Loveness; Campbell, Bob; Lubben, Fred – International Journal of Science Education, 1998
Investigates the procedural understanding of first-year university science students in South Africa. Explores ideas related to the reliability of experimental data and discusses the types of reasoning underlying the responses. (DDR)
Descriptors: Cognitive Processes, College Students, Concept Formation, Context Effect
Peer reviewedSameroff, Arnold J.; Fiese, Barbara H. – Monographs of the Society for Research in Child Development, 1999
Investigated reliability of the Family Narrative Consortium (FNC) scales for measuring the effect of social context on construction of family narratives. Analyzed data from four FNC studies to determine reliability of scale dimensions of narrative coherence, narrative interaction, and relationship beliefs. Found that the scale was a set of…
Descriptors: Depression (Psychology), Family Environment, Family History, Family Influence
Peer reviewedNewport, John F. – Assessment & Evaluation in Higher Education, 1996
In the United States, college students' ratings of instructors are routinely used to make personnel decisions. However, closer examination of the qualifications of amateur student raters and novice public school teachers who have received training that should enable them to be good raters suggests that neither group is qualified to give reliable…
Descriptors: College Students, Employment Practices, Faculty Evaluation, Higher Education
Peer reviewedHutchinson, Thomas A. – Language, Speech, and Hearing Services in Schools, 1996
This article uses a question-and-answer format to describe the major categories of technical information usually presented in technical manuals for tests used by speech/language clinicians, including logical evidence of validity, empirical evidence of validity, types of reliability estimates, and practical issues in applying standardization data…
Descriptors: Communication Disorders, Elementary Secondary Education, Language Tests, Psychometrics
Peer reviewedNovak, John R.; And Others – Journal of Educational Research, 1996
Techniques for establishing the reliability and validity of student writing assessment are presented. Raters scored collections of elementary students' narrative writing with holistic scores from two rubrics (one established and one new, performance-based rubric). The new rubric proved reliable and valid, though correlational patterns were not…
Descriptors: Elementary Education, Elementary School Students, Evaluation Methods, Performance Based Assessment
Peer reviewedSegall, Daniel O. – Psychometrika, 1996
Maximum likelihood and Bayesian procedures are presented for item selection and scoring of multidimensional adaptive tests. A demonstration with simulated response data illustrates that multidimensional adaptive testing can provide equal or higher reliabilities with fewer items than are required in one-dimensional adaptive testing. (SLD)
Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Equations (Mathematics)
Peer reviewedKinch, Carol; Lewis-Palmer, Teri; Hagan-Burke, Shanna; Sugai, George – Education and Treatment of Children, 2001
A study examined the usefulness of information secured from eight students displaying substantially more problem behaviors in one classroom (high-risk) than another, and 16 teachers. Students were able to provide reliable information in the functional assessment interview. Moderate to high agreement was obtained between students and teachers in…
Descriptors: Antisocial Behavior, Behavior Problems, Data Collection, Functional Behavioral Assessment
Peer reviewedVermeer, Anne – Language Testing, 2000
Discusses the reliability and validity of different measures of lexical richness in various language data research and computer simulations, and examines the behavior of these measures in spontaneous speech data of first language and second language children learning Dutch, aged 4 to 7, compared with their lexical abilities as measured by tests.…
Descriptors: Comparative Analysis, Dutch, Grade 1, Kindergarten
Peer reviewedTrusty, Jerry; Harris, Morag B. Colvin – Journal of Adolescent Research, 1999
Examined extent to which demographics, students' personal resources, and family resources predicted stable or lowered educational expectations from eighth grade to 2 years post-high school. Found that predictors of lost talent or lowered expectations over time included low SES and racial group membership. External locus of control predicted lost…
Descriptors: Adolescent Development, Adolescents, Age Differences, Expectation


