Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Pell, A. W.; Manganye, H. T. – Evaluation and Research in Education, 2007
Attitudes to science scales developed in England have been used in a rural region of South Africa with children aged 10 and 11 years in a two-stage data collection investigation. Cultural constraints on the use of "foreign" scales are explored. Factor analyses reveal differences between the South African and English children. A South…
Descriptors: African Languages, Curriculum Development, Foreign Countries, Science Teachers
Resnick, Lauren; And Others – 1993
The New Standards Project (NSP) is an effort to create a state- and district-based assessment and professional development system to serve as a catalyst for major educational reform. As part of a professional development strategy tied to assessment, 114 teachers, curriculum supervisors, and assessment directors, representing 23 states and…
Descriptors: Academic Standards, Educational Assessment, Educational Change, Elementary Secondary Education
Green, Sylvia; Oates, Tim – Educational Research, 2009
Background: In this article we address some of the challenges posed by the development of national assessment systems and discuss the need for high quality information on trends in attainment; support for school improvement processes and ways in which learning should be enhanced through valid assessment. Purpose: Key elements are explored,…
Descriptors: Educational Objectives, National Standards, Educational Quality, Educational Change
Oriogun, Peter K.; Cave, Diana – Journal of Interactive Online Learning, 2008
This article empirically validates an existing content analysis scheme and addresses a main concern of researchers about text-based, online transcripts in the form of code-recoding by mapping our scheme to the practical inquiry, cognitive presence model's five phases directly to realise higher-order thinking or critical thinking aspects for our…
Descriptors: Foreign Countries, Coding, Critical Thinking, Asynchronous Communication
Tekinarslan, Erkan – Online Submission, 2008
The purpose of this study is to develop an attitude scale toward Internet-based learning (IBL) and to investigate whether attitude levels of Turkish distance learners in an IBL environment differ according to their demographical characteristics (i.e. age, gender, marital status, parental status, employment status, grade point average (GPA).…
Descriptors: Grade Point Average, Measures (Individuals), Factor Analysis, Foreign Countries
Trafimow, David; Rice, Stephen – Psychological Review, 2008
People can use a variety of different strategies to perform tasks and these strategies all have two characteristics in common. First, they can be evaluated in comparison with either an absolute or a relative standard. Second, they can be used at varying levels of consistency. In the present article, the authors develop a general theory of task…
Descriptors: Behavior Theories, Performance, Scores, Performance Factors
Buser, Karen P. – 1995
Analysis of covariance (ANCOVA) has been recommended as one vehicle with which to evaluate special education and other intervention impacts (M. J. Taylor and M. S. Innocenti, 1993). Common misinterpretations of this methodology for these purposes are explained. These misapplications of ANCOVA include: (1) ignoring the assumption of homogeneity of…
Descriptors: Analysis of Covariance, Compensatory Education, Evaluation Methods, Evaluation Problems
Mislevy, Robert J. – 1991
This paper lays out a framework for comparing the qualities and the quantities of information about student competence provided by multiple-choice and free-response test items. After discussing the origins of multiple-choice testing and recent influences for change, the paper outlines an "inference network" approach to test theory, in…
Descriptors: Cognitive Psychology, Competence, Elementary Secondary Education, Inferences
Wintre, Maxine G.; Crowley, Jeannine – 1993
The Perception of Parental Reciprocity Scale (POPRS) was originally developed with a late adolescent population to assess the extent of perceived reciprocity in adolescent-parent relations. This study examined the reliability and validity of using POPRS with younger adolescents. Subjects, 655 males and 636 females ranging in age from 13 to 18,…
Descriptors: Adolescents, Affective Measures, Attitude Measures, Foreign Countries
Ceci, Stephen J.; Bruck, Maggie – Social Policy Report, 1993
This report provides an overview of the research on the testimony of young children in cases of sexual abuse, focusing on preschoolers' presumed suggestibility and the role of researchers and mental health professionals as expert witnesses in such cases. It does so in light of the McMartin preschool case, in which seven defendants were acquitted,…
Descriptors: Age Differences, Child Abuse, Court Litigation, Incidence
Pierce, Sarah; And Others – 1995
The development of the Home and Family Questionnaire (HFQ) is described and exploratory factor analyses and initial reliability studies are reported. The HFQ, which incorporates many of the items from the HOME Observation Inventory for Elementary School Children developed by B. M. Caldwell and R. H. Bradley (1984), evaluates home setting and home…
Descriptors: Elementary School Students, Estimation (Mathematics), Factor Structure, Grade 3
College of the Canyons, Santa Clarita, CA. – 1994
In July 1994, College of the Canyons (COC) in California conducted several predictive validity studies of the College Board Assessment and Placement Services (APS) Reading Test. COC began using the 35-question objective format test in spring 1993. Fall 1993 test scores were used in determining the ability of the APS Reading Test to predict student…
Descriptors: Community Colleges, Predictive Validity, Reading Tests, Standardized Tests
Bridgeman, Brent; And Others – 1996
If a student writes two essays, the score reliability can be estimated from the correlation between essays. However, if the essays are in different modes or require different skills, the reliability may be underestimated from the correlation. In Advanced Placement history examinations, students wrote one standard essay and one essay that required…
Descriptors: Advanced Placement, Constructed Response, Correlation, Essay Tests
Bezruczko, Nikolaus; And Others – 1989
The stability of bias estimates from J. Schueneman's chi-square method, the transformed Delta method, Rasch's one-parameter residual analysis, and the Mantel-Haenszel procedure, were compared across small and large samples for a data set of 30,000 cases. Bias values for 30 samples were estimated for each method, and means and variances of item…
Descriptors: Chi Square, Classification, Estimation (Mathematics), Identification
Schwarz, Julie A.; Collins, Michelle L. – 1995
Behaviorally Anchored Rating Scales (BARS) were developed to score responses from a previously designed police written communication test that lacked reliability. Rating scales for each of the 9 dimensions of the test consisted of the scale definition and a 5-point continuum, with the scores of 5, 3, and 1 defined by specified behavioral…
Descriptors: Graduate Students, Graduate Study, Higher Education, Interrater Reliability

Peer reviewed
Direct link
