Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
De Houwer, Jan – Learning and Motivation, 2006
Implicit measures such as the Implicit Association Test (OAT) have recently become popular as tools in research on evaluative conditioning. The reason is that these measures are thought to be impervious to changes in valence that are due to conscious propositional knowledge about the relation between the conditioned stimulus (CS) and the…
Descriptors: Association Measures, Conditioning, Stimuli, Interrater Reliability
Hellman, Chan M.; Fuqua, Dale R.; Worley, Jody – Educational and Psychological Measurement, 2006
The Survey of Perceived Organizational Support (SPOS) is a unidimensional measure of the general belief held by an employee that the organization is committed to him or her, values his or her continued membership, and is generally concerned about the employee's well-being. In the interest of efficiency, researchers are often compelled to use a…
Descriptors: Reliability, Generalization, Employee Attitudes, Beliefs
Goldberg, Mark F. – Education Digest: Essential Readings Condensed for Quick Review, 2004
Tests are a natural part of education, from the quizzes, essays, and classroom tests that teachers have traditionally administered to the high-stakes tests that states use to make decisions about graduation, promotion, and school funding and governance. In this article, the author stresses the need to learn the unintended consequences of…
Descriptors: Testing, High Stakes Tests, Standardized Tests, Federal Legislation
Raju, Nambury S.; Oshima, T.C. – Educational and Psychological Measurement, 2005
Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…
Descriptors: Item Response Theory, Test Reliability, Evaluation Methods, Computation
O'Rourke, Norm – Educational and Psychological Measurement, 2004
The Center for Epidemiologic Studies-Depression (CES-D) Scale is among the most commonly used measures of depressive symptomatology. Despite this, a paucity of research has been undertaken to examine the psychometric properties of responses to this scale. This meta-analytic study examined previously published studies of caregiving to identify…
Descriptors: Measures (Individuals), Psychometrics, Generalization, Depression (Psychology)
Hintze, John M.; Matthews, William J. – School Psychology Review, 2004
This study examined the generalizability of systematic direct observation across setting and time. Participants included 14 students from an intact inclusionary fifth grade classroom. On-task/off-task behavior was directly observed using momentary time-sampling recording, twice a day, for 10 school days. Using Generalizability (G) theory, results…
Descriptors: Grade 5, Psychometrics, Classroom Observation Techniques, Interrater Reliability
Peer reviewedNewgent, Rebecca A.; Parr, Patricia E.; Newman, Isadore; Higgins, Kristin K. – Measurement and Evaluation in Counseling and Development, 2004
This investigation was conducted to estimate the reliability and validity of scores on the Riso-Hudson Enneagram Type Indicator (D. R. Riso & R. Hudson, 1999a). Results of 287 participants were analyzed. Alpha suggests an adequate degree of internal consistency. Evidence provides mixed support for construct validity using correlational and…
Descriptors: Personality Traits, Test Validity, Construct Validity, Personality Measures
Peer reviewedKochenderfer-Ladd, Becky – Merrill-Palmer Quarterly, 2003
A study demonstrated utility of cluster analysis to classify a racially diverse group of children from kindergarten to Grade 3. Four victim subtypes were identified: nonaggressive nonasocial; aggressive; asocial; and both aggressive and asocial. Early aggression levels predicted increases in victimization and chronicity. The role of asocial…
Descriptors: Aggression, Child Behavior, Classification, Cluster Analysis
Goerss, Jean; Amend, Edward R.; Webb, James T.; Webb, Nadia; Beljan, Paul – Roeper Review, 2006
The Hartnett, Nelson, and Rinn 2004 study indicates that diagnostic confusion between ADD/ADHD and giftedness exists, and that research on medication practices is warranted. Mika disagrees, saying that there is no empirical evidence of misdiagnosis of gifted children as having ADD/ADHD. We disagree with Mika's logic, and describe evidence that…
Descriptors: Evidence, Gifted, Reader Response, Diagnostic Tests
Voyer, Daniel – Brain and Cognition, 2004
The purpose of the present study was to replicate and extend to word recognition previous findings of reduced magnitude and reliability of laterality effects when exogenous cueing was used in a dichotic listening task with syllable pairs. Twenty right-handed undergraduate students with normal hearing (10 females, 10 males) completed a dichotic…
Descriptors: Reliability, Effect Size, Listening, Cues
Raykov, Tenko – Structural Equation Modeling: A Multidisciplinary Journal, 2006
A structural equation modeling based method is outlined that accomplishes interval estimation of individual optimal scores resulting from multiple-component measuring instruments evaluating single underlying latent dimensions. The procedure capitalizes on the linear combination of a prespecified set of measures that is associated with maximal…
Descriptors: Scores, Structural Equation Models, Reliability, Validity
Davanzo, Pablo; Kerwin, Lauren; Nikore, Vipan; Esparza, Claudia; Forness, Steve; Murrelle, Lenn – Child Psychiatry and Human Development, 2004
The goal of this study was to test the internal reliability of a Spanish translation of the CDI, (i.e., CDI-LA), a potentially useful screening instrument for Hispanic youngsters in their native language at a primary-care level. Self-reported symptoms of depression were assessed with the CDI-LA in a school sample of 205 Hispanic students. Girls…
Descriptors: Spanish, Translation, Test Reliability, Hispanic American Students
Sloutsky, Vladimir M.; Fisher, Anna V. – Journal of Experimental Psychology: General, 2006
This article is a response to E. Heit and B. K. Hayes's comment on the target article "Induction and Categorization in Young Children: A Similarity-Based Model" (V. M. Sloutsky & A. V. Fisher, 2004a). The response discusses points of agreement and disagreement with Heit and Hayes; phenomena predicted by similarity, induction, naming, and…
Descriptors: Logical Thinking, Classification, Young Children, Recognition (Psychology)
Hunter, Simon C.; Boyle, James M. E.; Warden, David – Educational and Psychological Measurement, 2006
The stability of scores on the Peer-Relations subscale of the Self-Esteem Questionnaire (SEQ) was examined over 11 to 13 months, longer than in previous research. Participants were 839 mainstream Scottish pupils aged 8 to 14 years old (48% male), allowing for the psychometric qualities of the scale to be assessed in a younger sample than…
Descriptors: Test Reliability, Self Esteem, Questionnaires, Peer Relationship
Luhmann, Christian C.; Ahn, Woo-kyoung – Psychological Review, 2005
This paper comments on the response offered by Cheng and Novick to Luhmann and Ahn's initial comments on Cheng's and Cheng and Novick's previous articles. Cheng and Novick argue that people's willingness to generalize across contexts contradicts our hypothesis. They argue that previous studies demonstrate that participants generalize their…
Descriptors: Criticism, Reader Response, Generalization, Hypothesis Testing

Direct link
