Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Ito, Akihiro – System: An International Journal of Educational Technology and Applied Linguistics, 2004
This study examines the reliability and validity of translation tests as a reading comprehension measure. The following tests were administered to 70 Japanese high school students: (1) open-ended translation (OE-TRAN) test from English to Japanese; (2) multiple-choice translation (MC-TRAN) test; (3) cloze test (CLOZE-T); and (4) short-answer…
Descriptors: Cloze Procedure, Reading Comprehension, Translation, Reading Achievement
Moradi, Bonnie; Subich, Linda Mezydlo – Counseling Psychologist, 2002
Reliability and validity of three current instruments (Feminist Identity Scale [FIS], Feminist Identity Development Scale [FIDS]J Feminist Identity Composite [FIC]) used to operationalize Downing and Roush's model of feminist identity development were compared. A sample of 245 women completed all three instruments, and a separate sample of 35…
Descriptors: Feminism, Social Desirability, Females, Content Validity
Bosman, Anna M. T.; Vonk, Wietske; van Zwam, Margriet – Annals of Dyslexia, 2006
Lexical-decision studies with experienced English and French readers have shown that visual-word identification is not only affected by pronunciation inconsistency of a word (i.e., multiple ways to pronounce a spelling body), but also by spelling inconsistency (i.e., multiple ways to spell a pronunciation rime). The aim of this study was to…
Descriptors: Spelling, Reliability, Dyslexia, Word Recognition
Dillon, Frank; Worthington, Roger L. – Journal of Counseling Psychology, 2003
Five studies on the development of the Lesbian, Gay, and Bisexual Affirmative Counseling Self-Efficacy Inventory (LGB-CSI) were conducted. Exploratory and confirmatory factor analyses of an initial pool of 64 items yielded 5 factors that assess counselor self-efficacy to perform lesbian, gay, and bisexual (LGB) affirmative counseling behaviors…
Descriptors: Validity, Self Efficacy, Social Desirability, Homosexuality
Varrella, Gary F.; Veronesi, Peter D. – Journal of Elementary Science Education, 2004
This paper represents Part I of a two-part study examining preservice teachers' development of a personalized, research-based Science Teaching Rationale (STR). Researchers have historically documented the application of the "rationale paper" (Clough, 1992; Veronesi, 1998) using qualitative methodologies. Since the rationale paper continues to…
Descriptors: Preservice Teachers, Elementary School Science, Statistical Analysis, Evaluation Methods
Katzenmeyer, Conrad; Lawrenz, Frances – New Directions for Evaluation, 2006
This article presents a brief history of the National Science Foundation's involvement in education evaluation, reviews the program evaluations that NSF's Division of Research, Evaluation, and Communication (REC) has conducted in recent years, and describes future directions. The evaluations have reflected the field-driven nature of NSF's…
Descriptors: Teacher Characteristics, Program Evaluation, Teaching Methods, Evaluators
Lecavalier, Luc; Aman, Michael G.; Scahill, Lawrence; McDougle, Christopher J.; McCracken, James T.; Vitiello, Benedetto; Tierney, Elaine; Arnold, L. Eugene; Ghuman, Jaswinder K.; Loftin, Rachel L.; Cronin, Pegeen; Koenig, Kathleen; Posey, David J.; Martin, Andres; Hollway, Jill; Lee, Lisa S.; Kau, Alice S. M. – American Journal on Mental Retardation, 2006
The factor structure, internal consistency, and convergent validity of the Autism Diagnostic Interview-Revised (ADI-R) algorithm items were examined in a sample of 226 youngsters with pervasive developmental disabilities. Exploratory factor analyses indicated a three-factor solution closely resembling the original algorithm and explaining 38% of…
Descriptors: Test Validity, Measures (Individuals), Measurement Techniques, Autism
Wright, Steven; McNeill, Michael; Fry, Joan; Tan, Steven; Tan, Clara; Schempp, Paul – Journal of Teaching in Physical Education, 2006
This study examined 49 student teachers' actions and perspectives when implementing a curricular innovation (the tactical games approach). Data were collected via videotaped lessons, interviews, and follow-up questionnaires. Questions for interviews and questionnaires were pilot tested and data were analyzed using the constant comparison method.…
Descriptors: Educational Innovation, Student Teachers, Student Teacher Attitudes, Videotape Recordings
Cheung, Hoi-yan – Journal of Education for Teaching: International Research and Pedagogy, 2006
This study sought to measure general teacher efficacy levels of in-service primary teachers in Hong Kong. Participants included 725 Hong Kong in-service teachers, who were invited to take part in the study. These in-service teachers came from 28 different primary schools ranging from government, aided, private and direct subsidy schools. The…
Descriptors: Foreign Countries, Measures (Individuals), Teaching Experience, Teacher Effectiveness
Roberti, Jonathan W.; Harrington, Lisa N.; Storch, Eric A. – Journal of College Counseling, 2006
Because of increased stress conditions in college students, updated psychometrics of the Perceived Stress Scale, 10-item version (PSS-10; S. Cohen & G. Williamson, 1988) are necessary. Participants were 281 undergraduates at 3 public universities. An exploratory factor analysis revealed a 2-factor structure measuring Perceived Helplessness and…
Descriptors: Measures (Individuals), Universities, Psychometrics, Undergraduate Students
Attali, Yigal – ETS Research Report Series, 2007
This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…
Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)
Natural, Jim – Reclaiming Children and Youth: The Journal of Strength-based Interventions, 2007
Effective group programs with challenging youth need to be grounded in a clear understanding of core principles that guide effective practice. This article examines the operating philosophy of an established outdoor education and therapy program which combines the Re-ED philosophy of Nicholas Hobbs (1994) with adventure education developed by…
Descriptors: Adventure Education, Emotional Disturbances, Group Experience, Educational Principles
Mattsson, Matts; Kemmis, Stephen – Pedagogy, Culture and Society, 2007
This article elucidates criteria which might be helpful in evaluating praxis-related research. The authors explore both sides of the research and development (R & D) project. They examine different ways of understanding contributions to knowledge through research but more especially exploring ideas about contributions to changing praxis. Changing…
Descriptors: Research and Development, Social Sciences, Praxis, Evaluation Criteria
Higgs, Philip; Keevy, James – Perspectives in Education, 2007
This article reflects on the reliability of the evidence contained in the National Qualifications Framework Impact Study, a longitudinal comparative study conducted by the South African Qualifications Authority since 2002. In so doing, the veracity of evidence-based research in determining the impact of the South African Qualifications Framework…
Descriptors: Educational Research, Program Effectiveness, Comparative Analysis, Foreign Countries
Sharifi Ashtiani, Nahid; Babaii, Esmat – Studies in Educational Evaluation, 2007
For decades traditional methods of testing have been criticized for saying relatively little reliably about students' ability as well as causing anxiety, which can negatively affect students' recall of learned information. The reform movement with its innovative approaches focusing on learner-centered education perceives assessment as an…
Descriptors: Teaching Methods, Program Effectiveness, Grade 11, Test Construction

Peer reviewed
Direct link
