Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Liu, Ou Lydia – Journal of Psychoeducational Assessment, 2009
Learning strategies have been increasingly recognized as a useful tool to promote effective learning. In response to the lack of available learning strategies measures for middle school students, this study evaluated an instrument assessing cognitive, behavioral, and metacognitive strategy use among middle school students. The instrument, called…
Descriptors: Middle School Students, Grades (Scholastic), Learning Strategies, Reliability
Kreiner, Janice; Flexer, Robert – Education and Training in Developmental Disabilities, 2009
The purpose of this study was to develop and to evaluate the Preferences for Leisure Attributes (PLA) Assessment, a forced-choice computer software program for students with severe disabilities and communication difficulties. In order to determine content validity of the PLA Assessment, four experts in related fields assigned critical attributes…
Descriptors: Leisure Time, Developmental Disabilities, Content Validity, Test Validity
Webster, Beverley J.; Hazari, Anjali – Learning Environments Research, 2009
The purpose of this study was to explore a new learning environment instrument which could be used by teaching practitioners and other educators to measure the language learning environment in the secondary science classroom. The science teacher is central in creating science classrooms conductive to the language needs of students and should be…
Descriptors: Student Attitudes, Factor Analysis, Science Teachers, Vocabulary Development
Horzum, Mehmet Baris; Cakir, Ozlem – Educational Sciences: Theory and Practice, 2009
The aim of the present study is to adapt a scale of self-efficacy towards online technologies which was developed by Miltiadou and Yu (2000) to Turkish. In order to adapt the scale, first, the scale items were translated to Turkish by the researchers. Then, a translation form was further developed by consulting eight specialists. These English and…
Descriptors: Undergraduate Students, Intervals, Self Efficacy, Test Reliability
Alderson, J. Charles – Language Testing, 2009
In this article, the author reviews the TOEFL iBT which is the latest version of the TOEFL, whose history stretches back to 1961. The TOEFL iBT was introduced in the USA, Canada, France, Germany and Italy in late 2005. Currently the TOEFL test is offered in two testing formats: (1) Internet-based testing (iBT); and (2) paper-based testing (PBT).…
Descriptors: Oral Language, Writing Tests, Listening Comprehension Tests, Test Reviews
Tsai, Meng-Jung – Educational Technology & Society, 2009
This paper presents the Model of Strategic e-Learning to explain and evaluate student e-learning from metacognitive perspectives. An in-depth interview, pilot study and main study are employed to construct the model and develop an instrument--the Online Learning Strategies Scale (OLSS). The model framework is constructed and illustrated by four…
Descriptors: Learning Strategies, Computer Uses in Education, Construct Validity, Reliability
Chiang, Karl S.; Green, Kathy E.; Cox, Enid O. – Gerontologist, 2009
Purpose: The purpose of this study was to examine scale dimensionality, reliability, invariance, targeting, continuity, cutoff scores, and diagnostic use of the Geriatric Depression Scale-Short Form (GDS-SF) over time with a sample of 177 English-speaking U.S. elders. Design and Methods: An item response theory, Rasch analysis, was conducted with…
Descriptors: Intervention, Geriatrics, Measures (Individuals), Depression (Psychology)
Maiano, Christophe; Begarie, Jerome; Morin, Alexandre J. S.; Ninot, Gregory – Journal of Autism and Developmental Disorders, 2009
The purpose of this study was to test the factor validity and reliability of the Very Short Form of the Physical Self-Inventory- (PSI-VSF) within a sample of adolescents with mild to moderate Intellectual Disability (ID). A total of 362 ID adolescents were involved in two studies. In Study 1, the content and format scale response of the PSI-VSF…
Descriptors: Mental Retardation, Test Validity, Factor Structure, Adolescents
Rossiter, Marian J. – Canadian Modern Language Review, 2009
This article explores perceptions of the speaking fluency of 24 adult ESL learners (11 men, 13 women) who narrated picture stories at Time 1 and again 10 weeks later at Time 2. One-minute excerpts from each rendition were randomized and played to 15 novice and six expert native speakers of English (undergraduate education students and experienced…
Descriptors: Native Speakers, English (Second Language), Adult Students, Student Attitudes
Boldt, R. F. – 1992
The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…
Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency
Weare, Jane; And Others – 1987
This annotated bibliography was developed upon noting a deficiency of information in the literature regarding the training of raters for establishing agreement. The ERIC descriptor, "Interrater Reliability", was used to locate journal articles. Some of the 33 resulting articles focus on mathematical concepts and present formulas for computing…
Descriptors: Annotated Bibliographies, Cloze Procedure, Correlation, Essay Tests
Livingston, Samuel A. – 1976
A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)
Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement
Newtson, Darren; And Others – 1976
Two five-week test-retest reliability studies of a measure of the unit of perception of ongoing behavior were conducted. In the first, 25 females and 23 males segmented a 7-minute action sequence under fine-unit or gross-unit instructional sets. Number of units marked at first viewing correlated .87 with number of units at retest. Correlations…
Descriptors: Attribution Theory, Behavior Patterns, Behavior Rating Scales, Cognitive Processes
Peer reviewedO'Hara, Michael W.; Rehm, Lynn P. – Journal of Consulting and Clinical Psychology, 1983
Used the intraclass correlation coefficient to estimate the interrater reliability of judgments of clinician and novice raters of depressed females (N=20) who took the Hamilton Rating Scale for Depression (HRSD). Expert and student raters both made reliable ratings on the HRSD. Criterion validity for student raters was also satisfactory.…
Descriptors: College Students, Comparative Testing, Cost Effectiveness, Counselor Role
Watkins, Marley W.; Canivez, Gary L. – Diagnostique, 1997
A study of 71 students (ages 7-17) with disabilities investigated the interrater agreement of the Adjustment Scales for Children and Adolescents (ASCA), a behavior rating scale used in school settings. Participants were rated by 29 educational professionals in 24 classrooms. Results found ASCA produced acceptable levels of interrater agreement.…
Descriptors: Behavior Rating Scales, Disabilities, Elementary Secondary Education, Evaluation Methods

Direct link
