Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Stansfield, Charles W. – Language Testing, 2008
In this speech, the author covers a lot of ground. In the first half of his speech, the author gives a brief summary of the last 40 years of the history of language testing, from his perspective. The author reviews these years more or less by decade. Additionally, he discusses the evolution of the profession of language testing during this period,…
Descriptors: History, Testing, Language Tests, Role
Hove, Oddbjorn; Havik, Odd E. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2008
The DC-LD is a new classification system providing operationalized diagnostic criteria in recognition of lacking applicability of standard psychiatric criteria for adults with intellectual disability. This study attempts to evaluate internal consistency, inter-rater reliability and factor structure of the Psychopathology Checklists for Adults with…
Descriptors: Psychometrics, Psychopathology, Check Lists, Classification
Zhou, Xinyue; Xu, Qian; Ingles, Candido J.; Hidalgo, Maria D.; La Greca, Annette M. – Child Psychiatry and Human Development, 2008
This study evaluated the psychometric properties of the Chinese version of the Social Anxiety Scale for Adolescents (SAS-A) in a sample of 296 adolescents (49% boys) in Grades 7, 8, 9, 10, and 12 with a mean age of 15.52 years. Confirmatory factor analysis replicated the three-factor structure of the SAS-A in the Chinese sample: Fear of Negative…
Descriptors: Measures (Individuals), Anxiety, Adolescents, Test Validity
Sykes, Robert C.; Ito, Kyoko; Wang, Zhen – Educational Measurement: Issues and Practice, 2008
Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…
Descriptors: Test Items, Mathematics Tests, Reading Tests, Scoring
Rader, Martha H.; Bailey, Glenn A.; Kurth, Linda A. – Delta Pi Epsilon Journal, 2008
This study examined the validity of various measures of speed and accuracy for assessing proficiency in speech recognition. The study specifically compared two different word-count indices for speed and accuracy (the 5-stroke word and the 1.4-syllable standard word) on a timing administered to 114 speech recognition students measured at 1-, 2-,…
Descriptors: Speech, Recognition (Psychology), Syllables, Intervals
Fowell, S. L.; Fewtrell, R.; McLaughlin, P. J. – Advances in Health Sciences Education, 2008
Absolute standard setting procedures are recommended for assessment in medical education. Absolute, test-centred standard setting procedures were introduced for written assessments in the Liverpool MBChB in 2001. The modified Angoff and Ebel methods have been used for short answer question-based and extended matching question-based papers,…
Descriptors: Medical Education, Standard Setting (Scoring), Judges, Interrater Reliability
Swanson, Mark – Journal of School Health, 2008
Background: Assessing actual consumption of school cafeteria meals presents challenges, given recall problems of children, the cost of direct observation, and the time constraints in the school cafeteria setting. This study assesses the use of digital photography as a technique to measure what elementary-aged students select and actually consume…
Descriptors: Food Service, Photography, Nutrition, Interrater Reliability
Thorne, John C.; Coggins, Truman – International Journal of Language & Communication Disorders, 2008
Background: Foetal Alcohol Spectrum Disorders (FASD) include the range of disabilities that occur in children exposed to alcohol during pregnancy, with Foetal Alcohol Syndrome (FAS) on the severe end of the spectrum. Clinical research has documented a range of cognitive, social, and communication deficits in FASD and it indicates the need for…
Descriptors: Fetal Alcohol Syndrome, Children, Diagnostic Tests, Speech Communication
Matson, Johnny L.; Dempsey, Timothy; Rivet, Tessa – Research in Autism Spectrum Disorders, 2008
Asperger's syndrome (AS), first diagnosed in 1944, and has only recently begun to receive a great deal of research attention. An emerging controversy has been whether AS is a distinct condition from high functioning autism (HFA), and if so, can it be reliably and validly diagnosed. While measures designed specifically to aid in the screening and…
Descriptors: Autism, Asperger Syndrome, Symptoms (Individual Disorders), Identification
Matson, Johnny L.; Gonzalez, Melissa L.; Rivet, Tessa T. – Research in Autism Spectrum Disorders, 2008
A considerable amount of attention has occurred with respect to the diagnosis and treatment of Autism Spectrum Disorders (ASDs) of children and youth. Furthermore, the rationale for using the most restrictive of the applied behavior analysis methods and medication has been largely based on the presence of severe challenging behaviors such as…
Descriptors: Behavior Problems, Autism, Asperger Syndrome, Factor Structure
Ancis, Julie R.; Szymanski, Dawn M.; Ladany, Nicholas – Counseling Psychologist, 2008
This article describes the development and psychometric evaluation of the Counseling Women Competencies Scale (CWCS). The CWCS is designed to assess clinicians' self-perceived competencies with regard to therapeutic practice with diverse female clients. Through an extensive review of the literature on counseling women and expert review by 32…
Descriptors: Graduate Students, Females, Content Validity, Self Concept
Armstrong, Patrick Ian; Allison, Wyndolyn; Rounds, James – Journal of Vocational Behavior, 2008
Although commercially developed interest measures based on Holland's RIASEC types are effectively used in a variety of applied settings, these measures have somewhat limited research utility due to their length and copyright restrictions placed by the test publishers. In the present study, two sets of 8-item RIASEC scales were developed using…
Descriptors: College Students, Copyrights, Validity, Vocational Interests
Lembke, Erica S.; Foegen, Anne; Whittaker, Tiffany A.; Hampton, David – Assessment for Effective Intervention, 2008
The purpose of this study was to examine the use of three early numeracy measures to monitor the mathematics progress of students across time. One hundred and seven kindergarten and Grade 1 students were administered quantity discrimination, number identification, and missing-number measures once each month for 7 months. Alternate form reliability…
Descriptors: Standardized Tests, Numeracy, Reliability, Identification
Holbrook, Allyson; Bourke, Sid; Lovat, Terry; Fairbairn, Hedy – Australian Journal of Education, 2008
This is a mixed methods investigation of consistency in PhD examination. At its core is the quantification of the content and conceptual analysis of examiner reports for 804 Australian theses. First, the level of consistency between what examiners say in their reports and the recommendation they provide for a thesis is explored, followed by an…
Descriptors: Academic Standards, Examiners, Student Evaluation, Foreign Countries
Brennan, David J. – Higher Education Research and Development, 2008
This paper provides an overview of the issue of student anonymity in the summative assessment of student work in higher education. It considers both theoretical literature pertaining to bias in the evaluation of the work of others and the limited empirical work undertaken on this issue in higher education. It then describes the experience of three…
Descriptors: Higher Education, Student Evaluation, Interrater Reliability, Test Bias

Peer reviewed
Direct link
