Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Landgraf, Jeanne M. – Journal of Attention Disorders, 2007
Objective: To develop and evaluate a questionnaire, the ADHD Impact Module for Adults (AIM-A), to dimension quality of life for adults with attention-deficit/hyperactivity disorder (ADHD). Method: Six multi-item AIM scales were developed and evaluated in 317 participants enrolled in an open-label trial. Multitrait scaling analysis and correlations…
Descriptors: Scaling, Quality of Life, Hyperactivity, Test Reliability
Henry, Gary T.; Mashburn, Andrew J.; Konold, Timothy – Journal of Psychoeducational Assessment, 2007
High-stakes testing may potentially influence young children's attitudes toward school and learning. This study describes the development and evaluation of child and teacher versions of a measure called Children's Attitudes Toward School (CATS). Exploratory factor analyses of responses by 335 first graders and 130 first-grade teachers each…
Descriptors: Student Attitudes, School Attitudes, Attitude Measures, Grade 1
Kim, Do-Hong; Huynh, Huynh – Journal of Technology, Learning, and Assessment, 2007
This study examined comparability of student scores obtained from computerized and paper-and-pencil formats of the large-scale statewide end-of-course (EOC) examinations in the two subject areas of Algebra and Biology. Evidence in support of comparability of computerized and paper-based tests was sought by examining scale scores, item parameter…
Descriptors: Computer Assisted Testing, Measures (Individuals), Biology, Algebra
Martin, Lynn; Hirdes, John P.; Fries, Brant E.; Smith, Trevor F. – Journal of Policy and Practice in Intellectual Disabilities, 2007
This paper describes the development of the interRAI-Intellectual Disability (interRAI ID), a comprehensive instrument that assesses all key domains of interest to service providers relative to a person with an intellectual disability (ID). The authors report on the reliability and validity of embedded scales for cognition, self-care, aggression,…
Descriptors: Mental Retardation, Dementia, Psychometrics, Depression (Psychology)
Raulston, Cassie; Moellinger, Donna – Understanding Our Gifted, 2007
With the evolution of technology, students can now take online classes that may not be offered in their home schools. While online courses are commonly found in many high schools, WebQuests are used more commonly in elementary schools. Through the exploration of WebQuests, students are able to integrate the Internet into classroom activities. The…
Descriptors: Multiple Intelligences, Class Activities, Learning Activities, Student Interests
Reading, Suzanne; Richie, Carolyn – Child Language Teaching and Therapy, 2007
The Structured Observation System (SOS) is a data collection method developed to document changes in the communication behaviours of children identified with speech and language delays. The system employs a rating scale which reflects the occurrence of communication behaviours as well as the amount of assistance needed for behaviours to occur.…
Descriptors: Observation, Rating Scales, Delayed Speech, Evaluation Methods
Lee, Yong-Won; Kantor, Robert – International Journal of Testing, 2007
Possible integrated and independent tasks were pilot tested for the writing section of a new generation of the TOEFL[R] (Test of English as a Foreign Language[TM]). This study examines the impact of various rating designs and of the number of tasks and raters on the reliability of writing scores based on integrated and independent tasks from the…
Descriptors: Generalizability Theory, Writing Tests, English (Second Language), Second Language Learning
Trigg, Richard; Skevington, Suzanne M.; Jones, Roy W. – Gerontologist, 2007
Purpose: The study aim was to develop a measure of self-reported quality of life (QoL) for people with mild to moderate dementia based on their views--the Bath Assessment of Subjective Quality of Life in Dementia (BASQID). Design and Methods: We developed the measure through multiple stages. Two field tests of the measure (ns = 60 and 150)…
Descriptors: Alzheimers Disease, Quality of Life, Field Tests, Questionnaires
Liu, Chien-Hung; Chiang, Tzu-Chiang; Huang, Yueh-Min – Interactive Learning Environments, 2007
e-Learning is bringing training to the attention of upper management in a way that other learning technologies have never done. Web-based training will remain predominant to the design and delivery of workplace learning in the 21st century because of its advantages over traditional classroom-based training. A comprehensive framework that…
Descriptors: Training, Problem Solving, Program Effectiveness, Learning Experience
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research
Zwick, Rebecca; And Others – 1993
Although the belief has been expressed that performance assessments are intrinsically more fair than multiple-choice measures, some forms of performance assessment may in fact be more likely than conventional tests to tap construct-irrelevant factors. As performance assessment grows in popularity, it will be increasingly important to monitor the…
Descriptors: Educational Assessment, Item Bias, Multiple Choice Tests, Performance Based Assessment
Schumacker, Randall E.; Bembry, Karen – 1995
Research has suggested that important research questions can be addressed with meaningful interpretations using hierarchical linear modeling (HLM). The proper interpretation of results, however, is invariably linked to the choice of centering for the Level-1 predictor variables that produce the outcome measure for the Level-2 regression analysis.…
Descriptors: Estimation (Mathematics), Grade 9, High School Students, High Schools
Scholfield, Phil – 1995
This book is a guide to categorizing, measuring, testing, and assessing aspects of language, and is intended for language teachers, speech therapists and other language-related practitioners, and researchers, in conjunction with other resources on research methods and statistics. The first part is a discussion of basic terminology and the varied…
Descriptors: Data Collection, Language Proficiency, Language Skills, Language Tests
Lukhele, Robert; Sireci, Stephen G. – 1995
Free-response (FR) item formats, such as essay questions, are popular in educational assessment. The criticisms against FR items are that they are more expensive to score, take up more testing time, provide less content coverage, and are less reliable than multiple-choice (MC) items. For these reasons, FR items are often combined with MC items.…
Descriptors: Educational Assessment, Essay Tests, Item Response Theory, Multiple Choice Tests
Wang, Wen-chung – 1997
Traditional approaches to the investigation of the objectivity of ratings for constructed-response items are based on classical test theory, which is item-dependent and sample-dependent. Item response theory overcomes this drawback by decomposing item difficulties into genuine difficulties and rater severity. In so doing, objectivity of ability…
Descriptors: College Entrance Examinations, Constructed Response, Foreign Countries, Interrater Reliability

Peer reviewed
Direct link
