Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010
Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…
Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics
Goh, Jonathan W. P.; Lee, Ong Kim; Salleh, Hairon – Educational Research, 2010
Background: Most empirical investigations in survey research have been conducted using self-reported or self-evaluated item responses. Such measures are common because they are relatively easy to obtain and are often the only feasible way to assess constructs of interest. In order to improve on the validity of self-reports it has become a common…
Descriptors: Validity, Confidentiality, Foreign Countries, Raw Scores
Watson, Amy C.; Angell, Beth; Vidalon, Theresa; Davis, Kristin – Journal of Community Psychology, 2010
Despite increased recent attention to improving the quality of encounters between police officers and people with serious mental illness, there are no measures available for assessing how consumers perceive their interactions with police officers. Drawing upon conceptual frameworks developed within social psychology, this study reports the…
Descriptors: Mental Disorders, Rating Scales, Social Psychology, Police Community Relationship
Elbaum, Batya; Gattamorta, Karina A.; Penfield, Randall D. – Journal of Early Intervention, 2010
This study evaluated the Battelle Developmental Inventory, 2nd Edition, Screening Test (BDI-2 ST) for use in states' child outcomes accountability systems under the Individuals with Disabilities Education Act. Complete Battelle Developmental Inventory, 2nd Edition (BDI-2), assessment data were obtained for 142 children, ages 2 to 62 months, who…
Descriptors: Early Intervention, Screening Tests, Disabilities, Data Analysis
Rutkiene, Ausra; Tereseviciene, Margarita – Quality of Higher Education, 2010
The article presents the stages of the experiment planning that are necessary to ensure the validity and reliability of it. The research data reveal that doctoral students of Educational Research approach the planning of the experiment as the planning of the whole dissertation research; and the experiment as a research method is often confused…
Descriptors: Experiments, Planning, Educational Research, Graduate Students
Martin, Andrew J.; Hau, Kit-Tai – International Journal of Testing, 2010
The present study explored motivation and engagement among Chinese and Australian school students. Based on a sample of 528 Hong Kong Chinese 12-13 year olds and an archive sample of 6,366 Australian 12-13 year olds, achievement motivation was assessed using the Motivation and Engagement Scale-High School (MES-HS). Confirmatory factor analysis and…
Descriptors: Foreign Countries, Achievement Need, Student Motivation, Learner Engagement
Rutkowski, Leslie; Rutkowski, David – Journal of Curriculum Studies, 2010
In addition to collecting achievement data, international large-scale assessment programmes gather auxiliary information from students and schools regarding the context of teaching and learning. In an effort to clarify some of the opacity surrounding international large-scale assessment programmes and the potential problems associated with less…
Descriptors: Measures (Individuals), Data Collection, Questionnaires, Academic Achievement
Chen, Huey-Shys; Sheu, Jiunn-Jye; Ho, Ching-Sung – Journal of School Health, 2010
Background: Cigarette smoking is a health-risk behavior of global proportions. Self-efficacy plays an important role in both smoking acquisition and smoking resistance. Reliability and validity of an instrument is fundamental to research results, particularly in its simplified form on a different population. The purpose of this study was to…
Descriptors: Middle School Students, Smoking, Self Efficacy, Nurses
Rezaei, Ali Reza; Lovorn, Michael – Assessing Writing, 2010
This experimental project investigated the reliability and validity of rubrics in assessment of students' written responses to a social science "writing prompt". The participants were asked to grade one of the two samples of writing assuming it was written by a graduate student. In fact both samples were prepared by the authors. The…
Descriptors: Spelling, Sentence Structure, Punctuation, Social Sciences
Stamou, Lelouda; Schmidt, Charles P.; Humphreys, Jere T. – Journal of Research in Music Education, 2010
The purpose of this study was to standardize the Primary Measures of Music Audiation in Greece ( N = 1,188). Split-halves reliability was acceptable across grade levels (K through 3) for the Tonal and Rhythm subtests, but test-retest reliability was generally unacceptable, especially for the Rhythm subtest. Concurrent validity was mixed, with…
Descriptors: Music, Validity, Intelligence Tests, Foreign Countries
Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – Applied Linguistics, 2010
The main purpose of the study was to investigate the distinctness and reliability of analytic (or multi-trait) rating dimensions and their relationships to holistic scores and "e-rater"[R] essay feature variables in the context of the TOEFL[R] computer-based test (TOEFL CBT) writing assessment. Data analyzed in the study were holistic…
Descriptors: Writing Evaluation, Writing Tests, Scoring, Essays
Memon, Muhammed Ashraf; Joughin, Gordon Rowland; Memon, Breda – Advances in Health Sciences Education, 2010
The purpose of this review was to examine the practice of oral assessment in postgraduate medical education in the context of the core assessment constructs of validity, reliability and fairness. Although oral assessment has a long history in the certification process of medical specialists and is a well-established part of such proceedings for a…
Descriptors: Medical Education, Certification, Exit Examinations, Licensing Examinations (Professions)
Porter, Andrew C.; Polikoff, Morgan S.; Goldring, Ellen; Murphy, Joseph; Elliott, Stephen N.; May, Henry – Educational Administration Quarterly, 2010
Research has consistently shown that principal leadership matters for successful schools. Evaluating principals on the behaviors shown to improve student learning should be an important leverage point for raising leadership quality. Yet principals are often evaluated with the use of instruments with no theoretical background and little, if any,…
Descriptors: Psychometrics, Instructional Leadership, Principals, Test Construction
Marson, Stephen M.; DeAngelis, Donna; Mittal, Nisha – Research on Social Work Practice, 2010
Objectives: The purpose of this article is to create transparency for the psychometric methods employed for the development of the Association of Social Work Boards' (ASWB) exams. Results: The article includes an assessment of the macro (political) and micro (statistical) environments of testing social work competence. The seven-step process used…
Descriptors: Content Validity, Test Validity, Psychometrics, Social Work
Naude, Kevin A.; Greyling, Jean H.; Vogts, Dieter – Computers & Education, 2010
We present a novel approach to the automated marking of student programming assignments. Our technique quantifies the structural similarity between unmarked student submissions and marked solutions, and is the basis by which we assign marks. This is accomplished through an efficient novel graph similarity measure ("AssignSim"). Our experiments…
Descriptors: Grading, Assignments, Correlation, Interrater Reliability

Peer reviewed
Direct link
