Publication Date
| In 2026 | 5 |
| Since 2025 | 627 |
| Since 2022 (last 5 years) | 2564 |
| Since 2017 (last 10 years) | 5599 |
| Since 2007 (last 20 years) | 9195 |
Descriptor
| Test Validity | 21771 |
| Test Reliability | 10011 |
| Test Construction | 5891 |
| Foreign Countries | 4955 |
| Psychometrics | 2963 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2377 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1723 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 807 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 169 |
| United Kingdom | 160 |
| Netherlands | 159 |
| California | 156 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
Lu, Ying; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2007
Speededness refers to the situation where the time limits on a standardized test do not allow substantial numbers of examinees to fully consider all test items. When tests are not intended to measure speed of responding, speededness introduces a severe threat to the validity of interpretations based on test scores. In this article, we describe…
Descriptors: Test Items, Timed Tests, Standardized Tests, Test Validity
Sawyer, Richard – Applied Measurement in Education, 2007
Current thinking on validity suggests that educational institutions and individuals should evaluate their uses of test scores in the context of their fundamental goals. Regression coefficients and other traditional criterion-related validity statistics provide relevant information, but often do not, by themselves, address the fundamental reasons…
Descriptors: College Admission, Regression (Statistics), Test Validity, Scores
Hauser, Peter C.; Cohen, Julie; Dye, Matthew W. G.; Bavelier, Daphne – Journal of Deaf Studies and Deaf Education, 2007
Visual constructive and visual-motor skills in the deaf population were investigated by comparing performance of deaf native signers (n = 20) to that of hearing nonsigners (n = 20) on the Beery-Buktenica Developmental Test of Visual-Motor Integration, Rey-Osterrieth Complex Figure Test, Wechsler Memory Scale Visual Reproduction subtest, and…
Descriptors: Measures (Individuals), Deafness, Sign Language, Test Validity
Lagumen, Niko G.; Butterwick, Dale J.; Paskevich, David M.; Fung, Tak S.; Donnon, Tyrone L. – Athletic Training Education Journal, 2008
Objective: To establish the intra-rater reliability of nine content-validated Technical Skill Assessment Instruments (TSAI) for the skills of athletic taping. Setting: University of Calgary. Subjects: Canadian Certified Athletic Therapists, CAT(C), with a mean ± SD of 9.6 ± 10.8 years as a CAT(C), 7.8 ± 10.9 years as a Supervisory Athletic…
Descriptors: Athletics, Skills, Health Services, Measures (Individuals)
Mukherji, Sandip; Rustagi, Narendra – Journal of College Teaching & Learning, 2008
This study conducts a survey of students and faculty at a business school on critical issues regarding student evaluations of teaching and identifies several significant differences between their perceptions. Students agreed more strongly than faculty that evaluations are higher in courses where the instructor teaches effectively and students…
Descriptors: Higher Education, Student Evaluation of Teacher Performance, Teacher Effectiveness, Student Attitudes
Vignes, Celine; Coley, Nicola; Grandjean, Helene; Godeau, Emmanuelle; Arnaud, Catherine – Developmental Medicine & Child Neurology, 2008
This study aimed to identify instruments for measuring children's attitudes towards their peers with disabilities that are suitable for use in epidemiological studies and to report on their psychometric properties. A literature review was conducted to identify instruments measuring at least one of the three components of children's attitudes…
Descriptors: Test Reliability, Test Validity, Disabilities, Measures (Individuals)
Mak, Winnie W. S.; Cheung, Rebecca Y. M. – Journal of Applied Research in Intellectual Disabilities, 2008
Background: Affiliate stigma refers to the extent of self-stigmatization among associates of the targeted minorities. Given previous studies on caregiver stigma were mostly qualitative in nature, a conceptually based, unified, quantitative instrument to measure affiliate stigma is still lacking. Materials and Methods: Two hundred and ten…
Descriptors: Mental Retardation, Mental Disorders, Caregivers, Predictive Validity
Sondenaa, E.; Rasmussen, K.; Palmstierna, T.; Nottestad, J. – Journal of Intellectual Disability Research, 2008
Background: The objective of the study was to calculate the prevalence of inmates with intellectual disabilities (ID), and identify historical, medical and criminological characteristics of a certain impact. Methods: A random sample of 143 inmates from a Norwegian prison cross sectional sample was studied. The Hayes Ability Screening Index (HASI)…
Descriptors: Incidence, Mental Retardation, Correctional Institutions, Mental Disorders
Sikora, Darryn M.; Hall, Trevor A.; Hartley, Sigan L.; Gerrard-Morris, Aimee E.; Cagle, Sarah – Journal of Autism and Developmental Disorders, 2008
Behavior checklists are often utilized to screen for Autism Spectrum Disorders (ASDs) when comprehensive evaluations are unfeasible. The usefulness of two behavioral checklists, the Gilliam Autism Rating Scale (GARS) and Child Behavior Checklist (CBCL), in identifying ASDs was investigated among 109 children with Autism, 32 children with ASD, and…
Descriptors: Check Lists, Autism, Child Behavior, Rating Scales
Ma, Lingling; Cronin, John – Northwest Evaluation Association, 2009
Virtual Comparison Groups (VCG) were developed by the Northwest Evaluation Association as an alternative to conventional controlled experiments for social science researchers working in the field of education. The VCG is generally a group of up to 51 students who are matched, based on key characteristics of the student and school, to a single…
Descriptors: Social Science Research, Comparative Analysis, Sampling, Student Characteristics
Sandifer, Cody Clark – ProQuest LLC, 2009
The purpose of this study was to determine if the adoption of the Deming philosophy by teachers and use of the LtoJ[R] process resulted in greater academic achievement. Results of internal consistency analysis indicated that the instrument, the "Commitment to Quality Inventory for Educators," was a reliable measure of the Deming…
Descriptors: Formative Evaluation, Program Effectiveness, Intermediate Grades, Educational Philosophy
Weinberg, Anna; Klonsky, E. David – Psychological Assessment, 2009
The construct of emotion dysregulation increasingly has been used to explain diverse psychopathologies across the lifespan. The Difficulties in Emotion Regulation Scale (DERS; K. L. Gratz & L. Roemer, 2004) represents the most comprehensive measure of the construct to date and exhibits good reliability and validity in adults; however, the…
Descriptors: Eating Disorders, Construct Validity, Drug Use, Test Validity
Amrein-Beardsley, Audrey; Haladyna, Thomas – Journal of College Teaching & Learning, 2009
For over 30 years survey instruments have been used in colleges of higher education to measure instructional effectiveness. Extensive research has been conducted to determine which items best capture this construct. This research study was triggered by a college of education's enthusiastic but failed attempt to create a new and improved instructor…
Descriptors: Teacher Effectiveness, College Faculty, Educational Quality, Teacher Evaluation
Cadiz, David; Sawyer, John E.; Griffith, Terri L. – Educational and Psychological Measurement, 2009
Research on knowledge transfer in organizations has been hampered by the lack of tools yielding valid scores for studying critical constructs in concert. The authors developed survey measures of absorptive capacity (the ability to transform new knowledge into usable knowledge) and experienced community of practice (the extent to which a person is…
Descriptors: Test Validity, Path Analysis, Factor Analysis, Test Construction

Peer reviewed
Direct link
