Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Fuchs, Douglas; And Others – 1985
The present investigation represents a systematic effort to determine whether handicapped children have been included in the development of test norms, items, and indices of reliability and validity. It analysed up-to-date user manuals and technical supplements of 27 well known and widely used aptitude and achievement tests. Study procedure…
Descriptors: Achievement Tests, Aptitude Tests, Disabilities, Elementary Secondary Education
Peer reviewedMeisels, Samuel J.; And Others – Early Childhood Research Quarterly, 1995
Examined the reliability and validity of the Work Sampling System (WSS) for evaluating the schoolwork of 100 kindergarten children. Results indicated that the WSS checklist and summary report had very high internal and moderately high interrater reliability. The WSS accurately predicted the performance of the children on a norm-referenced…
Descriptors: Academic Achievement, Achievement Tests, Check Lists, Early Childhood Education
Peer reviewedDunbar, Stephen B.; And Others – Applied Measurement in Education, 1991
Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)
Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques
Cordier, Deborah – ProQuest LLC, 2009
A renewed focus on foreign language (FL) learning and speech for communication has resulted in computer-assisted language learning (CALL) software developed with Automatic Speech Recognition (ASR). ASR features for FL pronunciation (Lafford, 2004) are functional components of CALL designs used for FL teaching and learning. The ASR features…
Descriptors: Feedback (Response), Computer Assisted Instruction, Validity, Computer Software
Skolits, Gary J.; Richards, Jennifer – Canadian Journal of Program Evaluation, 2009
This article argues that intervention pilot test evaluations have focused insufficient attention on the measurement of project fidelity and the subsequent use of fidelity results for (a) interpreting variations in project outcomes and (b) understanding the rationale for teachers' deviations from implementation protocols. The authors report on the…
Descriptors: Intervention, Pilot Projects, Instructional Improvement, Middle Schools
Garside, Sarah; Levinson, Anthony; Kuziora, Sophie; Bay, Michael; Norman, Geoffrey – Electronic Journal of e-Learning, 2009
Background: Every physician in Ontario needs to know how to fill out a Form 1 in order to legally hold a person against their will for a psychiatric assessment. These forms are frequently inaccurately filled out, which could constitute wrongful confinement and, in extreme circumstances, could lead to fines as large as $25,000. Training people to…
Descriptors: Electronic Learning, Medical Education, Medical Students, Intervention
Renzulli, Joseph S.; Siegle, Del; Reis, Sally M.; Gavin, M. Katherine; Sytsma Reed, Rachael E. – Journal of Advanced Academics, 2009
Teacher rating scales have been used widely throughout the United States as part of a comprehensive plan for identifying potentially gifted and talented students. The Scales for Rating the Behavioral Characteristics for Superior Students (SRBCSS) are among the most frequently used teacher rating scales to assess the characteristics of and nominate…
Descriptors: Gifted, Teacher Evaluation, Talent, Factor Structure
Costantino, Giuseppe; Malgady, Robert G.; Primavera, Louis H. – Journal of Consulting and Clinical Psychology, 2009
This study investigated a new 2-factor construct, termed "cultural congruence", which is related to cultural competence in the delivery of mental health services to ethnic minority clients. Cultural congruence was defined as the distance between the cultural competence characteristics of the health care organization and the clients' perception of…
Descriptors: Health Services, Cultural Awareness, Mental Health Programs, Health Conditions
Westhoff, Gerard J. – Language Teaching, 2009
Teachers' competence to estimate the effectiveness of learning materials is important and often neglected in programmes for teacher education. In this lecture I will try to explore the possibilities of designing scaffolding instruments for a "priori" assessment of language learning tasks, based on insights from SLA and cognitive psychology, more…
Descriptors: Cognitive Psychology, Instructional Materials, Instructional Effectiveness, Test Reliability
Phelps, James L. – Educational Considerations, 2009
The purpose of this article is to illustrate how a valid and reliable state accountability system could be developed that identifies effective schools and school districts in a comprehensive, understandable, and practical way. The author presents an overview of the strategy used in the analysis, discusses the use of education production functions…
Descriptors: School Effectiveness, Accountability, Federal Legislation, Validity
Cunningham, Wm. Scott; Duffee, David E.; Huang, Yufan; Steinke, Camela M.; Naccarato, Toni – Research on Social Work Practice, 2009
Objective: This study describes the development of an engagement scale for use with youth in residential treatment centers. Engagement includes attitude about treatment, bond with providers, and participation in treatment activities. Method: Interview data were collected at the midpoint in residence of 130 youth in two centers. Items were selected…
Descriptors: Residential Programs, Content Validity, Factor Analysis, Logical Thinking
Helms, LuAnn Sherbeck – 1999
This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…
Descriptors: Effect Size, Meta Analysis, Reliability, Scores
Pepin, Michel – 1983
This paper presents three different ways of computing the internal consistency coefficient alpha for a same set of data. The main objective of the paper is the illustration of a method for maximizing coefficient alpha. The maximization of alpha can be achieved with the aid of a principal component analysis. The relation between alpha max. and the…
Descriptors: Research Methodology, Research Problems, Statistical Analysis, Test Items
Peer reviewedSchulman, Robert S.; Haden, Richard L. – Psychometrika, 1975
A model is proposed for the description of ordinal test scores based on the definition of true score as expected rank; its deviations are compared with results from classical test theory. An unbiased estimator of population true score from sample data is calculated. Score variance and population reliability are examined. (Author/BJG)
Descriptors: Career Development, Mathematical Models, Test Reliability, Test Theory
Peer reviewedSilverstein, A. B.; Fisher, Gary – Multivariate Behavioral Research, 1975
Responses of male prisoners to the Personal Orientation Inventory were clustered, using hierarchical linkage analysis. Six second-order clusters accounted for all the items. Reliabilities of these clusters were comparable to those of the first-order clusters. Relative validity of cluster scores and scale scores remains to be determined. (RC)
Descriptors: Cluster Analysis, Correlation, Item Analysis, Personality Measures

Direct link
