Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Taylor, Marcia B; Porterfield, William D. – 1984
This paper describes the Measure of Epistemological Reflection (MER), an instrument to assess cognitive developmental level according to the Perry scheme of intellectual and ethical development. It contains sets of questions for each of the six cognitive domains: decision making, learner role, instructor role in the learning process, peer role in…
Descriptors: Cognitive Development, Cognitive Tests, Epistemology, Higher Education
Tillinghast, B. S., Jr.; Renzulli, Joseph S. – Journal of Educational Research, 1968
The purpose of this study was to further examine the reliability of the Peabody Picture Vocabulary Test (PPVT), a new instrument to measure hearing vocabulary so that a student's verbal intelligence may be inferred. A group testing procedure was utilized by reproducing the PPVT plates on 35 millimeter transparent slides and projecting them onto a…
Descriptors: Aptitude Tests, Elementary School Students, Evaluation, Group Testing
Livingston, Samuel A. – 1970
The assumptions of the classical test-theory model are used to develop a theory of reliability for criterion-referenced measures which parallels that for norm-referenced measures. It is shown that the Spearman-Brown formula holds for criterion-referenced measures and that the criterion-referenced reliability coefficient can be used to correct…
Descriptors: Correlation, Criterion Referenced Tests, Measurement Instruments, Norm Referenced Tests
Baker, J. Philip – 1971
The usefulness of generalizability theory in assessing the reliability of classroom observation instruments is illustrated, with a new index of reliability, called the coefficient of generalizability, given as an index of how well one can generalize from the instrument to the universe score according to the conditions of observation. Data from an…
Descriptors: Analysis of Variance, Bias, Classroom Observation Techniques, Data Analysis
Peer reviewedBehuniak, Peter, Jr.; And Others – Educational and Psychological Measurement, 1982
This study examined how local content specialists performed when applying the Angoff and Nedelsky standard setting procedures to objective-referenced instruments in reading and mathematics. Results revealed several differences between the standard setting procedures in terms of both level and consistency of the cut scores generated. (Author/BW)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Cutting Scores, Interrater Reliability
Peer reviewedLivingston, Samuel A.; Wingersky, Marilyn A. – Journal of Educational Measurement, 1979
Procedures are described for studying the reliability of decisions based on specific passing scores with tests made up of discrete items and designed to measure continuous rather than categorical traits. These procedures are based on the estimation of the joint distribution of true scores and observed scores. (CTM)
Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement
Peer reviewedMagnusson, D.; Backteman, G. – Applied Psychological Measurement, 1979
A longitudinal study of approximately 1,000 students aged 10-16 showed high stability of intelligence and creativity. Stability coefficients for intelligence were higher than those for creativity. Results supported the construct validity of creativity. (MH)
Descriptors: Creativity, Creativity Tests, Elementary Secondary Education, Foreign Countries
Rojahn, Johannes; Tasse, Marc J.; Sturmey, Peter – American Journal on Mental Retardation, 1997
Development of the Stereotyped Behavior Scale for adolescents and adults with mental retardation is described. Use with 600 individuals resulted in refinement and a 26-item scale with an internal consistency alpha of 0.88, test-retest reliability of p=0.90, and interrater reliability of p=0.76. (DB)
Descriptors: Adolescents, Adults, Behavior Patterns, Behavior Rating Scales
Beuttler, Marybeth Grant; Leininger, Peter M.; Palisano, Robert J. – Physical & Occupational Therapy in Pediatrics, 2004
Purpose: The purpose of this study was to examine the test-retest and inter-rater reliability of a measure of muscle extensibility developed by Tardieu, de la Tour, Bret, and Tardieu (1982) in fullterm and preterm newborns. Method: Twenty-one fullterm infants and twenty preterm infants were examined by two physical therapists. Each physical…
Descriptors: Premature Infants, Neonates, Human Body, Motor Development
Hafner, John C.; Hafner, Patti M. – International Journal of Science Education, 2003
Although the rubric has emerged as one of the most popular assessment tools in progressive educational programs, there is an unfortunate dearth of information in the literature quantifying the actual effectiveness of the rubric as an assessment tool "in the hands of the students." This study focuses on the validity and reliability of the rubric as…
Descriptors: Interrater Reliability, Generalizability Theory, Biology, Scoring Rubrics
Tennessee Department of Education, 2012
In the summer of 2011, the Tennessee Department of Education contracted with the National Institute for Excellence in Teaching (NIET) to provide a four-day training for all evaluators across the state. NIET trained more than 5,000 evaluators intensively in the state model (districts using alternative instruments delivered their own training).…
Descriptors: Video Technology, Feedback (Response), Evaluators, Interrater Reliability
Kiboss, Joel Kipkemboi – Journal of Educational Computing Research, 2012
Achievement in mathematics is an issue of great concern not only to students and parents but also to employers and researchers in Kenya. This is because the Kenya National Examination Council (KNEC) has continuously reported dismal results in this area, and especially in geometry. Also, KNEC indicates that it presents difficulties to both the…
Descriptors: Foreign Countries, Developing Nations, Electronic Learning, Computer Assisted Instruction
Fostering Close and Effective Relationships in Youth Mentoring Programs. Research in Action. Issue 4
Rhodes, Jean – MENTOR, 2007
Successful mentors seem to understand and appreciate their mentees, entering their worlds to uncover their unique strengths and capabilities. This sort of empathy and sensitivity goes a long way toward facilitating close relationships, as does the mentee's willingness to fully engage in the mentoring experience. And, since initial resistances may…
Descriptors: Mentors, Empathy, Motivation, Interpersonal Relationship
Zhang, Yixin – Computers & Education, 2007
This paper describes the development and validation of a new 40-item Internet Attitude Scale (IAS), a one-dimensional inventory for measuring the Internet attitudes. The first experiment initiated a generic Internet attitude questionnaire, ensured construct validity, and examined factorial validity and reliability. The second experiment further…
Descriptors: Predictive Validity, Test Validity, Computer Attitudes, Internet
Worrell, Frank C.; Mello, Zena R. – Educational and Psychological Measurement, 2007
In this study, the authors examined the reliability, structural validity, and concurrent validity of Zimbardo Time Perspective Inventory (ZTPI) scores in a group of 815 academically talented adolescents. Reliability estimates of the purported factors' scores were in the low to moderate range. Exploratory factor analysis supported a five-factor…
Descriptors: Adolescents, Validity, Time Perspective, Factor Analysis

Direct link
