Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedJagacinski, Carolyn M.; Duda, Joan L. – Educational and Psychological Measurement, 2001
Studied three measures of task and ego achievement goal orientations in terms of factorial and construct validity, internal consistency reliability, and distributional characteristics. Results for 393 undergraduates suggest that the Patterns of Adaptive Learning Survey scales fared better than the other 2 in terms of distributional…
Descriptors: Achievement, Construct Validity, Factor Structure, Goal Orientation
Peer reviewedCarney, Amy G.; Merrell, Kenneth W. – Psychology in the Schools, 2002
Comparability of a Spanish language translation of the Preschool and Kindergarten Behavior Scales (PKBS) was examined in relation to the English language version. Children were rated concurrently by respondents on English and Spanish versions of the PKBS. Results showed virtually identical internal consistency of scores on both forms on Social…
Descriptors: Kindergarten Children, Preschool Children, Preschool Education, Primary Education
Peer reviewedMurgolo-Poore, Marie E.; Pitt, Leyland F.; Ewing, Michael T. – Public Relations Review, 2002
Describes a process directed at developing a simple paper-and-pencil checklist to assess Intranet effectiveness. Discusses the checklist purification procedure, and attempts to establish reliability and validity for the list. Concludes by identifying managerial applications of the checklist, recognizing the limitations of the approach, and…
Descriptors: Check Lists, Higher Education, Online Systems, Program Effectiveness
Peer reviewedBrowning, Mary; Jones, Robert – Journal of Learning Disabilities (United Kingdom), 2002
This study examined the validity and reliability of a staff-completed rating instrument (to determine relationship patterns and compatibility during the resettlement of 57 people with learning difficulties from a hospital to group home settings in Wales). Although ratings were relatively consistent across raters, there was limited support for the…
Descriptors: Adults, Deinstitutionalization (of Disabled), Foreign Countries, Mental Retardation
Peer reviewedUhlenbeck, Anne M.; Verloop, Nico; Beijaard, Douwe – Teachers College Record, 2002
Examined the best approach to the development of procedures for assessing beginning teachers, reviewing studies on teacher thinking, development, learning, and knowledge; examining studies on new approaches to teacher evaluation and on issues of validity and reliability; and proposing a framework with 15 implications for the development of…
Descriptors: Beginning Teachers, Elementary Secondary Education, Evaluation Methods, Faculty Development
Peer reviewedLaurent, Jeff – Journal of School Psychology, 1997
Using an independent college sample, examines the characteristics of the Woodcock-Johnson Tests of Cognitive Ability-Revised. Differences exists in subtest performance based on the order in which the Standard and Supplemental Batteries were administered. Gender differences exist on the Visual Matching, Picture Vocabulary and Cross Out subtests.…
Descriptors: Adults, Cognitive Ability, College Students, Higher Education
Peer reviewedYoshinaga-Itano, Christine; Snyder, Lynn S.; Day, Diane – Volta Review, 1999
The internal reliability and concurrent validity of the Play Assessment Questionnaire (PAQ) was compared to that of the Minnesota Child Development Inventory with 170 deaf or hard of hearing infants and toddlers. The PAQ was found to be a useful nonverbal tool that assesses symbolic play behaviors and demonstrates a parallel development with…
Descriptors: Deafness, Hearing Impairments, Infants, Language Acquisition
Peer reviewedRaines-Eudy, Ruth – Structural Equation Modeling, 2000
Demonstrates empirically a structural equation modeling technique for group comparison of reliability and validity. Data, which are from a study of 495 mothers' attitudes toward pregnancy, have a one-factor measurement model and three sets of subpopulation comparisons. (SLD)
Descriptors: Factor Analysis, Factor Structure, Mothers, Parent Attitudes
Peer reviewedVanLeeuwen, Dawn M.; Dormody, Thomas J.; Seevers, Brenda S. – Journal of Agricultural Education, 1999
Presents a generalizability theory (GT) analysis of data from 531 agricultural students' evaluations of teaching. Illustrates GT estimations of reliability for student and class means and the use of GT in obtaining standard errors. (SK)
Descriptors: Agricultural Education, Generalizability Theory, Higher Education, Measures (Individuals)
Peer reviewedLongford, N. T. – Journal of Educational and Behavioral Statistics, 1994
Presents a model-based approach to rater reliability for essays read by multiple raters. The approach is motivated by generalizability theory, and variation of rater severity and rater inconsistency is considered in the presence of between-examinee variations. Illustrates methods with data from standardized educational tests. (Author/SLD)
Descriptors: Educational Testing, Essay Tests, Generalizability Theory, Interrater Reliability
Tasker, Raymond S. – ESL Magazine, 2001
Looks at the use of the Test of English as a Foreign Language (TOEFL), the most widely used test to assess the English language proficiency of students applying to colleges and universities in the United States or Canada. Examines the TOEFL in relation to backwash, reliability, administration, validity, and ethics. (Author/VWL)
Descriptors: College Entrance Examinations, Ethics, Language Proficiency, Language Tests
Peer reviewedMeijer, Joost; Elshout-Mohr, Marianne; van Hout-Wolters, Bernadette H. A. M. – Educational Research and Evaluation: An International Journal on Theory and Practice, 2001
Constructed a multiple choice instrument to assess the level of competence of students aged 15 and 16 on eight cross-curricular skills. A pilot study involving 465 students and a main study with 9,000 students supported the developed measure, the Cross-Curriculum Skills Test, as a valid and reliable test of cross-curricular skills. (SLD)
Descriptors: Competence, High School Students, High Schools, Reliability
Peer reviewedFitzpatrick, Anne R.; Yen, Wendy M. – Applied Measurement in Education, 2001
Examined the effects of test length and sample size on the alternate forms reliability and equating of simulated mathematics tests composed of constructed response items scaled using the two-parameter partial credit model. Results suggest that, to obtain acceptable reliabilities and accurate equated scores, tests should have at least 8 6-point…
Descriptors: Constructed Response, Equated Scores, Mathematics Tests, Reliability
Peer reviewedCostenbader, Virginia; Ngari, Stephen Mbugua – School Psychology International, 2001
Establishes a Kenyan standardization of the Raven's Coloured Progressive Matrices (RCPM), a nonverbal instrument widely used to assess academic aptitude in young children. Data was gathered from a sample of 1,370 children between the ages of 6 and 10 years. Using the current data, the RCPM appears to be a reliable and valid instrument for use in…
Descriptors: Elementary Education, Foreign Countries, Intelligence Tests, Screening Tests
Peer reviewedMartin, Carol Lynn; Fabes, Richard A. – Developmental Psychology, 2001
Examined whether preschool children's play-partner choices were stable over time and how they influenced behavior. Found that partner preferences were highly sex differentiated and stable over time. Identified two types of consequences of partner choice: a binary effect that influenced differences between the sexes and a social dosage effect that…
Descriptors: Individual Differences, Longitudinal Studies, Peer Influence, Peer Relationship


