Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedLynch, Clifford A. – Journal of the American Society for Information Science, 1988
Describes the unique reliability problems of very large databases that necessitate specialized techniques for hardware problem management. The discussion covers the use of controlled partial redundancy to improve reliability, issues in operating systems and database management systems design, and the impact of disk technology on very large…
Descriptors: Algorithms, Computer System Design, Database Management Systems, Databases
Peer reviewedOlivarez, Arturo, Jr.; Tallent-Runnels, Mary K. – Journal of Experimental Education, 1994
The latent composition of the Learning and Study Strategies Inventory for High School (LASSI-HS) was studied through exploratory and confirmatory factor analysis of results from 367 ninth-grade students. Evidence supports a three-factor model. Interrelationships among the constructs are examined, and use of the instrument is discussed. (SLD)
Descriptors: Affective Behavior, Cognitive Processes, Factor Structure, Goal Orientation
Peer reviewedDaud, Nuraihan Mat – Computer Assisted Language Learning, 1995
Discusses the development of a scale to measure variables that may have an effect on teacher's affective attitudes towards computer-assisted language learning. Both qualitative and quantitative methodologies were used in the development of the scale to ensure its validity and reliability. (13 references) (Author/CK)
Descriptors: Affective Behavior, Case Studies, Computer Assisted Instruction, Construct Validity
Peer reviewedReid, William J.; And Others – Journal of Social Work Education, 1996
In a study with 13 social work and counseling interns, field supervisors' ratings of students' field performance were compared to an independent judge's content analysis of performance. Results revealed significant correlations between the evaluations, providing evidence of validity of the supervisors' assessments. Validity may have been enhanced…
Descriptors: Evaluation Methods, Field Experience Programs, Higher Education, Interrater Reliability
Peer reviewedLevine, Phyllis; Edgar, Eugene – Exceptional Children, 1994
High school graduates in regular (n=280) and special education (n=223) and their parents were interviewed. Parent-student agreement percentages were high for the variables of attending postsecondary school, employment status, type of residence, marital status, and number of children. Low agreement rates were obtained for salary level, hours…
Descriptors: Disabilities, Employment, Followup Studies, Graduate Surveys
Peer reviewedMessick, Samuel – American Psychologist, 1995
Presents a comprehensive review of validity that includes an empirical evaluation of the actual and potential consequences of score interpretation and use, how those consequences come about, and what determines them. Six distinguishable aspects of construct validity are highlighted as a means of addressing central issues implicit in the notion of…
Descriptors: Concurrent Validity, Construct Validity, Content Validity, Models
Peer reviewedDempsey, Ian – Australia and New Zealand Journal of Developmental Disabilities, 1995
The Enabling Practices Scale was developed to measure the level of empowering practices used by employment organizations serving individuals with disabilities and was tested with 127 Australian parents who have adult children with intellectual disability. Factor analysis revealed three factors: comfort with parent-staff relationship,…
Descriptors: Adult Children, Cooperation, Decision Making, Employment Services
Peer reviewedThompson, Irene – Foreign Language Annals, 1995
Considers the interrater reliability of certified testers in five European languages, the relationship between interviewer-assigned ratings and second ratings based on audio replay, interrater reliability as a function of proficiency level, effect of different languages on interrater agreement, and interrater disagreements with regard to…
Descriptors: Audiotape Recordings, English (Second Language), Evaluators, French
Peer reviewedFollman, John – Child Study Journal, 1995
Reviews research on elementary school pupils' ratings of teacher effectiveness. Concludes adequate research exists to generalize that pupils can rate reliably, pupils may be no more vulnerable than others to rating leniency and halo, psychometric characteristics and factor structures of rating scales resemble those of college students,…
Descriptors: Elementary School Students, Evaluation Methods, Evaluation Problems, Evaluation Research
Peer reviewedHalberstadt, Amy G.; And Others – Psychological Assessment, 1995
The Self-Expressiveness in the Family Questionnaire is introduced as a measure of emotional expressiveness. Four studies involving 499 mothers and 362 fathers provided evidence of good convergent, discriminant, and construct validity for the instrument and a preliminary short form. Factor analyses support a two-factor solution across the studies.…
Descriptors: Emotional Response, Factor Analysis, Factor Structure, Family Environment
Peer reviewedBlack, Paul – Studies in Educational Evaluation, 1995
The role of assessment in science education is explored, focusing on summative assessment in British public certificate examinations. Examples of test items are presented to illustrate difficulties in making valid and reliable assessments, and issues with implications for formative assessment are discussed. (SLD)
Descriptors: Educational Assessment, Feedback, Foreign Countries, Formative Evaluation
Peer reviewedVu, Nu Viet; And Others – Academic Medicine, 1992
The use of a performance-based assessment of senior medical students' clinical skills utilizing standardized patients was evaluated, with 6,804 student-patient encounters involving 405 students over 6 years. Results provide evidence for test security, content validity, construct validity, reliability, and test ability to discriminate a wide range…
Descriptors: Clinical Experience, Evaluation Methods, Higher Education, Medical Education
Peer reviewedDornyei, Zoltan; Katona, Lucy – Language Testing, 1992
A total of 102 university English majors were administered 4 different language tests to form a General Language Proficiency measure against which the C-test was evaluated. Results confirmed its reliability and validity and also provided data on text difficulty/appropriateness, word structure, content, and different scoring methods. (13…
Descriptors: College Students, English (Second Language), Higher Education, Language Proficiency
Peer reviewedMarsh, Herbert W.; Bailey, Michael – Journal of Higher Education, 1993
The "Students Evaluation of Educational Quality" is an instrument measuring dimensions of college teaching effectiveness. A study showed that, for ratings of 123 instructors in 3,079 classes over 13 years, each instructor has a distinct profile of ratings that generalizes over time, different courses, and course level. (Author/MSE)
Descriptors: Graduate Study, Higher Education, Longitudinal Studies, Profiles
Peer reviewedKolen, Michael J.; And Others – Journal of Educational Measurement, 1992
A procedure is described for estimating the reliability and conditional standard errors of measurement of scale scores incorporating the discrete transformation of raw scores to scale scores. The method is illustrated using a strong true score model, and practical applications are described. (SLD)
Descriptors: College Entrance Examinations, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)


