Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
O'Neill, Thomas R.; Lunz, Mary E. – 1996
To generalize test results beyond the particular test administration, an examinee's ability estimate must be independent of the particular items attempted, and the item difficulty calibrations must be independent of the particular sample of people attempting the items. This stability is a key concept of the Rasch model, a latent trait model of…
Descriptors: Ability, Benchmarking, Comparative Analysis, Difficulty Level
Giota, Joanna – 1995
This study examined the concept of quality in child day care and how this can be measured by the Early Childhood Environment Rating Scale (ECERS). Swedish day care centers in three communities were administered a version of the ECERS, which was translated from the original scale to accommodate conceptual differences between Sweden and the United…
Descriptors: Day Care, Day Care Centers, Foreign Countries, Interrater Reliability
August, Diane, Ed.; McArthur, Edith, Ed. – 1996
This report presents the results of a working meeting to provide guidance to staff at the National Center for Education Statistics on: (1) establishing guidelines for inclusion of limited-English-proficient (LEP) students in the National Assessment of Educational Progress (NAEP), field tests, research, and development; (2) modifications in the…
Descriptors: Elementary Secondary Education, Federal Legislation, Federal Regulation, Identification
Terry, Julie – 1996
Year after year, students, teachers, administrators, politicians, and parents are faced with the dilemma of reading assessment. Sheila Valencia (1990) feels that reading assessment has become a hot topic because the outcome tends to differ from school to school. Evaluations should be authentic and trustworthy. Roger Farr (1992) has noted that the…
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation Methods, Performance Based Assessment
Braun, Henry I.; And Others – 1989
The use of constructed response items in large scale standardized testing has been hampered by the costs and difficulties associated with obtaining reliable scores. The advent of expert systems may signal the eventual removal of this impediment. This study investigated the accuracy with which expert systems could score a new, non-multiple choice…
Descriptors: Computer Science, Constructed Response, Expert Systems, High School Seniors
Motika, Robert T. – 1997
Data from performance measures that were part of two foreign language teacher certification examinations were used in a generalizability study of the quality of their performance ratings. A total of 775 examinees from the Spanish K-12 and 192 examinees from the French K-12 subject area tests of the Florida Teacher Certification Examinations were…
Descriptors: Elementary Secondary Education, Error of Measurement, French, Generalizability Theory
Cashin, William E. – 1995
This paper attempts to summarize the conclusions of the major reviews of the literature on student ratings of teaching. It is an update of a paper by the same name published as IDEA Paper No. 20 from the Center for Faculty Evaluation and Development in 1988. Viewing student ratings as data rather than evaluations may help to put them in proper…
Descriptors: College Faculty, Data Collection, Evaluation Methods, Higher Education
Assessment in Counseling: A Guide to the Use of Psychological Assessment Procedures. Second Edition.
Hood, Albert B.; Johnson, Richard W. – 1997
Assessment has always played an important role in counseling. This book provides information about the various psychological assessment procedures that are relevant for practicing counselors. The text deals with the use of tests that are most often employed by counselors and it includes case studies. Its purpose is to help counselors become better…
Descriptors: Counseling, Counselors, Developmental Psychology, Evaluation Methods
Shapley, Kelly S.; And Others – 1997
The implementation of the Dallas (Texas) Public Schools 1995-96 Title I PK-2 reading and language arts portfolio entailed monitoring and data collection to determine the student outcomes and the technical quality of the instrument. A total of 2,001 portfolios were reviewed for prekindergarten through grade 2. The components were generally in place…
Descriptors: Data Collection, Language Arts, Performance Based Assessment, Portfolio Assessment
Rodriguez-Aragon, Graciela; And Others – 1993
The predictive power of the Split-Half version of the Wechsler Intelligence Scale for Children--Revised (WISC-R) Object Assembly (OA) subtest was compared to that of the full administration of the OA subtest. A cohort of 218 male and 49 female adolescent offenders detained in a Texas juvenile detention facility between 1990 and 1992 was used. The…
Descriptors: Adolescents, Cohort Analysis, Comparative Testing, Correlation
Thompson, Bruce; Crowley, Susan – 1994
Most training programs in education and psychology focus on classical test theory techniques for assessing score dependability. This paper discusses generalizability theory and explores its concepts using a small heuristic data set. Generalizability theory subsumes and extends classical test score theory. It is able to estimate the magnitude of…
Descriptors: Analysis of Variance, Cutting Scores, Decision Making, Error of Measurement
Wasem, Jim – 1993
"Pickleball" is a new racquet sport which is one of the fastest growing educational activities in the Northwest. This paper describes the development of a test battery designed to measure students' pickleball skills for purposes of classification; to determine improvement of playing skills; and to aid in grading of individual…
Descriptors: Higher Education, Physical Education, Preservice Teacher Education, Racquet Sports
Willmington, S. Clay; Steinbrecher, Milda M. – 1993
A "Fundamentals of Speech Communication" course is required of all college students, and upon completion of such a course students should possess those basic speaking and listening skills necessary to complete successfully their college educations. With a view toward developing a new, more effective listening test, a study examined…
Descriptors: Communication Research, Higher Education, Introductory Courses, Listening Comprehension
Huang, Shenghui Cindy; Lloyd, Paul; Mikulecky, Larry – 1999
The development and validation of a scale to assess English-as-a-Second-Language (ESL) learners' perceived self-efficacy are described. Self-efficacy expectations are beliefs about one's ability to perform a given task successfully. Research on self-efficacy and related concepts is reviewed, noting their significant role in predictor human…
Descriptors: English (Second Language), Learning Motivation, Literacy, Rating Scales
Ortiz, Camilo; Arnold, David H.; Stowe, Rebecca M. – 1997
Despite its supposed importance, children's emergent interest in literacy has been seldom studied. As a result, no easy-to-use and psychometrically sound measure of children's emergent interest in literacy exists. This study made an initial attempt at validating such a measure. On three separate occasions, 24 parents and their 2- to 3-year-old…
Descriptors: Childhood Attitudes, Childrens Literature, Measurement Techniques, Picture Books


