Publication Date
| In 2026 | 2 |
| Since 2025 | 613 |
| Since 2022 (last 5 years) | 2550 |
| Since 2017 (last 10 years) | 5585 |
| Since 2007 (last 20 years) | 9181 |
Descriptor
| Test Validity | 21757 |
| Test Reliability | 10004 |
| Test Construction | 5884 |
| Foreign Countries | 4949 |
| Psychometrics | 2962 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2373 |
| Higher Education | 2249 |
| Evaluation Methods | 2084 |
| College Students | 1812 |
| Correlation | 1722 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 806 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 171 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedVispoel, Walter P.; Coffman, Don D. – Applied Measurement in Education, 1994
Computerized-adaptive (CAT) and self-adapted (SAT) music listening tests were compared for efficiency, reliability, validity, and motivational benefits with 53 junior high school students. Results demonstrate trade-offs, with greater potential motivational benefits for SAT and greater efficiency for CAT. SAT elicited more favorable responses from…
Descriptors: Adaptive Testing, Computer Assisted Testing, Efficiency, Item Response Theory
Peer reviewedBudgell, Glen R.; And Others – Applied Psychological Measurement, 1995
The usefulness of three item response theory-based methods and the Mantel Haenszel technique in evaluating the measurement equivalence of translated assessment instruments was demonstrated in a study involving 2,000 French-speaking Canadian adults who took a French test translation and 2,000 English-speaking adults who took the English original.…
Descriptors: Adults, Chi Square, Cultural Awareness, Culture Fair Tests
East, John W. – Australian Academic & Research Libraries, 2006
This article reviews the methods commonly used to evaluate journals, looking particularly at indicators relevant to journals in the humanities. It then applies these methods to a sample of Australian humanities journals. The indicators used are: level of holdings in large overseas academic libraries, coverage in international databases, standards…
Descriptors: Citations (References), Humanities, Classification, Evaluation Criteria
Cantrell, Pamela – School Science and Mathematics, 2003
The difference in gain scores produced by traditional pretests and those produced by retrospective pretests when compared to posttest scores on the Science Teaching Efficacy Belief Instrument for preservice teachers was investigated in this study. Results indicated that gain scores using the traditional pretest produced significant improvement in…
Descriptors: Pretests Posttests, Validity, Scores, Preservice Teachers
Tortella-Feliu, Miquel; Fullana, Miquel A.; Caseras, Xavier; Andion, Oscar; Torrubia, Rafael; Mataix-Cols, David – Behavior Modification, 2006
The factor structure, psychometric properties, and relationship with personality variables of a Spanish version of the Savings Inventory-Revised (SI-R) are investigated in a sample of 381 undergraduate students. A maximum likelihood factor analysis suggests a three-factor structure, which is similar but not identical to that of the original…
Descriptors: Spanish, Psychometrics, Undergraduate Students, Factor Structure
Eaves, Ronald C.; Williams, Thomas O., Jr. – Journal of Genetic Psychology, 2006
In this study, the authors examined the construct validity of the Pervasive Developmental Disorder Rating Scale (PDDRS; R. C. Eaves, 1993), which is a screening instrument used to identify individuals with autistic disorder and other pervasive developmental disorders. The PDDRS is purported to measure 3 factors--arousal, affect, and…
Descriptors: Pervasive Developmental Disorders, Construct Validity, Test Validity, Factor Structure
Slomp, David H. – English Teaching: Practice and Critique, 2005
A goal of this double issue of English Teaching: Practice and Critique is to collectively consider what we mean when we talk about knowledge about language. How have our understandings changed over time? What are the implications of these new understandings for pedagogy in the field of language teaching? These are necessary and important…
Descriptors: Writing Evaluation, Writing Tests, High School Students, Standardized Tests
Shields, Jennifer; Konold, Timothy R.; Glutting, Joseph J. – Journal of Psychoeducational Assessment, 2004
This study investigated the differential validity of the Wide Range Intelligence Test, which is a new, brief measure of ability. Participants (N = 744) ranged in age from 5 through 85 years (M = 26.7 years, SD = 21.4 years) and varied by the demographic variables of gender, race/ethnicity (Anglo, African American, Hispanic), and education level…
Descriptors: Intelligence, High Schools, Ethnic Groups, Test Validity
Hawks, Steven; Merrill, Ray M.; Madanat, Hala N. – American Journal of Health Education, 2004
This article describes the development and validation of an instrument designed to measure the concept of intuitive eating. To ensure face and content validity for items used in the Likert-type Intuitive Eating Scale (IES), content domain was clearly specified and a panel of experts assessed the validity of each item. Based on responses from 391…
Descriptors: College Students, Obesity, Eating Disorders, Content Validity
Chard, David J.; Clarke, Ben; Baker, Scott; Otterstedt, Janet; Braun, Drew; Katz, Rachell – Assessment for Effective Intervention, 2005
As recent research efforts have focused on preventing reading difficulties and enhancing the effectiveness of special education services for students with reading problems, similar efforts in mathematics have not been realized. This article describes the development and preliminary field testing of a set of measures designed to screen students in…
Descriptors: Test Validity, Predictive Validity, Field Tests, Kindergarten
Santhanam, Elizabeth; Hicks, Owen – Teaching in Higher Education, 2002
The prevalent use of student ratings in teaching evaluations, particularly the reliability of such data, has been debated for many years. Reports in the literature indicate that there are many factors influencing student perceptions of teaching. Three of these factors were investigated at the University of Western Australia, namely the broad…
Descriptors: Foreign Countries, Student Evaluation of Teacher Performance, Intellectual Disciplines, Student Attitudes
Secolsky, Charles, Ed.; Denison, D. Brian, Ed. – Routledge, Taylor & Francis Group, 2011
Increased demands for colleges and universities to engage in outcomes assessment for accountability purposes have accelerated the need to bridge the gap between higher education practice and the fields of measurement, assessment, and evaluation. The "Handbook on Measurement, Assessment, and Evaluation in Higher Education" provides higher…
Descriptors: Generalizability Theory, Higher Education, Institutional Advancement, Teacher Effectiveness
de la Cruz, Rey E. – 1996
This paper reviews the literature on assessment bias issues in special education. While assessment instruments yielding a single IQ score are seen as useful components in a comprehensive multifactored assessment, and are the primary tool of diagnosis for mental retardation, they are found to be irrelevant when applied to students with learning…
Descriptors: Ability Identification, Court Litigation, Culture Fair Tests, Disabilities
Yepes-Baraya, Mario – 1995
This paper describes the task analysis of performance-based science tasks that were designed for the 1994 National Assessment of Educational Progress (NAEP) science assessment, now postponed until 1996, and field tested in 1993. A brief description of the science performance tasks is followed by a description of the task analyses performed and a…
Descriptors: Cognitive Processes, Educational Assessment, Elementary Secondary Education, Field Tests
Chalhoub-Deville, Micheline; Tarone, Elaine – 1996
A discussion of second language testing focuses on the need for collaboration among researchers in second language learning, teaching, and testing concerning development of context-appropriate language tests. It is argued that the nature of the proficiency construct in language is not constant, but that different linguistic, functional, and…
Descriptors: Educational Environment, Evaluation Criteria, Language Proficiency, Language Tests

Direct link
