Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Koretz, Daniel; And Others – 1993
The 1992-93 school year was the second year of the implementation of the Vermont assessment program. Evaluation of the 1991-92 year yielded mixed results, with some evidence that the assessment program was having a strong impact on instruction, but other indications that the reliability of the portfolio scoring in both writing and mathematics was…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Evaluation Utilization
Sullivan, Francis J. – 1986
A study examined how pragmatic form influences evaluation of student essays in university placement testing. Specifically, the study documented how patterns in students' use of information (assumed to be either old, inferable, or new for readers) affected the holistic scores for quality given to the essays. Subjects, 99 randomly selected entering…
Descriptors: College Freshmen, Essay Tests, Evaluation Criteria, Evaluation Methods
Schempp, Paul G. – 1986
The stability of teaching behavior was examined by observing student/teacher interaction over one academic year. One teacher was studied using a time-series analysis. He had 14 years experience and taught physical education in grades K-6 in a single school. Data were collected over one academic year using the Cheffers Adaptation of Flanders…
Descriptors: Behavior Change, Case Studies, Classroom Observation Techniques, Classroom Research
Perkins, Kyle – 1986
Based on the premise that composition skills and their evaluation are crucial to the educational process, this paper presents a tentative research program for conducting future English as a second language (ESL) composition evaluation studies. The program developed in the paper covers the following topics as areas which merit further rigorous…
Descriptors: Elementary Secondary Education, English (Second Language), Error Analysis (Language), Evaluation Criteria
Bejar, Isaac I. – 1985
The feasibility of reducing scoring costs for the Test of Spoken English (TSE) by using one rater was investigated. Currently, two raters are used. It was found that, because of the possibility of different standards used by potential raters, it does not appear feasible to use a single rater as the sole determiner of speaking proficiency under the…
Descriptors: Analysis of Covariance, Cost Effectiveness, English (Second Language), Evaluation Criteria
Olejnik, Stephen F.; Porter, Andrew C. – 1978
The statistical properties of two methods of estimating gain scores for groups in quasi-experiments are compared: (1) gains in scores standardized separately for each group; and (2) analysis of covariance with estimated true pretest scores. The fan spread hypothesis is assumed for groups but not necessarily assumed for members of the groups.…
Descriptors: Academic Achievement, Achievement Gains, Analysis of Covariance, Analysis of Variance
Peer reviewedTurner, Jean – Annual Review of Applied Linguistics, 1998
This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…
Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews
Gordon, Howard R. D. – 1996
The purpose of this study was to profile the preferred productivity and learning style preferences of participants enrolled in distance education courses at Marshall University (West Virginia) (Spring of 1995). The accessible population of this study consisted of 167 distance education participants in nursing, education, and paralegal programs. A…
Descriptors: Cognitive Style, College Students, Distance Education, Higher Education
Brown, James Dean; Ross, Jacqueline A. – 1993
This study investigates the Test of English as a Foreign Language (TOEFL), in particular the relative contributions to score dependability (analogous to classical theory reliability) of various numbers of items and subtests as well as the decision dependability at different cut points. Research questions that apply to the overall TOEFL battery and…
Descriptors: English (Second Language), Language Tests, Statistical Analysis, Test Reliability
PDF pending restorationValiga, Michael J. – 1983
An analysis of variance approach to estimating reliability is examined. This approach uses an internal-consistency index for estimating the reliability of survey instruments containing ranked items, and is recommended when the relative ranking of item means is of interest to survey researchers. The computation of this index is demonstrated using…
Descriptors: Analysis of Variance, Higher Education, Institutional Research, Item Analysis
Peer reviewedRubin, Martha; Ventry, Ira M. – American Annals of the Deaf, 1975
Descriptors: Audiology, Auditory Tests, Elementary Secondary Education, Exceptional Child Research
Peer reviewedMartuza, Victor R.; Kallstrom, Dale W. – Psychological Reports, 1974
The multi-trait-multimethod procedure was used to assess the validity of Spillberger's dual conception of anxiety and the interpretation of his scales in a graduate level, educational environment. (Author/KM)
Descriptors: Affective Behavior, Anxiety, Educational Environment, Graduate Study
Peer reviewedAdams, Jack; Creamer, Lyle R. – Psychological Reports, 1974
Findings indicate that students are capable of matching the relative changes in the level of experiences test anxiety with changes in the amplitude of auditory stimulus. (Author/KM)
Descriptors: Anxiety, Auditory Stimuli, College Students, Higher Education
Peer reviewedCrehan, Kevin D. – Journal of Educational Measurement, 1974
Various item selection techniques are compared on criterion-referenced reliability and validity. Techniques compared include three nominal criterion-referenced methods, a traditional point biserial selection, teacher selection, and random selection. (Author)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Item Banks
Peer reviewedWagener, J. Mark – Journal of Personality Assessment, 1974
Descriptors: Adults, Interpersonal Competence, Males, Maturity (Individuals)


