Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Goldstein, Irwin; And Others – 1979
The purpose of this test is to evaluate a non-native speaking student's speaking knowledge of the basic structures of English, using the most frequently used words in the English Language. The test does not attempt to determine vocabulary level or student's ability to learn vocabulary effectively, rather the test focuses exclusively on aural/oral…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Listening Comprehension
Brown, Mary M.; Brown, Scott W. – 1990
An issue facing researchers who study very select populations is how to obtain reliability estimates on instruments. When the populations and resulting samples are very small and select, the ability to obtain reliability estimates becomes very difficult. As a result, many researchers ignore reliability concerns and forge ahead with data…
Descriptors: Estimation (Mathematics), Higher Education, Likert Scales, Measurement Techniques
Cheal, Jennifer Putnam – 1991
The refinement process of a survey instrument developed to operationalize the construct of organizational climate by identifying and describing the dimensions of middle-level school climate is described in this paper. Seven dimensions of organizational climate were identified: administrative support, administrative control, teacher intimacy,…
Descriptors: Construct Validity, Factor Analysis, Intermediate Grades, Junior High Schools
Haladyna, Thomas M. – 1984
The purpose of this study is to examine an option-weighting method as it affects pass-fail decisions in formative and summative evaluation of student achievement for instructional units, certification, advancement, licensure, admissions, placement, and selection. A database was constructed using high school achievement test data where a…
Descriptors: Achievement Tests, Cutting Scores, High Schools, Multiple Choice Tests
Peer reviewedPersons, W. Scott; And Others – Psychology in the Schools, 1976
The authors present a quick and simple procedure for observing four behaviors relevant to classroom management: student disruption, student attention, and the teacher's use of both positive and negative events. The procedure utilizes paraprofessionals as raters and is validated by high interrater reliabilities. (Author/EJT)
Descriptors: Classroom Observation Techniques, Correlation, Observation, Rating Scales
Peer reviewedSilverstein, A. B.; And Others – American Journal of Mental Deficiency, 1975
Descriptors: Adolescents, Cognitive Processes, Evaluation Methods, Exceptional Child Research
Peer reviewedSiegel, Arthur I.; Bergman, Brian A. – Personnel Psychology, 1975
In coping with the problems that have arisen from the rulings of the Supreme Court on equal employment opportunities, a preemployment test was devised that combined miniature job training with tests which are content relevant to all who are tested. (RK)
Descriptors: Job Applicants, Job Training, Psychological Studies, Research Methodology
Peer reviewedScott, Norval C.; And Others – Journal of Medical Education, 1975
Research on the reliability of various measures of interviewing skills used an experimental design involving patient interviews of medical students in their sophomore and senior years. Results suggest caution against overinterpretation of data from faculty raters and illustrate that nonprofessionals can be trained to use interaction analysis…
Descriptors: College Faculty, Educational Research, Higher Education, Interaction Process Analysis
Peer reviewedNelson, Edward A.; Uhl, Norman P. – Multivariate Behavioral Research, 1974
Descriptors: Attitude Change, Attitude Measures, Black Colleges, College Freshmen
Politzer, Robert L.; Brown, Dwight – Florida FL Reporter, 1973
As part of the development of a battery of tests to determine proficiency in black standard and nonstandard speech, a test, consisting of 20 items involving verbal and pictorial cues, was developed and administered to 27 third graders and 32 sixth graders. Results were analyzed to determine test reliability and correlation with other test scores.…
Descriptors: Black Dialects, Elementary Education, Language Proficiency, Language Tests
Peer reviewedChaney, Lillian H.; Billett, Nancy J. – Business Education Forum, 1975
The pilot study, limited to 37 beginning shorthand students at Memphis State University, was an attempt to develop a "credit by examination" instrument. (MW)
Descriptors: Business Education, College Credits, Equivalency Tests, Measurement Instruments
Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005
Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006
This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…
Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias
Peer reviewedCarrow, Elizabeth – Journal of Speech and Hearing Disorders, 1974
Descriptors: Evaluation Methods, Exceptional Child Research, Imitation, Language Handicaps
Lichtenstein, Robert – 1988
The Gesell School Readiness Screening Test (GSRST) is widely used to identify "developmentally immature" children for placement in extra-year, transition programs in spite of a problematic absence of psychometric evidence and research support. In this study of psychometric characteristics of the GSRST, teacher ratings of classroom…
Descriptors: Concurrent Validity, Kindergarten, Kindergarten Children, Primary Education


