Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
DeVore, Jerry R.; Handal, Paul J. – Journal of College Student Personnel, 1981
Reports the test-retest reliability coefficients for each of the five scales of the College Student Satisfaction Questionnaire over a seven-day interval. The reliability coefficients for both male and female private university undergraduate students (N=89) were significantly and uniformly high. (Author)
Descriptors: Affective Measures, College Students, Higher Education, Measures (Individuals)
Peer reviewedJohnson, James H.; And Others – Journal of Clinical Psychology, 1980
Describes a computer assisted system for intake assessment. Reports on two experiments that compared the reliability of a diagnostic procedure that involves technicians, a structured interview schedule, and a computerized diagnostic program with diagnoses made by clinicians. Results show the computer assisted technician approach is as reliable as…
Descriptors: Clinical Diagnosis, Computer Oriented Programs, Measurement Techniques, Mental Health Clinics
Peer reviewedHsu, Louis M. – Applied Psychological Measurement, 1979
A comparison of the relative ordering power of separate and grouped-items true-false tests indicated that neither type of test was uniformly superior to the other across all levels of knowledge of examinees. Grouped-item tests were found superior for examinees with low levels of knowledge. (Author/CTM)
Descriptors: Academic Ability, Knowledge Level, Multiple Choice Tests, Scores
Peer reviewedEbel, Robert L. – Educational Horizons, 1979
The basic rationale for using objective tests is that they are relevant and reliable measures of the most important kind of learning that schools and colleges seek to foster. Measurement of student achievement in learning is necessary, if educational excellence is to be achieved. (Author/SJL)
Descriptors: Academic Ability, Accountability, College Entrance Examinations, Educational Quality
Mattran, Kenneth J. – Adult Literacy and Basic Education, 1979
Gives an overview of the development and testing of an interview instrument used by teachers of English as a second language to assess the oral proficiency of nonnative speakers. (The actual instrument is reprinted in the article that follows this one). (SK)
Descriptors: Adult Basic Education, Educational Diagnosis, English (Second Language), Informal Assessment
Peer reviewedGardner, R. C.; Smythe, P. C. – Canadian Modern Language Review, 1981
Outlines stages followed in initial development of an attitude/motivation test battery to study individual differences associated with achievement in French and presents reliability and validity data. Focuses attention on procedures for test construction and evaluation of test adequacy. (Author/BK)
Descriptors: Attitude Measures, French, Language Tests, Learning Motivation
Peer reviewedRobinson, Elizabeth A.; Eyberg, Sheila M. – Journal of Consulting and Clinical Psychology, 1981
Problem-child families and normal families were classified using the dyadic parent-child interaction coding system (DPICS). The DPICS was found to be a reliable, clinically practical research instrument that correctly classified 94 percent of the families and predicted 64 percent of the variance in parent report of home behavior problems. (Author)
Descriptors: Behavior Problems, Classification, Evaluation Methods, Family (Sociological Unit)
Peer reviewedHolloway, Richard L. – American Journal of Pharmaceutical Education, 1979
Since competency-based instruction is based on the testing process, this study suggests that careful attention to the construction of tests can produce a reliable, valid measure. The next steps to be taken include developing larger banks of test items, increasingly accurate tests, and instruction to meet specified objectives of the curriculum.…
Descriptors: Competency Based Education, Criterion Referenced Tests, Higher Education, Pharmaceutical Education
Peer reviewedEllerman, D. A. – British Journal of Educational Psychology, 1980
Samples of children (N=1,267) in rural Australian primary schools completed the "Where Are You Game" for the assessment of self-regard. Results indicated acceptable estimated test reliability and considerable convergent validity. Comparatively, self-regard was higher in younger children, boys, and higher academic achievers. (Author/SJL)
Descriptors: Academic Achievement, Age Differences, Children, Elementary Education
Peer reviewedHattie, John – Journal of Educational Psychology, 1980
Three conditions for administering creativity tests by Torrance and by Wallach and Kogan were compared: (1) untimed, gamelike; (2) conventional testlike; and (3) administration of measures under testlike conditions on two adjacent days, using the second testing as the predictor. The conventional testlike condition seems optimal. (Author/CP)
Descriptors: Correlation, Creativity, Creativity Tests, Foreign Countries
Sykes, Barbara – CORE, 1979
Problems in grading and evaluating English compositions are discussed. Factors include methods of marking, reliability, prediction, characteristics of markers, and handwriting. The effect of time passage on evaluation criteria was determined by comparing essays written in 1922 with new ones written on the same topic (f=fiche number). (MH)
Descriptors: Elementary Education, Essays, Evaluation Criteria, Factor Structure
Weinrach, Stephen G.; Diamond, Esther E. – Vocational Guidance Quarterly, 1980
Interpreting interest inventories correctly encourages client understanding and decision-making skills. Inadequate time and information overload can be major obstacles. Weinrach's Discrepancy Identification can minimize these problems in interpreting the Kuder Occupational Interest Survey. Includes a response by Diamond on scoring. (JAC)
Descriptors: Career Counseling, Career Planning, Counseling Techniques, Counselors
Peer reviewedWaddell, Deborah D. – Journal of School Psychology, 1980
A review of the technical data available on the 1972 norms edition of the Stanford-Binet demonstrates how inadequate these data are. The Stanford-Binet should not continue to be used in important decision making processes unless this weakness is corrected. (Author)
Descriptors: Educational Assessment, Elementary Secondary Education, Intelligence Quotient, Intelligence Tests
Peer reviewedNeely, Margery A.; Steffan, John D. – Journal of Vocational Education Research, 1979
This investigation sought to establish how documentation of relevant unpaid work experience can be reliably rated by administrators, and how well these ratings correlate with other assessments of administrative skills. A portfolio technique was developed and validated as a documentation device. (SK)
Descriptors: Administrator Qualifications, Evaluation Methods, Experiential Learning, Females
Peer reviewedKlein, Raymond S.; And Others – Journal of Vocational Education Research, 1980
To validate the National Occupational Competency Testing Institute examinations to be used to select vocational instructors in Georgia, a random sample of teachers by occupation and race was tested. Test results were compared to national norms; local norms and a method for determining cut-off scores were developed. (SK)
Descriptors: Cutting Scores, National Competency Tests, National Norms, Postsecondary Education


