Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedJafarpur, Abdoljavad – System, 1988
Investigation of non-native English speakers' ratings of other non-native English learners' oral proficiency. Results indicate that the judges' ratings significantly differed, and the average of three judges' ratings was a better appraisal of the testee's true ability than that of any single rating or pair of ratings. (Author/CB)
Descriptors: English (Second Language), Evaluation Methods, Foreign Countries, Interrater Reliability
Peer reviewedHutton, Jerry B.; And Others – Psychology in the Schools, 1987
Special education, basic, and honors ninth-grade students (n=60) rated the severity of stress for each of the life events on the Source of Stress Inventory (Chandler, 1981). There was a significant positive relationship between the Chandler rankings (teachers and mental health workers) and the student rankings. (Author/NB)
Descriptors: Grade 9, Interrater Reliability, Mental Health, Secondary Education
Peer reviewedLuecht, Richard M. – Educational and Psychological Measurement, 1987
Test Pac, a test scoring and analysis computer program for moderate-sized sample designs using dichotomous response items, performs comprehensive item analyses and multiple reliability estimates. It also performs single-facet generalizability analysis of variance, single-parameter item response theory analyses, test score reporting, and computer…
Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Item Analysis
Peer reviewedJones, Randy M.; Streitmatter, Janice L. – Adolescence, 1987
Examined Extended Objective Measure of Ego Identity Status for reliability and validity among 467 secondary school students. Results were supportive of appropriateness of all measures for the subjects. Analysis of reliability, validity, demographic characteristics, and psychosocial maturity yielded results which parallel theoretical framework and…
Descriptors: Adolescent Development, Adolescents, College Students, Secondary Education
Peer reviewedLustig, Myron W. – Small Group Behavior, 1987
Investigated reliability and dimensionality of Bales's Interpersonal Rating Forms (IRF) using volunteer subjects (N=266) enrolled in undergraduate communications course. Results documented shortcomings of IRF as a measuring instrument finding the subscales neither reliable nor dimensionally structured; only 2 of 18 items in each subscale are…
Descriptors: College Students, Group Behavior, Groups, Higher Education
Peer reviewedLooney, Marilyn A. – Research Quarterly for Exercise and Sport, 1987
The characteristics of three threshold loss agreement indices which reflect the agreement or consistency in assignment to mastery-nonmastery status are reviewed. These are proportion of agreement, coefficient kappa, and modified kappa. (Author/MT)
Descriptors: Confidence Testing, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education
Peer reviewedMadsen, Harold – CALICO Journal, 1986
Evaluates one of the first operational computerized-adaptive English-as-a-second-language tests in the United States, showing an overwhelmingly positive student reaction to the tests and higher effectiveness than conventional paper-and-pencil tests. (Author/CB)
Descriptors: Anxiety, Computer Assisted Testing, English (Second Language), Language Tests
Winston, Roger B., Jr.; Polkosnik, Mark C. – Journal of College Student Personnel, 1986
Summarizes reliability and validity studies reported about the Student Developmental Task Inventory, second edition (SDTI-2), an objective assessment instrument based on Chickering's theory of psychosocial development described in Education and Identity. Outlines other findings related to differences in psychosocial development. (Author/ABB)
Descriptors: College Students, Developmental Tasks, Higher Education, Self Concept
Peer reviewedCaldwell, JoAnne – Journal of Reading, 1987
Concludes that the test has basic problems in construction, interpretation, validity, and reliability. (FL)
Descriptors: Cognitive Style, Individual Testing, Reading Instruction, Reading Tests
Peer reviewedNevo, Baruch – Journal of Educational Measurement, 1985
A literature review and a proposed means of measuring face validity, a test's appearance of being valid, are presented. Empirical evidence from examinees' perceptions of a college entrance examination support the reliability of measuring face validity. (GDC)
Descriptors: College Entrance Examinations, Evaluation Methods, Evaluators, Foreign Countries
Peer reviewedSims, Ronald R. – Educational and Psychological Measurement, 1986
The Learning Style Inventory (LSI) and the newly revised Learning Style Inventory (LSI II) were examined for internal consistency, test-retest reliability, and stability of the four classifications resulting from their scores. Internal consistency was improved in LSI II, but problems with low test-retest indices and classifications stability…
Descriptors: Cognitive Measurement, Cognitive Style, College Students, Higher Education
Peer reviewedDanielson, Kathy Everts – Reading Horizons, 1987
Lists the dangers of readability formulas and concludes that while they may be a necessary evil, they should not be used as the underlying structure of a reading program. (FL)
Descriptors: Elementary Education, Readability Formulas, Reading Comprehension, Reading Instruction
Peer reviewedSilverman, William H.; And Others – Personnel Psychology, 1986
Examined how assessment center methods affect the way assessors organize and process assessment center information and affect the ratings they make. Results suggested that methods for evaluating assessment center candidates affected the way the assessors organized the assessment center information and affected the obtained ratings. (Author/ABB)
Descriptors: Assessment Centers (Personnel), Cognitive Processes, Evaluation Methods, Evaluators
Peer reviewedAdkins, Meredith C.; Lucas, Kathleen C. – Electronic Library, 1986
This checklist provides guidance on how to measure adequacy of control and security of computer systems, identifies areas of risk, raises management and user awareness of their stewardship responsibilities, and reviews issues relative to system documentation, maintenance, integrity, and reliability. (MBR)
Descriptors: Check Lists, Computer Software, Cost Effectiveness, Integrity
Peer reviewedZimmerman, Mark; Coryell, William – Journal of Consulting and Clinical Psychology, 1987
Assessed the Inventory to Diagnose Depression (IDD) by having 398 relatives of psychiatric patients and normal controls complete the IDD and then interviewing them with the Diagnostic Interview Scale (DIS). Found the point prevalence of major depressive disorder (MDD) to be nearly identical according to the IDD (3.0 percent) and the DIS (2.8…
Descriptors: Clinical Diagnosis, Depression (Psychology), Diagnostic Tests, Measures (Individuals)


