Publication Date
| In 2026 | 2 |
| Since 2025 | 469 |
| Since 2022 (last 5 years) | 1948 |
| Since 2017 (last 10 years) | 4520 |
| Since 2007 (last 20 years) | 7005 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10011 |
| Test Construction | 4371 |
| Foreign Countries | 3834 |
| Psychometrics | 2429 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 839 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 130 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedJaradat, Derar; Tollefson, Nona – Educational and Psychological Measurement, 1988
This study compared the reliability and validity indexes of randomly parallel tests administered under inclusion, exclusion, and correction for guessing directions, using 54 graduate students. It also compared the criterion-referenced grading decisions based on the different scoring methods. (TJH)
Descriptors: Criterion Referenced Tests, Grading, Graduate Students, Guessing (Tests)
Peer reviewedGreenan, James P.; McCabe, Connie C. – Journal of Industrial Teacher Education, 1989
The authors developed, tested, and validated a set of student self-ratings, teacher ratings, and performance assessment instruments designed to measure generalizable reasoning skills of students enrolled in secondary vocational programs. They were found to be sufficiently reliable and valid indicators of functional learning strengths and…
Descriptors: Logical Thinking, Performance Based Assessment, Secondary Education, Self Evaluation (Individuals)
Peer reviewedRoberts, Clare; Pratt, Chris – Australasian Journal of Special Education, 1988
The study evaluated the psychometric properties of reliability and construct validity of the Attitude Toward Mainstreaming Scale (ATMS) in an Australian context. It was concluded that the scale is both reliable and factorially valid in an Australian context. (Author/DB)
Descriptors: Attitude Measures, Cultural Differences, Elementary Secondary Education, Foreign Countries
Peer reviewedStreufert, Siegfried; And Others – Personnel Psychology, 1988
Evaluated quasi-experimental simulation technique designed to measure impact of individual differences in managerial styles on executive performance. Tested 20 simulation-based measures for reliability and validity. Data from two samples suggest that this quasi-experimental simulation technology may be useful in assessing managerial styles not…
Descriptors: Administrator Qualifications, Competence, Evaluation Methods, Individual Differences
Peer reviewedSzajna, Bernadette – Educational and Psychological Measurement, 1994
Predictive validities of computer aptitude and computer anxiety were studied using nonprogramming computer performance as the criterion variable for 162 young adults. Effects of computer anxiety on performance were negligible, and computer aptitude yielded uncertain results. The measurement instruments' reliability was also reported. (SLD)
Descriptors: Aptitude, Computer Anxiety, Computer Attitudes, Computer Literacy
Peer reviewedMay, Kim; Nicewander, W. Alan – Journal of Educational Measurement, 1994
Reliabilities and information functions for percentile ranks and number-right scores were compared using item response theory, modeling standardized achievement tests. Results demonstrate that situations exist in which the percentage of items known by examinees can be accurately estimated, but the percentage of persons falling below a given score…
Descriptors: Achievement Tests, Difficulty Level, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedSong, Li-yu; And Others – Psychological Assessment, 1994
Measurement fidelity (reliability, factor structure, and validity) of Aschenbach's Youth Self-Report scale was studied with 226 adolescents at a psychiatric hospital. Findings confirm convergent validity and reliability of four of the measure's seven narrowband syndromes, and seven meaningful subdimensions were extracted from the other three…
Descriptors: Adolescents, Factor Analysis, Factor Structure, Measurement Techniques
Peer reviewedReckase, Mark D. – Educational Measurement: Issues and Practice, 1995
An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)
Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models
Peer reviewedHigbee, Katherine R.; Roberts, Robert E. – Hispanic Journal of Behavioral Sciences, 1994
Eight-item revision of the UCLA Loneliness Scale was administered to 2,614 students, aged 11-14. Loneliness did not differ by age or between Anglo- and Mexican-American students, but was higher for girls than boys in each ethnic group. Principal components factor analysis and correlations with other related measures indicate good reliability and…
Descriptors: Affective Measures, Anglo Americans, Early Adolescents, Loneliness
Peer reviewedStumpf, Steven H. – Evaluation and the Health Professions, 1994
A five-year curriculum evaluation project is described that treated students' course ratings, examination reliability coefficients, and item-discrimination data as a battery of data points for determining annual revision efforts. Histograms were constructed to make valid demonstrations of successful efforts immediately comprehensible to faculty.…
Descriptors: College Faculty, Comprehension, Curriculum Evaluation, Longitudinal Studies
Peer reviewedAntonak, Richard F.; Larrivee, Barbara – Exceptional Children, 1995
Evidence supporting the use of a revision of the Opinions Relative to Mainstreaming scale, called Opinions Relative to Integration of Students with Disabilities, is presented. Scale testing with 376 professionals revealed satisfactory item characteristics, adequate reliability and homogeneity, and initial support for construct validity. The scale…
Descriptors: Attitude Measures, Disabilities, Elementary Secondary Education, Inclusive Schools
Peer reviewedAllan, Alistair – Language Testing, 1992
The design of a valid and reliable test of test-wiseness is reported: a 33-item multiple-choice instrument with 4 subscales trialed with several groups of English-as-a-Second-Language students. Findings indicate differential skills in test-taking; some learner scores are influenced by skills that are not the focus of the test. (13 references)…
Descriptors: English (Second Language), Language Research, Language Tests, Multiple Choice Tests
Schouten, Peter G. W. – Diagnostique, 1992
The Miller Assessment for Preschoolers, intended for children ages 2-5, is designed as both a screening and a diagnostic assessment, measuring neuromaturational variables, coordination, language, memory, problem solving, and visual perception. This review examines test administration, summation of data, standardization, reliability, and validity,…
Descriptors: Diagnostic Tests, Educational Diagnosis, High Risk Students, Preschool Education
Peer reviewedPentony, Joseph F. – Educational and Psychological Measurement, 1992
The reliability and validity of E. D. Hirsch's (1988) Cultural Literacy Test (CLT) was studied with 150 first-year college students at the University of St. Thomas in Houston (Texas). The test appears reliable, with a split-half reliability estimate of 0.93, and the cultural literacy construct and the CLT are valid. (SLD)
Descriptors: College Freshmen, Concurrent Validity, Construct Validity, Correlation
Peer reviewedFrisbie, David A.; Becker, Douglas F. – Applied Measurement in Education, 1990
Seventeen educational measurement textbooks were reviewed to analyze current perceptions regarding true-false achievement testing. A synthesis of the rules for item writing is presented, and the purported advantages and disadvantages of the true-false format derived from those texts are reviewed. (TJH)
Descriptors: Achievement Tests, Higher Education, Methods Courses, Objective Tests


