Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Riley, Thomas; Davani, Holly; Chason, Pat; Findley, Ken; Druyor, Dale – Educational Technology, 2003
Discusses level 3 evaluation from Donald Kirkpatrick's Four Level Evaluation Model (level: 1-reaction; 2-learning; 3-behavior; 4-results) to measure training program performance. Highlights include an evaluation planning matrix; increasing evaluation data accuracy and reliability; data collection method selection; designing data collection…
Descriptors: Data Collection, Evaluation Methods, Interviews, Matrices
Peer reviewedCreed, Peter A.; Patton, Wendy; Watson, Mark B. – Journal of Career Assessment, 2002
The Career Decision-Making Self-Efficacy Scale-Short Form was completed by 416 South African and 563 Australian high school students. Analysis found three factors in each sample, but the factors differed between samples and from those of a U.S. sample, suggesting cultural differences in self-efficacy and lack of cultural equivalence for the…
Descriptors: Career Choice, Cultural Relevance, Decision Making, Foreign Countries
Peer reviewedMacDonald, Colla J.; Breithaupt, Krista; Stodel, Emma J.; Farres, Laura G.; Gabriel, Martha A. – International Journal of Testing, 2002
Developed and tested an online survey to assess Web-based learning (WBL) educational programs, extending theoretical work on the Demand Driven Learning Model. Data from 93 adult learners from 3 WBL programs found high internal reliability and adequate construct validity for the 5 scales of the online measure. (SLD)
Descriptors: Adult Education, Adult Students, Distance Education, Educational Demand
Peer reviewedHegarty, Mary; Richardson, Anthony E.; Montello, Daniel R.; Lovelace, Kristin; Subbiah, Ilavanil – Intelligence, 2002
Developed a standardized self-report scale of environmental spatial ability, the Santa Barbara Sense of Direction Scale and evaluated it in six studies with 544 college students. Results supported the reliability of the scale and suggested that the scale is related to tasks that require one to update location in space as a result of self-motion.…
Descriptors: College Students, Higher Education, Measures (Individuals), Reliability
Peer reviewedHolbert, R. Lance; Stephenson, Michael T. – Human Communication Research, 2002
Notes that structural equation modeling (SEM) is a viable multivariate tool used by communication researchers for the past quarter century. Summarizes the use of this technique from 1995-2000 in 37 communication-based academic journals. Identifies and critically assesses 3 unique methods for testing structural relationships via SEM in terms of the…
Descriptors: Communication (Thought Transfer), Communication Research, Higher Education, Multivariate Analysis
Peer reviewedWetherby, Amy M.; Allen, Lori; Cleary, Julie; Kublin, Kary; Goldstein, Howard – Journal of Speech, Language, and Hearing Research, 2002
Three studies with approximately 600 children were conducted to evaluate the validity and reliability of the three measures of the Communication and Symbolic Behavior Scales Developmental Profile. Findings support the use of the profile as a screening and evaluation tool for identifying children with developmental delays at 12 to 24 months of age.…
Descriptors: Developmental Delays, Disability Identification, Early Childhood Education, Infants
Peer reviewedHanson, Bradley A.; Brennan, Robert L. – Journal of Educational Measurement, 1990
Using several data sets, the relative performance of the beta binomial model and two more general strong true score models in estimating several indices of classification consistency is examined. It appears that the beta binomial model can provide inadequate fits to raw score distributions compared to more general models. (TJH)
Descriptors: Classification, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedMcCloskey, George – Topics in Early Childhood Special Education, 1990
The article examines issues related to establishing a rationale for the use of early childhood rating scales and to selecting a rating scale using such criteria as user needs, user friendliness, scale format, score interpretation, reliability, and validity. Methods of establishing the validity of a rating scale are suggested. (Author/DB)
Descriptors: Behavior Problems, Behavior Rating Scales, Disabilities, Early Childhood Education
Peer reviewedHamilton, J. S.; McLone, R. R. – Studies in Educational Evaluation, 1989
Influences on the educational validity of examinations are reviewed. Changes occurring in approaches to standard setting are traced. A view of reliability is presented, with emphasis on assessment of project work, which often involves individual investigation and design by students. A consistency index formula for grading standards is presented.…
Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Standard Setting (Scoring)
Peer reviewedCarver, Ronald P. – Journal of Reading Behavior, 1990
Argues that the original Degrees of Reading Power (DRP) test scores indicated large mismatches between the average difficulty of the textbooks used in each grade and the average ability of students in that grade. Presents a rescaling procedure to provide new, valid DRP test scores and grade equivalent scores for selecting instructional materials.…
Descriptors: Elementary Secondary Education, Instructional Materials, Reading Research, Test Reliability
Peer reviewedParsons, Nancy K.; And Others – Journal of Counseling Psychology, 1990
Developed and administered Reasoning about Abortion Questionnaire (RAQ) to measure how persons view abortions. Pilot tested the RAQ on 134 college students and modified scale on basis of data. Administered revised RAQ to college students (N=230) replicating factor pattern and obtaining evidence for validity of polarity scores through structured…
Descriptors: Abortions, Attitude Measures, College Students, Higher Education
Peer reviewedJanikowski, Timothy P.; And Others – Rehabilitation Counseling Bulletin, 1989
Describes development of computer-based case simulation designed to assess skill in predicting client behavior and conceptual ability, resolution of informational dissonance, and experiential learning. Reviews theoretical basis for simulation. Summarizes preliminary investigation of simulation's reliability and validity. Discusses future research.…
Descriptors: Computer Simulation, Counselor Performance, Counselor Qualifications, Counselor Training
Peer reviewedSackett, Paul R.; And Others – Personnel Psychology, 1989
Reviews recent developments in the use of commercially available written integrity tests for employee selection. Discusses legal issues related to the use of the polygraph and integrity tests, and reviews empirical research on the reliability, criterion-related validity, construct validity, fakeability, and adverse effects of integrity tests. (TE)
Descriptors: Ethics, Integrity, Occupational Tests, Personnel Evaluation
Peer reviewedWesterman, Gary H.; And Others – Journal of Dental Education, 1989
A study in four first-year dental school classes found that while the Myers-Briggs Type Indicator is a good measure of dental students' personality traits, it is not a useful measure for predicting first-year achievement. (MSE)
Descriptors: Academic Achievement, Dental Students, Higher Education, Perception
Peer reviewedKlein, Howard A. – Reading Improvement, 1989
Examines whether using a combined silent reading-listening mode to administer the "Social Studies Inference Test" optimized information gathering. Finds that the combined modality produced more correct inferences than did silent reading alone. Finds only one gender difference--girls'"caution score" was higher than that for…
Descriptors: Data Collection, Educational Testing, Grade 6, Intermediate Grades


