Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBartel, Lee R. – Bulletin of the Council for Research in Music Education, 1994
Addresses the validity of the Cognitive-Affective Response Test-Music by clarifying the theoretical assertions on which the construct validity of the instrument rests. Presents nine specific assertions and data from two studies. Concludes that the test is valid for college-bound high school students. (CFR)
Descriptors: Academic Ability, Academic Aptitude, Grade 10, High Schools
Peer reviewedHuesmann, L. Rowell; And Others – Psychological Assessment, 1994
The Teacher Prediction of Peer-Nominated Aggression, an alternative assessment of aggressive disposition in children, was evaluated with 82 boys and 92 girls in grades 2 through 5. The measure is highly reliable and valid and is a better measure than teacher checklist ratings of aggression. (SLD)
Descriptors: Aggression, Check Lists, Children, Elementary Education
Peer reviewedSalend, Spencer J. – Intervention in School and Clinic, 1995
Test modifications and techniques that teachers can employ to adapt their tests to meet individualized needs of mainstreamed students with disabilities are considered. Suggestions are offered to assist special education teachers in helping general educators design tests; address test reliability, validity, content, and format; and develop…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Learning Problems
Peer reviewedLarose, Simon; Roy, Roland – Journal of Educational Psychology, 1995
The Test of Reactions and Adaptation in College (TRAC), a measure of students' personal dispositions toward learning, is presented. Results from 2 different samples of college students (n=1,081) indicate acceptable psychometric characteristics for the TRAC. The TRAC predicted college success better than high school weighted academic average did.…
Descriptors: Academic Achievement, Behavior Patterns, Beliefs, College Students
Peer reviewedBain, Alan – Educational Psychology: An International Journal of Experimental Educational Psychology, 1991
Presents results of a study developing an informal social skills teacher rating scale for use with early adolescents. Reports that the scale includes skill identification and situational analysis. Provides information on identifying and ranking responses and checking a content validity and reliability. Concludes that the matrix can generate…
Descriptors: Adolescents, Behavior Rating Scales, Informal Assessment, Interpersonal Competence
Peer reviewedGrych, John H.; And Others – Child Development, 1992
Reports on the development and testing of the Children's Perception of Interparental Conflict Scale. Three subscales were derived from factor analysis. The validity of the conflict subscale was supported by parent reports of conflict. The threat and self-blame subscales correlated with children's responses to conflict vignettes. (BC)
Descriptors: Adjustment (to Environment), Childhood Attitudes, Conflict, Elementary Education
Bracey, Gerald W. – Principal, 1993
Describes recent efforts of the Center for Research in Evaluation, Standards and Student Testing (CRESST) to evaluate authentic assessment methods, such as portfolios and performance tests. When comparing the merits of authentic versus multiple-choice testing, it is wise to consider validity, reliability, consequences, fairness, generalization,…
Descriptors: Cost Effectiveness, Efficiency, Elementary Education, Evaluation Criteria
Baird, Samera Major; Turner, Keith D. – Diagnostique, 1991
The Competence Enjoyment Scale (CES) was developed to measure parental feelings of competence and enjoyment as they interact with their infants. Mothers of 60 full-term healthy, preterm healthy, and preterm atypical infants were administered the CES to evaluate the instrument. Preliminary results support the reliability and validity of the scale.…
Descriptors: Affective Measures, Birth Weight, Competence, Disabilities
Peer reviewedWalker, Richard N. – Early Childhood Research Quarterly, 1992
A test battery that corresponded to the Gesell Developmental Assessment (GDA) was given to 400 4-6 year olds. A truncated version of the GDA had moderate reliability and predictive power. Experienced judges sometimes differed in their assessments of a child's developmental level and recommendations for grade placement. (GLR)
Descriptors: Age Differences, Early Childhood Education, Individual Development, Maturity (Individuals)
Peer reviewedBeller, Michal – Educational Measurement: Issues and Practice, 1994
A broad description is given of admissions procedures to Israeli universities. In Israel, a single unified test, the Psychometric Entrance Test, is used for admission to the various universities. Issues of validity and reliability and problems of ensuring fairness for non-Hebrew speakers are considered. (SLD)
Descriptors: Admission (School), College Bound Students, College Entrance Examinations, Equal Education
Peer reviewedGao, Xiaohong; And Others – Applied Measurement in Education, 1994
This study provides empirical evidence about the sampling variability and generalizability (reliability) of a statewide performance assessment for grade six. Results for 600 students at individual and school levels indicate that task-sampling variability was the major source of measurement error. Rater-sampling variability was negligible. (SLD)
Descriptors: Achievement Tests, Educational Assessment, Elementary School Students, Error of Measurement
Peer reviewedCornell, Dewey G.; And Others – Journal for the Education of the Gifted, 1994
Gifted (n=675) and regular education (n=322) students in grades two and three were compared for incidence of behavior problems as rated by parents and teachers. After controlling for grade and minority status, no significant differences were found between groups in incidence or type of behavior problems. Agreement between parent and teacher…
Descriptors: Behavior Problems, Behavior Rating Scales, Elementary School Students, Emotional Adjustment
Peer reviewedShavelson, R. J., Ed. – International Journal of Educational Research, 1994
The seven chapters of this issue focus on technical qualities of performance assessments and report the findings of systematic empirical inquiries. They add to the body of research that shows that performance assessment, although time consuming and expensive, merits the serious attention of educators. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, International Studies
Peer reviewedBejar, Isaac I.; Yocom, Peter – Applied Psychological Measurement, 1991
An approach to test modeling is illustrated that encompasses both response consistency and response difficulty. This generative approach makes validation an ongoing process. An analysis of hidden figure items with 60 high school students supports the feasibility of the method. (SLD)
Descriptors: Construct Validity, Difficulty Level, Evaluation Methods, High School Students
Peer reviewedSkaggs, Gary; Lissitz, Robert W. – Journal of Educational Measurement, 1992
The consistency of several item bias detection methods was studied across different test administrations of the same items using data from a mathematics test given to approximately 6,600 eighth grade students in all. The Mantel Haenszel and item-response-theory-based sum-of-squares methods were the most consistent. (SLD)
Descriptors: Comparative Testing, Grade 8, Item Bias, Item Response Theory


