Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedSmith, Richard M.; And Others – Journal of Dental Education, 1990
A study of inconsistent person-response patterns in the test with 1,000 students from each of 2 test administrations found a high incidence of atypical response patterns. The findings are seen as having serious implications for the admissions procedure, since atypical response patterns invalidate the standard scores reported for those students.…
Descriptors: College Entrance Examinations, Dental Schools, Higher Education, Logical Thinking
Peer reviewedGross, Alan L. – Psychometrika, 1990
A model is proposed for investigating test validity as a predictor of a criterion variable when there are both missing and censored scores in the data set. Implications for maximum likelihood estimation are discussed, and the method is illustrated with hypothetical data sets. (SLD)
Descriptors: Equations (Mathematics), Mathematical Models, Maximum Likelihood Statistics, Predictive Measurement
Peer reviewedAllen, W. B. – Journal of Vocational Behavior, 1988
Draws on examples from college athletics, education, and personal experience to describe racial unfairness both of using scholastic tests where they should not be used and of not using them where they should be used. Suggests that greater consideration be given to reasons for administering or withholding tests and whether such action is…
Descriptors: Aptitude Tests, College Athletics, Higher Education, Occupational Tests
Kamil, Michael S.; Tierney, Robert J. – Illinois Schools Journal, 1988
In conjunction with testing mandates, some states have developed new measures intended to reflect changes in thinking about reading. Discusses, in dialogue form, whether these new measures support educational improvement or limit them. (BJV)
Descriptors: Educational Assessment, Educational Improvement, Reading Tests, Scores
Peer reviewedAkande, Adebowale – Early Child Development and Care, 1994
Tested 21 low-functioning children with mental retardation to determine validity of the Motivation Assessment Scale (MAS). Found that the interrater measures within the MAS were essentially uncorrelated and of independent dimensions and that the MAS is not suitable for use with African primary school children. (HTH)
Descriptors: Elementary School Students, Foreign Countries, Interrater Reliability, Mental Retardation
Peer reviewedBrambring, M.; Troster, H. – Journal of Visual Impairment and Blindness, 1994
This study evaluated the Bielefeld Developmental Test for Blind Infants and Preschoolers by comparing cognitive performance of blind and sighted children (ages three and four). Results indicated that even this test (with "blind-neutral" items) did not permit a fair comparative assessment, though it did prove suitable for within-group…
Descriptors: Blindness, Cognitive Development, Cognitive Tests, Infants
The Constant Danger of Sacrificing Validity to Reliability: Making Writing Assessment Serve Writers.
Peer reviewedWiggins, Grant – Assessing Writing, 1994
Suggests that assessment must be built into the curriculum and focused upon the kinds of skills students need. Considers much educational testing in writing to be reductionist, unrealistic, and detrimental to learning. Critiques writing assessment's trust and reliance on a single or small sample of student work collected and scored outside of a…
Descriptors: Elementary Secondary Education, Evaluation Methods, Reliability, Student Evaluation
Peer reviewedHenning, Grant – System, 1992
Reports results of variety of validity analyses involving American Council on Teaching of Foreign Languages (ACTFL) Oral Proficiency Interview as it was administered to 59 learners of English and 60 learners of French. Concludes that ACTFL guidelines can be useful as an assessment tool and offer advantages that warrant serious consideration in the…
Descriptors: English (Second Language), French, Language Proficiency, Language Tests
Peer reviewedAllison, David B.; And Others – Psychological Assessment, 1992
A subscale of the Three-Factor Eating Questionnaire (TFEQ), a subscale of the Dutch Eating Behavior Questionnaire, and the Revised Restraint Scale were compared with 901 undergraduates. The TFEQ had the greatest discriminant validity with respect to social desirability and was the least susceptible to dissimulation. (SLD)
Descriptors: Comparative Testing, Dietetics, Eating Habits, Higher Education
McDonald, Joseph P. – Phi Delta Kappan, 1993
Describes several ways to view senior exhibits at an urban high school employing the Coalition of Essential Schools'"graduation by exhibition" assessment method. The coalition advocates a pedagogy combining a personalized, caring environment with a focus on student production. Judges must balance warm regard with cool, critical appraisal…
Descriptors: Exhibits, Graduation Requirements, High Schools, Performance Based Assessment
Peer reviewedTyler-Wood, Tandra; Carri, Louis – Roeper Review, 1991
This study examined the scores obtained by 21 elementary-level gifted students on 4 different intellectual measures--Stanford-Binet (LM), Stanford-Binet (Fourth Edition), Otis-Lennon School Abilities Test, and the Cognitive Abilities Test. Results showed that the population of gifted students identified will vary greatly depending upon which test…
Descriptors: Ability Identification, Cognitive Ability, Elementary Education, Evaluation Methods
Peer reviewedArcia, Emily; And Others – Journal of School Psychology, 1991
Explored validity of Neurobehavioral Evaluation System, set of computerized tests and examined validity of reaction time variability as index of sustained attention. Findings from 105 children showed children able to complete 4 of tests. Findings from subsample of 88 children showed test performance significantly associated with teacher ratings of…
Descriptors: Attention, Children, Computer Assisted Testing, Elementary Education
Peer reviewedBlack, Leora; Piercy, Fred P. – Journal of Marital and Family Therapy, 1991
Reports on development and psychometric properties of Feminist Family Therapy Scale (FFTS), a 17-item instrument intended to reflect degree to which family therapists conceptualize process of family therapy from feminist-informed perspective. Found that the instrument discriminated between self-identified feminists and nonfeminists, women and men,…
Descriptors: Family Counseling, Feminism, Psychological Testing, Psychometrics
Peer reviewedGlascoe, Frances Page; Byrne, Karen E. – Journal of Early Intervention, 1993
The accuracy of 3 developmental screening tests administered to 89 young children was compared. The Battelle Developmental Inventory Screening Test was more accurate than the Academic Scale of the Developmental Profile-II and the Denver-II, identifying correctly 72% of children with difficulties and 76% of children without diagnoses. (Author/JDD)
Descriptors: Child Development, Disabilities, Disability Identification, Early Identification
Peer reviewedHippisley, Jonathan; Douglas, Graham – British Journal of Educational Technology, 1998
A study of 331 elementary school children tested the reliability of a computer resident interactive arithmetic test and found high levels of reliability, using single sitting and parallel forms methods. The study also tried to determine the validity of the interactive test by comparing it with arithmetic subtests of the Key Math Test (KMT). (PEN)
Descriptors: Arithmetic, Computer Assisted Testing, Diagnostic Tests, Educational Technology


