Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Turner, Carol J.; Smith, Jeffrey K. – Measurement and Evaluation in Guidance, 1982
Used aggregate ratings of teacher behavior as data for a multitrait-multimethod validity analysis. Scaled ratings using Rasch latent trait scaling model and traditional scaling techniques. Compared Rasch-scaled multitrait-multimethod matrix to the traditionally scaled multitrait-multimethod matrix. Results showed Rasch scaling resulted in higher…
Descriptors: Children, Comparative Testing, Data Analysis, Elementary Education
Imrie, B. W. – Assessment in Higher Education, 1979
Three case studies are presented that describe variations used to obtain student perceptions of examinations and examination questions. Student perceptions can provide feedback about the quality of the examination. Results are presented and discussed and some examples of test evaluation questions are given. (Author/MLW)
Descriptors: Case Studies, Educational Testing, Feedback, Higher Education
Cervero, Ronald M. – Adult Education, 1980
Researchers reanalyzed the original Adult Performance Level Test survey data for the test's validity and reliability, and they concluded that (1) the test is not content valid because it assumes that functional competence can be logically defined and adult success accurately measured; and (2) the test is a valid measure of verbal, writing, and…
Descriptors: Adult Basic Education, Basic Skills, Factor Analysis, Functional Literacy
Peer reviewedHeneman, Herbert G., III – Personnel Psychology, 1980
There are many opportunities for research on how self-assessment affects behavior, especially external and internal mobility. Problems often occur in choice of ability dimensions and selection context. A firm theoretical base is necessary. (JAC)
Descriptors: Behavioral Science Research, Job Applicants, Occupational Mobility, Research Problems
Peer reviewedThornton, George C., III; Gierasch, Paul F., III – Journal of Personality Assessment, 1980
Ninety-four college males completed a management trainees' selection test that had been developed by criterion-keying. They were instructed once to answer honestly, and once to answer as a highly motivated job applicant would. "Faking" instructions resulted in significantly higher scores. (Author/GDC)
Descriptors: Higher Education, Males, Managerial Occupations, Motivation
Peer reviewedPage, Roger; Bode, James – Educational and Psychological Measurement, 1980
The Ethical Reasoning Inventory (ERI) is an objective test derived from Kohlberg's Moral Judgment Interview. It correlated higher with Kohlberg , and has higher internal consistency than the Defining Issues Test and the Moral Judgment Scale. (CP)
Descriptors: Abstract Reasoning, Higher Education, Item Analysis, Moral Issues
Peer reviewedBailey, Brenda S.; Richmond, Burt O. – Journal of School Psychology, 1979
Scores on the WISC-R and the AAMD Adaptive Behavior Scale, Part I, Public School Version, were obtained for elementary school children referred for psychological services. Some adaptive behavior scores differentiated among children classified as EMR, slow-learners, or average intelligence. (Author)
Descriptors: Academically Handicapped, Adjustment (to Environment), Behavior Patterns, Comparative Testing
Peer reviewedGross, Karen; Rothenberg, Stephen – Journal of Learning Disabilities, 1979
Two methodological problems often arising in dyslexia research are considered. The first problem concerns the validity of experimental measures and the related problem of interpreting null results. The second problem involves the effects of sampling from a disabled population if the disorder under investigation has multiple unknown origins.…
Descriptors: Cognitive Processes, Dyslexia, Hypothesis Testing, Learning Disabilities
Peer reviewedHoward, Francoise – Canadian Modern Language Review, 1980
Discusses the concept of communicative competence in second language learning and various models for testing this skill. (AM)
Descriptors: Communicative Competence (Languages), French, Language Instruction, Language Tests
Peer reviewedDiLalla, David L. – Assessment, 1996
A computer-administered (CA) form of the Multidimensional Personality Questionnaire (MPQ) (A. Tellegen, 1982) was created, and scores of 126 college students were compared to those of 101 who took a paper-and-pencil MPQ. Multivariate analyses found no group differences for the CA format, and likelihood of profile validity was decreased. Results…
Descriptors: College Students, Computer Assisted Testing, Higher Education, Multivariate Analysis
Peer reviewedChapelle, Carol A.; Jamieson, Joan; Hegelheimer, Volker – Language Testing, 2003
Presents the design and validation of an English-as-a-Second-Language (ESL) test for a commercial publisher. (Author/VWL)
Descriptors: English (Second Language), Language Tests, Second Language Instruction, Second Language Learning
Peer reviewedSouth, Mikle; Williams, Brenda J.; McMahon, William M.; Owley, Thomas; Filipek, Pauline A.; Shernoff, E.; Corsello, Christina; Lainhart, Janet E.; Landa, Rebecca; Ozonoff, Sally – Journal of Autism and Developmental Disorders, 2002
A study examined the validity of the Gilliam Autism Rating Scale (GARS) with 119 children (ages 3-10) with strict DSM-IV (Diagnostic Statistical Manual of Mental Disorders-IV) diagnoses of autism. The GARS consistently underestimated the likelihood that children would be classified as having autism. Limitations of ratings scales and of the GARS…
Descriptors: Autism, Behavior Rating Scales, Classification, Clinical Diagnosis
Peer reviewedVeloski, J. Jon; And Others – Evaluation and the Health Professions, 1990
Part III of the National Board Examination--a certifying examination of medical knowledge and patient management abilities--was assessed using 1,866 first-year residents. This 15-year study comparing Part III results with those of Parts I and II and with superiors' ratings indicates Part III's validity and provides a model for future research.…
Descriptors: Analysis of Covariance, Clinical Diagnosis, Computer Assisted Testing, Licensing Examinations (Professions)
Peer reviewedRobinson, Dale O. – Language, Speech, and Hearing Services in Schools, 1988
The study used 32 children (mean age 11 years) with moderate sensorineural hearing losses to examine whether Test of Auditory Comprehension of Language (TACL) scores were significantly affected by mode of presentation. No differences were found between auditory-only and auditory-visual modes of presentation. (Author/DB)
Descriptors: Auditory Perception, Children, Hearing Impairments, Language Tests
Peer reviewedStanley, Julian C.; Brody, Linda E. – Gifted Child Quarterly, 1989
This article responds to criticisms made in the Ebmeier and Schmulbach study (EC 221 845) of the Scholastic Aptitude Test as used by talent search programs such as the Center for the Advancement of Academically Talented Youth (CTY). The history of CTY's uses of cutoff scores and alternative interpretations of statistics are discussed. (PB)
Descriptors: Achievement Tests, Aptitude Tests, Gifted, Predictor Variables


