Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Herman, Joan L. – 1986
Issues in designing valid tests for the National Assessment of Educational Progress (NAEP) are discussed. Test scores are often provided without any information on the nature of the tasks represented. Because test domains are defined by individual item writers, the generalizability between tests and items is suspect. While typical content…
Descriptors: Achievement Tests, Content Validity, Criterion Referenced Tests, Educational Assessment
Osborn, William C.; Ford, J. Patrick – 1976
A synthetic performance test is a job performance test that has been degraded to some degree in the range of tasks covered or in the fidelity of stimulus/response features. Since further development is needed before synthetic performance testing is valid and efficient, this research project focused on three objectives: to (1) identify problems…
Descriptors: Adults, Group Testing, Intelligence, Job Analysis
Stratton, Julius A. – 1974
The relationship between the Instructional Process, Instructional Objectives, and Assessment Tasks, identified at the School City of Gary, Indiana, necessitate an effective testing program. Four characteristics perceived crucial to a sound program were: (1) The program should be continuous, (2) The testing program should be comprehensive, (3)…
Descriptors: Content Analysis, Elementary Secondary Education, Guides, Measurement Techniques
Peer reviewedYsseldyke, James E.; Marston, Douglas – School Psychology Review, 1982
When selecting standardized reading tests for purposes of decision making, the school psychologist must answer several questions, such as "What reading skills do I wish to assess?" or "How do I judge if the test is technically adequate?" Recommendations for test selection are made within the context of these questions.…
Descriptors: Criterion Referenced Tests, Elementary Education, Group Testing, Individual Testing
Peer reviewedSkakun, Ernest N.; And Others – Educational and Psychological Measurement, 1979
Factor analysis was used to determine whether computerized patient management problems had the same factor structure as multiple choice examinations and rating scales. It was determined that the factor structure was similar to the examinations but not the rating scale. (JKS)
Descriptors: Comparative Testing, Computer Assisted Testing, Computer Programs, Factor Structure
Green, Donald Ross – 1998
As the new version of the "Standards for Educational and Psychological Testing" is being developed, it is apparent that putting together a set of standards for test publishing involves many difficulties. Although the basic intent of almost all parties involved is similar, there are many potential areas of disagreement among parties to…
Descriptors: Definitions, Educational Testing, Elementary Secondary Education, Limited English Speaking
Peer reviewedWen, Shih-Sung – Journal of Educational Measurement, 1975
The relationship between students' scores on a verbal meaning test and their degrees of confidence in item responses was investigated. Subjects were black undergraduate students and they were administered a verbal meaning test by following a confidence testing procedure. (Author/BJG)
Descriptors: Blacks, Confidence Testing, Higher Education, Language Skills
Johnson, Jeffrey G.; Bornstein, Robert F. – 1989
The validity of the Personality Diagnostic Questionnaire-Revised (PDQ-R) was examined. The PDQ-R and the Crowne-Marlowe Social Desirability Scale (SD) were administered in the spring of 1989 to 45 undergraduates (26 females and 19 males) at Gettysburg College in Pennsylvania. One month later, the Hopkins Symptom Check List (SCL-90) was…
Descriptors: Comparative Testing, Correlation, Diagnostic Tests, Higher Education
Hambleton, Ronald K.; And Others – 1988
Tests to assess problem-solving ability being provided for the Air Force are described, and some details on the development and validation of these computer-administered diagnostic achievement tests are discussed. Three measurement approaches were employed: (1) sequential problem solving; (2) context-free assessment of fundamental skills and…
Descriptors: Achievement Tests, Aircraft Pilots, Computer Assisted Testing, Occupational Tests
ERIC Clearinghouse on Tests, Measurement, and Evaluation, Princeton, NJ. – 1985
This Digest overviews legal challenges in five areas of test use for decision-making in schools: ability tracking, placement in special education classes, test scores as college admissions criteria, test disclosure, and teacher competency testing. Cases illustrating these challenges are described and include: Hobson v. Hansen (1967), Moses v.…
Descriptors: Court Litigation, Educational Testing, Intelligence Tests, Legal Problems
Suhadolnik, Debra; Weiss, David J. – 1983
The present study was an attempt to alleviate some of the difficulties inherent in multiple-choice items by having examinees respond to multiple-choice items in a probabilistic manner. Using this format, examinees are able to respond to each alternative and to provide indications of any partial knowledge they may possess concerning the item. The…
Descriptors: Confidence Testing, Multiple Choice Tests, Probability, Response Style (Tests)
Office of Personnel Management, Washington, DC. – 1979
The stimulus for this colloquium was the convergence of several significant developments bearing on the construct validation of standardized tests and other assessment methods. Of these developments, some were fundamental to psychology as a science; others reflected socio-political pressures on measurement in education and employment. The ten…
Descriptors: Aptitude Tests, Educational Practices, Educational Testing, Employment Practices
Farr, Roger; Roser, Nancy – 1974
This article presents views of proponents and opponents to standardized tests, isolates the major weakness of testing--questionable validity--and offers several recommendations for the betterment of test development and use. Some major misuses of tests include the following: (a) tests are at times administered with no clear purpose; (b) test…
Descriptors: Accountability, Criterion Referenced Tests, Educational Assessment, Educational Testing
Chandler, Theodore A. – 1974
Many self-concept measures employ several different scales to which the subject responds in a set order at one sitting. This study examined the effects of different testing conditions on such scales. Bill's Index of Adjustment and Values was administered to 191 graduate students under two different sequences, and two time delay conditions. The…
Descriptors: Feedback, Graduate Students, Reaction Time, Self Concept
Koehler, Roger A. – 1974
A potentially valuable measure of overconfidence on probabilistic multiple-choice tests was evaluated. The measure of overconfidence was based on probabilistic responses to nonsense items embedded in a vocabulary test. The test was administered under both confidence response and conventional choice response directions to 208 undergraduate…
Descriptors: Confidence Testing, Guessing (Tests), Measurement Techniques, Multiple Choice Tests


