Publication Date
| In 2026 | 0 |
| Since 2025 | 186 |
| Since 2022 (last 5 years) | 1065 |
| Since 2017 (last 10 years) | 2887 |
| Since 2007 (last 20 years) | 6172 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Anderson, Paul S. – 1987
A recent innovation in the area of educational measurement is MDT multi-digit testing, a machine-scored near-equivalent to "fill-in-the-blank" testing. The MDT method is based on long lists (or "Answer Banks") that contain up to 1,000 discrete answers, each with a three-digit label. Students taking an MDT multi-digit test mark…
Descriptors: College Students, Computer Assisted Testing, Higher Education, Scoring
Slate, John R. – 1986
Studies have revealed significant problems in correctly scoring ambiguous verbal responses to test items on the Wechsler Intelligence Scale for Children-Revised (WISC-R). This study evaluated the effectiveness of an instructional design procedure developed to reduce examiner scoring errors on the WISC-R. Data concerning frequent sources of error…
Descriptors: Clinical Psychology, Error of Measurement, Graduate Students, Higher Education
Harnisch, Delwyn L.; And Others – 1987
The capabilities and hardware requirements of four microcomputer software packages produced by the Office of Educational Testing, Research and Service at the University of Illinois at Urbana-Champaign are described. These programs are: (1) the Scan-Tron Forms Analysis Package Version 2.0, an interface between an IBM-compatible and a Scan-Tron…
Descriptors: Authoring Aids (Programing), Computer Assisted Testing, Computer Software, Item Banks
Mislevy, Robert J. – 1988
Large-scale educational assessments differ from familiar educational measurements by attempting to provide information about the levels and natures of skills in populations rather than in individuals. That the distinct purposes of assessment require different methodologies than individual measurement was recognized by the development of…
Descriptors: Educational Assessment, Evaluation Methods, Item Analysis, Latent Trait Theory
New Jersey Basic Skills Council, Trenton. – 1983
The purpose of the essay portion of the New Jersey College Basic Skills Placement Test is to identify students whose English language skills are not sufficiently strong to ensure that they can successfully manage the writing required in regular freshman classes. Students are given 20 minutes to read and respond to the essay topic. The essays…
Descriptors: College Entrance Examinations, Essay Tests, Higher Education, Holistic Approach
Gray, James; And Others – 1982
Five studies of holistic writing assessment procedures examined interactive relationships of the participants, processes, and products of writing assessment episodes. The first study examined practices in designing writing test prompts. The second study investigated the effects of variation in the specification of audience in a writing test prompt…
Descriptors: Data Collection, Evaluators, Holistic Evaluation, Longitudinal Studies
Yen, Wendy M. – 1984
Two of the most popular methods for obtaining equal-interval scales for educational measurement are discussed: Thurstone's method and Item Response Theory (IRT). Between-grade growth on these scales is compared; while unstandardized differences show different trends for the two scales, standardized differences that take standard deviations into…
Descriptors: Academic Achievement, Achievement Tests, Educational Research, Latent Trait Theory
van der Linden, Wim J. – 1981
It has often been argued that all techniques of standard setting are arbitrary and likely to yield different results for different techniques or persons. This paper deals with a related but hitherto ignored aspect of standard setting, namely, the possibility that Angoff or Nedelsky judges misspecify the probabilities of the borderline student's…
Descriptors: Error of Measurement, Evaluators, Foreign Countries, Latent Trait Theory
Angoff, William H. – 1985
This paper points out that there are certain generalizations about directions for guessing and methods of scoring that require that data be derived from random groups design. It supports the viewpoint that it is neither sufficient nor appropriate to make such generalizations on the basis of an analysis of scores obtained from the answer sheets of…
Descriptors: Correlation, Guessing (Tests), Research Design, Scoring Formulas
Ellington, Henry – 1987
The second of three sequels to the booklet "Student Assessment," this booklet begins by describing and giving examples of three different forms that short-answer questions can take: (1) completion items; (2) unique-answer questions; and (3) open short-answer questions. Guidelines are then provided for deciding which type of question to…
Descriptors: Foreign Countries, Higher Education, Instructional Material Evaluation, Questioning Techniques
George Washington Univ., Washington, DC. Inst. for Educational Leadership. – 1980
The transcript of a six-part National Public Radio broadcast on standardized testing is presented. The first part focuses on the reasons tests are administered; these reasons are discussed by proponents and opponents of testing. Part Two contains a discussion of the possible bias of tests, and their validity. The third part discusses the…
Descriptors: College Entrance Examinations, Scoring, Standardized Tests, Student Attitudes
Hogan, Thomas P.; Mishler, Carol – 1982
This literature review summarizes what is currently known about the agreement among six measures of writing skills. Three of these methods involve the application of human judgment in scoring or rating a piece of writing: holistic, analytical, and primary trait scoring. Two methods involve anatomical or taxonomic analysis of a piece of writing:…
Descriptors: Comparative Testing, Criterion Referenced Tests, Measurement Techniques, Scoring
Plake, Barbara S.; And Others – 1983
Differential test performance by undergraduate males and females enrolled in a developmental educational psychology course (n=167) was reported on a quantitative examination as a function of item arrangement. Males were expected to perform better than females on tests whose items arranged easy to hard. Plake and Ansorge (1982) speculated this may…
Descriptors: Difficulty Level, Feedback, Higher Education, Scoring
Simner, Marvin L. – 1982
The Printing Performance School Readiness Test is an empirically derived instrument designed to aid in the early identification of preschool children who are at risk for school failure. The test is based on the outcome of a research program dealing with various aspects of children's printing that involved over 400 normal, non-repeating, native…
Descriptors: Guides, Handwriting Skills, Preschool Education, Preschool Tests
Peer reviewedGipps, G.; Ewen, E. – Educational Research, 1974
Evaluated the use of the T-unit in the scoring of spoken and written work by children learning a second language. (RK)
Descriptors: Children, Educational Research, English (Second Language), Scoring


