Publication Date
| In 2026 | 3 |
| Since 2025 | 190 |
| Since 2022 (last 5 years) | 1069 |
| Since 2017 (last 10 years) | 2891 |
| Since 2007 (last 20 years) | 6176 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 481 |
| Practitioners | 358 |
| Researchers | 153 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 134 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Peer reviewedStephens, M. Irene; Montgomery, Allen A. – Topics in Language Disorders, 1985
Six recently published language tests are examined in terms of theoretical models, choice of subtests, test format, reliability, norming population, and reporting scores. The tests are the Screening Test of Adolescent Language, the Word Test, the Test of Language Development-Intermediate, the Fullerton Language Tests for Adolescents, Clinical…
Descriptors: Elementary Secondary Education, Language Handicaps, Scoring, Standardized Tests
Peer reviewedReplogle, William H.; Eicke, F. J. – Journal of School Psychology, 1985
Evaluated an automated analysis system for the Wechsler Intelligence Scale Revised. Results indicated significantly higher ratings for the automated analysis on an overall item and on items addressing Verbal-Performance, discrepancies, relative weaknesses, and relative lack of irresponsible interpretation. These results support cautious use of the…
Descriptors: Automation, Data Processing, Evaluation Methods, Test Interpretation
Peer reviewedMullis, Ina V.S. – Educational Measurement: Issues and Practice, 1984
Scoring systems for direct writing assessment are described. In holistic scoring, a global quality judgment of the writing sample is made. Primary trait scoring, developed by the National Assessment of Educational Progress, is conducted in accordance with specific goals. Analytic scoring identifies characteristics and quality of writing. These…
Descriptors: Elementary Secondary Education, Essay Tests, Holistic Evaluation, Scoring
Peer reviewedAiken, Lewis R. – Educational and Psychological Measurement, 1983
A procedure and a computing diagram for assigning score boundaries to grading categories on classroom tests are described. The procedure takes both the median ability level of the class and the test performance of the class relative to that of other classes into account. (Author)
Descriptors: Grades (Scholastic), Grading, Scores, Scoring
Peer reviewedMcKenna, Michael – Journal of Reading, 1976
Descriptors: Cloze Procedure, Educational Research, Elementary Education, Measurement Techniques
Lippey, Gerald; Partos, Nathan – Educational Technology, 1976
This article describes some of the programming improvements made to a computer-based instructional support system after it was operating as had been envisioned during its design. Changes were made to accommodate differences between the original objectives and what users discovered they really wished to do. (Author/BD)
Descriptors: Computer Assisted Testing, Computer Programs, Item Banks, Test Construction
Gillmore, Gerald M.; Stallings, William M. – Improving College and University Teaching, 1976
Instead of shying away from problem solving items, testing specialists should devise and offer guidelines for construction and scoring problem solving items. (Editor/LBH)
Descriptors: Divergent Thinking, Higher Education, Problem Solving, Scoring
Howe, Roger; Scheaffer, Richard; Lindquist, Mary; Philip, Frank; Halbrook, Arthur – US Department of Education, 2004
This document contains the framework and a set of recommendations for the 2005 NAEP mathematics assessment. It includes descriptions of the mathematical content of the test, the types of test questions, and recommendations for administration of the test. In broad terms, this framework attempts to answer the question: What mathematics should be…
Descriptors: National Competency Tests, Student Evaluation, Mathematics Achievement, Test Items
Reese, Lynda M. – 1999
This study represented a first attempt to evaluate the impact of local item dependence (LID) for Item Response Theory (IRT) scoring in computerized adaptive testing (CAT). The most basic CAT design and a simplified design for simulating CAT item pools with varying degrees of LID were applied. A data generation method that allows the LID among…
Descriptors: College Entrance Examinations, Item Response Theory, Law Schools, Scoring
Allen, Sally; Sudweeks, Richard R. – 2001
A study was conducted to identify local item dependence (LID) in the context-dependent item sets used in an examination prepared for use in an introductory university physics class and to assess the effects of LID on estimates of the reliability and standard error of measurement. Test scores were obtained for 487 students in the physics class. The…
Descriptors: College Students, Error of Measurement, Higher Education, Physics
Bastick, Tony – 2002
This paper makes two criticisms of dichotomously scored instruments. One is that dichotomous scoring restrains the scores to ipsative measures that should not be compared, and the other is that dichotomous scoring ignores the strength with which a subject endorses a response so that the resulting count may imply a different construct measure from…
Descriptors: Adolescents, Foreign Countries, High School Students, High Schools
Mushi, Selina L. P. – 2003
This document is a scoring guide that presents the Teacher-candidates' Overall Program Portfolio Scoring: Systematic, Comprehensive and Hierarchical Evaluative Measures of Excellence (TOPPS SCHEME) approach to scoring teacher candidates' program portfolios. The guide facilitates systematic, comprehensive, and hierarchical means of measuring and…
Descriptors: Portfolio Assessment, Portfolios (Background Materials), Preservice Teachers, Scoring
Schafer, William D. – 2003
Three groups of persons are involved in the testing enterprise: test producers, test users, and test takers. A wide literature is available to guide the first two groups, but only recently have measurement professionals considered the interests of test takers in any careful way. The content of this chapter is presented as a set of 26…
Descriptors: Educational Assessment, Educational Testing, Evaluation Methods, Guidelines
Capa, Yesim; Loadman, William E. – 2003
The purpose of this study was to investigate the effect of a Rasch-based procedure to calibrate responses for funding applications. The data set included 112 proposals and 66 readers, who independently scored randomly assigned proposals using a scoring instrument. The data were analyzed using FACETS (Linacre, 1999). The analysis indicated that the…
Descriptors: Evaluators, Financial Support, Grants, Interrater Reliability
Wise, Steven L. – 1999
Outside of large-scale testing programs, the computerized adaptive test (CAT) has thus far had only limited impact on measurement practice. In smaller-scale testing contexts, limited data are often available, which precludes the establishment of calibrated item pools for use by traditional (i.e., item response theory (IRT) based) CATs. This paper…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Scores


