Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Wright, Benjamin D. – 1998
This videotape contains a series of presentations by Benjamin D. Wright of the University of Chicago about the Rasch model and measurement in testing. The first presentation is excerpts from a workshop held in 1974 for educators from the Portland (Oregon) Public Schools and the Oregon Department of Education. Dr. Wright's lectures were part of a…
Descriptors: Educational Testing, Elementary Secondary Education, Estimation (Mathematics), Higher Education
Kentucky State Dept. of Education, Frankfort. – 1999
In 1998, the Kentucky General Assembly called for a new testing system known as the Commonwealth Accountability Testing System (CATS) to determine how much children are learning in school. This brochure tells the basics about Kentucky's system of public education. The "Core Content" document describes what students should know and be…
Descriptors: Academic Achievement, Accountability, Achievement Tests, Educational Assessment
Pennsylvania State Dept. of Education, Harrisburg. Bureau of Curriculum and Academic Services. – 2000
Beginning in 1999, all of the Pennsylvania System of School Assessment (PSSA) had to be aligned with the Pennsylvania Academic Standards. This handbook describes the PSSA reading assessment. It contains samples and instructions for developing assessment items for grades 5, 8, and 11. Although designed for teachers, the Handbook is meant to be a…
Descriptors: Achievement Tests, Elementary Secondary Education, Guides, Reading Achievement
Takala, Sauli – 1998
This paper discusses recent developments in language testing. It begins with a review of the traditional criteria that are applied to all measurement and outlines recent emphases that derive from the expanding range of stakeholders. Drawing on Alderson's seminal work, criteria are presented for evaluating communicative language tests. Developments…
Descriptors: Alternative Assessment, Communicative Competence (Languages), Comparative Analysis, Evaluation Criteria
Delaware State Dept. of Education, Dover. – 2002
The Delaware Student Testing Program (DSTP) is designed to assess progress toward the Delaware Content Standards. Every year a certain number of items are removed from the test and then selected for public release. This booklet contains items released from the 1998 administration of the DSTP mathematics component tests. Taken as a whole, these…
Descriptors: Academic Standards, Elementary Secondary Education, Information Dissemination, Mathematics Tests
Sykes, Robert C.; Heidorn, Mark; Lee, Guemin – 1999
A study was conducted to evaluate the effect of different modes (modalities) of assigning raters to test items. The impact on total constructed response (c.r.) score, and subsequently on total test score, of assigning a single versus multiple raters to an examination reading of a student's set of c.r. responses was evaluated for several mixed-item…
Descriptors: Constructed Response, Elementary School Students, Elementary Secondary Education, Evaluators
Colvin, Stephen S. – Bureau of Education, Department of the Interior, 1924
A decade ago intelligence testing was in its beginnings in the United States. There were no standardized tests available except those of the Binet-Simon scale. These tests had been used but little, and chiefly for the detection and classification of the backward and the feeble-minded. Goddard had just begun pioneer work in this field, while…
Descriptors: Intelligence Tests, Intelligence, Performance Tests, Testing
Peer reviewedKolstad, Rosemarie K.; And Others – Journal of Research and Development in Education, 1983
A study compared college students' performance on complex multiple-choice tests with scores on multiple true-false clusters. Researchers concluded that the multiple-choice tests did not accurately measure students' knowledge and that cueing and guessing led to grade inflation. (PP)
Descriptors: Achievement Tests, Difficulty Level, Guessing (Tests), Higher Education
Peer reviewedLukmani, Yasmeen – ELT Journal, 1982
The approach taken to testing reading comprehension in a Bombay University English program is described. A distinction is drawn between communicative and communicational teaching approaches. Reading skills and the traditional techniques for teaching them are examined, and sample reading comprehension test items using the communicational approach…
Descriptors: Classification, Communicative Competence (Languages), English (Second Language), Foreign Countries
Peer reviewedMeredith, Gerald M. – Perceptual and Motor Skills, 1983
Two brief scales were proposed to assess effectiveness of teaching in laboratory and seminar/discussion group classes. (Author)
Descriptors: College Students, Course Evaluation, Discussion Groups, Factor Analysis
Peer reviewedSecolsky, Charles – Journal of Educational Measurement, 1983
A model is presented using examinee judgements in detecting ambiguous/misinterpreted items on teacher-made criterion-referenced tests. A computational example and guidelines for constructing domain categories and interpreting the indices are presented. (Author/PN)
Descriptors: Criterion Referenced Tests, Higher Education, Item Analysis, Mathematical Models
Peer reviewedSandoval, Jonathan; And Others – Journal of School Psychology, 1983
Item difficulty patterns of four groups of nonreferred, average children (ages 7 1/2 and l0 1/2)--Anglos, Blacks, Chicanos and Bermudians--were compared on each of the verbal subtests of the Wechsler Intelligence Scale for Children-Revised. Item difficulty curves were remarkably parallel. (Author/HLM)
Descriptors: Anglo Americans, Black Youth, Cultural Differences, Difficulty Level
Peer reviewedHuck, Schuyler W.; And Others – Educational and Psychological Measurement, 1981
Believing that examinee-by-item interaction should be conceptualized as true score variability rather than as a result of errors of measurement, Lu proposed a modification of Hoyt's analysis of variance reliability procedure. Via a computer simulation study, it is shown that Lu's approach does not separate interaction from error. (Author/RL)
Descriptors: Analysis of Variance, Comparative Analysis, Computer Programs, Difficulty Level
Peer reviewedGhatala, Elizabeth S.; And Others – Contemporary Educational Psychology, 1981
Two experiments on multiple-choice assessment of students learning from sentences were conducted. Interference arising from intersentence similarity was a function of both the kind of learning strategies students were instructed to employ and the kind of strategies they reported having employed spontaneously. Implications for test construction and…
Descriptors: Cognitive Processes, Context Clues, Control Groups, Elementary Secondary Education
Peer reviewedDonlon, Thomas F.; And Others – Applied Psychological Measurement, 1980
The scope and nature of sex differences in the Graduate Record Examination are explored by identifying individual test items that differ from the other items in terms of the magnitude of the difference in item difficulty for the sexes. In general, limited evidence of differences was established. (Author/CTM)
Descriptors: Aptitude Tests, College Entrance Examinations, Graduate Students, Higher Education


