Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 38 |
| Since 2007 (last 20 years) | 132 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 12 |
| Practitioners | 10 |
| Community | 5 |
| Parents | 5 |
| Teachers | 3 |
| Policymakers | 2 |
Location
| Florida | 7 |
| United Kingdom | 6 |
| United Kingdom (England) | 6 |
| Australia | 5 |
| Canada | 5 |
| United States | 5 |
| Georgia | 3 |
| New York | 3 |
| North Carolina | 3 |
| Turkey | 3 |
| California | 2 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 3 |
| No Child Left Behind Act 2001 | 3 |
| Education for All Handicapped… | 1 |
| Individuals with Disabilities… | 1 |
| Serrano v Priest | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedKoenke, Karl – Journal of Reading, 1971
Descriptors: Difficulty Level, Evaluation Methods, Measurement Instruments, Readability
Peer reviewedArkin, Robert M.; Walts, Elizabeth A. – Journal of Educational Psychology, 1983
The effects of corrective testing and how such feedback might affect high- and low-test-anxious students differently are indicated. Subjects were 286 college students in three classes--one using mastery testing and two using multiple choice tests. (Author/PN)
Descriptors: Attribution Theory, Feedback, Higher Education, Mastery Tests
Peer reviewedZughoul, Muhammad R.; Kambal, M. Osman – International Review of Applied Linguistics in Language Teaching, 1983
Based on the responses of 50 ESL instructors to a composition-scoring exercise, a detailed method of scoring compositions was developed that divides the writing into basic components (structure, content, vocabulary, organization, and mechanics) and provides a scoring mechanism for each component for each of three competency levels. (MSE)
Descriptors: English (Second Language), Evaluation Criteria, Evaluation Methods, Measurement Techniques
van der Linden, Wim J. – Evaluation in Education: International Progress, 1982
In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)
Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models
Peer reviewedUpshur, John A.; Turner, Carolyn E. – ELT Journal, 1995
Reviews the place of rating scales in second-language measurement and summarizes some of the problems associated with them. Standard and alternative scales were studied. High agreement among raters can be achieved even under conditions not favorable to high interrater reliability. The full range of score categories are effectively utilized. (17…
Descriptors: Evaluation Problems, Interrater Reliability, Language Tests, Measurement Techniques
Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004
The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…
Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items
Aghbar, Ali A.; Tang, Huixing – 1991
A study was undertaken to develop a partial credit scheme for scoring cloze-type questions on an English collocation test, obtain construct validity evidence for the test and the scoring scheme using the Rasch Partial Credit Model, and compare partial credit scoring with the more commonly used dichotomous scoring with the same test instrument.…
Descriptors: Cloze Procedure, College Students, English (Second Language), Language Tests
Alberta Dept. of Education, Edmonton. – 1989
This English Language Arts Achievement Test was designed to evaluate the writing skills of third grade students. It includes instructions in which students are asked to write their own stories after reading a "story starter." The test instructs students to use their imaginations to finish the story and encourages them to do a prewriting…
Descriptors: Achievement Tests, Descriptive Writing, Foreign Countries, Grade 3
Angoff, William H.; Schrader, William B. – 1982
In a study to determine whether a shift from Formula scoring to Rights scoring can be made without causing a discontinuity in the test scale, the analysis of special administrations of the Scholastic Aptitude Test and Chemistry Achievement Test and the variable section of an operational form of the Graduate Management Admission Test (GMAT) is…
Descriptors: Comparative Analysis, Equated Scores, Guessing (Tests), Higher Education
Cole, Nancy S. – 1982
The advantages and disadvantages of grade equivalent (GE) scores are explored, including appropriate uses for GE type scores and how to bring current GE scales closer to the type of information educators appear to desire. Although GE scores are not an equal interval scale, not comparable across school subjects, and do not indicate the grade level…
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation Methods, Formative Evaluation
Smith, Richard M. – 1982
There have been many attempts to formulate a procedure for extracting information from incorrect responses to multiple choice items, i.e., the assessment of partial knowledge. The results of these attempts can be described as inconsistent at best. It is hypothesized that these inconsistencies arise from three methodological problems: the…
Descriptors: Difficulty Level, Evaluation Methods, Goodness of Fit, Guessing (Tests)
Lowry, Stephen R. – 1977
The effects of luck and misinformation on ability of multiple-choice test scores to estimate examinee ability were investigated. Two measures of examinee ability were defined. Misinformation was shown to have little effect on ability of raw scores and a substantial effect on ability of corrected-for-guessing scores to estimate examinee ability.…
Descriptors: Ability, College Students, Guessing (Tests), Multiple Choice Tests
Peer reviewedJacobs, Stanley S. – Journal of Educational Measurement, 1975
Descriptors: Criterion Referenced Tests, Guessing (Tests), Multiple Choice Tests, Response Style (Tests)
Saunders, Joseph C.; Huynh, Huynh – 1980
In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…
Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests
Cross, Lawrence H.; Frary, Robert B. – 1976
It has been demonstrated that corrected-for-guessing scores will be superior to number-right scores in providing estimates of examinee standing on the trait measured by a multiple-choice test, if it can be assumed that examinees can and will comply with the appropriate directions. The purpose of the present study was to test the validity of that…
Descriptors: Achievement Tests, Guessing (Tests), Individual Characteristics, Multiple Choice Tests

Direct link
