Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 1 |
Descriptor
| Higher Education | 60 |
| Testing Problems | 60 |
| Scoring | 44 |
| Test Construction | 19 |
| Scoring Formulas | 17 |
| College Entrance Examinations | 12 |
| Test Reliability | 12 |
| Test Interpretation | 11 |
| Guessing (Tests) | 10 |
| Test Items | 10 |
| Multiple Choice Tests | 8 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
| Practitioners | 5 |
| Teachers | 4 |
| Researchers | 3 |
Location
| Brazil | 1 |
| California (Los Angeles) | 1 |
| Greece | 1 |
| Haiti | 1 |
| Japan | 1 |
| Mexico | 1 |
| Ohio | 1 |
| United Kingdom (England) | 1 |
| United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Glory Tobiason; Adrienne Lavine – Change: The Magazine of Higher Learning, 2025
Current methods for evaluating faculty teaching fall short, and one way to address this is through campus-wide initiatives that focus on change at the level of academic units. The complex context of higher education makes meaningful teaching evaluation difficult; in particular, four sobering realities of this context must be taken into account in…
Descriptors: Teacher Evaluation, Evaluation Methods, Testing Problems, Educational Change
Moy, Raymond H. – 1981
The problem of standard setting on language proficiency tests is often approached by the use of norms derived from the group being tested, a process commonly known as "grading on the curve." One particular problem with this ad hoc method of standard setting is that it will usually result in a fluctuating standard dependent on the particular group…
Descriptors: Cutting Scores, Higher Education, Language Proficiency, Norm Referenced Tests
Kingston, Neal M. – 1984
In October 1981, the Graduate Record Examinations (GRE) Program introduced a new version of the General Test (GT) that differed from the previous version in three major ways. The GT was altered to: reduce the verbal measure's speededness and allow the addition of several quantitative items; delete two item types from the analytical measure; and…
Descriptors: College Entrance Examinations, Equated Scores, Higher Education, Mathematics Tests
Peer reviewedRudner, Lawrence M.; Eissenberg, Thomas E. – Journal of Personnel Evaluation in Education, 1990
Standard-setting practices in states using the National Teachers Examination (NTE) were examined. Passing score-setting procedures, recommended study scores, passing scores established by the states, and the implications of the passing scores were studied. States have typically adopted standards lower than cut scores recommended by advisory…
Descriptors: Cutting Scores, Higher Education, Licensing Examinations (Professions), Minimum Competency Testing
Melican, Gerald; Plake, Barbara S. – 1984
The validity of combining a correction for guessing with the Nedelsky-based cutscore was investigated. A five option multiple choice Mathematics Achievement Test was used in the study. Items were selected to meet several criteria. These included: the capability of measuring mathematics concepts related to performance in introductory statistics;…
Descriptors: Cutting Scores, Guessing (Tests), Higher Education, Multiple Choice Tests
Angoff, William H.; Schrader, William B. – 1981
The purpose of this study was to determine whether it would be possible to equate rights-scored to formula-scored tests without causing a discontinuity in the meaning of the score scale. Several other subsidiary studies--of the characteristics of the two scoring methods, of nonresponse and guessing, and of reliability and parallelism--were also…
Descriptors: Academic Ability, College Entrance Examinations, Equated Scores, Guessing (Tests)
Larkin, Kevin C.; Weiss, David J. – 1975
A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…
Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs
Pike, Lewis W. – 1980
This study describes intergroup guessing differences in response to tests and to test-like tasks. It is a composite of seven component inquiries with three substudies in Phase 1 and four in Phase 2. These seven studies cover the Graduate Record Examination (GRE) item-type domain from a number of viewpoints relevant to implicit guessing behavior.…
Descriptors: Aptitude Tests, Black Students, College Entrance Examinations, Ethnic Groups
Peer reviewedSher, Lawrence – Two-Year College Mathematics Journal, 1977
A formula for converting raw test scores to refined, more meaningful scores is presented. Formula scores are easily computed. (SD)
Descriptors: College Mathematics, Educational Testing, Higher Education, Mathematics Education
Peer reviewedCreaser, James W.; Jacobs, Mitchell – Journal of Counseling Psychology, 1987
Strong-Campbell Interest Inventory answer sheets for 300 male university freshmen were scored via both the 1981 and 1985 scoring systems. Communalities of the profiles generated by the two scoring systems indicated considerable profile variance. Counselors should thoroughly understand changes made in the new instrument. (Author/NB)
Descriptors: College Freshmen, Higher Education, Interest Inventories, Males
Flaherty, Jane F. – 1981
The history and institutional use of cutoff scores is reviewed, emphasizing the changes in these scores used over the years and the rationale used in selecting cutoff scores. A brief description of the College Level Examination Program (CLEP) is provided, as is background about how it works. Cutoff score recommendations of the American Council on…
Descriptors: Academic Standards, Cutting Scores, Educational Trends, Higher Education
Slate, John R. – 1986
Studies have revealed significant problems in correctly scoring ambiguous verbal responses to test items on the Wechsler Intelligence Scale for Children-Revised (WISC-R). This study evaluated the effectiveness of an instructional design procedure developed to reduce examiner scoring errors on the WISC-R. Data concerning frequent sources of error…
Descriptors: Clinical Psychology, Error of Measurement, Graduate Students, Higher Education
Plake, Barbara S.; And Others – 1983
Differential test performance by undergraduate males and females enrolled in a developmental educational psychology course (n=167) was reported on a quantitative examination as a function of item arrangement. Males were expected to perform better than females on tests whose items arranged easy to hard. Plake and Ansorge (1982) speculated this may…
Descriptors: Difficulty Level, Feedback, Higher Education, Scoring
Frary, Robert B.; And Others – 1985
Students in an introductory college course (n=275) responded to equivalent 20-item halves of a test under number-right and formula-scoring instructions. Formula scores of those who omitted items overaged about one point lower than their comparable (formula adjusted) scores on the test half administered under number-right instructions. In contrast,…
Descriptors: Guessing (Tests), Higher Education, Multiple Choice Tests, Questionnaires
PDF pending restorationKingston, Neal M. – 1985
This research investigated the effect on estimated lower asymptotes of the instructions to Graduate Record Examination (GRE) examinees about how the test would be scored. This effect was assessed for four different verbal item types (analogies, antonyms, sentence completion, and reading comprehension) using a two-way, unweighted means analysis of…
Descriptors: Analysis of Variance, College Entrance Examinations, Guessing (Tests), Higher Education

Direct link
