Descriptor
Comparative Testing | 13 |
Scores | 13 |
Test Construction | 13 |
Higher Education | 7 |
Test Format | 6 |
Computer Assisted Testing | 4 |
Multiple Choice Tests | 4 |
Test Items | 4 |
Test Use | 4 |
Difficulty Level | 3 |
Achievement Tests | 2 |
More ▼ |
Source
Applied Measurement in… | 2 |
Educational Measurement:… | 1 |
Evaluation and the Health… | 1 |
Journal of Educational… | 1 |
Journal of Experimental… | 1 |
TESL Canada Journal | 1 |
Author
Publication Type
Reports - Research | 10 |
Journal Articles | 7 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 3 |
Tests/Questionnaires | 2 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Education Level
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Bayley Scales of Infant… | 1 |
College Level Examination… | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating

Frary, Robert B. – Applied Measurement in Education, 1991
The use of the "none-of-the-above" option (NOTA) in 20 college-level multiple-choice tests was evaluated for classes with 100 or more students. Eight academic disciplines were represented, and 295 NOTA and 724 regular test items were used. It appears that the NOTA can be compatible with good classroom measurement. (TJH)
Descriptors: College Students, Comparative Testing, Difficulty Level, Discriminant Analysis
Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.
Sykes, Robert C.; And Others – 1991
To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…
Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

Friedman, Stephen J.; Ansley, Timothy N. – Journal of Experimental Education, 1990
To investigate the relationship between reading and listening test scores, 3 different sets of listening items accompanied by answer sheets requiring varying amounts of reading were administered to 1,200 students in grades 3 through 8. Listening scores increased as more printed information was added to the answer sheet. (SLD)
Descriptors: Answer Sheets, Comparative Testing, Elementary Education, Elementary School Students
Mazzeo, John; And Others – 1991
Two studies investigated the comparability of scores from paper-and-pencil and computer-administered versions of the College-Level Examination Program (CLEP) General Examinations in mathematics and English composition. The first study used a prototype computer-administered version on each examination for 94 students for mathematics and 116 for…
Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing

Forsyth, Robert A.; And Others – Applied Measurement in Education, 1992
Eighth grade teachers in three local school districts helped customize two standardized norm-referenced tests for ninth graders to investigate effects of deleting some items and adding locally constructed items. Results indicate that percentile ranks for the customized tests could be very different from those for the complete test. (SLD)
Descriptors: Adaptive Testing, Comparative Testing, Elementary Secondary Education, Grade 9

Colliver, Jerry A.; And Others – Evaluation and the Health Professions, 1991
A study was conducted to assess the effect of station position in a multiple-stations performance-based examination administered to 127 senior medical students. There was no evidence for a sequence effect on student performance, with no improvement on scores awarded for standardized cases across successive cases in the examination. (SLD)
Descriptors: Clinical Experience, Comparative Testing, Higher Education, Licensing Examinations (Professions)
McManus, Barbara Luger – 1992
This paper discusses whether or not revisions of the Scholastic Aptitude Test (SAT) and the American College Test (ACT) have created such significant differences between the two tests that a student could conceivably score significantly higher on one than the other. The SAT has been revised to meet the needs of an increasingly diverse student…
Descriptors: Ability, Achievement Tests, Aptitude Tests, College Entrance Examinations
Wasserman, John D.; And Others – 1993
The original Bayley Scales of Infant Development (BSID) have been among the most popular measures of performance and aptitude of infants. In this study, the construct validity of scores on the Behavior Rating Scale of the revised Bayley Scales, the BSID-II, was investigated using national standardization and clinical samples of children ranging in…
Descriptors: Age Differences, Aptitude Tests, Behavior Rating Scales, Child Development
Semple, Brian McLean – 1992
The second International Assessment of Educational Progress focused on the mathematics and science achievement of 13-year-olds. Performance assessments were used as part of the overall assessment in four countries (England, Scotland, Soviet Union, and Taiwan) and five Canadian provinces. The performance assessment approach drew heavily on the…
Descriptors: Academic Achievement, Adolescents, Comparative Testing, Educational Assessment
Anderson, Paul S.; Kanzler, Eileen M. – 1985
Test scores were compared for two types of objective achievement tests--multiple choice tests and the recently developed Multi-Digit Test (MDT) procedure. MDT is an approximation of the fill-in-the-blank technique. Students select their answers from long lists of alphabetized terms, with each answer corresponding to a number from 001 to 999. The…
Descriptors: Achievement Tests, Cloze Procedure, Comparative Testing, Computer Assisted Testing

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992
Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)
Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

Armstrong, Anne-Marie – Educational Measurement: Issues and Practice, 1993
The effects of test performance of differentially written multiple-choice tests and test takers' cognitive style were studied for 47 graduate students and 35 public school and college teachers. Adhering to test-writing item guidelines resulted in mean scores basically the same for two groups of differing cognitive style. (SLD)
Descriptors: Cognitive Style, College Faculty, Comparative Testing, Graduate Students
Des Brisay, Margaret – TESL Canada Journal, 1994
Data from the Canadian Test of English for Scholars and Trainees (CanTEST) are compared to data from the Test of English as a Foreign Language (TOEFL) to establish CanTEST as a valid admissions tool for English-as-a-Second Language college applicants. Data are taken from four groups of examinees who took both tests. (eight references) (LR)
Descriptors: Admission Criteria, Comparative Analysis, Comparative Testing, Correlation