Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 11 |
| Since 2007 (last 20 years) | 32 |
Descriptor
Source
Author
| Badger, Elizabeth | 4 |
| Melancon, Janet G. | 3 |
| Thomas, Brenda | 3 |
| Thompson, Bruce | 3 |
| Anderson, Paul S. | 2 |
| Colliver, Jerry A. | 2 |
| Enger, John M. | 2 |
| Huntley, Renee M. | 2 |
| Lissitz, Robert W. | 2 |
| Lunz, Mary E. | 2 |
| Sykes, Robert C. | 2 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 17 |
| Postsecondary Education | 9 |
| Elementary Secondary Education | 6 |
| Elementary Education | 4 |
| High Schools | 4 |
| Secondary Education | 3 |
| Adult Education | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| More ▼ | |
Audience
| Researchers | 11 |
| Practitioners | 1 |
| Teachers | 1 |
Location
| Canada | 3 |
| United Kingdom | 3 |
| China | 2 |
| Czech Republic | 2 |
| Ireland | 2 |
| United States | 2 |
| California | 1 |
| India | 1 |
| Israel | 1 |
| Israel (Tel Aviv) | 1 |
| Jamaica | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedBirenbaum, Menucha; And Others – Applied Psychological Measurement, 1992
The effect of multiple-choice (MC) or open-ended (OE) response format on diagnostic assessment of algebra test performance was investigated with 231 eighth and ninth graders in Tel Aviv (Israel) using bug or rule space analysis. Both analyses indicated closer similarity between parallel OE subsets than between stem-equivalent OE and MC subsets.…
Descriptors: Algebra, Comparative Testing, Educational Assessment, Educational Diagnosis
van der Linden, Wim J. – Applied Psychological Measurement, 2006
Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores
Siskind, Theresa G.; And Others – 1992
The instructional validity of computer administered tests was studied with a focus on whether differences in test scores and item behavior are a function of instructional mode (computer versus non-computer). In the first of 3 studies, performance test scores for approximately 400 high school students in 1990-91 for tasks accomplished with the…
Descriptors: Comparative Testing, Comprehension, Computer Assisted Instruction, Computer Assisted Testing
Melancon, Janet G.; Thompson, Bruce – 1989
Classical measurement theory was used to investigate the measurement (psychometric) characteristics of both parts of the Finding Embedded Figures Test (FEFT) administered in either a "no guessing" supply format or a multiple-choice selection format to undergraduate college students or to middle school students. Three issues were…
Descriptors: Comparative Testing, Construct Validity, Higher Education, Junior High School Students
Vispoel, Walter P.; Twing, Jon S. – 1989
The measurement precision, efficiency, and validity of an adaptive test and four conventional listening tests designed to assess musical ability were compared. The conventional tests were the Seashore Tonal Memory Test and three tests (peaked, rectangular, and maximum discrimination) constructed from items in the 278-item adaptive test pool. The…
Descriptors: Adaptive Testing, College Students, Comparative Testing, High School Students
Olsen, James B.; And Others – 1986
Student achievement test scores were compared and equated, using three different testing methods: paper-administered, computer-administered, and computerized adaptive testing. The tests were developed from third and sixth grade mathematics item banks of the California Assessment Program. The paper and the computer-administered tests were identical…
Descriptors: Achievement Tests, Adaptive Testing, Comparative Testing, Computer Assisted Testing
Moon, Russ – 1988
Since the emergence of the General Certificate of Secondary Education (GCSE) there have been calls for improved methods of assessing economics. Oral assessment has been suggested as a possible technique and this study investigated whether it might be used to allow students to demonstrate achievement in GCSE economics. The empirical study compared…
Descriptors: Achievement Tests, Comparative Analysis, Comparative Testing, Economics Education
Peer reviewedKinicki, Angelo J.; And Others – Educational and Psychological Measurement, 1985
Using both the Behaviorally Anchored Rating Scales (BARS) and the Purdue University Scales, 727 undergraduates rated 32 instructors. The BARS had less halo effect, more leniency error, and lower interrater reliability. Both formats were valid. The two tests did not differ in rate discrimination or susceptibility to rating bias. (Author/GDC)
Descriptors: Behavior Rating Scales, College Faculty, Comparative Testing, Higher Education
Peer reviewedHarris, Deborah J. – Applied Psychological Measurement, 1991
Effects of passage and item-scrambling on equipercentile and item-response theory equating were investigated using 2 scrambled versions of the American College Testing Program Assessment for approximately 25,000 examinees. Results indicate that using a base-form conversion table with a scrambled form affects the individual examinee level. (SLD)
Descriptors: College Entrance Examinations, Comparative Testing, Context Effect, Equated Scores
Trevisan, Michael S.; Sax, Gilbert – 1991
The purpose of this study was to compare the reliabilities of two-, three-, four-, and five-choice tests using an incremental option paradigm. Test forms were created incrementally, a method approximating actual test construction procedures. Participants were 154 12th-grade students from the Portland (Oregon) area. A 45-item test with two options…
Descriptors: Comparative Testing, Distractors (Tests), Estimation (Mathematics), Grade 12
Chang, Lei – 1993
Equivalence in reliability and validity across 4-point and 6-point scales was assessed by fitting different measurement models through confirmatory factor analysis of a multitrait-multimethod covariance matrix. Responses to nine Likert-type items designed to measure perceived quantitative ability, self-perceived usefulness of quantitative…
Descriptors: Ability, Comparative Testing, Education Majors, Graduate Students
Lyon, Mark A.; Smith, Douglas K. – 1986
This study examined agreement rates between identified strengths and weaknesses in shared abilities and influences on the Wechsler Intelligence Scale for Children-Revised (WISC-R) and the Kaufman Assessment Battery for Children (K-ABC). Sixty-seven students in the first through seventh grades referred for learning disabilities (LD) evaluation were…
Descriptors: Ability Identification, Comparative Testing, Concurrent Validity, Elementary Education
Peer reviewedStricker, Lawrence J. – Journal of Educational Measurement, 1991
To study whether different forms of the Scholastic Aptitude Test (SAT) used since the mid-1970s varied in their correlations with academic performance criteria, 1975 and 1985 forms were administered to 1,554 and 1,753 high school juniors, respectively. The 1975 form did not have greater validity than the 1985 form. (SLD)
Descriptors: Class Rank, College Entrance Examinations, Comparative Testing, Correlation
Peer reviewedTrevisan, Michael S.; And Others – Educational and Psychological Measurement, 1991
The reliability and validity of multiple-choice tests were computed as a function of the number of options per item and student ability for 435 parochial high school juniors, who were administered the Washington Pre-College Test Battery. Results suggest the efficacy of the three-option item. (SLD)
Descriptors: Ability, Comparative Testing, Distractors (Tests), Grade Point Average
Wise, Steven L.; And Others – 1993
This study assessed whether providing examinees with a choice between computerized adaptive testing (CAT) and self-adaptive testing (SAT) affects test performance in comparison with being assigned a CAT or SAT, and evaluated variables influencing examinee choice of either test form. The relative influences of test type and test choice on examinee…
Descriptors: Ability, Adaptive Testing, Algebra, College Students

Direct link
