ERIC - Search Results

Descriptor

Comparative Testing	13
Scores	13
Test Construction	13
Higher Education	7
Test Format	6
Computer Assisted Testing	4
Multiple Choice Tests	4
Test Items	4
Test Use	4
Difficulty Level	3
Achievement Tests	2
Adaptive Testing	2
Aptitude Tests	2
College Entrance Examinations	2
College Students	2
Graduate Students	2
High School Students	2
Item Response Theory	2
Licensing Examinations…	2
Mathematics Achievement	2
Mathematics Tests	2
Performance	2
Performance Based Assessment	2
Secondary School Teachers	2
Standardized Tests	2
More ▼

Source

Applied Measurement in…	2
Educational Measurement:…	1
Evaluation and the Health…	1
Journal of Educational…	1
Journal of Experimental…	1
TESL Canada Journal	1

Author

Anderson, Paul S.	1
Ansley, Timothy N.	1
Armstrong, Anne-Marie	1
Colliver, Jerry A.	1
Des Brisay, Margaret	1
Forsyth, Robert A.	1
Frary, Robert B.	1
Friedman, Stephen J.	1
Kanzler, Eileen M.	1
Mazzeo, John	1
McManus, Barbara Luger	1
Semple, Brian McLean	1
Sykes, Robert C.	1
Wasserman, John D.	1
Wise, Steven L.	1
More ▼

Publication Type

Reports - Research	10
Journal Articles	7
Reports - Evaluative	3
Speeches/Meeting Papers	3
Tests/Questionnaires	2
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Bayley Scales of Infant…	1
College Level Examination…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

The None-of-the-Above Option: An Empirical Study.

Peer reviewed

Frary, Robert B. – Applied Measurement in Education, 1991

The use of the "none-of-the-above" option (NOTA) in 20 college-level multiple-choice tests was evaluated for classes with 100 or more students. Eight academic disciplines were represented, and 295 NOTA and 724 regular test items were used. It appears that the NOTA can be compatible with good classroom measurement. (TJH)

Descriptors: College Students, Comparative Testing, Difficulty Level, Discriminant Analysis

Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.

Download full text

Sykes, Robert C.; And Others – 1991

To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…

Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

The Influence of Reading on Listening Test Scores.

Peer reviewed

Friedman, Stephen J.; Ansley, Timothy N. – Journal of Experimental Education, 1990

To investigate the relationship between reading and listening test scores, 3 different sets of listening items accompanied by answer sheets requiring varying amounts of reading were administered to 1,200 students in grades 3 through 8. Listening scores increased as more printed information was added to the answer sheet. (SLD)

Descriptors: Answer Sheets, Comparative Testing, Elementary Education, Elementary School Students

Comparability of Computer and Paper-and-Pencil Scores for Two CLEP General Examinations. College Board Report No. 91-5.

Mazzeo, John; And Others – 1991

Two studies investigated the comparability of scores from paper-and-pencil and computer-administered versions of the College-Level Examination Program (CLEP) General Examinations in mathematics and English composition. The first study used a prototype computer-administered version on each examination for 94 students for mathematics and 116 for…

Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing

Three Applications of Customized Testing in Local School Districts.

Peer reviewed

Forsyth, Robert A.; And Others – Applied Measurement in Education, 1992

Eighth grade teachers in three local school districts helped customize two standardized norm-referenced tests for ninth graders to investigate effects of deleting some items and adding locally constructed items. Results indicate that percentile ranks for the customized tests could be very different from those for the complete test. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Elementary Secondary Education, Grade 9

Effect of Position-within-Sequence on Case Performance in a Multiple-Stations Examination Using Standardized-Patient Cases.

Peer reviewed

Colliver, Jerry A.; And Others – Evaluation and the Health Professions, 1991

A study was conducted to assess the effect of station position in a multiple-stations performance-based examination administered to 127 senior medical students. There was no evidence for a sequence effect on student performance, with no improvement on scores awarded for standardized cases across successive cases in the examination. (SLD)

Descriptors: Clinical Experience, Comparative Testing, Higher Education, Licensing Examinations (Professions)

The Revised SAT's and the ACT's--Are They Really Different?

Download full text

McManus, Barbara Luger – 1992

This paper discusses whether or not revisions of the Scholastic Aptitude Test (SAT) and the American College Test (ACT) have created such significant differences between the two tests that a student could conceivably score significantly higher on one than the other. The SAT has been revised to meet the needs of an increasingly diverse student…

Descriptors: Ability, Achievement Tests, Aptitude Tests, College Entrance Examinations

The Factor Structure of the Behavior Rating Scale of the Bayley Scales of Infant Development-II: Cross-Sample, Cross-Sectional, and Cross-Method Investigations of Construct Validity.

Download full text

Wasserman, John D.; And Others – 1993

The original Bayley Scales of Infant Development (BSID) have been among the most popular measures of performance and aptitude of infants. In this study, the construct validity of scores on the Behavior Rating Scale of the revised Bayley Scales, the BSID-II, was investigated using national standardization and clinical samples of children ranging in…

Descriptors: Age Differences, Aptitude Tests, Behavior Rating Scales, Child Development

Performance Assessment: An International Experiment.

Download full text

Semple, Brian McLean – 1992

The second International Assessment of Educational Progress focused on the mathematics and science achievement of 13-year-olds. Performance assessments were used as part of the overall assessment in four countries (England, Scotland, Soviet Union, and Taiwan) and five Canadian provinces. The performance assessment approach drew heavily on the…

Descriptors: Academic Achievement, Adolescents, Comparative Testing, Educational Assessment

Comparison of Cognitive Achievement in Objective Testing: Multi-Digit and Multiple-Choice Tests.

Download full text

Anderson, Paul S.; Kanzler, Eileen M. – 1985

Test scores were compared for two types of objective achievement tests--multiple choice tests and the recently developed Multi-Digit Test (MDT) procedure. MDT is an approximation of the fill-in-the-blank technique. Students select their answers from long lists of alphabetized terms, with each answer corresponding to a number from 001 to 999. The…

Descriptors: Achievement Tests, Cloze Procedure, Comparative Testing, Computer Assisted Testing

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

Cognitive-Style Differences in Testing Situations.

Peer reviewed

Armstrong, Anne-Marie – Educational Measurement: Issues and Practice, 1993

The effects of test performance of differentially written multiple-choice tests and test takers' cognitive style were studied for 47 graduate students and 35 public school and college teachers. Adhering to test-writing item guidelines resulted in mean scores basically the same for two groups of differing cognitive style. (SLD)

Descriptors: Cognitive Style, College Faculty, Comparative Testing, Graduate Students

Problems in Developing an Alternative to the TOEFL.

Peer reviewed
PDF on ERIC

Download full text

Des Brisay, Margaret – TESL Canada Journal, 1994

Data from the Canadian Test of English for Scholars and Trainees (CanTEST) are compared to data from the Test of English as a Foreign Language (TOEFL) to establish CanTEST as a valid admissions tool for English-as-a-Second Language college applicants. Data are taken from four groups of examinees who took both tests. (eight references) (LR)

Descriptors: Admission Criteria, Comparative Analysis, Comparative Testing, Correlation