ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	32

Descriptor

Comparative Testing	141
Test Format	141
Higher Education	49
Test Items	44
Multiple Choice Tests	40
Computer Assisted Testing	39
Test Construction	35
Test Reliability	28
Foreign Countries	25
Test Validity	23
Item Analysis	18
Scores	17
College Students	16
High School Students	16
High Schools	15
Mathematics Tests	15
Response Style (Tests)	14
Difficulty Level	13
Item Response Theory	13
Testing Problems	13
Undergraduate Students	13
College Entrance Examinations	12
Correlation	12
Elementary School Students	12
Adaptive Testing	11
More ▼

Publication Type

Reports - Research	115
Journal Articles	82
Speeches/Meeting Papers	41
Reports - Evaluative	21
Tests/Questionnaires	6
Opinion Papers	3
Information Analyses	2
Reports - Descriptive	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Higher Education	17
Postsecondary Education	9
Elementary Secondary Education	6
Elementary Education	4
High Schools	4
Secondary Education	3
Adult Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Researchers	11
Practitioners	1
Teachers	1

Location

Canada	3
United Kingdom	3
China	2
Czech Republic	2
Ireland	2
United States	2
California	1
India	1
Israel	1
Israel (Tel Aviv)	1
Jamaica	1
Louisiana	1
Maryland	1
Missouri	1
Nebraska	1
Netherlands	1
New York	1
Pennsylvania	1
Portugal	1
Saudi Arabia	1
South Africa	1
Taiwan (Taipei)	1
Turkey (Ankara)	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Test Format X

Showing 91 to 105 of 141 results Save | Export

Effects of Response Format on Diagnostic Assessment of Scholastic Achievement.

Peer reviewed

Birenbaum, Menucha; And Others – Applied Psychological Measurement, 1992

The effect of multiple-choice (MC) or open-ended (OE) response format on diagnostic assessment of algebra test performance was investigated with 231 eighth and ninth graders in Tel Aviv (Israel) using bug or rule space analysis. Both analyses indicated closer similarity between parallel OE subsets than between stem-equivalent OE and MC subsets.…

Descriptors: Algebra, Comparative Testing, Educational Assessment, Educational Diagnosis

Equating Scores from Adaptive to Linear Tests

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores

The Instructional Validity of Computer Administered Tests.

Download full text

Siskind, Theresa G.; And Others – 1992

The instructional validity of computer administered tests was studied with a focus on whether differences in test scores and item behavior are a function of instructional mode (computer versus non-computer). In the first of 3 studies, performance test scores for approximately 400 high school students in 1990-91 for tasks accomplished with the…

Descriptors: Comparative Testing, Comprehension, Computer Assisted Instruction, Computer Assisted Testing

Measurement Characteristics of the Finding Embedded Figures Test with Middle School Students.

Download full text

Melancon, Janet G.; Thompson, Bruce – 1989

Classical measurement theory was used to investigate the measurement (psychometric) characteristics of both parts of the Finding Embedded Figures Test (FEFT) administered in either a "no guessing" supply format or a multiple-choice selection format to undergraduate college students or to middle school students. Three issues were…

Descriptors: Comparative Testing, Construct Validity, Higher Education, Junior High School Students

A Comparison of the Efficiency, Reliability and Validity of Adaptive and Conventional Listening Tests.

Download full text

Vispoel, Walter P.; Twing, Jon S. – 1989

The measurement precision, efficiency, and validity of an adaptive test and four conventional listening tests designed to assess musical ability were compared. The conventional tests were the Seashore Tonal Memory Test and three tests (peaked, rectangular, and maximum discrimination) constructed from items in the 278-item adaptive test pool. The…

Descriptors: Adaptive Testing, College Students, Comparative Testing, High School Students

Comparison and Equating of Paper-Administered, Computer-Administered and Computerized Adaptive Tests of Achievement.

Olsen, James B.; And Others – 1986

Student achievement test scores were compared and equated, using three different testing methods: paper-administered, computer-administered, and computerized adaptive testing. The tests were developed from third and sixth grade mathematics item banks of the California Assessment Program. The paper and the computer-administered tests were identical…

Descriptors: Achievement Tests, Adaptive Testing, Comparative Testing, Computer Assisted Testing

Oral Assessment in GCSE Economics. Research Papers in Economics Education, Number 14.

Moon, Russ – 1988

Since the emergence of the General Certificate of Secondary Education (GCSE) there have been calls for improved methods of assessing economics. Oral assessment has been suggested as a possible technique and this study investigated whether it might be used to allow students to demonstrate achievement in GCSE economics. The empirical study compared…

Descriptors: Achievement Tests, Comparative Analysis, Comparative Testing, Economics Education

Behaviorally Anchored Rating Scales vs. Summated Rating Scales: Psychometric Properties and Susceptibility to Rating Bias.

Peer reviewed

Kinicki, Angelo J.; And Others – Educational and Psychological Measurement, 1985

Using both the Behaviorally Anchored Rating Scales (BARS) and the Purdue University Scales, 727 undergraduates rated 32 instructors. The BARS had less halo effect, more leniency error, and lower interrater reliability. Both formats were valid. The two tests did not differ in rate discrimination or susceptibility to rating bias. (Author/GDC)

Descriptors: Behavior Rating Scales, College Faculty, Comparative Testing, Higher Education

Effects of Passage and Item Scrambling on Equating Relationships.

Peer reviewed

Harris, Deborah J. – Applied Psychological Measurement, 1991

Effects of passage and item-scrambling on equipercentile and item-response theory equating were investigated using 2 scrambled versions of the American College Testing Program Assessment for approximately 25,000 examinees. Results indicate that using a base-form conversion table with a scrambled form affects the individual examinee level. (SLD)

Descriptors: College Entrance Examinations, Comparative Testing, Context Effect, Equated Scores

Estimating the Optimum Choice Format Using an Incremental Option Paradigm.

Download full text

Trevisan, Michael S.; Sax, Gilbert – 1991

The purpose of this study was to compare the reliabilities of two-, three-, four-, and five-choice tests using an incremental option paradigm. Test forms were created incrementally, a method approximating actual test construction procedures. Participants were 154 12th-grade students from the Portland (Oregon) area. A 45-item test with two options…

Descriptors: Comparative Testing, Distractors (Tests), Estimation (Mathematics), Grade 12

Using Confirmatory Factor Analysis of Multitrait-Multimethod Data To Assess the Psychometrical Equivalence of 4-Point and 6-Point Likert-Type Scales.

Download full text

Chang, Lei – 1993

Equivalence in reliability and validity across 4-point and 6-point scales was assessed by fitting different measurement models through confirmatory factor analysis of a multitrait-multimethod covariance matrix. Responses to nine Likert-type items designed to measure perceived quantitative ability, self-perceived usefulness of quantitative…

Descriptors: Ability, Comparative Testing, Education Majors, Graduate Students

Concordance between Shared Abilities and Influences on the WISC-R and K-ABC.

Download full text

Lyon, Mark A.; Smith, Douglas K. – 1986

This study examined agreement rates between identified strengths and weaknesses in shared abilities and influences on the Wechsler Intelligence Scale for Children-Revised (WISC-R) and the Kaufman Assessment Battery for Children (K-ABC). Sixty-seven students in the first through seventh grades referred for learning disabilities (LD) evaluation were…

Descriptors: Ability Identification, Comparative Testing, Concurrent Validity, Elementary Education

Current Validity of 1975 and 1985 SATs: Implications for Validity Trends since the Mid-1970s.

Peer reviewed

Stricker, Lawrence J. – Journal of Educational Measurement, 1991

To study whether different forms of the Scholastic Aptitude Test (SAT) used since the mid-1970s varied in their correlations with academic performance criteria, 1975 and 1985 forms were administered to 1,554 and 1,753 high school juniors, respectively. The 1975 form did not have greater validity than the 1985 form. (SLD)

Descriptors: Class Rank, College Entrance Examinations, Comparative Testing, Correlation

The Effects of the Number of Options per Item and Student Ability on Test Validity and Reliability.

Peer reviewed

Trevisan, Michael S.; And Others – Educational and Psychological Measurement, 1991

The reliability and validity of multiple-choice tests were computed as a function of the number of options per item and student ability for 435 parochial high school juniors, who were administered the Washington Pre-College Test Battery. Results suggest the efficacy of the three-option item. (SLD)

Descriptors: Ability, Comparative Testing, Distractors (Tests), Grade Point Average

The Role of Anxiety in Examinee Preference for Self-Adapted Testing.

Download full text

Wise, Steven L.; And Others – 1993

This study assessed whether providing examinees with a choice between computerized adaptive testing (CAT) and self-adaptive testing (SAT) affects test performance in comparison with being assigned a CAT or SAT, and evaluated variables influencing examinee choice of either test form. The relative influences of test type and test choice on examinee…

Descriptors: Ability, Adaptive Testing, Algebra, College Students

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Journal of Educational…	8
Educational and Psychological…	7
Applied Psychological…	6
Evaluation and the Health…	5
Journal of Technology,…	5
Psychological Assessment	5
Advances in Health Sciences…	2
Educational Technology…	2
Journal of Educational…	2
Journal of Experimental…	2
Online Submission	2
Alberta Journal of…	1
Anatomical Sciences Education	1
Applied Measurement in…	1
Assessing Writing	1
Behavior Research Methods,…	1
Biochemistry and Molecular…	1
British Educational Research…	1
College Teaching	1
Computers & Education	1
Computers in Human Behavior	1
ETS Research Report Series	1
Education and Information…	1
Educational Research Quarterly	1
Educational Research and…	1
More ▼

Badger, Elizabeth	4
Melancon, Janet G.	3
Thomas, Brenda	3
Thompson, Bruce	3
Anderson, Paul S.	2
Colliver, Jerry A.	2
Enger, John M.	2
Huntley, Renee M.	2
Lissitz, Robert W.	2
Lunz, Mary E.	2
Sykes, Robert C.	2
Trevisan, Michael S.	2
Wise, Steven L.	2
Alanna Lecher	1
Allen, Nancy	1
Allen, Nancy L.	1
Allison, Donald E.	1
Ansley, Timothy N.	1
Appleman, Deborah	1
April L. Millet	1
Ates, Salih	1
Atwood, Kristin	1
Aydin Ceran, Sema	1
Barnes, Janet L.	1
More ▼

Graduate Record Examinations	4
ACT Assessment	3
Embedded Figures Test	3
National Assessment of…	2
SAT (College Admission Test)	2
Wechsler Adult Intelligence…	2
Wechsler Intelligence Scale…	2
Advanced Placement…	1
Armed Forces Qualification…	1
Armed Services Vocational…	1
Beck Depression Inventory	1
College Level Academic Skills…	1
College Level Examination…	1
Differential Aptitude Test	1
Goodenough Harris Drawing Test	1
Group Embedded Figures Test	1
Kaufman Assessment Battery…	1
Marlowe Crowne Social…	1
Minnesota Multiphasic…	1
Myers Briggs Type Indicator	1
NEO Personality Inventory	1
Self Directed Search	1
Stanford Binet Intelligence…	1
Strong Campbell Interest…	1
Wechsler Intelligence Scales…	1
More ▼