ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Comparative Testing	8
Test Format	8
Multiple Choice Tests	7
High School Students	4
Test Items	4
College Entrance Examinations	3
High Schools	3
Mathematics Tests	3
Computer Assisted Testing	2
Correlation	2
Grade 8	2
Higher Education	2
Item Response Theory	2
Scores	2
Sex Bias	2
Test Bias	2
Test Construction	2
Testing Problems	2
Academic Achievement	1
Adaptive Testing	1
Adolescents	1
Answer Sheets	1
Chi Square	1
Class Rank	1
College Students	1
More ▼

Source

Journal of Educational…

Author

Bennett, Randy Elliot	1
Bolger, Niall	1
Bridgeman, Brent	1
Kellaghan, Thomas	1
Kim, Sooyeon	1
Lissitz, Robert W.	1
Martinez, Michael E.	1
McHale, Frederick	1
Skaggs, Gary	1
Stricker, Lawrence J.	1
Walker, Michael E.	1
Wise, Steven L.	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	2

Education Level

Elementary Secondary Education

Audience

Location

Ireland

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Graduate Record Examinations	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Equivalence of Free-Response and Multiple-Choice Items.

Peer reviewed

Bennett, Randy Elliot; And Others – Journal of Educational Measurement, 1991

The relationship of multiple-choice and free-response items on the College Board's Advanced Placement Computer Science Examination was studied using confirmatory factor analysis. Results with 2 samples of 1,000 high school students suggested that the most parsimonious fit was achieved using a single factor. Implications for construct validity are…

Descriptors: Chi Square, College Entrance Examinations, Comparative Testing, Computer Science

The Consistency of Detecting Item Bias across Different Test Administrations: Implications of Another Failure.

Peer reviewed

Skaggs, Gary; Lissitz, Robert W. – Journal of Educational Measurement, 1992

The consistency of several item bias detection methods was studied across different test administrations of the same items using data from a mathematics test given to approximately 6,600 eighth grade students in all. The Mantel Haenszel and item-response-theory-based sum-of-squares methods were the most consistent. (SLD)

Descriptors: Comparative Testing, Grade 8, Item Bias, Item Response Theory

Current Validity of 1975 and 1985 SATs: Implications for Validity Trends since the Mid-1970s.

Peer reviewed

Stricker, Lawrence J. – Journal of Educational Measurement, 1991

To study whether different forms of the Scholastic Aptitude Test (SAT) used since the mid-1970s varied in their correlations with academic performance criteria, 1975 and 1985 forms were administered to 1,554 and 1,753 high school juniors, respectively. The 1975 form did not have greater validity than the 1985 form. (SLD)

Descriptors: Class Rank, College Entrance Examinations, Comparative Testing, Correlation

Method of Measurement and Gender Differences in Scholastic Achievement.

Peer reviewed

Bolger, Niall; Kellaghan, Thomas – Journal of Educational Measurement, 1990

Gender differences in scholastic achievement as a function of measurement method were examined by comparing performance of 739 15-year-old boys and 758 15-year-old girls in Irish high schools on multiple-choice and free-response tests of mathematics, Irish, and English achievement. Method-based gender differences are discussed. (SLD)

Descriptors: Academic Achievement, Adolescents, Comparative Testing, English

A Comparison of Quantitative Questions in Open-Ended and Multiple-Choice Formats.

Peer reviewed

Bridgeman, Brent – Journal of Educational Measurement, 1992

Examinees in a regular administration of the quantitative portion of the Graduate Record Examination responded to particular items in a machine-scannable multiple-choice format. Volunteers (n=364) used a computer to answer open-ended counterparts of these items. Scores for both formats demonstrated similar correlational patterns. (SLD)

Descriptors: Answer Sheets, College Entrance Examinations, College Students, Comparative Testing

A Comparison of Multiple-Choice and Constructed Figural Response Items.

Peer reviewed

Martinez, Michael E. – Journal of Educational Measurement, 1991

Figural response items (FRIs) in science were administered to 347 fourth graders, 365 eighth graders, and 322 twelfth graders. Item and test statistics from parallel FRIs and multiple-choice questions illustrate FRIs' more difficult and more discriminating nature. Relevance of guessing to FRIs and diagnostic value of the item type are highlighted.…

Descriptors: Comparative Testing, Constructed Response, Elementary School Students, Elementary Secondary Education

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level