ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	5

Descriptor

Multiple Choice Tests	17
Test Items	17
Test Length	17
Test Format	8
Test Reliability	7
Higher Education	6
Difficulty Level	5
Test Construction	5
Testing Problems	5
Achievement Tests	4
Guessing (Tests)	4
Item Analysis	4
Test Validity	4
Equated Scores	3
Foreign Countries	3
Item Response Theory	3
Scoring	3
Cheating	2
Comparative Analysis	2
Criterion Referenced Tests	2
Error of Measurement	2
Latent Trait Theory	2
Monte Carlo Methods	2
Objective Tests	2
Sample Size	2
More ▼

Source

Educational and Psychological…	2
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
English Teaching	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
Participatory Educational…	1
ProQuest LLC	1

Publication Type

Reports - Research	12
Journal Articles	10
Speeches/Meeting Papers	3
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	2
Postsecondary Education	1

Audience

Researchers

Location

Australia	1
Israel	1
South Korea	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams

Peer reviewed

Direct link

Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023

This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…

Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Effects of Test Level Discrimination and Difficulty on Answer-Copying Indices

Peer reviewed
PDF on ERIC

Download full text

Sunbul, Onder; Yormaz, Seha – International Journal of Evaluation and Research in Education, 2018

In this study Type I Error and the power rates of omega (?) and GBT (generalized binomial test) indices were investigated for several nominal alpha levels and for 40 and 80-item test lengths with 10,000-examinee sample size under several test level restrictions. As a result, Type I error rates of both indices were found to be below the acceptable…

Descriptors: Difficulty Level, Cheating, Duplication, Test Length

Effects of Text Length and Question Type on Test-Takers' Performance on Fill-in-the-Blank Items in Korean CSAT

Peer reviewed
PDF on ERIC

Download full text

Bae, Minryoung; Lee, Byungmin – English Teaching, 2018

This study examines the effects of text length and question type on Korean EFL readers' reading comprehension of the fill-in-the-blank items in Korean CSAT. A total of 100 Korean EFL college students participated in the study. After divided into three different proficiency groups, the participants took a reading comprehension test which consisted…

Descriptors: Test Items, Language Tests, Second Language Learning, Second Language Instruction

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

What's Wrong with Three-Option Multiple Choice Items?

Peer reviewed

Owen, Steven V.; Froman, Robin D. – Educational and Psychological Measurement, 1987

To test further for efficacy of three-option achievement items, parallel three- and five-option item tests were distributed randomly to college students. Results showed no differences in mean item difficulty, mean discrimination or total test score, but a substantial reduction in time spent on three-option items. (Author/BS)

Descriptors: Achievement Tests, Higher Education, Multiple Choice Tests, Test Format

Determining the Length of Multiple Choice Criterion-Referenced Tests When an Answer-Until-Correct Scoring Procedure Is Used.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1982

When determining criterion-referenced test length, problems of guessing are shown to be more serious than expected. A new method of scoring is presented that corrects for guessing without assuming that guessing is random. Empirical investigations of the procedure are examined. Test length can be substantially reduced. (Author/CM)

Descriptors: Criterion Referenced Tests, Guessing (Tests), Multiple Choice Tests, Scoring

Multiple Choice and True-False: Reliability and Validity Compared.

Peer reviewed

Green, Kathy – Journal of Experimental Education, 1979

Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)

Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format

Multiple-Choice and True/False Tests: Myths and Misapprehensions

Peer reviewed

Direct link

Burton, Richard F. – Assessment and Evaluation in Higher Education, 2005

Examiners seeking guidance on multiple-choice and true/false tests are likely to encounter various faulty or questionable ideas. Twelve of these are discussed in detail, having to do mainly with the effects on test reliability of test length, guessing and scoring method (i.e. number-right scoring or negative marking). Some misunderstandings could…

Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Test Reliability

Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004

The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…

Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

Test Length and Validity: An Application of Test Theory to a Finite World.

Myers, Charles T. – 1978

The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…

Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

Q. How Many Options Should a Multiple-Choice Question Have? (a) 2. (b) 3. (c) 4. At-a-glance Research Report.

Catts, Ralph – 1978

The reliability of multiple choice tests--containing different numbers of response options--was investigated for 260 students enrolled in technical college economics courses. Four test forms, constructed from previously used four-option items, were administered, consisting of (1) 60 two-option items--two distractors randomly discarded; (2) 40…

Descriptors: Answer Sheets, Difficulty Level, Foreign Countries, Higher Education

Comparison of Difficulties and Reliabilities of Math-Completion and Multiple-Choice Item Formats.

Download full text

Oosterhof, Albert C.; Coats, Pamela K. – 1981

Instructors who develop classroom examinations that require students to provide a numerical response to a mathematical problem are often very concerned about the appropriateness of the multiple-choice format. The present study augments previous research relevant to this concern by comparing the difficulty and reliability of multiple-choice and…

Descriptors: Comparative Analysis, Difficulty Level, Grading, Higher Education

The Effect of Keying All Options Correct on Equating Functions and Scores.

Download full text

Lenel, Julia C.; Gilmer, Jerry S. – 1986

In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)

Effect of the Guessing Parameter on the Estimation of the Item Discrimination and Difficulty Parameters When Three-Parameter Logistic Model Is Assumed.

Samejima, Fumiko – 1986

Item analysis data fitting the normal ogive model were simulated in order to investigate the problems encountered when applying the three-parameter logistic model. Binary item tests containing 10 and 35 items were created, and Monte Carlo methods simulated the responses of 2,000 and 500 examinees. Item parameters were obtained using Logist 5.…

Descriptors: Computer Simulation, Difficulty Level, Guessing (Tests), Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Burton, Richard F.	2
Bae, Minryoung	1
Budescu, David V.	1
Catts, Ralph	1
Coats, Pamela K.	1
Froman, Robin D.	1
Gilmer, Jerry S.	1
Green, Kathy	1
Kiliç, Abdullah Faruk	1
Lang, Joseph B.	1
Lee, Byungmin	1
Lenel, Julia C.	1
Millman, Jason	1
Myers, Charles T.	1
Nevo, Baruch	1
Oosterhof, Albert C.	1
Owen, Steven V.	1
Sahin-Kürsad, Merve	1
Samejima, Fumiko	1
Sunbul, Onder	1
Uysal, Ibrahim	1
Wang, Wei	1
Wilcox, Rand R.	1
Yormaz, Seha	1
More ▼