Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 5 |
Descriptor
Responses | 9 |
Test Format | 9 |
Test Reliability | 9 |
Test Items | 6 |
Multiple Choice Tests | 5 |
Adults | 2 |
College Students | 2 |
Comparative Analysis | 2 |
Difficulty Level | 2 |
Higher Education | 2 |
Item Analysis | 2 |
More ▼ |
Source
Assessment & Evaluation in… | 1 |
ETS Research Report Series | 1 |
Educational Measurement:… | 1 |
Educational and Psychological… | 1 |
Eurasian Journal of… | 1 |
International Journal of… | 1 |
Journal of Educational and… | 1 |
Author
Aiken, Lewis R. | 1 |
Bush, Martin | 1 |
Chiu, Chia-Yi | 1 |
Demir, Ergul | 1 |
Frisbie, David A. | 1 |
Gurdil Ege, Hatice | 1 |
Henning, Grant | 1 |
Köhn, Hans Friedrich | 1 |
Maeda, Hotaka | 1 |
Schuldberg, David | 1 |
Wang, Yu | 1 |
More ▼ |
Publication Type
Reports - Research | 8 |
Journal Articles | 7 |
Information Analyses | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Location
North Carolina | 1 |
Washington | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Minnesota Multiphasic… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Gurdil Ege, Hatice; Demir, Ergul – Eurasian Journal of Educational Research, 2020
Purpose: The present study aims to evaluate how the reliabilities computed using a, Stratified a, Angoff-Feldt, and Feldt-Raju estimators may differ when sample size (500, 1000, and 2000) and item type ratio of dichotomous to polytomous items (2:1; 1:1, 1:2) included in the scale are varied. Research Methods: In this study, Cronbach's a,…
Descriptors: Test Format, Simulation, Test Reliability, Sample Size
Wang, Yu; Chiu, Chia-Yi; Köhn, Hans Friedrich – Journal of Educational and Behavioral Statistics, 2023
The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…
Descriptors: Multiple Choice Tests, Nonparametric Statistics, Test Format, Educational Assessment
Maeda, Hotaka – International Journal of Social Research Methodology, 2015
Likert scales are used to make relative and absolute judgments about measures of attitude. Despite its ubiquitous use, only few studies have investigated the effects of altering the configurations of the response options. The purpose of this experiment was to explore the effects of response option orientation and directionality in Likert scales…
Descriptors: Likert Scales, Online Surveys, Responses, Test Reliability
Bush, Martin – Assessment & Evaluation in Higher Education, 2015
The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability
Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013
The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…
Descriptors: Test Format, Test Items, Responses, Computation
Schuldberg, David – 1988
Indices were constructed to measure individual differences in the effects of the automated testing format and repeated testing on Minnesota Multiphasic Personality Inventory (MMPI) responses. Two types of instability measures were studied within a data set from the responses of 150 undergraduate students who took a computer-administered and…
Descriptors: College Students, Computer Assisted Testing, Higher Education, Individual Differences

Aiken, Lewis R. – Educational and Psychological Measurement, 1983
Each of six forms of a 10-item teacher evaluation rating scale, having two to seven response categories per form, was administered to over 100 college students. Means of item responses and item variances increased with the number of response categories. Internal consistency of total scores did not change systematically. (Author/PN)
Descriptors: College Students, Higher Education, Item Analysis, Rating Scales
Henning, Grant – 1991
In order to evaluate the Test of English as a Foreign Language (TOEFL) vocabulary item format and to determine the effectiveness of alternative vocabulary test items, this study investigated the functioning of eight different multiple-choice formats that differed with regard to: (1) length and inference-generating quality of the stem; (2) the…
Descriptors: Adults, Context Effect, Difficulty Level, English (Second Language)

Frisbie, David A. – Educational Measurement: Issues and Practice, 1992
Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)
Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests