ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Descriptor

Responses	9
Test Format	9
Test Reliability	9
Test Items	6
Multiple Choice Tests	5
Adults	2
College Students	2
Comparative Analysis	2
Difficulty Level	2
Higher Education	2
Item Analysis	2
Item Response Theory	2
Models	2
Sample Size	2
Test Validity	2
Ability	1
Accuracy	1
Achievement Tests	1
Algorithms	1
Classification	1
Computation	1
Computer Assisted Testing	1
Context Effect	1
Diagnostic Tests	1
Educational Assessment	1
More ▼

Source

Assessment & Evaluation in…	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational and Psychological…	1
Eurasian Journal of…	1
International Journal of…	1
Journal of Educational and…	1

Author

Aiken, Lewis R.	1
Bush, Martin	1
Chiu, Chia-Yi	1
Demir, Ergul	1
Frisbie, David A.	1
Gurdil Ege, Hatice	1
Henning, Grant	1
Köhn, Hans Friedrich	1
Maeda, Hotaka	1
Schuldberg, David	1
Wang, Yu	1
Wang, Zhen	1
Yao, Lihua	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	7
Information Analyses	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Audience

Location

North Carolina	1
Washington	1

Laws, Policies, & Programs

Assessments and Surveys

Minnesota Multiphasic…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Examining of Internal Consistency Coefficients in Mixed-Format Tests in Different Simulation Conditions

Peer reviewed
PDF on ERIC

Download full text

Gurdil Ege, Hatice; Demir, Ergul – Eurasian Journal of Educational Research, 2020

Purpose: The present study aims to evaluate how the reliabilities computed using a, Stratified a, Angoff-Feldt, and Feldt-Raju estimators may differ when sample size (500, 1000, and 2000) and item type ratio of dichotomous to polytomous items (2:1; 1:1, 1:2) included in the scale are varied. Research Methods: In this study, Cronbach's a,…

Descriptors: Test Format, Simulation, Test Reliability, Sample Size

Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis

Peer reviewed

Direct link

Wang, Yu; Chiu, Chia-Yi; Köhn, Hans Friedrich – Journal of Educational and Behavioral Statistics, 2023

The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…

Descriptors: Multiple Choice Tests, Nonparametric Statistics, Test Format, Educational Assessment

Response Option Configuration of Online Administered Likert Scales

Peer reviewed

Direct link

Maeda, Hotaka – International Journal of Social Research Methodology, 2015

Likert scales are used to make relative and absolute judgments about measures of attitude. Despite its ubiquitous use, only few studies have investigated the effects of altering the configurations of the response options. The purpose of this experiment was to explore the effects of response option orientation and directionality in Likert scales…

Descriptors: Likert Scales, Online Surveys, Responses, Test Reliability

Reducing the Need for Guesswork in Multiple-Choice Tests

Peer reviewed

Direct link

Bush, Martin – Assessment & Evaluation in Higher Education, 2015

The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Indices of Individuals' Sensitivities To Computerized Test Administration and Repeated Testing.

Download full text

Schuldberg, David – 1988

Indices were constructed to measure individual differences in the effects of the automated testing format and repeated testing on Minnesota Multiphasic Personality Inventory (MMPI) responses. Two types of instability measures were studied within a data set from the responses of 150 undergraduate students who took a computer-administered and…

Descriptors: College Students, Computer Assisted Testing, Higher Education, Individual Differences

Number of Response Categories and Statistics on a Teacher Rating Scale.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1983

Each of six forms of a 10-item teacher evaluation rating scale, having two to seven response categories per form, was administered to over 100 college students. Means of item responses and item variances increased with the number of response categories. Internal consistency of total scores did not change systematically. (Author/PN)

Descriptors: College Students, Higher Education, Item Analysis, Rating Scales

A Study of the Effects of Contextualization and Familiarization on Responses to the TOEFL Vocabulary Test Items.

Download full text

Henning, Grant – 1991

In order to evaluate the Test of English as a Foreign Language (TOEFL) vocabulary item format and to determine the effectiveness of alternative vocabulary test items, this study investigated the functioning of eight different multiple-choice formats that differed with regard to: (1) length and inference-generating quality of the stem; (2) the…

Descriptors: Adults, Context Effect, Difficulty Level, English (Second Language)

The Multiple True-False Item Format: A Status Review.

Peer reviewed

Frisbie, David A. – Educational Measurement: Issues and Practice, 1992

Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)

Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests