ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Descriptor

Comparative Analysis	12
Responses	12
Test Format	12
Multiple Choice Tests	7
Test Items	6
Elementary School Students	3
Models	3
Accuracy	2
Difficulty Level	2
Equated Scores	2
Foreign Countries	2
Grade 2	2
Item Response Theory	2
Pretests Posttests	2
Statistical Analysis	2
Test Reliability	2
Test Validity	2
Writing Tests	2
Ability	1
Administrator Attitudes	1
Bias	1
Classification	1
Cognitive Tests	1
Communication (Thought…	1
Computation	1
More ▼

Source

ETS Research Report Series	4
Applied Measurement in…	1
Assessment & Evaluation in…	1
Interactive Learning…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
TESOL Quarterly: A Journal…	1

Publication Type

Reports - Research	11
Journal Articles	10
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Elementary Education	3
Early Childhood Education	2
Grade 2	2
Primary Education	2
Grade 1	1

Audience

Location

Chile (Santiago)	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Reducing the Need for Guesswork in Multiple-Choice Tests

Peer reviewed

Direct link

Bush, Martin – Assessment & Evaluation in Higher Education, 2015

The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability

Using a Fine-Grained Multiple-Choice Response Format in Educational Drill-and-Practice Video Games

Peer reviewed

Direct link

Beserra, Vagner; Nussbaum, Miguel; Grass, Antonio – Interactive Learning Environments, 2017

When using educational video games, particularly drill-and-practice video games, there are several ways of providing an answer to a quiz. The majority of paper-based options can be classified as being either multiple-choice or constructed-response. Therefore, in the process of creating an educational drill-and-practice video game, one fundamental…

Descriptors: Multiple Choice Tests, Drills (Practice), Educational Games, Video Games

The Effects of Administration and Response Modes on Grade 1-2 Students' Writing Performance

Peer reviewed

Direct link

Kim, Ahyoung Alicia; Lee, Shinhye; Chapman, Mark; Wilmes, Carsten – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2019

This study aimed to investigate how Grade 1-2 English language learners (ELLs) differ in their performance on a writing test in two test modes: paper and online. Participants were 139 ELLs in the United States. They completed three writing tasks, representing three test modes: (1) a paper in which students completed their writing using a…

Descriptors: Elementary School Students, English (Second Language), Second Language Learning, Second Language Instruction

Dynamic Testing of Analogical Reasoning in 5- to 6-Year-Olds: Multiple-Choice versus Constructed-Response Training Items

Peer reviewed

Direct link

Stevenson, Claire E.; Heiser, Willem J.; Resing, Wilma C. M. – Journal of Psychoeducational Assessment, 2016

Multiple-choice (MC) analogy items are often used in cognitive assessment. However, in dynamic testing, where the aim is to provide insight into potential for learning and the learning process, constructed-response (CR) items may be of benefit. This study investigated whether training with CR or MC items leads to differences in the strategy…

Descriptors: Logical Thinking, Multiple Choice Tests, Test Items, Cognitive Tests

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Evaluating Subpopulation Invariance of Linking Functions to Determine the Anchor Composition for a Mixed-Format Test. Research Report. ETS RR-09-36

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2009

We examined the appropriateness of the anchor composition in a mixed-format test, which includes both multiple-choice (MC) and constructed-response (CR) items, using subpopulation invariance indices. We derived linking functions in the nonequivalent groups with anchor test (NEAT) design using two types of anchor sets: (a) MC only and (b) a mix of…

Descriptors: Test Format, Equated Scores, Test Items, Multiple Choice Tests

Studies of a Latent-Class Signal-Detection Model for Constructed-Response Scoring. Research Report. ETS RR-08-63

Peer reviewed
PDF on ERIC

Download full text

DeCarlo, Lawrence T. – ETS Research Report Series, 2008

Rater behavior in essay grading can be viewed as a signal-detection task, in that raters attempt to discriminate between latent classes of essays, with the latent classes being defined by a scoring rubric. The present report examines basic aspects of an approach to constructed-response (CR) scoring via a latent-class signal-detection model. The…

Descriptors: Scoring, Responses, Test Format, Bias

Comparisons among Designs for Equating Constructed-Response Tests. Research Report. ETS RR-08-53

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – ETS Research Report Series, 2008

This study examined variations of a nonequivalent groups equating design used with constructed-response (CR) tests to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, the study investigated the use of anchor CR item rescoring in the context of classical…

Descriptors: Equated Scores, Comparative Analysis, Test Format, Responses

Robustness to Format Effects of IRT Linking Methods for Mixed-Format Tests

Peer reviewed

Direct link

Kim, Seonghoon; Kolen, Michael J. – Applied Measurement in Education, 2006

Four item response theory linking methods (2 moment methods and 2 characteristic curve methods) were compared to concurrent (CO) calibration with the focus on the degree of robustness to format effects (FEs) when applying the methods to multidimensional data that reflected the FEs associated with mixed-format tests. Based on the quantification of…

Descriptors: Item Response Theory, Robustness (Statistics), Test Format, Comparative Analysis

Paired Comparison Intransitivity: Useful Information or Nuisance?

Download full text

Johanson, George A.; Gips, Crystal J. – 1993

The decision to use a forced-choice test item format versus an item format where choice is not forced (e.g., a Likert scale) might best be determined by the nature of the information sought since the difficult decisions required for forced-choice formats may result in a different scaling than an unforced method. If a forced choice is desired,…

Descriptors: Administrator Attitudes, Comparative Analysis, Likert Scales, Principals

The Effect of Stimulus Format on Discriminability in School Surveys.

Peer reviewed

Jaeger, Richard M.; Wolf, Marian B. – Journal of Educational Measurement, 1982

The effectiveness of a Likert-scale and three paired-choice presentation formats in discriminating among parents' preferences for curriculum elements were compared. Paired-choice formats gave more reliable discriminations which increased with stimulus specificity. Similarities and differences in preference orderings are discussed. (Author/CM)

Descriptors: Comparative Analysis, Elementary Education, Parent Attitudes, Parent School Relationship

Relationships between Test Specifications, Item Responses, Task Demands, and Item Attributes in a Large-Scale Science Assessment.

Download full text

Park, Chung; Allen, Nancy L. – 1994

This study is part of continuing research into the meaning of future National Assessment of Educational Progress (NAEP) science scales. In this study, the test framework, as examined by NAEP's consensus process, and attributes of the items, identified by science experts, cognitive scientists, and measurement specialists, are examined. Preliminary…

Descriptors: Communication (Thought Transfer), Comparative Analysis, Construct Validity, Content Validity

Kim, Sooyeon	2
Walker, Michael E.	2
Allen, Nancy L.	1
Beserra, Vagner	1
Bush, Martin	1
Chapman, Mark	1
DeCarlo, Lawrence T.	1
Gips, Crystal J.	1
Grass, Antonio	1
Heiser, Willem J.	1
Jaeger, Richard M.	1
Johanson, George A.	1
Kim, Ahyoung Alicia	1
Kim, Seonghoon	1
Kolen, Michael J.	1
Lee, Shinhye	1
McHale, Frederick	1
Nussbaum, Miguel	1
Park, Chung	1
Resing, Wilma C. M.	1
Stevenson, Claire E.	1
Wang, Zhen	1
Wilmes, Carsten	1
Wolf, Marian B.	1
Yao, Lihua	1
More ▼