ERIC - Search Results

Descriptor

Objective Tests	12
Testing Problems	12
Multiple Choice Tests	6
Test Items	6
Test Construction	5
Achievement Tests	4
Test Format	4
Test Use	4
Test Validity	4
Educational Assessment	3
Educational Change	3
Elementary Secondary Education	3
Foreign Countries	3
Higher Education	3
Measurement Techniques	3
Standardized Tests	3
Test Bias	3
Ability Grouping	2
Decision Making	2
Difficulty Level	2
Guessing (Tests)	2
Language Tests	2
Report Cards	2
Standards	2
State Programs	2
More ▼

Source

Applied Measurement in…	2
Journal of Educational…	1
Journal of Research and…	1

Author

Neill, Monty	2
Anderson, Paul S.	1
Ebel, Robert L.	1
Haladyna, Thomas A.	1
Hambleton, Ronald K.	1
Kolstad, Rosemarie K.	1
Murphy, Edward	1
Saliba, Alcyone	1
Samejima, Fumiko	1
Terwilliger, James S.	1
Wang, Jianjun	1
de Jong, John H. A. L.	1
van Roosmalen, Willem M. M.	1
More ▼

Publication Type

Reports - Evaluative	12
Speeches/Meeting Papers	5
Journal Articles	4
Information Analyses	1
Tests/Questionnaires	1

Education Level

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Critical Values of Guessing on True-False and Multiple-Choice Tests.

Download full text

Wang, Jianjun – 1995

Effects of blind guessing on the success of passing true-false and multiple-choice tests are investigated under a stochastic binomial model. Critical values of guessing are thresholds which signify when the effect of guessing is negligible. By checking a table of critical values assembled in this paper, one can make a decision with 95% confidence…

Descriptors: Bayesian Statistics, Grading, Guessing (Tests), Models

Proposed Solutions to Two Problems of Test Construction.

Peer reviewed

Ebel, Robert L. – Journal of Educational Measurement, 1982

Reasonable and practical solutions to two major problems confronting the developer of any test of educational achievement (what to measure and how to measure it) are proposed, defended, and defined. (Author/PN)

Descriptors: Measurement Techniques, Objective Tests, Test Construction, Test Items

Complex Multiple-Choice Items Fail to Measure Achievement.

Peer reviewed

Kolstad, Rosemarie K.; And Others – Journal of Research and Development in Education, 1983

A study compared college students' performance on complex multiple-choice tests with scores on multiple true-false clusters. Researchers concluded that the multiple-choice tests did not accurately measure students' knowledge and that cueing and guessing led to grade inflation. (PP)

Descriptors: Achievement Tests, Difficulty Level, Guessing (Tests), Higher Education

Distinctions between Item Format and Objectivity in Scoring.

Terwilliger, James S. – 1991

This paper clarifies important distinctions in item writing and item scoring and considers the implications of these distinctions for developing guidelines related to test construction for training teachers. The terminology used to describe and classify paper and pencil test questions frequently confuses two distinct features of questions:…

Descriptors: Classroom Techniques, Educational Testing, Higher Education, Measurement Techniques

The Effectiveness of Several Multiple-Choice Formats.

Peer reviewed

Haladyna, Thomas A. – Applied Measurement in Education, 1992

Several multiple-choice item formats are examined in the current climate of test reform. The reform movement is discussed as it affects use of the following formats: (1) complex multiple-choice; (2) alternate choice; (3) true-false; (4) multiple true-false; and (5) the context dependent item set. (SLD)

Descriptors: Cognitive Psychology, Comparative Testing, Context Effect, Educational Change

A Psychometric Perspective on Authentic Measurement.

Peer reviewed

Hambleton, Ronald K.; Murphy, Edward – Applied Measurement in Education, 1992

The validity of several criticisms of objective tests is addressed, and the viability of some alternatives to objective testing is discussed. Evidence against multiple-choice tests is not as strong as has been claimed. Authentic assessments may not always be better, and research about new forms of assessment is necessary. (SLD)

Descriptors: Achievement Tests, Educational Assessment, Literature Reviews, Measurement Techniques

Changing World Patterns of Machine-Scored Objective Testing: The Expected Impact of the Multi-Digit Method.

Download full text

Anderson, Paul S.; Saliba, Alcyone – 1987

The use of optical scanners and computers in educational testing is common where objective testing methods (such as true-false, matching, and multiple-choice items) are well-established means of evaluating educational achievement. Where non-objective testing methods (such as fill-in-the-blank, short-answer, and essay items) have been more common,…

Descriptors: Computer Assisted Testing, Developed Nations, Developing Nations, Educational Technology

Content-Based Observation of Informative Distractors and Efficiency of Ability Estimation.

Samejima, Fumiko – 1990

The shortcomings of the conventional way of using and interpreting multiple-choice tests are summarized. Some theories and methodologies that can be applied for better use multiple-choice test items are described. Empirical facts are introduced to support the theoretical observations. New strategies are proposed that will reduce "noise"…

Descriptors: Ability Identification, Distractors (Tests), Equations (Mathematics), Estimation (Mathematics)

Objective Based Tests for Reading Comprehension.

van Roosmalen, Willem M. M. – 1983

The construction of objective tests for native language reading comprehension is described. The tests were designed for the early secondary school years in several kinds of schools, vocational and non-vocational. The description focuses on the use of the Rasch model in test development, to develop a large pool of homogenous items and establish…

Descriptors: Ability Grouping, Difficulty Level, Foreign Countries, Item Banks

Testing Our Children: A Report Card on State Assessment Systems.

Download full text

Neill, Monty – 1997

In this study, FairTest evaluated how well state assessment practices live up to the promise of high standards without standardization. The practices of states were measured against standards derived from the "Principles and Indicators for Student Assessment Systems," a 1995 publication of education and civil rights groups working…

Descriptors: Achievement Tests, Decision Making, Educational Assessment, Educational Change

Testing Our Children: A Report Card on State Assessment Systems. Executive Summary and Summary of State Findings.

Download full text

Neill, Monty – 1997

FairTest evaluated how well state assessment practices live up to the promise of high standards without standardization. The practices of states were measured against standards derived from the "Principles and Indicators for Student Assessment Systems," a 1995 publication of education and civil rights groups working through the National…

Descriptors: Achievement Tests, Decision Making, Educational Assessment, Educational Change

Focusing in on a Latent Trait: An Attempt at Construct Validation by Means of the Rasch Model.

Download full text

de Jong, John H. A. L. – 1983

The Rasch model for test analysis is a latent-trait model, which specifies the relationship between observable test performance and the unobservable traits or abilities assumed under test performance. In most cases, the test constructor has no clue as to whether the latent traits postulated by the model are indeed the abilities he wants to…

Descriptors: Ability Grouping, Cloze Procedure, Correlation, English (Second Language)