NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)0
Since 2006 (last 20 years)5
What Works Clearinghouse Rating
Showing 1 to 15 of 25 results Save | Export
Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013
The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…
Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010
The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…
Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends
Berger, Martijn P. F.; Veerkamp, Wim J. J. – 1994
The designing of tests has been a source of concern for test developers over the past decade. Various kinds of test forms have been applied. Among these are the fixed-form test, the adaptive test, and the testlet. Each of these forms has its own design. In this paper, the construction of test forms is placed within the general framework of optimal…
Descriptors: Adaptive Testing, Foreign Countries, Research Design, Selection
Peer reviewed Peer reviewed
Hawkins, Katherine W. – Communication Education, 1987
Provides a brief, nontechnical overview of latent trait models and argues for the preferability of these models (particularly the Rasch logistic model) over classical test models. Offers an example application of the Rasch model and discusses implications for the use of latent trait models for communication educators. (AEW)
Descriptors: Higher Education, Journalism Education, Latent Trait Theory, Teaching Methods
Peer reviewed Peer reviewed
Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)
Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Dudley, Albert – Language Testing, 2006
This study examined the multiple true-false (MTF) test format in second language testing by comparing multiple-choice (MCQ) and multiple true-false (MTF) test formats in two language areas of general English: vocabulary and reading. Two counter-balanced experimental designs--one for each language area--were examined in terms of the number of MCQ…
Descriptors: Second Language Learning, Test Format, Validity, Testing
Peer reviewed Peer reviewed
Balch, William R. – Teaching of Psychology, 1989
Studies the effect of item order on test scores and completion time. Students scored slightly higher when test items were grouped sequentially (relating to text and lectures) than on tests when test items were grouped by text chapter but ordered randomly, or when test items were ordered randomly. Found no differences in completion time. (Author/LS)
Descriptors: Educational Research, Higher Education, Performance, Psychology
Peer reviewed Peer reviewed
Bruno, James E.; Dirkzwager, A. – Educational and Psychological Measurement, 1995
Determining the optimal number of choices on a multiple-choice test is explored analytically from an information theory perspective. The analysis revealed that, in general, three choices seem optimal. This finding is in agreement with previous statistical and psychometric research. (SLD)
Descriptors: Distractors (Tests), Information Theory, Multiple Choice Tests, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Xu, Yuejin; Iran-Nejad, Asghar; Thoma, Stephen J. – Journal of Interactive Online Learning, 2007
The purpose of the study was to determine comparability of an online version to the original paper-pencil version of Defining Issues Test 2 (DIT2). This study employed methods from both Classical Test Theory (CTT) and Item Response Theory (IRT). Findings from CTT analyses supported the reliability and discriminant validity of both versions.…
Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Test Theory
Peer reviewed Peer reviewed
Leary, Linda F.; Dorans, Neil J. – Review of Educational Research, 1985
Research on the potential effects of different item arrangement schemes on item statistics is reviewed for three separate periods. Earliest studies investigated the simple main effect of item order on test performance. The late 1960s emphasized interactions between item order and examinees' characteristics. Current concern focuses on item…
Descriptors: Achievement Tests, Aptitude Tests, Item Analysis, Latent Trait Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Norris, Stephen P. – 1988
A study examined whether the process of gathering verbal reports of subjects' thinking while taking multiple-choice critical thinking tests could be used to infer the reasoning process used and identify test items which do not require critical thinking skills. Four factors can render an inference of a subject's critical thinking skills…
Descriptors: Cognitive Processes, Critical Thinking, High School Students, High Schools
Lancaster, Diana M.; And Others – 1987
Difficulty and discrimination ability were compared between multiple choice and short answer items in midterm and final examinations for the internal medicine course at Louisiana State University School of Dentistry. The examinations were administered to 67 sophomore dental students in that course. Additionally, the impact of the source of the…
Descriptors: Dental Schools, Dentistry, Difficulty Level, Discriminant Analysis
White, David M. – 1986
This book discusses tricks for answering questions on the Law School Admission Test (LSAT). The tricks are based on an analysis of 12 editions of the LSAT which have been made public pursuant to New York's Truth in Testing Law. Sample LSAT questions published by the Law School Admission Council are referenced to exemplify the tricks' applications…
Descriptors: College Entrance Examinations, Higher Education, Multiple Choice Tests, Pretesting
White, David M. – 1985
This book discusses tricks for answering questions on the Graduate Management Admission Test (GMAT). The tricks are based on an analysis of 20 editions of the GMAT which have been made public pursuant to New York's Truth in Testing Law. Sample GMAT questions published by the Graduate Management Admission Council are referenced to exemplify the…
Descriptors: College Entrance Examinations, Higher Education, Multiple Choice Tests, Pretesting
Previous Page | Next Page ยป
Pages: 1  |  2