NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Grosse, Martin E.; Wright, Benjamin D. – Educational and Psychological Measurement, 1985
A model of examinee behavior was used to generate hypotheses about the operation of true-false scores. Confirmation of hypotheses supported the contention that true-false scores contain an error component that makes these tests less reliable than multiple-choice tests. Examinee response style may invalidate a total true-false score. (Author/DWH)
Descriptors: Objective Tests, Response Style (Tests), Test Format, Test Reliability
Johanson, George; Motlomelo, Samuel – 1998
Many textbooks in educational measurement and classroom assessment have chapters devoted to specific item formats. There may be attempts to relate one item format to another, but the chapters and item formats are largely seem as distinct entities with only loose and uncertain connections. This paper synthesizes these discussions. An item format…
Descriptors: Educational Assessment, Essay Tests, Measurement Techniques, Objective Tests
Hardy, Helen – Georgia Social Science Journal, 1981
This report describes the design and field testing of a 50-item objective test designed to measure high school students' understanding of the state of Georgia's history, geography, government, econcomics, and culture. A copy of the test is included in the appendix. (AM)
Descriptors: Cultural Context, Objective Tests, Secondary Education, Social Studies
Peer reviewed Peer reviewed
Downing, Steven M.; And Others – Applied Measurement in Education, 1995
The criterion-related validity evidence and other psychometric characteristics of multiple-choice and multiple true-false (MTF) items in medical specialty certification examinations were compared using results from 21,346 candidates. Advantages of MTF items and implications for test construction are discussed. (SLD)
Descriptors: Cognitive Ability, Licensing Examinations (Professions), Medical Education, Objective Tests
Sax, Gilbert; Reiter, Pauline B. – 1980
Despite the popularity of both multiple-choice (MC) and true-false (TF) items, most investigations comparing the two formats have done so to determine the optimum number of choices to be given to students within a given time period. The purpose of this investigation was to compare the reliabilities and the validities of both formats when the items…
Descriptors: Analysis of Variance, Correlation, Higher Education, Item Analysis
Peer reviewed Peer reviewed
Ory, John C.; And Others – Journal of Educational Psychology, 1980
The study investigated the structural corroboration of instructional evaluation information collected from one source (students) by three different methods: responses to objective questionnaire items, written comments to open-ended questions, and group interview results. The three types of information presented a similar general impression of…
Descriptors: Course Evaluation, Data Collection, Evaluation Methods, Higher Education
Peer reviewed Peer reviewed
Parkes, Jay – Educational Research, 2000
Data from 77 ninth-grade Spanish students who took an objective test, a performance assessment, and a measure of perceptions of control indicate that control perceptions predict scores on performance assessments, not objective tests. Performance assessments thus reflect motivational variables beyond the constructs being tested. (SK)
Descriptors: High Schools, Locus of Control, Motivation, Objective Tests
White, Karl R.; Carcelli, Larry – 1982
The effect on children's test scores of different item formats used in standardized mathematics achievement tests was investigated. In a Utah school, 40 second grade students completed a mathematics computation test using eight different formats derived from five standardized achievement tests. Identical content, taken from the most frequently…
Descriptors: Achievement Tests, Comparative Analysis, Computation, Elementary School Mathematics
Peer reviewed Peer reviewed
Pomplun, Mark; Omar, Md Hafidz – Educational and Psychological Measurement, 1997
Four threats to validity of an alternative objective test item format, the multiple-mark format, were studied with data from a state-mandated assessment with about 30,000 students at each of three grade levels. Reliability and validity coefficients show that the format has promise as an objective format that can be aligned with new curriculum…
Descriptors: Curriculum Development, Elementary School Students, Elementary Secondary Education, Objective Tests
van Roosmalen, Willem M. M. – 1983
The construction of objective tests for native language reading comprehension is described. The tests were designed for the early secondary school years in several kinds of schools, vocational and non-vocational. The description focuses on the use of the Rasch model in test development, to develop a large pool of homogenous items and establish…
Descriptors: Ability Grouping, Difficulty Level, Foreign Countries, Item Banks
Doyle, Teresa F.; Lin, Thung-Rung – 1991
Supervisory performance appraisals may be of limited utility in the validation of bilingual tests because incumbents are often hired to be the only employee in a unit who possesses the skills necessary to do the job. In an effort to provide criterion-related validity for four equivalent forms of a Spanish/English bilingual test for school district…
Descriptors: Adults, Bilingual Teachers, Bilingualism, English
Ory, John C.; Ryan, Katherine E. – 1993
This book for college faculty provides a resource for developing, using, and grading classroom exams. The first chapter addresses ways to determine what content should be included on an exam. The second chapter identifies testing considerations such as number of exams, difficulty level of items, and test length. Chapters 3 and 4 provide guidelines…
Descriptors: Classroom Techniques, Codes of Ethics, Essay Tests, Evaluation Methods
Weltin, Mary M.; Popelka, Beverly A. – 1983
The composite of Armed Services Vocational Aptitude Battery (ASVAB) subtests used to select applicants for entry-level training in Army clerical schools was evaluated by correlating composite scores with training performance scores. Comparisons were made between the multiple R for this optimal set of predictors and that for the composite of…
Descriptors: Achievement, Aptitude Tests, Armed Forces, Clerical Occupations
de Jong, John H. A. L. – 1982
The development and validation of a test of listening comprehension for English as a second language at the Dutch National Institute for Educational Measurement (Cito) is described. The test uses two distinct item formats: true-false items and modified cloze items with two options. Both item formats were found to measure foreign language listening…
Descriptors: Cloze Procedure, English (Second Language), Evaluation Criteria, Foreign Countries