Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 6 |
Descriptor
Objective Tests | 12 |
Scores | 12 |
Test Items | 12 |
Multiple Choice Tests | 5 |
Mathematics Tests | 4 |
Test Reliability | 4 |
Achievement Tests | 3 |
Difficulty Level | 3 |
Higher Education | 3 |
Mathematics Achievement | 3 |
Response Style (Tests) | 3 |
More ▼ |
Source
Author
Katz, Irvin R. | 2 |
Keehner, Madeleine | 2 |
Moon, Jung Aa | 2 |
Bordage, Georges | 1 |
Burton, Richard F. | 1 |
Clingman, Joy M. | 1 |
Crehan, Kevin D. | 1 |
Daniels, Vijay J. | 1 |
Daughtry, Don | 1 |
Fowler, Robert L. | 1 |
Gierl, Mark J. | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 8 |
Reports - Evaluative | 3 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Elementary Secondary Education | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Secondary Education | 1 |
Audience
Policymakers | 1 |
Teachers | 1 |
Location
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Moon, Jung Aa; Keehner, Madeleine; Katz, Irvin R. – Educational Assessment, 2020
We investigated how item formats influence test takers' response tendencies under uncertainty. Adult participants solved content-equivalent math items in three formats: multiple-selection multiple-choice, grid with forced-choice (true-false) options, and grid with non-forced-choice options. Participants showed a greater tendency to commit (rather…
Descriptors: College Students, Test Wiseness, Test Format, Test Items
Kelly, William E.; Daughtry, Don – College Student Journal, 2018
This study developed an abbreviated form of Barron's (1953) Ego Strength Scale for use in research among college student samples. A version of Barron's scale was administered to 100 undergraduate college students. Using item-total score correlations and internal consistency, the scale was reduced to 18 items (Es18). The Es18 possessed adequate…
Descriptors: Undergraduate Students, Self Concept Measures, Test Length, Scores
Moon, Jung Aa; Keehner, Madeleine; Katz, Irvin R. – Educational Measurement: Issues and Practice, 2019
The current study investigated how item formats and their inherent affordances influence test-takers' cognition under uncertainty. Adult participants solved content-equivalent math items in multiple-selection multiple-choice and four alternative grid formats. The results indicated that participants' affirmative response tendency (i.e., judge the…
Descriptors: Affordances, Test Items, Test Format, Test Wiseness
Özkan, Yesim Özer; Özaslan, Nesrin – International Journal of Evaluation and Research in Education, 2018
The aim of this study is to determine the level of achievement of students participating in Programme for International Student Assessment (PISA) 2003 and PISA 2012 tests in Turkey according to questions in the mathematical literacy test. This study is a descriptive survey. Within the scope of the study, the mathematical literacy test items were…
Descriptors: Foreign Countries, Academic Achievement, Mathematics Tests, Test Items
Daniels, Vijay J.; Bordage, Georges; Gierl, Mark J.; Yudkowsky, Rachel – Advances in Health Sciences Education, 2014
Objective structured clinical examinations (OSCEs) are used worldwide for summative examinations but often lack acceptable reliability. Research has shown that reliability of scores increases if OSCE checklists for medical students include only clinically relevant items. Also, checklists are often missing evidence-based items that high-achieving…
Descriptors: Graduate Medical Education, Check Lists, Scores, Internal Medicine
National Assessment Governing Board, 2012
Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. Results of these periodic assessments, produced in print and web-based formats, provide valuable information to a wide variety of audiences. They inform citizens about the nature of students' comprehension of the…
Descriptors: Academic Achievement, Mathematics Achievement, National Competency Tests, Grade 4
Perrin, David W.; Kerasotes, Dean L. – 1979
It was hypothesized that using asterisks as attention focusing devices would cause students to read all asteriked test items more carefully and would improve test scores of undergraduate education students. Sixty-three undergraduates majoring in elementary or special education were administered a 36-item objective test. Asterisks were used to…
Descriptors: Difficulty Level, Higher Education, Objective Tests, Response Style (Tests)

McMorris, Robert F.; And Others – Journal of Educational Measurement, 1987
Consistency of gain from changing test answers was tested for students instructed about answer-changing research results, and composition of the gain was analyzed by examining the students' reasons for changing. Mean gain remained positive and consistent with gain for previously studied uninstructed groups; amount of change was also stable.…
Descriptors: Difficulty Level, Graduate Students, Higher Education, Instruction
Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004
The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…
Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

Harasym, P. H.; And Others – Evaluation and the Health Professions, 1980
Coded, as opposed to free response items, in a multiple choice physiology test had a cueing effect which raised students' scores, especially for lower achievers. Reliability of coded items was also lower. Item format and scoring method had an effect on test results. (GDC)
Descriptors: Achievement Tests, Comparative Testing, Cues, Higher Education

Fowler, Robert L.; Clingman, Joy M. – Educational and Psychological Measurement, 1992
Monte Carlo techniques are used to examine the power of the "B" statistic of R. L. Brennan (1972) to detect negatively discriminating items drawn from a variety of nonnormal population distributions. A simplified procedure is offered for conducting an item-discrimination analysis on typical classroom objective tests. (SLD)
Descriptors: Classroom Techniques, Elementary Secondary Education, Equations (Mathematics), Item Analysis
Crehan, Kevin D.; And Others – 1993
A strategy is proposed for combining scores from multiple-choice achievement measures with performance assessments. The specific situation discussed involves the revision of a curriculum-based multiple-choice and performance assessment testing program for grades 1 through 6 for a large school district. Reading, language-arts, and mathematics…
Descriptors: Achievement Tests, Curriculum Based Assessment, Educational Testing, Elementary Education