ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	2

Descriptor

Difficulty Level	11
Item Analysis	11
Scoring Formulas	11
Test Items	9
Test Construction	5
Guessing (Tests)	4
Higher Education	3
Latent Trait Theory	3
Multiple Choice Tests	3
Psychometrics	3
Statistical Studies	3
Test Reliability	3
Test Validity	3
College Entrance Examinations	2
Computer Programs	2
Confidence Testing	2
Equated Scores	2
Mathematical Models	2
Scaling	2
Scores	2
Statistical Analysis	2
Test Bias	2
Testing Problems	2
Weighted Scores	2
Ability	1
More ▼

Source

Advances in Health Sciences…	1
Educational and Psychological…	1
Journal of Educational…	1
Review of Educational Research	1

Publication Type

Reports - Research	9
Speeches/Meeting Papers	6
Journal Articles	3
Information Analyses	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Matching Familiar Figures Test	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Developing, Analyzing, and Using Distractors for Multiple-Choice Tests in Education: A Comprehensive Review

Peer reviewed

Direct link

Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017

Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…

Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns

New Directions in Matching Familiar Figures Test Research Resulting From Scoring and Item Analyses.

Download full text

Brinzer, Raymond J. – 1979

The problem engendered by the Matching Familiar Figures (MFF) Test is one of instrument integrity (II). II is delimited by validity, reliability, and utility of MFF as a measure of the reflective-impulsive construct. Validity, reliability and utility of construct assessment may be improved by utilizing: (1) a prototypic scoring model that will…

Descriptors: Conceptual Tempo, Difficulty Level, Item Analysis, Research Methodology

The Impact of Item Deletion on Equating Conversions and Reported Score Distributions.

Peer reviewed

Dorans, Neil J. – Journal of Educational Measurement, 1986

The analytical decomposition demonstrates how the effects of item characteristics, test properties, individual examinee responses, and rounding rules combine to produce the item deletion effect on the equating/scaling function and candidate scores. The empirical portion of the report illustrates the effects of item deletion on reported score…

Descriptors: Difficulty Level, Equated Scores, Item Analysis, Latent Trait Theory

Some Exploratory Indices for Selection of a Test Equating Method.

Jaeger, Richard M. – 1980

Five statistical indices are developed and described which may be used for determining (1) when linear equating of two approximately parallel tests is adequate, and (2) whan a more complex method such as equipercentile equating must be used. The indices were based on: (1) similarity of cumulative score distributions; (2) shape of the raw-score to…

Descriptors: College Entrance Examinations, Difficulty Level, Equated Scores, Higher Education

Robbins-Monro Procedures for Tailored Testing

Peer reviewed

Lord, Frederic M. – Educational and Psychological Measurement, 1971

Descriptors: Ability, Adaptive Testing, Computer Oriented Programs, Difficulty Level

The Statistical Structure of Multiple-Choice Items.

Donlon, Thomas F.; Fitzpatrick, Anne R. – 1978

On the basis of past research efforts to improve multiple-choice test information through differential weighting of responses to wrong answers (distractors), two statistical indices are developed. Each describes the properties of response distributions across the options of an item. Jaspen's polyserial generalization of the biserial correlation…

Descriptors: Confidence Testing, Difficulty Level, Guessing (Tests), High Schools

The Use of Precalibrated Item Bank to Establish and Maintain Cutoff Scores: A Case Study of the Florida Teacher Certification Examination.

Download full text

Legg, Sue M. – 1982

A case study of the Florida Teacher Certification Examination (FTCE) program was described to assist others launching the development of large scale item banks. FTCE has four subtests: Mathematics, Reading, Writing, and Professional Education. Rasch calibrated item banks have been developed for all subtests except Writing. The methods used to…

Descriptors: Cutting Scores, Difficulty Level, Field Tests, Item Analysis

Scoreing and Analyzing Confidence Tests. Final Report.

Download full text

Rippey, Robert M. – 1971

Technical improvements, which may be made in the reliability and validity of tests through confidence scores, are discussed. However, studies indicate that subjects do not handle their confidence uniformly. (MS)

Descriptors: Computer Programs, Confidence Testing, Correlation, Difficulty Level

Assessing Guessing Behavior Using the Three-Parameter Logistic Model.

Download full text

Kingston, Neal M. – 1985

Birnbaum's three-parameter logistic item response model was used to study guessing behavior of low ability examinees on the Graduate Record Examinations (GRE) General Test, Verbal Measure. GRE scoring procedures had recently changed, from a scoring formula which corrected for guessing, to number-right scoring. The three-parameter theory was used…

Descriptors: Academic Aptitude, Analysis of Variance, College Entrance Examinations, Difficulty Level

Improving the Predictive Ability of Placement Tests Using the Rasch Model for Scoring.

Smith, Richard M.; Mitchell, Virginia P. – 1979

To improve the accuracy of college placement, Rasch scoring and person-fit statistics on the Comparative Guidance and Placement test (CGP) was compared to the traditional right-only scoring. Correlations were calculated between English and mathematics course grades and scores of 1,448 entering freshmen on the reading, writing, and mathematics…

Descriptors: Academic Ability, Computer Programs, Difficulty Level, Goodness of Fit

Bauer, Daniel	1
Brinzer, Raymond J.	1
Bulut, Okan	1
Donlon, Thomas F.	1
Dorans, Neil J.	1
Fischer, Martin R.	1
Fitzpatrick, Anne R.	1
Gierl, Mark J.	1
Guo, Qi	1
Guttormsen, Sissel	1
Huwendiek, Sören	1
Jaeger, Richard M.	1
Kingston, Neal M.	1
Krebs, René	1
Lahner, Felicitas-Maria	1
Legg, Sue M.	1
Lord, Frederic M.	1
Lörwald, Andrea Carolin	1
Mitchell, Virginia P.	1
Nouns, Zineb Miriam	1
Rippey, Robert M.	1
Smith, Richard M.	1
Zhang, Xinxin	1
More ▼