ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	2

Descriptor

Difficulty Level	18
Scoring Formulas	18
Test Items	18
Item Analysis	9
Higher Education	7
Multiple Choice Tests	6
Test Construction	6
Guessing (Tests)	5
Test Reliability	5
Mathematical Models	4
Statistical Analysis	4
Test Validity	4
College Entrance Examinations	3
Cutting Scores	3
Latent Trait Theory	3
Psychometrics	3
Statistical Studies	3
Test Format	3
Testing Problems	3
Adaptive Testing	2
Computer Programs	2
Confidence Testing	2
Equated Scores	2
Error of Measurement	2
Evaluation Methods	2
More ▼

Source

Journal of Experimental…	2
Educational and Psychological…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Review of Educational Research	1

Publication Type

Reports - Research	15
Speeches/Meeting Papers	9
Journal Articles	5
Information Analyses	1
Reports - Descriptive	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	1
Graduate Record Examinations	1
Matching Familiar Figures Test	1
SAT (College Admission Test)	1
State Trait Anxiety Inventory	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

On Using Simulations to Inform Decision Making during Instrument Development

Peer reviewed

Direct link

Morgan, Grant B.; Moore, Courtney A.; Floyd, Harlee S. – Journal of Psychoeducational Assessment, 2018

Although content validity--how well each item of an instrument represents the construct being measured--is foundational in the development of an instrument, statistical validity is also important to the decisions that are made based on the instrument. The primary purpose of this study is to demonstrate how simulation studies can be used to assist…

Descriptors: Simulation, Decision Making, Test Construction, Validity

Developing, Analyzing, and Using Distractors for Multiple-Choice Tests in Education: A Comprehensive Review

Peer reviewed

Direct link

Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017

Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…

Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns

New Directions in Matching Familiar Figures Test Research Resulting From Scoring and Item Analyses.

Download full text

Brinzer, Raymond J. – 1979

The problem engendered by the Matching Familiar Figures (MFF) Test is one of instrument integrity (II). II is delimited by validity, reliability, and utility of MFF as a measure of the reflective-impulsive construct. Validity, reliability and utility of construct assessment may be improved by utilizing: (1) a prototypic scoring model that will…

Descriptors: Conceptual Tempo, Difficulty Level, Item Analysis, Research Methodology

The Impact of Item Deletion on Equating Conversions and Reported Score Distributions.

Peer reviewed

Dorans, Neil J. – Journal of Educational Measurement, 1986

The analytical decomposition demonstrates how the effects of item characteristics, test properties, individual examinee responses, and rounding rules combine to produce the item deletion effect on the equating/scaling function and candidate scores. The empirical portion of the report illustrates the effects of item deletion on reported score…

Descriptors: Difficulty Level, Equated Scores, Item Analysis, Latent Trait Theory

Effect of Item Arrangement, Knowledge of Arrangement, and Test Anxiety on Two Scoring Methods.

Peer reviewed

Plake, Barbara S.; And Others – Journal of Experimental Education, 1981

Number right and elimination scores were analyzed on a college level mathematics exam assembled from pretest data. Anxiety measures were administered along with the experimental forms to undergraduates. Results suggest that neither test scores nor attitudes are influenced by item order knowledge thereof, or anxiety level. (Author/GK)

Descriptors: College Mathematics, Difficulty Level, Higher Education, Multiple Choice Tests

Some Exploratory Indices for Selection of a Test Equating Method.

Jaeger, Richard M. – 1980

Five statistical indices are developed and described which may be used for determining (1) when linear equating of two approximately parallel tests is adequate, and (2) whan a more complex method such as equipercentile equating must be used. The indices were based on: (1) similarity of cumulative score distributions; (2) shape of the raw-score to…

Descriptors: College Entrance Examinations, Difficulty Level, Equated Scores, Higher Education

Number Right and Elimination Score as a Function of Item Arrangement, Knowledge of Arrangement, and Test Anxiety.

Plake, Barbara S.; And Others – 1980

Number right and elimination scores were analyzed on a 48-item college level mathematics test that was assembled from pretest data in three forms by varying the item orderings: easy-hard, uniform, or random. Half of the forms contained information explaining the item arrangement and suggesting strategies for taking the test. Several anxiety…

Descriptors: Difficulty Level, Higher Education, Multiple Choice Tests, Quantitative Tests

The Merits of Multiple-Answer Items as Evaluated by Using Six Scoring Formulas.

Peer reviewed

Hsu, Tse-Chi; And Others – Journal of Experimental Education, 1984

The indices of item difficulty and discrimination, the coefficients of effective length, and the average item information for both single- and multiple-answer items using six different scoring formulas were computed and compared. These formulas vary in terms of the assignment of partial credit and the correction for guessing. (Author/BW)

Descriptors: College Entrance Examinations, Comparative Analysis, Difficulty Level, Guessing (Tests)

A Model for Evaluating the Assessment of Partial Knowledge.

Smith, Richard M. – 1982

There have been many attempts to formulate a procedure for extracting information from incorrect responses to multiple choice items, i.e., the assessment of partial knowledge. The results of these attempts can be described as inconsistent at best. It is hypothesized that these inconsistencies arise from three methodological problems: the…

Descriptors: Difficulty Level, Evaluation Methods, Goodness of Fit, Guessing (Tests)

Consideration for Sample Size in Reliability Studies for Mastery Tests. Publication Series in Mastery Testing.

Download full text

Saunders, Joseph C.; Huynh, Huynh – 1980

In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

Robbins-Monro Procedures for Tailored Testing

Peer reviewed

Lord, Frederic M. – Educational and Psychological Measurement, 1971

Descriptors: Ability, Adaptive Testing, Computer Oriented Programs, Difficulty Level

Adjusting Scores on Examinations Offering a Choice of Questions.

Download full text

Livingston, Samuel A. – 1986

This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…

Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models

Applications of Adaptive Testing in Measuring Achievement and Performance.

Download full text

Bejar, Issac I. – 1976

The concept of testing for partial knowledge is considered with the concept of tailored testing. Following the special usage of latent trait theory, the word valdity is used to mean the correlation of a test with the construct the test measures. The concept of a method factor in the test is also considered as a part of the validity. The possible…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Confidence Testing

The Use of Precalibrated Item Bank to Establish and Maintain Cutoff Scores: A Case Study of the Florida Teacher Certification Examination.

Download full text

Legg, Sue M. – 1982

A case study of the Florida Teacher Certification Examination (FTCE) program was described to assist others launching the development of large scale item banks. FTCE has four subtests: Mathematics, Reading, Writing, and Professional Education. Rasch calibrated item banks have been developed for all subtests except Writing. The methods used to…

Descriptors: Cutting Scores, Difficulty Level, Field Tests, Item Analysis

Predictive Validity of Short Form Placement Tests under Two Scoring Systems.

Hisama, Kay K.; And Others – 1977

The optimal test length, using predictive validity as a criterion, depends on two major conditions: the appropriate item-difficulty rather than the total number of items, and the method used in scoring the test. These conclusions were reached when responses to a 100-item multi-level test of reading comprehension from 136 non-native speakers of…

Descriptors: College Students, Difficulty Level, English (Second Language), Foreign Students

Previous Page | Next Page »

Pages: 1 | 2

Plake, Barbara S.	2
Smith, Richard M.	2
Bejar, Issac I.	1
Brinzer, Raymond J.	1
Bulut, Okan	1
Dorans, Neil J.	1
Floyd, Harlee S.	1
Gierl, Mark J.	1
Guo, Qi	1
Hisama, Kay K.	1
Hsu, Tse-Chi	1
Huynh, Huynh	1
Jaeger, Richard M.	1
Kingston, Neal M.	1
Legg, Sue M.	1
Livingston, Samuel A.	1
Lord, Frederic M.	1
Mitchell, Virginia P.	1
Moore, Courtney A.	1
Morgan, Grant B.	1
Rippey, Robert M.	1
Saunders, Joseph C.	1
Zhang, Xinxin	1
More ▼