ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Scoring Formulas	12
Test Reliability	6
Multiple Choice Tests	5
Guessing (Tests)	4
Higher Education	4
Item Analysis	4
Scoring	4
Test Items	4
Mathematical Models	3
Scores	3
Test Validity	3
Weighted Scores	3
Comparative Analysis	2
Evaluation Methods	2
Item Response Theory	2
Models	2
Predictive Validity	2
Probability	2
Psychometrics	2
Achievement Tests	1
Analysis of Variance	1
Answer Keys	1
Cognitive Style	1
Cognitive Tests	1
Comparative Testing	1
More ▼

Source

Applied Psychological…

Author

Frary, Robert B.	2
Attali, Yigal	1
Bonett, Douglas G.	1
Claudy, John G.	1
Downey, Ronald G.	1
Drasgow, Fritz	1
Frederiksen, Norman	1
Garcia-Perez, Miguel A.	1
Kane, Michael	1
Kreiner, Svend	1
McGarvey, Bill	1
Moloney, James	1
Poizner, Sharon B.	1
Ward, William C.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	4
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Adult Education	1
Higher Education	1

Audience

Location

Denmark

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
Graduate Record Examinations	1
Rod and Frame Test	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Immediate Feedback and Opportunity to Revise Answers: Application of a Graded Response IRT Model

Peer reviewed

Direct link

Attali, Yigal – Applied Psychological Measurement, 2011

Recently, Attali and Powers investigated the usefulness of providing immediate feedback on the correctness of answers to constructed response questions and the opportunity to revise incorrect answers. This article introduces an item response theory (IRT) model for scoring revised responses to questions when several attempts are allowed. The model…

Descriptors: Feedback (Response), Item Response Theory, Models, Error Correction

A Note on Item-Restscore Association in Rasch Models

Peer reviewed

Direct link

Kreiner, Svend – Applied Psychological Measurement, 2011

To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…

Descriptors: Item Analysis, Correlation, Item Response Theory, Models

Biserial Weights: A New Approach to Test Item Option Weighting

Peer reviewed

Claudy, John G. – Applied Psychological Measurement, 1978

Option weighting is an alternative to increasing test length as a means of improving the reliability of a test. The effects on test reliability of option weighting procedures were compared in two empirical studies using four independent sets of items. Biserial weights were found to be superior. (Author/CTM)

Descriptors: Higher Education, Item Analysis, Scoring Formulas, Test Items

The Effect of Misinformation, Partial Information, and Guessing on Expected Multiple-Choice Test Item Scores.

Peer reviewed

Frary, Robert B. – Applied Psychological Measurement, 1980

Six scoring methods for assigning weights to right or wrong responses according to various instructions given to test takers are analyzed with respect to expected change scores and the effect of various levels of information and misinformation. Three of the methods provide feedback to the test taker. (Author/CTM)

Descriptors: Guessing (Tests), Knowledge Level, Multiple Choice Tests, Scores

Modeling Incorrect Responses to Multiple-Choice Items with Multilinear Formula Score Theory.

Peer reviewed

Drasgow, Fritz; And Others – Applied Psychological Measurement, 1989

Multilinear formula scoring (MFS) is reviewed, with emphasis on estimating option characteristic curves (OCSs). MFS was used to estimate OCSs for the arithmetic reasoning subtest of the Armed Services Vocational Aptitude Battery for 2,978 examinees. A second analysis obtained OCSs for simulated data. The use of MFS is discussed. (SLD)

Descriptors: Estimation (Mathematics), Mathematical Models, Multiple Choice Tests, Scores

Psychometric Properties of Finite-State Scores versus Number-Correct and Formula Scores: A Simulation Study.

Peer reviewed

Garcia-Perez, Miguel A.; Frary, Robert B. – Applied Psychological Measurement, 1989

Simulation techniques were used to generate conventional test responses and track the proportion of alternatives examinees could classify independently before and after taking the test. Finite-state scores were compared with these actual values and with number-correct and formula scores. Finite-state scores proved useful. (TJH)

Descriptors: Comparative Analysis, Computer Simulation, Guessing (Tests), Mathematical Models

Robust Confidence Interval for a Ratio of Standard Deviations

Peer reviewed

Direct link

Bonett, Douglas G. – Applied Psychological Measurement, 2006

Comparing variability of test scores across alternate forms, test conditions, or subpopulations is a fundamental problem in psychometrics. A confidence interval for a ratio of standard deviations is proposed that performs as well as the classic method with normal distributions and performs dramatically better with nonnormal distributions. A simple…

Descriptors: Intervals, Mathematical Concepts, Comparative Analysis, Psychometrics

Scoring Field Dependence: A Methodological Analysis of Five Rod-and-Frame Scoring Systems

Peer reviewed

McGarvey, Bill; And Others – Applied Psychological Measurement, 1977

The most consistently used scoring system for the rod-and-frame task has been the total number of degrees in error from the true vertical. Since a logical case can be made for at least four alternative scoring systems, a thorough comparison of all five systems was performed. (Author/CTM)

Descriptors: Analysis of Variance, Cognitive Style, Cognitive Tests, Elementary Education

The Effect of Guessing on Item Reliability under Answer-Until-Correct Scoring

Peer reviewed

Kane, Michael; Moloney, James – Applied Psychological Measurement, 1978

The answer-until-correct (AUC) procedure requires that examinees respond to a multi-choice item until they answer it correctly. Using a modified version of Horst's model for examinee behavior, this paper compares the effect of guessing on item reliability for the AUC procedure and the zero-one scoring procedure. (Author/CTM)

Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests

Alternative Response and Scoring Methods for Multiple Choice Items: An Empirical Study of Probabilistic and Ordinal Response Modes

Peer reviewed

Poizner, Sharon B.; And Others – Applied Psychological Measurement, 1978

Binary, probability, and ordinal scoring procedures for multiple-choice items were examined. In two situations, it was found that both the probability and ordinal scoring systems were more reliable than the binary scoring method. (Author/CTM)

Descriptors: Confidence Testing, Guessing (Tests), Higher Education, Multiple Choice Tests

Measures for the Study of Creativity in Scientific Problem-Solving

Peer reviewed

Frederiksen, Norman; Ward, William C. – Applied Psychological Measurement, 1978

A set of Tests of Scientific Thinking were developed for possible use as criterion measures in research on creativity. Scores on the tests describe both quality and quantity of ideas produced in formulating hypotheses, evaluating proposals, solving methodological problems, and devising methods for measuring constructs. (Author/CTM)

Descriptors: Creativity Tests, Higher Education, Item Sampling, Predictive Validity

Item-Option Weighting of Achievement Tests: Comparative Study of Methods.

Peer reviewed

Downey, Ronald G. – Applied Psychological Measurement, 1979

This research attempted to interrelate several methods of producing option weights (i.e., Guttman internal and external weights and judges' weights) and examined their effects on reliability and on concurrent, predictive, and face validity. It was concluded that option weighting offered limited, if any, improvement over unit weighting. (Author/CTM)

Descriptors: Achievement Tests, Answer Keys, Comparative Testing, High Schools