ERIC - Search Results

Descriptor

Scoring Formulas	5
Test Items	5
Test Length	5
Test Reliability	4
Difficulty Level	2
Error of Measurement	2
Latent Trait Theory	2
Monte Carlo Methods	2
Multiple Choice Tests	2
Scores	2
Achievement Tests	1
Adaptive Testing	1
College Students	1
Computer Assisted Testing	1
Cutting Scores	1
English (Second Language)	1
Equated Scores	1
Foreign Students	1
Guessing (Tests)	1
Higher Education	1
Item Analysis	1
Licensing Examinations…	1
Mastery Tests	1
Maximum Likelihood Statistics	1
Models	1
More ▼

Source

Assessment & Evaluation in…

Author

Burton, Richard F.	1
Gilmer, Jerry S.	1
Hisama, Kay K.	1
Huynh, Huynh	1
Lenel, Julia C.	1
Maurelli, Vincent A.	1
Saunders, Joseph C.	1
Weiss, David J.	1

Publication Type

Reports - Research	4
Speeches/Meeting Papers	2
Journal Articles	1
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004

The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…

Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

Consideration for Sample Size in Reliability Studies for Mastery Tests. Publication Series in Mastery Testing.

Download full text

Saunders, Joseph C.; Huynh, Huynh – 1980

In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

The Effect of Keying All Options Correct on Equating Functions and Scores.

Download full text

Lenel, Julia C.; Gilmer, Jerry S. – 1986

In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)

Factors Influencing the Psychometric Characteristics of an Adaptive Testing Strategy for Test Batteries.

Download full text

Maurelli, Vincent A.; Weiss, David J. – 1981

A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory

Predictive Validity of Short Form Placement Tests under Two Scoring Systems.

Hisama, Kay K.; And Others – 1977

The optimal test length, using predictive validity as a criterion, depends on two major conditions: the appropriate item-difficulty rather than the total number of items, and the method used in scoring the test. These conclusions were reached when responses to a 100-item multi-level test of reading comprehension from 136 non-native speakers of…

Descriptors: College Students, Difficulty Level, English (Second Language), Foreign Students