Descriptor
Difficulty Level | 5 |
Scoring Formulas | 5 |
Testing Problems | 5 |
Test Construction | 3 |
Test Items | 3 |
Equated Scores | 2 |
Higher Education | 2 |
Item Analysis | 2 |
Latent Trait Theory | 2 |
Scaling | 2 |
Statistical Analysis | 2 |
More ▼ |
Source
Author
Church, Austin T. | 1 |
Jaeger, Richard M. | 1 |
Legg, Sue M. | 1 |
Livingston, Samuel A. | 1 |
Weiss, David J. | 1 |
Yen, Wendy M. | 1 |
Publication Type
Reports - Research | 4 |
Speeches/Meeting Papers | 4 |
Reports - Evaluative | 1 |
Education Level
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yen, Wendy M. – 1982
Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…
Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods
Jaeger, Richard M. – 1980
Five statistical indices are developed and described which may be used for determining (1) when linear equating of two approximately parallel tests is adequate, and (2) whan a more complex method such as equipercentile equating must be used. The indices were based on: (1) similarity of cumulative score distributions; (2) shape of the raw-score to…
Descriptors: College Entrance Examinations, Difficulty Level, Equated Scores, Higher Education
Livingston, Samuel A. – 1986
This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…
Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models
Legg, Sue M. – 1982
A case study of the Florida Teacher Certification Examination (FTCE) program was described to assist others launching the development of large scale item banks. FTCE has four subtests: Mathematics, Reading, Writing, and Professional Education. Rasch calibrated item banks have been developed for all subtests except Writing. The methods used to…
Descriptors: Cutting Scores, Difficulty Level, Field Tests, Item Analysis
Church, Austin T.; Weiss, David J. – 1980
A pilot study on the development and administration of a test using a spatial reasoning problem, the 15-puzzle, is described. The test utilizes on-line capabilities of a real-time computer to record an examinee's progress on each problem through a sequence of problem-solving "moves", and to collect additional on-line data that might be…
Descriptors: Adaptive Testing, Cognitive Measurement, Computer Assisted Testing, Difficulty Level