NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Sachar, Jane; Suppes, Patrick – 1977
It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…
Descriptors: Comparative Analysis, Elementary Education, Item Analysis, Item Banks
Myerberg, N. James – 1975
The effect of stratified sampling of items on the estimation of test score distribution parameters by multiple matrix sampling was studied. Item difficulty and/or interitem correlations were the bases of stratification. Various item iniverses were created by computer simulation and sampled according to several plans. The results indicate that…
Descriptors: Computer Programs, Item Analysis, Item Sampling, Matrices
Woodson, M. I. Charles E.
It has been argued that item variance and test variance are not necessary characteristics for criterion-referenced tests, although they are necessary for norm-referenced tests. This position is in error because it considers sample statistics as the criteria for evaluating items and tests. Within a particular sample, an item or test may have no…
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Item Analysis, Item Sampling
Davis, Richard W.; Loadman, William E. – 1973
A subject by item matrix of test responses is shown to be a useful heuristic in criterion referenced and norm referenced test analysis, and in the teaching of measurement. The pattern of responses within the matrix provides indications of item interactions, weak deceptors, and conventional test statistics. The strong visual analogy between the…
Descriptors: Computer Programs, Item Analysis, Item Sampling, Matrices
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling
Berk, Ronald A. – 1978
Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…
Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides
Fruchter, Dorothy A.; Ree, Malcolm James – 1977
In order to meet the needs of all the Armed Services, new forms of the Armed Services Vocational Aptitude Battery (ASVAB) must periodically be developed, refined, and standardized on an appropriate normative sample. Since one of the uses of the ASVAB is to determine candidate suitability for military service, it is necessary for the…
Descriptors: Aptitude Tests, Armed Forces, Equated Scores, Item Analysis
Scheetz, James P.; Forsyth, Robert A. – 1977
Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…
Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling
Douglass, James B. – 1979
A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…
Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit
Harris, Chester W.; And Others – 1977
The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…
Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies
Lewy, Arieh; Doron, Rina – 1977
The concept of tailored testing for individuals is applied to the construction of tests for special groups and extended to apply to item content as well as item difficulty. It is suggested that evaluators may decide to construct tests on the basis of a unique combination of items drawn from an item bank to fit the need of a particular group. At…
Descriptors: Achievement Tests, Adaptive Testing, Criterion Referenced Tests, Group Norms
Lorton, Paul, Jr.; Searle, Barbara W. – 1976
A linear regression model was used to select items from a pool of 700 arithmetic word problems to be used in a computer-assisted mathematics curriculum for elementary school students. The experimental procedure first involved a stepwise linear regression analysis of a student's performance over a set of 25 problems. The probability correct for…
Descriptors: Computer Assisted Instruction, Computer Oriented Programs, Correlation, Elementary School Mathematics