ERIC - Search Results

Descriptor

Item Analysis	18
Item Sampling	18
Test Construction	11
Test Reliability	9
Criterion Referenced Tests	7
Statistical Analysis	7
Achievement Tests	6
Mathematical Models	6
Test Validity	6
Test Items	5
Item Banks	4
Latent Trait Theory	4
Matrices	4
Test Interpretation	4
Test Theory	4
Career Development	3
Difficulty Level	3
Mastery Tests	3
Norm Referenced Tests	3
Adaptive Testing	2
Comparative Analysis	2
Decision Making	2
Elementary Education	2
Equated Scores	2
Measurement Techniques	2
More ▼

Source

Educational and Psychological…

Publication Type

Reports - Research	18
Journal Articles	2
Speeches/Meeting Papers	2

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
National Assessment of…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

The Effect of Item Stratification on the Estimation of the Mean and Variance of Universe Scores in Multiple Matrix Sampling.

Peer reviewed

Myerberg, N. James – Educational and Psychological Measurement, 1979

The effect of stratified sampling of items based on item difficulty and/or interitem correlations on the estimation of test score distribution parameters using multiple matrix sampling was studied. Results indicated that stratification did not consistently improve the stability of parameter estimation. (Author/JKS)

Descriptors: Item Analysis, Item Sampling, Matrices, Technical Reports

Estimating Total-test Scores from Partial Scores in a Matrix Sampling Design.

Sachar, Jane; Suppes, Patrick – 1977

It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…

Descriptors: Comparative Analysis, Elementary Education, Item Analysis, Item Banks

The Effect of Item Stratification in Multiple Matrix Sampling.

Download full text

Myerberg, N. James – 1975

The effect of stratified sampling of items on the estimation of test score distribution parameters by multiple matrix sampling was studied. Item difficulty and/or interitem correlations were the bases of stratification. Various item iniverses were created by computer simulation and sampled according to several plans. The results indicate that…

Descriptors: Computer Programs, Item Analysis, Item Sampling, Matrices

Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.

Peer reviewed

Sachar, Jane; Suppes, Patrick – Educational and Psychological Measurement, 1980

The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)

Descriptors: Content Analysis, Correlation, Item Analysis, Item Sampling

The Effects of Various Item Selection Methods on the Classification Accuracy and Classification Consistency of Criterion-Referenced Instruments.

Smith, Douglas U. – 1978

This study examined the effects of certain item selection methods on the classification accuracy and classification consistency of criterion-referenced instruments. Three item response data sets, representing varying situations of instructional effectiveness, were simulated. Five methods of item selection were then applied to each data set for the…

Descriptors: Criterion Referenced Tests, Item Analysis, Item Sampling, Latent Trait Theory

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Decision Reliability and Classification Validity for Decision Oriented Criterion-Referenced Tests.

Faggen, Jane – 1978

Formulas are presented for decision reliability and for classification validity for mastery/nonmastery decisions based on criterion referenced tests. Two item parameters are used: the probability of a master answering an item correctly, and the probability of a nonmaster answering an item incorrectly. The theory explores the relationships of…

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Item Analysis, Item Banks

A Basic Test Theory Generalizable to Tailored Testing. Technical Report No. 1.

Download full text

Cliff, Norman – 1975

Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…

Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences

A Consumers' Guide to Criterion-Referenced Test Item Statistics.

Berk, Ronald A. – 1978

Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…

Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides

Development of the Armed Services Vocational Aptitude Battery: Forms 8, 9, and 10. Final Report 19 December 1975-31 January 1977.

Fruchter, Dorothy A.; Ree, Malcolm James – 1977

In order to meet the needs of all the Armed Services, new forms of the Armed Services Vocational Aptitude Battery (ASVAB) must periodically be developed, refined, and standardized on an appropriate normative sample. Since one of the uses of the ASVAB is to determine candidate suitability for military service, it is necessary for the…

Descriptors: Aptitude Tests, Armed Forces, Equated Scores, Item Analysis

A Comparison of Simple Random Sampling Versus Stratification for Allocating Items to Subtests in Multiple Matrix Sampling.

Download full text

Scheetz, James P.; Forsyth, Robert A. – 1977

Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…

Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling

Scale-Score Reporting of National Assessment Data (Final Report).

Download full text

Mislevy, Robert J.; And Others – 1982

An approach was developed based on item-response models defined at the level of salient subject groups rather than at the level of individuals, designed for use with multiple-matrix sampling designs. In each of three National Assessment of Educational Progress (NAEP) mathematics subtopics, Reiser's group-effects latent trait model was fitted to…

Descriptors: Educational Assessment, Item Analysis, Item Sampling, Latent Trait Theory

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis

Group Tailored Tests and Some Problems of their Utilization.

Lewy, Arieh; Doron, Rina – 1977

The concept of tailored testing for individuals is applied to the construction of tests for special groups and extended to apply to item content as well as item difficulty. It is suggested that evaluators may decide to construct tests on the basis of a unique combination of items drawn from an item bank to fit the need of a particular group. At…

Descriptors: Achievement Tests, Adaptive Testing, Criterion Referenced Tests, Group Norms

Characteristics of Samples and Linking Items Affecting a Partial Pre-Calibrations Design.

Download full text

Cook, Linda L.; And Others – 1987

This study tests several explanations for discrepant results in an earlier study (Cook et al., 1985) which presented a partial pre-calibration method for equating new editions of the Scholastic Aptitude Test (SAT) to the same scale as older editions. In contrast to full pre-calibration, which seeks to equate all items from two or more editions,…

Descriptors: College Entrance Examinations, Concurrent Validity, Equated Scores, Estimation (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2

Myerberg, N. James	2
Sachar, Jane	2
Suppes, Patrick	2
Berk, Ronald A.	1
Cliff, Norman	1
Cook, Linda L.	1
Doron, Rina	1
Epstein, Kenneth I.	1
Faggen, Jane	1
Forster, Fred	1
Forsyth, Robert A.	1
Fruchter, Dorothy A.	1
Haladyna, Tom	1
Hively, Wells, Ed.	1
Knerr, Claramae S.	1
Lewy, Arieh	1
Mislevy, Robert J.	1
Ree, Malcolm James	1
Scheetz, James P.	1
Smith, Douglas U.	1
Wilcox, Rand R.	1
More ▼