Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Source
Applied Psychological… | 1 |
Intelligence | 1 |
International Journal of… | 1 |
Journal of Educational… | 1 |
New Directions for Testing… | 1 |
Psychometrika | 1 |
Author
Mislevy, Robert J. | 2 |
Bergeron, Renee | 1 |
Bock, R. Darrell | 1 |
Derya Çobanoglu Aktan | 1 |
Floyd, Randy G. | 1 |
Hill, Richard K. | 1 |
Jarjoura, David | 1 |
Kleinke, David J. | 1 |
McGrew, Kevin S. | 1 |
Misanchuk, Earl R. | 1 |
Nese Güler | 1 |
More ▼ |
Publication Type
Reports - Research | 7 |
Journal Articles | 5 |
Collected Works - Proceedings | 1 |
Numerical/Quantitative Data | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
California Psychological… | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Süleyman Demir; Derya Çobanoglu Aktan; Nese Güler – International Journal of Assessment Tools in Education, 2023
This study has two main purposes. Firstly, to compare the different item selection methods and stopping rules used in Computerized Adaptive Testing (CAT) applications with simulative data generated based on the item parameters of the Vocational Maturity Scale. Secondly, to test the validity of CAT application scores. For the first purpose,…
Descriptors: Computer Assisted Testing, Adaptive Testing, Vocational Maturity, Measures (Individuals)
Floyd, Randy G.; Shands, Elizabeth I.; Rafael, Fawziya A.; Bergeron, Renee; McGrew, Kevin S. – Intelligence, 2009
To understand the extent to which the general-factor loadings of tests are inherent in their characteristics or due to the sampling of tests, the number of tests in the correlation matrix, and the factor-extraction methods used to obtain them, test scores from a large sample of young adults were inserted into independent and overlapping batteries…
Descriptors: Generalizability Theory, Young Adults, Factor Analysis, Correlation
Waller, Niels G. – Applied Psychological Measurement, 2008
Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…
Descriptors: Test Items, Reliability, Scores, Psychometrics
Kleinke, David J. – 1973
In a post mortem study, it is demonstrated that linear prediction is as effective as computing a negative hyper-geometric distribution for estimating test norms following matrix sampling from a total test with a highly skewed score distribution, provided the same prediction coefficient is used for all examinee groups. It is also demonstrated…
Descriptors: Item Sampling, Norms, Predictive Measurement, Research Reports

Shoemaker, David M. – Journal of Educational Measurement, 1973
Investigated empirically through post mortem item-examinee samplings were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. (Editor)
Descriptors: Databases, Error of Measurement, Item Sampling, Research Design

Jarjoura, David – Psychometrika, 1983
The problem of predicting universe scores for samples of examinees based on their responses to samples of items is treated. The measurement model categorizes items according to the cells of a table of test specifications, and the linear function derived for minimizing error variance in prediction uses responses to these categories. (Author/JKS)
Descriptors: Error of Measurement, Generalizability Theory, Item Sampling, Prediction
Sachar, Jane; Suppes, Patrick – 1977
It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…
Descriptors: Comparative Analysis, Elementary Education, Item Analysis, Item Banks

Hill, Richard K. – 1974
When norming tests, it may be preferable to use the matrix sampling technique. The results from the samples may be used to estimate what the distribution of scores would have been if each subject had taken all the items. This paper compares four methods for making these estimates. The sample size made it possible to compare the techniques in a…
Descriptors: Bayesian Statistics, Comparative Analysis, Data Analysis, Item Sampling

Misanchuk, Earl R. – 1978
Multiple matrix sampling of three subscales of the California Psychological Inventory was used to investigate the effects of four variables on error estimates of the mean (EEM) and variance (EEV). The four variables were examinee population size (600, 450, 300, 150, 100, and 75); number of subtests, (2, 3, 4, 5, 6, and 7), hence the number of…
Descriptors: Adults, Analysis of Variance, Error of Measurement, Item Sampling
Bock, R. Darrell; Mislevy, Robert J. – New Directions for Testing and Measurement, 1981
California Assessment Program's application of matrix sampling and item response curve theory to the scaling and reporting of state assessment data is described. It is designed to express educational outcomes in an efficient and interpretable form that is both immediately informative and suited to analysis over extended periods of time.…
Descriptors: Basic Skills, Educational Assessment, Factor Analysis, Item Banks
Mislevy, Robert J.; And Others – 1982
An approach was developed based on item-response models defined at the level of salient subject groups rather than at the level of individuals, designed for use with multiple-matrix sampling designs. In each of three National Assessment of Educational Progress (NAEP) mathematics subtopics, Reiser's group-effects latent trait model was fitted to…
Descriptors: Educational Assessment, Item Analysis, Item Sampling, Latent Trait Theory
Nitko, Anthony J. – 1970
Criterion-referenced testing is defined and some of its background is discussed. A distinction is made between criterion-referenced scores, norm-referenced scores, cut-off scores, criterion scores, criterion variables, and content-standard scores. The relationship between norm-referenced information and criterion-referenced information is…
Descriptors: Academic Achievement, Criterion Referenced Tests, Decision Making, Educational Objectives
Educational Testing Service, Princeton, NJ. – 1977
The 1976 Educational Testing Service (ETS) Invitational Conference served as a platform for individuals who have been prominent in educational measurement and research to present their views on issues surrounding the testing controversy. The 1976 ETS "The Testing Scene: Chaos and Controversy," presents a historical review of events surrounding the…
Descriptors: Achievement Tests, Adaptive Testing, Awards, Career Development