Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Error of Measurement | 22 |
Item Sampling | 22 |
Statistical Analysis | 10 |
Matrices | 6 |
Test Reliability | 6 |
Sampling | 5 |
Statistical Bias | 5 |
Measurement Techniques | 4 |
Achievement Tests | 3 |
Item Analysis | 3 |
Item Response Theory | 3 |
More ▼ |
Source
Educational and Psychological… | 4 |
Psychometrika | 2 |
Applied Measurement in… | 1 |
ETS Research Report Series | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
Psychological Methods | 1 |
Studies in Educational… | 1 |
Author
Publication Type
Reports - Research | 9 |
Journal Articles | 8 |
Reports - Descriptive | 4 |
Non-Print Media | 1 |
Reference Materials -… | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
Trends in International… | 2 |
California Psychological… | 1 |
Program for International… | 1 |
Progress in International… | 1 |
What Works Clearinghouse Rating
Martin, Michael O.; Mullis, Ina V. S. – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments of student achievement such as International Association for the Evaluation of Educational Achievement's Trends in International Mathematics and Science Study (TIMSS) and Progress in International Reading Literacy Study and Organization for Economic Cooperation and Development's Program for International…
Descriptors: Achievement Tests, International Assessment, Mathematics Tests, Science Achievement
Rutkowski, Leslie – Applied Measurement in Education, 2014
Large-scale assessment programs such as the National Assessment of Educational Progress (NAEP), Trends in International Mathematics and Science Study (TIMSS), and Programme for International Student Assessment (PISA) use a sophisticated assessment administration design called matrix sampling that minimizes the testing burden on individual…
Descriptors: Measurement, Testing, Item Sampling, Computation
Puhan, Gautam; Moses, Tim; Grant, Mary; McHale, Fred – ETS Research Report Series, 2008
A single group (SG) equating design with nearly equivalent test forms (SiGNET) design was developed by Grant (2006) to equate small volume tests. The basis of this design is that examinees take two largely overlapping test forms within a single administration. The scored items for the operational form are divided into mini-tests called testlets.…
Descriptors: Data Collection, Equated Scores, Item Sampling, Sample Size
Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007
Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…
Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory

Barcikowski, Robert S. – Educational and Psychological Measurement, 1974
Descriptors: Error of Measurement, Item Sampling, Testing Problems

Pandey, Tej N.; Shoemaker, David M. – Educational and Psychological Measurement, 1975
Described herein are formulas and computational procedures for estimating the mean and second through fourth central moments of universe scores through multiple matrix sampling. Additionally, procedures are given for approximating the standard error associated with each estimate. All procedures are applicable when items are scored either…
Descriptors: Error of Measurement, Item Sampling, Matrices, Scoring Formulas
Pandey, Tej N. – 1975
Standard errors of pooled mean estimate in multiple matrix sampling were compared for two procedures. The data were from tests involving items with and without replacement. The two procedures involve the formulations of Madow and Lord, and Novick; the former permits sampling of item, with or without replacement, whereas the latter is to be used…
Descriptors: Comparative Analysis, Error of Measurement, Item Sampling, Matrices
Shoemaker, David M. – 1972
Investigated empirically through post mortem item-examinee sampling was the feasibility of the jackknife as a procedure for approximating standard errors of estimate in multiple matrix sampling. The parameters estimated were the mean test score, second through fourth central moments of the test score distribution, and the variance of the item…
Descriptors: Error of Measurement, Error Patterns, Item Sampling, Matrices
Shoemaker, David M. – 1972
Investigated empirically through post mortem item-examinee sampling were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. The results indicate clearly that a partially balanced incomplete block…
Descriptors: Error of Measurement, Item Sampling, Matrices, Sampling

Shoemaker, David M. – Journal of Educational Measurement, 1973
Investigated empirically through post mortem item-examinee samplings were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. (Editor)
Descriptors: Databases, Error of Measurement, Item Sampling, Research Design

Jarjoura, David – Psychometrika, 1983
The problem of predicting universe scores for samples of examinees based on their responses to samples of items is treated. The measurement model categorizes items according to the cells of a table of test specifications, and the linear function derived for minimizing error variance in prediction uses responses to these categories. (Author/JKS)
Descriptors: Error of Measurement, Generalizability Theory, Item Sampling, Prediction

Shoemaker, David M. – Educational and Psychological Measurement, 1972
Descriptors: Difficulty Level, Error of Measurement, Item Sampling, Simulation
Adams, Raymond J. – Studies in Educational Evaluation, 2005
Test reliability is a concept central to classical test theory and it is commonly stated as a requirement that a test attain a certain level of reliability before it be considered of sufficient quality for practical use. This article discusses the role of reliability in item response theory, and in particular the role of reliability in contexts…
Descriptors: Test Reliability, Error of Measurement, Item Sampling, Item Response Theory
Moy, Mabel L. Y.; Barcikowski, Robert S. – 1973
Using a computer-based Monte Carlo approach to generate item responses, the results of this study indicate that, when item discrimination indices are considered, item-examinee sampling procedures having the same number of observations have different standard errors in estimating both test mean and test variance. With certain types of tests, a…
Descriptors: Error of Measurement, Evaluation Methods, Item Sampling, Monte Carlo Methods

Lord, Frederic M. – Psychometrika, 1985
Given a loss function, an asymptotic method for optimal test design for a specified target population of examinees is presented. Also, of more practical use, given an existing unidimensional test and target population, a way is presented to find the loss function for which the test is optimal. (NSF)
Descriptors: Error of Measurement, Higher Education, Item Sampling, Latent Trait Theory
Previous Page | Next Page ยป
Pages: 1 | 2