Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 15 |
| Since 2007 (last 20 years) | 36 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 7 |
| Policymakers | 1 |
| Practitioners | 1 |
| Students | 1 |
Location
| Oregon | 2 |
| Pennsylvania | 2 |
| Turkey | 2 |
| United Kingdom (England) | 2 |
| Australia | 1 |
| Bosnia and Herzegovina | 1 |
| China | 1 |
| Croatia | 1 |
| Florida | 1 |
| Germany | 1 |
| Israel | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
OECD Publishing (NJ1), 2012
The "PISA 2009 Technical Report" describes the methodology underlying the PISA 2009 survey. It examines additional features related to the implementation of the project at a level of detail that allows researchers to understand and replicate its analyses. The reader will find a wealth of information on the test and sample design,…
Descriptors: Quality Control, Research Reports, Research Methodology, Evaluation Criteria
PDF pending restorationScheetz, James P.; Forsyth, Robert A. – 1977
The choice of design parameters (e.g., number of subtests, number of items per subtest, and number of examinees per subtest) can be controlled by the test constructor in a multiple matrix sampling evaluation. The purpose of this study was to determine empirically which combination of the above parameters produces the smallest standard errors of…
Descriptors: Error Patterns, Item Sampling
Hess, Karin K.; Jones, Ben S.; Carlock, Dennis; Walkup, John R. – Online Submission, 2009
To teach the rigorous skills and knowledge students need to succeed in future college-entry courses and workforce training programs, education stakeholders have increasingly called for more rigorous curricula, instruction, and assessments. Identifying the critical attributes of rigor and measuring its appearance in curricular materials is…
Descriptors: Educational Objectives, Classification, Matrices, Curriculum Development
Peer reviewedShoemaker, David M. – Journal of Educational Measurement, 1970
Descriptors: Item Sampling, Norms, Test Interpretation
Fishbein, Ronald L.; Shoemaker, David M. – 1977
The administrative and political ramifications of a form of multiple matrix sampling where all students in the population are tested on a subtest is discussed in the context of a statewide testing program. The conclusion is drawn that large-scale testing programs will have to adopt a testing framework which samples items from the total item domain…
Descriptors: Item Sampling, State Programs, Testing Programs
Scheetz, James P. – 1976
When performing large scale evaluations (e.g., on a state-wide or national level) it may not be possible to administer all items in the item universe to all respondents in the subject population. One method which has been proposed to sample both items and respondents is multiple matrix sampling (MMS) in which a sample of the items is administered…
Descriptors: Item Sampling, Statistical Analysis, Testing Programs
Peer reviewedForsyth, Robert A. – Educational and Psychological Measurement, 1976
Shoemaker's conclusions related to the influence of various data base characteristics (reliability, variability of item difficulty indices, and degree of skewness in the normative distribution) on the standard error of a mean estimated via multiple matrix sampling procedures are examined. (Author/RC)
Descriptors: Item Sampling, Statistical Analysis, Test Reliability
Peer reviewedWellington, Roger – Psychometrika, 1976
Generalized symmetric means are redefined in a way which allows them to be calculated for any matrix sampling design. It is proved that these sample generalized symmetric means are unbiased estimates of the analogous population generalized symmetric means. Illustrative examples are given. (Author)
Descriptors: Item Sampling, Matrices, Research Design, Sampling
Peer reviewedSirotnik, Ken – Educational and Psychological Measurement, 1972
This note refers to EJ 056 482. (CB)
Descriptors: Item Analysis, Item Sampling, Mathematical Applications
Rudd, Andy; Johnson, R. Burke – Studies in Educational Evaluation, 2008
As a result of the federal No Child Left Behind Act (NCLB) of 2002, the field of education has seen a heavy emphasis on the use of "scientifically based research" for designing and testing the effectiveness of new and existing educational programs. According to NCLB, when addressing basic cause and effect questions scientifically based…
Descriptors: Quasiexperimental Design, Scientific Research, Educational Research, Federal Legislation
Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007
Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…
Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory
Peer reviewedBarcikowski, Robert S. – Educational and Psychological Measurement, 1974
Descriptors: Error of Measurement, Item Sampling, Testing Problems
Peer reviewedSerlin, Ronald C.; Kaiser, Henry F. – Educational and Psychological Measurement, 1976
Internal consistency as one rationale for item selection from the unverse of possible test items is discussed and formulae are presented which relate the maximum internal consistency of a test to the largest eigenvalue of the interitem correlation matrix. A computer program to perform these calculations is presented. (Author/JKS)
Descriptors: Computer Programs, Item Sampling, Matrices, Test Construction
Peer reviewedShoemaker, David M.; Knapp, Thomas R. – Journal of Educational Measurement, 1974
Descriptors: Classification, Definitions, Item Sampling, Subject Index Terms
Peer reviewedShoemaker, David M. – Journal of Educational Measurement, 1971
Results indicate that scale values can be approximated satisfactorily through item-examinee sampling. Defining one observation as the response made by one examinee to one item, the similarity between the estimated scale values and normative scale values increased generally with increases in the number of observations acquired by the sampling plan.…
Descriptors: Attitudes, Item Sampling, Norms, Statistical Analysis

Direct link
