Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 6 |
Descriptor
Item Sampling | 70 |
Statistical Analysis | 70 |
Mathematical Models | 16 |
Matrices | 14 |
Sampling | 14 |
Test Construction | 13 |
Test Reliability | 13 |
Item Analysis | 12 |
Test Items | 12 |
Error of Measurement | 10 |
Test Interpretation | 10 |
More ▼ |
Source
Author
Shoemaker, David M. | 14 |
Pandey, Tej N. | 4 |
Forsyth, Robert A. | 3 |
Harris, Chester W. | 2 |
Scheetz, James P. | 2 |
Sirotnik, Ken | 2 |
Aparisi, D. | 1 |
Austin, Dean A. | 1 |
Bashkov, Bozhidar M. | 1 |
Beaton, Albert E. | 1 |
Bechger, Timo M. | 1 |
More ▼ |
Publication Type
Education Level
Secondary Education | 2 |
Grade 10 | 1 |
Grade 9 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
National Assessment of… | 2 |
Armed Services Vocational… | 1 |
California Achievement Tests | 1 |
California Psychological… | 1 |
Graduate Record Examinations | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Marc Brysbaert – Cognitive Research: Principles and Implications, 2024
Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…
Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis
Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019
Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…
Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy
Inglés, Cándido J.; Aparisi, D.; Delgado, B.; Granados, L.; García-Fernández, José M. – Electronic Journal of Research in Educational Psychology, 2017
Introduction: The aim of this study was to analyze the relationship between sociometric types, behavioral categories and self-attributions for academic failure ("Ability", "Effort" or "External Causes") in "Reading", "Mathematics" and "General". Method: The total sample was composed of…
Descriptors: Academic Failure, Secondary Education, Sociometric Techniques, Foreign Countries
Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015
Multiple matrix designs are commonly used in large-scale assessments to distribute test items to students. These designs comprise several booklets, each containing a subset of the complete item pool. Besides reducing the test burden of individual students, using various booklets allows aligning the difficulty of the presented items to the assumed…
Descriptors: Measurement, Item Sampling, Statistical Analysis, Models
Lorié, William A. – Online Submission, 2013
A reverse engineering approach to automatic item generation (AIG) was applied to a figure-based publicly released test item from the Organisation for Economic Cooperation and Development (OECD) Programme for International Student Assessment (PISA) mathematical literacy cognitive instrument as part of a proof of concept. The author created an item…
Descriptors: Numeracy, Mathematical Concepts, Mathematical Logic, Difficulty Level
Waller, Niels G. – Applied Psychological Measurement, 2008
Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…
Descriptors: Test Items, Reliability, Scores, Psychometrics
Scheetz, James P. – 1976
When performing large scale evaluations (e.g., on a state-wide or national level) it may not be possible to administer all items in the item universe to all respondents in the subject population. One method which has been proposed to sample both items and respondents is multiple matrix sampling (MMS) in which a sample of the items is administered…
Descriptors: Item Sampling, Statistical Analysis, Testing Programs

Forsyth, Robert A. – Educational and Psychological Measurement, 1976
Shoemaker's conclusions related to the influence of various data base characteristics (reliability, variability of item difficulty indices, and degree of skewness in the normative distribution) on the standard error of a mean estimated via multiple matrix sampling procedures are examined. (Author/RC)
Descriptors: Item Sampling, Statistical Analysis, Test Reliability

Shoemaker, David M. – Journal of Educational Measurement, 1971
Results indicate that scale values can be approximated satisfactorily through item-examinee sampling. Defining one observation as the response made by one examinee to one item, the similarity between the estimated scale values and normative scale values increased generally with increases in the number of observations acquired by the sampling plan.…
Descriptors: Attitudes, Item Sampling, Norms, Statistical Analysis

Dziuban, Charles D.; And Others – Educational and Psychological Measurement, 1979
The distributional characteristics of Kaiser's Measure of Sampling Adequacy (MSA) were investigated in sample matrices generated from multivariate normal populations of specified correlation levels. Systematic variation of sample size and number of variables revealed the overall MSA to be most influenced by the number of variables. (Author/JKS)
Descriptors: Correlation, Factor Analysis, Item Sampling, Psychometrics
Pandey, Tej N.; Hubert, Lawrence J. – 1974
This investigation had two major purposes. The first was to explore the use of an inferential technique called Tukey's Jackknife in establishing a confidence interval about cooefficient alpha reliability. The second purpose was to study the robustness of the Feldt and the jackknife procedures when the data fails to satisfy usual normality…
Descriptors: Comparative Analysis, Item Sampling, Statistical Analysis, Statistics

Sirotnik, Ken – Educational and Psychological Measurement, 1970
Descriptors: Analysis of Variance, Item Sampling, Mathematical Models, Statistical Analysis

Pandey, Tej N.; Shoemaker, David M. – Educational and Psychological Measurement, 1975
Described herein are formulas and computational procedures for estimating the mean and second through fourth central moments of universe scores through multiple matrix sampling. Additionally, procedures are given for approximating the standard error associated with each estimate. All procedures are applicable when items are scored either…
Descriptors: Error of Measurement, Item Sampling, Matrices, Scoring Formulas
Pandey, Tej N. – 1975
Standard errors of pooled mean estimate in multiple matrix sampling were compared for two procedures. The data were from tests involving items with and without replacement. The two procedures involve the formulations of Madow and Lord, and Novick; the former permits sampling of item, with or without replacement, whereas the latter is to be used…
Descriptors: Comparative Analysis, Error of Measurement, Item Sampling, Matrices
Hill, Richard K., Jr. – 1973
A model for multiple choice test-taking behavior is proposed which is different from those presently used for item sampling theory. A new theory is developed, which includes a new concept to facilitate comprehension of item sampling theory, a "number-known" score distribution. A major advantage of the model is that it accommodates data from…
Descriptors: Item Sampling, Mathematical Models, Norms, Speeches