Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Item Sampling | 13 |
Statistical Analysis | 13 |
Test Construction | 13 |
Item Analysis | 6 |
Test Items | 6 |
Achievement Tests | 4 |
Criterion Referenced Tests | 4 |
Test Interpretation | 4 |
Difficulty Level | 3 |
Reliability | 3 |
Tables (Data) | 3 |
More ▼ |
Author
Berk, Ronald A. | 1 |
Doron, Rina | 1 |
Douglass, James B. | 1 |
Forsyth, Robert A. | 1 |
Fruchter, Dorothy A. | 1 |
Haladyna, Thomas | 1 |
Harris, Chester W. | 1 |
Lewis, Charles | 1 |
Lewy, Arieh | 1 |
Marc Brysbaert | 1 |
Ree, Malcolm James | 1 |
More ▼ |
Publication Type
Reports - Research | 6 |
Speeches/Meeting Papers | 2 |
Journal Articles | 1 |
Reports - Descriptive | 1 |
Reports - General | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
What Works Clearinghouse Rating
Marc Brysbaert – Cognitive Research: Principles and Implications, 2024
Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…
Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis
Further Results on the Standard Errors of Estimate Associated with Item-Examinee Sampling Procedures

Shoemaker, David M. – Journal of Educational Measurement, 1971
Descriptors: Difficulty Level, Item Sampling, Statistical Analysis, Test Construction
Woodson, M. I. Charles E.
It has been argued that item variance and test variance are not necessary characteristics for criterion-referenced tests, although they are necessary for norm-referenced tests. This position is in error because it considers sample statistics as the criteria for evaluating items and tests. Within a particular sample, an item or test may have no…
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Item Analysis, Item Sampling

Scott, William A. – Educational and Psychological Measurement, 1972
Descriptors: Item Sampling, Mathematical Applications, Scoring Formulas, Statistical Analysis
Harris, Chester W. – 1975
Achievement tests which are specifically linked to an instructional program and have been developed in relation to an objectives base and/or to an item generation rule are considered, as well as student response data. Three types of studies are outlined and the kind of procedures thought useful illustrated. As various methods for examining…
Descriptors: Achievement Tests, Instructional Programs, Item Banks, Item Sampling

Tucker, Ledyard R.; Lewis, Charles – Psychometrika, 1973
Maximum likelihood factor analysis provides an effective method for estimation of factor matrices and a useful test statistic in the likelihood ratio for rejection of overly simple factor models. A reliability coefficient is proposed for analysis of factor solution. (Author/RK)
Descriptors: Analysis of Variance, Factor Analysis, Goodness of Fit, Item Sampling
Berk, Ronald A. – 1978
Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…
Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides
Fruchter, Dorothy A.; Ree, Malcolm James – 1977
In order to meet the needs of all the Armed Services, new forms of the Armed Services Vocational Aptitude Battery (ASVAB) must periodically be developed, refined, and standardized on an appropriate normative sample. Since one of the uses of the ASVAB is to determine candidate suitability for military service, it is necessary for the…
Descriptors: Aptitude Tests, Armed Forces, Equated Scores, Item Analysis
Scheetz, James P.; Forsyth, Robert A. – 1977
Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…
Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling
Douglass, James B. – 1979
A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…
Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit
Lewy, Arieh; Doron, Rina – 1977
The concept of tailored testing for individuals is applied to the construction of tests for special groups and extended to apply to item content as well as item difficulty. It is suggested that evaluators may decide to construct tests on the basis of a unique combination of items drawn from an item bank to fit the need of a particular group. At…
Descriptors: Achievement Tests, Adaptive Testing, Criterion Referenced Tests, Group Norms
Ward, Barbara – 1980
The National Assessment of Educational Progress (NAEP) has completed two assessments of mathematics, the first conducted in 1972-73 and the second during 1977-78. Each assessment surveyed the mathematics achievement of American 9-, 13-, and 17-year-olds, using a deeply stratified, multi-stage probability sample design. This report documents…
Descriptors: Academic Achievement, Data Analysis, Data Collection, Educational Assessment
Haladyna, Thomas – 1975
A central problem for the user of domain-referenced tests in instruction is deciding who has passed and who has failed. Two procedures were presented and discussed. The first, employing classical test theory, was found to be more useful for larger domains and where the passing standard is 70 percent or less. The sampling procedure suggested by…
Descriptors: Academic Achievement, Academic Standards, Criterion Referenced Tests, Decision Making Skills