Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 2 |
Descriptor
| Evaluation Methods | 14 |
| Testing Problems | 14 |
| Sampling | 12 |
| Test Reliability | 5 |
| Testing Programs | 5 |
| Educational Testing | 4 |
| Elementary Secondary Education | 4 |
| Response Rates… | 4 |
| Test Construction | 4 |
| Accountability | 3 |
| Data Collection | 3 |
| More ▼ | |
Source
Author
| Altschuld, James W. | 1 |
| Askegaard, Lewis D. | 1 |
| Austin, Dean A. | 1 |
| Carifio, James | 1 |
| Foster, Jeff L. | 1 |
| Ingels, Steven J. | 1 |
| Jaeger, Richard M. | 1 |
| Meyer, Kevin D. | 1 |
| Novak, Carl D. | 1 |
| Phillips, Gary W. | 1 |
| Porter, Andrew C. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 7 |
| Reports - Research | 5 |
| Opinion Papers | 3 |
| Reports - Evaluative | 3 |
| Reports - Descriptive | 2 |
| Speeches/Meeting Papers | 2 |
| Guides - Non-Classroom | 1 |
| Information Analyses | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 2 |
What Works Clearinghouse Rating
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
National Education Association, Washington, DC. – 1975
The National Education Association's Task Force on Testing has stated its opinion that standardized tests are overused. The task force suggests that the application of sampling techniques and a variety of alternatives to current testing practices would accomplish the same purposes. Representatives of the testing industry have indicated that the…
Descriptors: Accountability, Alternative Assessment, Cost Effectiveness, Educational Testing
Austin, Dean A.; Novak, Carl D. – Health Education (Washington D.C.), 1976
This study demonstrates that multiple matrix sampling procedures can be used to collect assessment data efficiently, unabstrusively, and reliably. (MB)
Descriptors: Data Collection, Educational Testing, Evaluation Methods, Item Sampling
Peer reviewedAltschuld, James W.; And Others – Evaluation and Program Planning, 1992
An original study with a 96 percent questionnaire return rate and four replications with high return rates were compared in terms of populations sampled, implementation, and results. Future use of this procedure, which includes advance mailing, telephone contact, telephone explanations, and followups, is discussed. (SLD)
Descriptors: Comparative Analysis, Evaluation Methods, Mail Surveys, Questionnaires
Roeber, Edward D. – 1996
This paper is based on guidelines developed in 1989 for training workshops for state and local educators to demonstrate the processes by which performance assessments could be created, validated, and used in statewide assessment programs. These guidelines are based on work with the National Assessment of Educational Progress and several statewide…
Descriptors: Evaluation Methods, Performance Based Assessment, Sampling, Scoring
Carifio, James; And Others – 1990
Possible bias due to sampling problems or low response rates has been a troubling "nuisance" variable in empirical research since seminal and classical studies were done on these problems at the beginning of this century. Recent research suggests that: (1) earlier views of the alleged bias problem were misleading; (2) under a variety of fairly…
Descriptors: Data Collection, Evaluation Methods, Research Problems, Response Rates (Questionnaires)
Peer reviewedAskegaard, Lewis D.; Umila, Benwardo V. – Journal of Educational Measurement, 1982
Multiple matrix sampling of items and examinees was applied to an 18-item rank order instrument administered to a randomly assigned group and compared to the ordering and ranking of all items by control subjects. High correlations between ranks suggest the methodology may viably reduce respondent effort on long rank ordering tasks. (Author/CM)
Descriptors: Evaluation Methods, Item Sampling, Junior High Schools, Student Reaction
Peer reviewedJaeger, Richard M. – Educational Measurement: Issues and Practice, 1991
Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)
Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners
Peer reviewedStockman, Ida J. – Language, Speech, and Hearing Services in Schools, 1996
This article discusses the use of language sample analysis (LSA) as a screening tool for preschool linguistic minority children due to the difficulty of using standardized tests in assessing language delays in speakers of minority dialects and languages. The use of LSA with seven African American preschoolers is examined. (CR)
Descriptors: Black Students, Diagnostic Tests, Evaluation Methods, Language Minorities
Womer, Frank B. – 1971
This symposium deals with recent issues in the development of the National Assessment model. General goals are outlined and the following topics are discussed: "Objectives and Exercises" (Jack C. Merwin); "Sampling" (A. Finkner); and "Data Analysis" (John Milholland). (CK) Aspect of National Assessment (NAEP) dealt…
Descriptors: Academic Achievement, Conferences, Data Analysis, Demonstration Programs
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Ingels, Steven J.; And Others – 1989
Nonresponse issues are investigated for the base year (1988) survey of the United States Department of Education's National Education Longitudinal Study of 1988 (NELS:88), a national probability sample of middle schools and eighth-grade students in the spring of 1988. The total eighth-grade enrollment for the NELS:88 sample of schools was 203,002;…
Descriptors: Data Collection, Educational Assessment, Elementary Secondary Education, Estimation (Mathematics)
Peer reviewedShepard, Lorrie – Studies in Educational Evaluation, 1979
Assessment generally refers to large-scale, system-wide measurement programs for pupil diagnosis; pupil certification; program evaluation; research; accountability; resource allocations; or teacher evaluation. The purpose of assessment should determine the test content, construction, administration, and examinees sampled. Assessment methods for…
Descriptors: Accountability, Diagnostic Tests, Educational Assessment, Educational Research
Porter, Andrew C. – 1990
The measurement dilemmas involved in assessing the national educational goals established by the President and governors at the 1989 education summit are discussed. The first and most important choice is what to assess and whether to align assessment to the vision of curriculum reform or to the curriculum that students are actually experiencing.…
Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Educational Assessment

Direct link
