Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Applied Psychological… | 2 |
Applied Measurement in… | 1 |
Educational Measurement:… | 1 |
Studies in Educational… | 1 |
Author
Petersen, Nancy S. | 2 |
Brennan, Robert L. | 1 |
Cook, Linda L. | 1 |
Daniel, Mark | 1 |
Phillips, Gary W. | 1 |
Porter, Andrew C. | 1 |
Roeber, Edward D. | 1 |
Shepard, Lorrie | 1 |
Wu, Margaret | 1 |
Publication Type
Journal Articles | 5 |
Reports - Research | 5 |
Reports - Descriptive | 3 |
Information Analyses | 2 |
Opinion Papers | 2 |
Guides - Non-Classroom | 1 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 3 |
Adult Education | 1 |
Elementary Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Brennan, Robert L. – Applied Psychological Measurement, 2008
The discussion here covers five articles that are linked in the sense that they all treat population invariance. This discussion of population invariance is a somewhat broader treatment of the subject than simply a discussion of these five articles. In particular, occasional reference is made to publications other than those in this issue. The…
Descriptors: Advanced Placement, Law Schools, Science Achievement, Achievement Tests
Petersen, Nancy S. – Applied Psychological Measurement, 2008
This article discusses the five studies included in this issue. Each article addressed the same topic, population invariance of equating. They all used data from major standardized testing programs, and they all used essentially the same statistics to evaluate their results, namely, the root mean square difference and root expected mean square…
Descriptors: Testing Programs, Standardized Tests, Equated Scores, Evaluation Methods
Roeber, Edward D. – 1996
This paper is based on guidelines developed in 1989 for training workshops for state and local educators to demonstrate the processes by which performance assessments could be created, validated, and used in statewide assessment programs. These guidelines are based on work with the National Assessment of Educational Progress and several statewide…
Descriptors: Evaluation Methods, Performance Based Assessment, Sampling, Scoring
Daniel, Mark – 1983
The correlations of each of the 22 tests in the Johnson O'Connor Research Foundation battery with all other tests in the battery are listed. Four fairly large samples are used, each including cases of one sex and a narrow age range. These cases come from a file of 3,555 examinees tested between June 1981 and the fall of 1982. The purpose of…
Descriptors: Adults, Age Differences, Aptitude Tests, Comparative Analysis
National Education Association, Washington, DC. – 1975
The National Education Association's Task Force on Testing has stated its opinion that standardized tests are overused. The task force suggests that the application of sampling techniques and a variety of alternatives to current testing practices would accomplish the same purposes. Representatives of the testing industry have indicated that the…
Descriptors: Accountability, Alternative Assessment, Cost Effectiveness, Educational Testing
Cook, Linda L.; Petersen, Nancy S. – 1986
This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…
Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods

Shepard, Lorrie – Studies in Educational Evaluation, 1979
Assessment generally refers to large-scale, system-wide measurement programs for pupil diagnosis; pupil certification; program evaluation; research; accountability; resource allocations; or teacher evaluation. The purpose of assessment should determine the test content, construction, administration, and examinees sampled. Assessment methods for…
Descriptors: Accountability, Diagnostic Tests, Educational Assessment, Educational Research
Porter, Andrew C. – 1990
The measurement dilemmas involved in assessing the national educational goals established by the President and governors at the 1989 education summit are discussed. The first and most important choice is what to assess and whether to align assessment to the vision of curriculum reform or to the curriculum that students are actually experiencing.…
Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Educational Assessment
Educational Testing Service, Princeton, NJ. Center for Statewide Educational Assessment. – 1975
Six research papers that have been published by the staff of the Center for Statewide Educational Assessment at Educational Testing Service are presented. In "A Selection of Self Concept Measures," Joan Knapp explores questions of defining and measuring self concept. In another paper, she examines some problems of measuring attitudes toward…
Descriptors: Academic Achievement, Attitude Measures, Data Analysis, Data Collection