ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Evaluation Methods	11
Sampling	11
Testing Programs	11
Equated Scores	5
Testing Problems	5
Educational Assessment	4
State Programs	4
Academic Achievement	3
Accountability	3
Educational Testing	3
Elementary Secondary Education	3
Error of Measurement	3
Evaluation Problems	3
Group Testing	3
Item Response Theory	3
Measurement Techniques	3
Standardized Tests	3
Test Construction	3
Achievement Tests	2
College Entrance Examinations	2
Data Collection	2
Educational Research	2
Law Schools	2
Racial Differences	2
Sample Size	2
More ▼

Source

Applied Psychological…	2
Applied Measurement in…	1
Educational Measurement:…	1
Studies in Educational…	1

Author

Petersen, Nancy S.	2
Brennan, Robert L.	1
Cook, Linda L.	1
Daniel, Mark	1
Phillips, Gary W.	1
Porter, Andrew C.	1
Roeber, Edward D.	1
Shepard, Lorrie	1
Wu, Margaret	1

Publication Type

Journal Articles	5
Reports - Research	5
Reports - Descriptive	3
Information Analyses	2
Opinion Papers	2
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Adult Education	1
Elementary Secondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

A Discussion of Population Invariance

Peer reviewed

Direct link

Brennan, Robert L. – Applied Psychological Measurement, 2008

The discussion here covers five articles that are linked in the sense that they all treat population invariance. This discussion of population invariance is a somewhat broader treatment of the subject than simply a discussion of these five articles. In particular, occasional reference is made to publications other than those in this issue. The…

Descriptors: Advanced Placement, Law Schools, Science Achievement, Achievement Tests

A Discussion of Population Invariance of Equating

Peer reviewed

Direct link

Petersen, Nancy S. – Applied Psychological Measurement, 2008

This article discusses the five studies included in this issue. Each article addressed the same topic, population invariance of equating. They all used data from major standardized testing programs, and they all used essentially the same statistics to evaluate their results, namely, the root mean square difference and root expected mean square…

Descriptors: Testing Programs, Standardized Tests, Equated Scores, Evaluation Methods

Guidelines for the Management of Performance Assessments in Large-Scale Assessment Programs.

Download full text

Roeber, Edward D. – 1996

This paper is based on guidelines developed in 1989 for training workshops for state and local educators to demonstrate the processes by which performance assessments could be created, validated, and used in statewide assessment programs. These guidelines are based on work with the National Assessment of Educational Progress and several statewide…

Descriptors: Evaluation Methods, Performance Based Assessment, Sampling, Scoring

Large-Sample Test Intercorrelations. Technical Report 1983-2.

Daniel, Mark – 1983

The correlations of each of the 22 tests in the Johnson O'Connor Research Foundation battery with all other tests in the battery are listed. Four fairly large samples are used, each including cases of one sex and a narrow age range. These cases come from a file of 3,555 examinees tested between June 1981 and the fall of 1982. The purpose of…

Descriptors: Adults, Age Differences, Aptitude Tests, Comparative Analysis

Why Should All Those Students Take All Those Tests? (Every-Student Testing or Sampling of Selected Groups?).

Download full text

National Education Association, Washington, DC. – 1975

The National Education Association's Task Force on Testing has stated its opinion that standardized tests are overused. The task force suggests that the application of sampling techniques and a variety of alternatives to current testing practices would accomplish the same purposes. Representatives of the testing industry have indicated that the…

Descriptors: Accountability, Alternative Assessment, Cost Effectiveness, Educational Testing

Download full text

Cook, Linda L.; Petersen, Nancy S. – 1986

This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…

Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods

Purposes of Assessment.

Peer reviewed

Shepard, Lorrie – Studies in Educational Evaluation, 1979

Assessment generally refers to large-scale, system-wide measurement programs for pupil diagnosis; pupil certification; program evaluation; research; accountability; resource allocations; or teacher evaluation. The purpose of assessment should determine the test content, construction, administration, and examinees sampled. Assessment methods for…

Descriptors: Accountability, Diagnostic Tests, Educational Assessment, Educational Research

Assessing National Goals: Some Measurement Dilemmas.

Download full text

Porter, Andrew C. – 1990

The measurement dilemmas involved in assessing the national educational goals established by the President and governors at the 1989 education summit are discussed. The first and most important choice is what to assess and whether to align assessment to the vision of curriculum reform or to the curriculum that students are actually experiencing.…

Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Educational Assessment

Aspects of Educational Assessment.

Download full text

Educational Testing Service, Princeton, NJ. Center for Statewide Educational Assessment. – 1975

Six research papers that have been published by the staff of the Center for Statewide Educational Assessment at Educational Testing Service are presented. In "A Selection of Self Concept Measures," Joan Knapp explores questions of defining and measuring self concept. In another paper, she examines some problems of measuring attitudes toward…

Descriptors: Academic Achievement, Attitude Measures, Data Analysis, Data Collection