Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Author
Beaton, Albert E. | 2 |
Cai, Li | 2 |
Chromy, James R. | 2 |
Petersen, Nancy S. | 2 |
Roeber, Edward D. | 2 |
Yang, Ji Seung | 2 |
Alexander, Lamar | 1 |
Arenson, Ethan A. | 1 |
August, Diane, Ed. | 1 |
Barone, John L. | 1 |
Barton, Paul E. | 1 |
More ▼ |
Publication Type
Audience
Researchers | 6 |
Practitioners | 1 |
Students | 1 |
Location
Alaska | 1 |
Arizona | 1 |
Arkansas | 1 |
Australia | 1 |
California | 1 |
Indiana | 1 |
Missouri | 1 |
Netherlands | 1 |
Rhode Island | 1 |
Texas | 1 |
United States | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Perkins Loan Program | 1 |
Assessments and Surveys
National Assessment of… | 25 |
Program for International… | 2 |
SAT (College Admission Test) | 2 |
Alabama High School… | 1 |
Comprehensive Tests of Basic… | 1 |
Early Childhood Longitudinal… | 1 |
National Longitudinal Study… | 1 |
What Works Clearinghouse Rating
Yang, Ji Seung; Cai, Li – Journal of Educational and Behavioral Statistics, 2014
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM). Results indicate that the MH-RM algorithm can produce estimates and standard…
Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect
Yang, Ji Seung; Cai, Li – Grantee Submission, 2014
The main purpose of this study is to improve estimation efficiency in obtaining maximum marginal likelihood estimates of contextual effects in the framework of nonlinear multilevel latent variable model by adopting the Metropolis-Hastings Robbins-Monro algorithm (MH-RM; Cai, 2008, 2010a, 2010b). Results indicate that the MH-RM algorithm can…
Descriptors: Computation, Hierarchical Linear Modeling, Mathematics, Context Effect
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Haberman, Shelby J. – Educational Testing Service, 2010
Sampling errors limit the accuracy with which forms can be linked. Limitations on accuracy are especially important in testing programs in which a very large number of forms are employed. Standard inequalities in mathematical statistics may be used to establish lower bounds on the achievable inking accuracy. To illustrate results, a variety of…
Descriptors: Testing Programs, Equated Scores, Sampling, Accuracy
Doorey, Nancy A. – Council of Chief State School Officers, 2011
The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…
Descriptors: Testing, Sampling, Expertise, Testing Programs
Olsen, Robert B.; Unlu, Fatih; Price, Cristofer; Jaciw, Andrew P. – National Center for Education Evaluation and Regional Assistance, 2011
This report examines the differences in impact estimates and standard errors that arise when these are derived using state achievement tests only (as pre-tests and post-tests), study-administered tests only, or some combination of state- and study-administered tests. State tests may yield different evaluation results relative to a test that is…
Descriptors: Achievement Tests, Standardized Tests, State Standards, Reading Achievement
Hart, Ray; Casserly, Michael; Uzzell, Renata; Palacios, Moses; Corcoran, Amanda; Spurgeon, Liz – Council of the Great City Schools, 2015
There has been little data collected on how much testing actually goes on in America's schools and how the results are used. So in the Spring of 2014, the Council staff developed and launched a survey of assessment practices. This report presents the findings from that survey and subsequent Council analysis and review of the data. It also offers…
Descriptors: Urban Schools, Student Evaluation, Testing Programs, Testing
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Brennan, Robert L. – Applied Psychological Measurement, 2008
The discussion here covers five articles that are linked in the sense that they all treat population invariance. This discussion of population invariance is a somewhat broader treatment of the subject than simply a discussion of these five articles. In particular, occasional reference is made to publications other than those in this issue. The…
Descriptors: Advanced Placement, Law Schools, Science Achievement, Achievement Tests
Bovaird, James A., Ed.; Geisinger, Kurt F., Ed.; Buckendahl, Chad W., Ed. – APA Books, 2011
Educational assessment and, more broadly, educational research in the United States have entered into an era characterized by a dramatic increase in the prevalence and importance of test score use in accountability systems. This volume covers a selection of contemporary issues about testing science and practice that impact the nation's public…
Descriptors: Graduate Students, Test Use, Student Placement, Educational Research
Petersen, Nancy S. – Applied Psychological Measurement, 2008
This article discusses the five studies included in this issue. Each article addressed the same topic, population invariance of equating. They all used data from major standardized testing programs, and they all used essentially the same statistics to evaluate their results, namely, the root mean square difference and root expected mean square…
Descriptors: Testing Programs, Standardized Tests, Equated Scores, Evaluation Methods
Jaeger, Richard M. – 1973
While school systems most often use achievement test results for individual appraisals, increasing attention to program evaluation and accountability requires that test results be used for institutional appraisals as well. When institutional test results are desired--that is, results for schools or school districts--not all pupils need be tested.…
Descriptors: Accountability, Program Evaluation, Sampling, Speeches
Kolen, Michael J. – 1984
Large sample standard errors for the Tucker method of linear equating under the common item nonrandom groups design are derived under normality assumptions as well as under less restrictive assumptions. Standard errors of Tucker equating are estimated using the bootstrap method described by Efron. The results from different methods are compared…
Descriptors: Certification, Comparative Analysis, Equated Scores, Error of Measurement

Gohmann, Stephen F. – Journal of Educational Measurement, 1988
One method to correct for selection bias in comparing Scholastic Aptitude Test (SAT) scores among states is presented, which is a modification of J. J. Heckman's Selection Bias Correction (1976, 1979). Empirical results suggest that sample selection bias is present in SAT score regressions. (SLD)
Descriptors: Regression (Statistics), Sampling, Scoring, Selection
Benrud, C. H.; And Others – 1981
Sampling activities for Year 11 of the National Assessment of Educational Progress began in 1977 when plans were begun to Years 11-14. In March 1979 the sample was selected and allocated. In-school secondary sample selection activities were carried out during May through August, 1979, and in-school package assignment and field support activities…
Descriptors: Computer Oriented Programs, Educational Assessment, Elementary Secondary Education, Methods