ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Error of Measurement	11
Sampling	11
Testing Problems	11
Test Interpretation	5
Equated Scores	4
Item Analysis	4
Test Reliability	4
Measurement	3
Research Methodology	3
Test Validity	3
Testing Programs	3
Educational Assessment	2
Educational Testing	2
Estimation (Mathematics)	2
Evaluation Methods	2
Evaluation Problems	2
High Schools	2
Item Response Theory	2
Mathematical Models	2
National Surveys	2
Sample Size	2
Test Construction	2
Test Items	2
Weighted Scores	2
Accuracy	1
More ▼

Source

Journal of Educational…	2
Applied Measurement in…	1
Educational Testing Service	1
International Journal of…	1

Author

Angoff, William H.	1
Cowell, William R.	1
Foster, Jeff L.	1
Haberman, Shelby J.	1
Hambleton, Ronald K.	1
Kolstad, Andrew	1
Lord, Frederic M.	1
Meyer, Kevin D.	1
Phillips, Gary W.	1
Shoemaker, David M.	1
Spencer, Bruce D.	1
Whitely, Susan E.	1
Wright, Benjamin D.	1
More ▼

Publication Type

Reports - Research	5
Journal Articles	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Opinion Papers	1
Reports - Descriptive	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Limits on the Accuracy of Linking. Research Report. ETS RR-10-22

Download full text

Haberman, Shelby J. – Educational Testing Service, 2010

Sampling errors limit the accuracy with which forms can be linked. Limitations on accuracy are especially important in testing programs in which a very large number of forms are employed. Standard inequalities in mathematical statistics may be used to establish lower bounds on the achievable inking accuracy. To illustrate results, a variety of…

Descriptors: Testing Programs, Equated Scores, Sampling, Accuracy

A Note on Allocating Items to Subtests in Multiple Matrix Sampling.

Download full text

Shoemaker, David M. – 1972

Investigated empirically through post mortem item-examinee sampling were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. The results indicate clearly that a partially balanced incomplete block…

Descriptors: Error of Measurement, Item Sampling, Matrices, Sampling

The Standard Error of Equipercentile Equating.

Download full text

Lord, Frederic M. – 1981

Transformations or equating of raw test scores on two or more forms of the same test are made interchangeable by empirical procedures deriving the standard error of an equipercentile equating for four different situations. Some numerical results are checked by Monte Carlo methods. Numerical standard errors are computed for two sets of real data.…

Descriptors: Educational Testing, Equated Scores, Error of Measurement, Mathematical Formulas

Influence of Item Parameter Errors in Test Development.

Download full text

Hambleton, Ronald K.; And Others – 1990

Item response theory (IRT) model parameter estimates have considerable merit and open up new directions for test development, but misleading results are often obtained because of errors in the item parameter estimates. The problem of the effects of item parameter estimation errors on the test development process is discussed, and the seriousness…

Descriptors: Error of Measurement, Estimation (Mathematics), Item Response Theory, Sampling

Models, Meanings and Misunderstandings: Some Issues in Applying Rasch's Theory

Peer reviewed

Whitely, Susan E. – Journal of Educational Measurement, 1977

A debate concerning specific issues and the general usefulness of the Rasch latent trait test model is continued. Methods of estimation, necessary sample size, and the applicability of the model are discussed. (JKS)

Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Measurement

Misunderstanding the Rasch Model

Peer reviewed

Wright, Benjamin D. – Journal of Educational Measurement, 1977

Statements made in a previous article of this journal concerning the Rasch latent trait test model are questioned. Methods of estimation, necessary sample sizes, several formuli, and the general usefulness of the Rasch model are discussed. (JKS)

Descriptors: Computers, Error of Measurement, Item Analysis, Mathematical Models

The Impact of Clustering in the Sample Design of the 1987 High School Transcript Study on Estimates of Sampling Variability.

Download full text

Kolstad, Andrew – 1990

Use of clustering methods with data from the 1987 High School Transcript (HST) component of the 1986 National Assessment of Educational Progress (NAEP) is discussed in an attempt to persuade data analysts and researchers that they should be interested in problems traditionally turned over to samplers. Nearly all surveys used by the National Center…

Descriptors: Cluster Analysis, Educational Assessment, Error of Measurement, Estimation (Mathematics)

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Efficient Methods for Sampling Out-of-School Seventeen-Year-Olds in the National Assessment of Educational Progress.

Download full text

Spencer, Bruce D. – 1986

The National Assessment of Educational Progress (NAEP) currently tests seventeen-year-old students enrolled in public and private secondary schools, but it does not test "out-of-school" seventeen-year-olds who have either graduated or dropped out. Estimating that one of five seventeen-year-olds is out of school, the interpretability of…

Descriptors: Adolescents, Cohort Analysis, Dropouts, Educational Assessment

An Examination of the Assumption that the Equating of Parallel Forms is Population-Independent.

Download full text

Angoff, William H.; Cowell, William R. – 1985

Linear and equipercentile equating conversions were developed for two forms of the Graduate Record Examinations (GRE) quantitative test and the verbal-plus-quantitative test. From a very large sample of students taking the GRE in October 1981, subpopulations were selected with respect to race, sex, field of study, and level of performance (defined…

Descriptors: Aptitude Tests, College Entrance Examinations, Equated Scores, Error of Measurement