ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	4

Descriptor

Error of Measurement	22
Item Sampling	22
Statistical Analysis	10
Matrices	6
Test Reliability	6
Sampling	5
Statistical Bias	5
Measurement Techniques	4
Achievement Tests	3
Item Analysis	3
Item Response Theory	3
Research Design	3
Scores	3
Tables (Data)	3
Test Construction	3
Test Theory	3
Analysis of Variance	2
Comparative Analysis	2
Data Collection	2
Educational Research	2
Error Patterns	2
Mathematics Tests	2
Models	2
Monte Carlo Methods	2
Prediction	2
More ▼

Source

Educational and Psychological…	4
Psychometrika	2
Applied Measurement in…	1
ETS Research Report Series	1
Journal of Educational…	1
Journal of Educational and…	1
Psychological Methods	1
Studies in Educational…	1

Publication Type

Reports - Research	9
Journal Articles	8
Reports - Descriptive	4
Non-Print Media	1
Reference Materials -…	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Trends in International…	2
California Psychological…	1
Program for International…	1
Progress in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

TIMSS 2015: Illustrating Advancements in Large-Scale International Assessments

Peer reviewed

Direct link

Martin, Michael O.; Mullis, Ina V. S. – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments of student achievement such as International Association for the Evaluation of Educational Achievement's Trends in International Mathematics and Science Study (TIMSS) and Progress in International Reading Literacy Study and Organization for Economic Cooperation and Development's Program for International…

Descriptors: Achievement Tests, International Assessment, Mathematics Tests, Science Achievement

Sensitivity of Achievement Estimation to Conditioning Model Misclassification

Peer reviewed

Direct link

Rutkowski, Leslie – Applied Measurement in Education, 2014

Large-scale assessment programs such as the National Assessment of Educational Progress (NAEP), Trends in International Mathematics and Science Study (TIMSS), and Programme for International Student Assessment (PISA) use a sophisticated assessment administration design called matrix sampling that minimizes the testing burden on individual…

Descriptors: Measurement, Testing, Item Sampling, Computation

An Alternative Data Collection Design for Equating with Very Small Samples. Research Report. ETS RR-08-11

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Moses, Tim; Grant, Mary; McHale, Fred – ETS Research Report Series, 2008

A single group (SG) equating design with nearly equivalent test forms (SiGNET) design was developed by Grant (2006) to equate small volume tests. The basis of this design is that examinees take two largely overlapping test forms within a single administration. The scored items for the operational form are divided into mini-tests called testlets.…

Descriptors: Data Collection, Equated Scores, Item Sampling, Sample Size

The Effects of Item Discrimination on the Standard Errors of Estimate Associated with Item-Examinee Sampling Procedures

Peer reviewed

Barcikowski, Robert S. – Educational and Psychological Measurement, 1974

Descriptors: Error of Measurement, Item Sampling, Testing Problems

A Rasch Perspective

Peer reviewed

Direct link

Schumacker, Randall E.; Smith, Everett V., Jr. – Educational and Psychological Measurement, 2007

Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…

Descriptors: Measurement Techniques, Error of Measurement, Item Sampling, Item Response Theory

Estimating Moments of Universe Scores and Associated Standard Errors in Multiple Matrix Sampling for All Item-Scoring Procedures

Peer reviewed

Pandey, Tej N.; Shoemaker, David M. – Educational and Psychological Measurement, 1975

Described herein are formulas and computational procedures for estimating the mean and second through fourth central moments of universe scores through multiple matrix sampling. Additionally, procedures are given for approximating the standard error associated with each estimate. All procedures are applicable when items are scored either…

Descriptors: Error of Measurement, Item Sampling, Matrices, Scoring Formulas

Estimating the Standard Error of the Mean in Multiple Matrix Sampling When Items are Sampled With and Without Replacement.

Download full text

Pandey, Tej N. – 1975

Standard errors of pooled mean estimate in multiple matrix sampling were compared for two procedures. The data were from tests involving items with and without replacement. The two procedures involve the formulations of Madow and Lord, and Novick; the former permits sampling of item, with or without replacement, whereas the latter is to be used…

Descriptors: Comparative Analysis, Error of Measurement, Item Sampling, Matrices

A General Procedure for Approximating Standard Errors of Estimate in Multiple Matrix Sampling.

Download full text

Shoemaker, David M. – 1972

Investigated empirically through post mortem item-examinee sampling was the feasibility of the jackknife as a procedure for approximating standard errors of estimate in multiple matrix sampling. The parameters estimated were the mean test score, second through fourth central moments of the test score distribution, and the variance of the item…

Descriptors: Error of Measurement, Error Patterns, Item Sampling, Matrices

A Note on Allocating Items to Subtests in Multiple Matrix Sampling.

Download full text

Shoemaker, David M. – 1972

Investigated empirically through post mortem item-examinee sampling were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. The results indicate clearly that a partially balanced incomplete block…

Descriptors: Error of Measurement, Item Sampling, Matrices, Sampling

A Note on Allocating Items to Subtests in Multiple Matrix Sampling and Approximating Standard Errors of Estimate with the Jackknife

Peer reviewed

Shoemaker, David M. – Journal of Educational Measurement, 1973

Investigated empirically through post mortem item-examinee samplings were the relative merits of two alternative procedures for allocating items to subtests in multiple matrix sampling and the feasibility of using the jackknife in approximating standard errors of estimate. (Editor)

Descriptors: Databases, Error of Measurement, Item Sampling, Research Design

Best Linear Prediction of Composite Universe Scores.

Peer reviewed

Jarjoura, David – Psychometrika, 1983

The problem of predicting universe scores for samples of examinees based on their responses to samples of items is treated. The measurement model categorizes items according to the cells of a table of test specifications, and the linear function derived for minimizing error variance in prediction uses responses to these categories. (Author/JKS)

Descriptors: Error of Measurement, Generalizability Theory, Item Sampling, Prediction

Standard Errors of Estimate in Item-Examinee Sampling as a Function of Test Reliability, Variation in Item Difficulty Indices and Degree of Skewness in the Normative Distribution

Peer reviewed

Shoemaker, David M. – Educational and Psychological Measurement, 1972

Descriptors: Difficulty Level, Error of Measurement, Item Sampling, Simulation

Reliability as a Measurement Design Effect

Peer reviewed

Direct link

Adams, Raymond J. – Studies in Educational Evaluation, 2005

Test reliability is a concept central to classical test theory and it is commonly stated as a requirement that a test attain a certain level of reliability before it be considered of sufficient quality for practical use. This article discusses the role of reliability in item response theory, and in particular the role of reliability in contexts…

Descriptors: Test Reliability, Error of Measurement, Item Sampling, Item Response Theory

Item Sampling: Optimum Number of People and Items.

Download full text

Moy, Mabel L. Y.; Barcikowski, Robert S. – 1973

Using a computer-based Monte Carlo approach to generate item responses, the results of this study indicate that, when item discrimination indices are considered, item-examinee sampling procedures having the same number of observations have different standard errors in estimating both test mean and test variance. With certain types of tests, a…

Descriptors: Error of Measurement, Evaluation Methods, Item Sampling, Monte Carlo Methods

Estimating the Imputed Social Cost of Errors of Measurement.

Peer reviewed

Lord, Frederic M. – Psychometrika, 1985

Given a loss function, an asymptotic method for optimal test design for a specified target population of examinees is presented. Also, of more practical use, given an existing unidimensional test and target population, a way is presented to find the loss function for which the test is optimal. (NSF)

Descriptors: Error of Measurement, Higher Education, Item Sampling, Latent Trait Theory

Previous Page | Next Page »

Pages: 1 | 2

Shoemaker, David M.	6
Barcikowski, Robert S.	2
Pandey, Tej N.	2
Adams, Raymond J.	1
Cumsille, Patricio E.	1
Graham, John W.	1
Grant, Mary	1
Haladyna, Tom	1
Harris, Chester W.	1
Jarjoura, David	1
Kolakowski, Donald	1
Lord, Frederic M.	1
Martin, Michael O.	1
McHale, Fred	1
Misanchuk, Earl R.	1
Moses, Tim	1
Moy, Mabel L. Y.	1
Mullis, Ina V. S.	1
Olchowski, Allison E.	1
Penfield, Douglas A.	1
Puhan, Gautam	1
Rutkowski, Leslie	1
Schumacker, Randall E.	1
Smith, Everett V., Jr.	1
More ▼