Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Statistical Inference | 3 |
Test Items | 3 |
Equated Scores | 2 |
Sampling | 2 |
Computation | 1 |
Cutting Scores | 1 |
Difficulty Level | 1 |
Error of Measurement | 1 |
Females | 1 |
Grade 8 | 1 |
Item Bias | 1 |
More ▼ |
Source
Applied Measurement in… | 3 |
Author
Haertel, Edward H. | 1 |
Kim, Sooyeon | 1 |
Mazzeo, John | 1 |
Michaelides, Michalis P. | 1 |
Thayer, Dorothy T. | 1 |
Walker, Michael | 1 |
Zwick, Rebecca | 1 |
Publication Type
Journal Articles | 3 |
Reports - Research | 2 |
Reports - Evaluative | 1 |
Education Level
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Michaelides, Michalis P.; Haertel, Edward H. – Applied Measurement in Education, 2014
The standard error of equating quantifies the variability in the estimation of an equating function. Because common items for deriving equated scores are treated as fixed, the only source of variability typically considered arises from the estimation of common-item parameters from responses of samples of examinees. Use of alternative, equally…
Descriptors: Equated Scores, Test Items, Sampling, Statistical Inference
Kim, Sooyeon; Walker, Michael – Applied Measurement in Education, 2012
This study examined the appropriateness of the anchor composition in a mixed-format test, which includes both multiple-choice (MC) and constructed-response (CR) items, using subpopulation invariance indices. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using two types of anchor sets: (a) MC only and (b)…
Descriptors: Multiple Choice Tests, Test Format, Test Items, Equated Scores

Zwick, Rebecca; Thayer, Dorothy T.; Mazzeo, John – Applied Measurement in Education, 1997
Differential item functioning (DIF) assessment procedures for items with more than two ordered score categories, referred to as polytomous items, were evaluated. Three descriptive statistics (standardized mean difference and two procedures based on the SIBTEST computer program) and five inferential procedures were used. Conditions under which the…
Descriptors: Item Bias, Research Methodology, Statistical Inference, Test Construction