ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Descriptor

Sampling	8
Computation	5
Error of Measurement	4
Statistical Analysis	4
Comparative Analysis	3
National Competency Tests	3
Probability	3
Weighted Scores	3
Grade 8	2
Item Response Theory	2
Sample Size	2
Test Items	2
True Scores	2
Accuracy	1
Data	1
Educational Assessment	1
Educational Legislation	1
Equated Scores	1
Equations (Mathematics)	1
Federal Legislation	1
Gender Differences	1
Goodness of Fit	1
Grade 4	1
Higher Education	1
Inferences	1
More ▼

Source

ETS Research Report Series

Author

Qian, Jiahe	8
Li, Shuhong	2
Braun, Henry	1
Gu, Lixiong	1
Haberman, Shelby J.	1
Jiang, Yanming	1
Lee, Yi-Hsuan	1
von Davier, Alina A.	1

Publication Type

Journal Articles	8
Reports - Research	8

Education Level

Elementary Education	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Applying Multiphase Sampling to Selecting Testlets with Constraints on Item Blocks. Research Report. ETS RR-19-03

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Gu, Lixiong; Li, Shuhong – ETS Research Report Series, 2019

In assembling testlets (i.e., test forms) with a pool of new and used item blocks, test security is one of the main issues of concern. Strict constraints are often imposed on repeated usage of the same item blocks. Nevertheless, for an assessment administering multiple testlets, a goal is to select as large a sample of testlets as possible. In…

Descriptors: Test Construction, Sampling, Test Items, Mathematics

Model Adequacy Checking for Applying Harmonic Regression to Assessment Quality Control. Research Report. ETS RR-21-13

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Li, Shuhong – ETS Research Report Series, 2021

In recent years, harmonic regression models have been applied to implement quality control for educational assessment data consisting of multiple administrations and displaying seasonality. As with other types of regression models, it is imperative that model adequacy checking and model fit be appropriately conducted. However, there has been no…

Descriptors: Models, Regression (Statistics), Language Tests, Quality Control

Variance Estimation with Complex Data and Finite Population Correction--A Paradigm for Comparing Jackknife and Formula-Based Methods for Variance Estimation. Research Report. ETS RR-20-11

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2020

The finite population correction (FPC) factor is often used to adjust variance estimators for survey data sampled from a finite population without replacement. As a replicated resampling approach, the jackknife approach is usually implemented without the FPC factor incorporated in its variance estimates. A paradigm is proposed to compare the…

Descriptors: Computation, Sampling, Data, Statistical Analysis

Applying the Hájek Approach in Formula-Based Variance Estimation. Research Report. ETS RR-17-24

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2017

The variance formula derived for a two-stage sampling design without replacement employs the joint inclusion probabilities in the first-stage selection of clusters. One of the difficulties encountered in data analysis is the lack of information about such joint inclusion probabilities. One way to solve this issue is by applying Hájek's…

Descriptors: Mathematical Formulas, Computation, Sampling, Research Design

Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013

Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

Descriptors: Item Response Theory, Test Items, Sampling, True Scores

Jackknifing Techniques for Evaluation of Equating Accuracy. Research Report. ETS RR-09-39

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Lee, Yi-Hsuan; Qian, Jiahe – ETS Research Report Series, 2009

Grouped jackknifing may be used to evaluate the stability of equating procedures with respect to sampling error and with respect to changes in anchor selection. Properties of grouped jackknifing are reviewed for simple-random and stratified sampling, and its use is described for comparisons of anchor sets. Application is made to examples of item…

Descriptors: Equated Scores, Accuracy, Sampling, Statistical Analysis

Mapping State Standards to the NAEP Scale. Research Report. ETS RR-08-57

Peer reviewed
PDF on ERIC

Download full text

Braun, Henry; Qian, Jiahe – ETS Research Report Series, 2008

This report describes the derivation and evaluation of a method for comparing the performance standards for public school students set by different states. It is based on an approach proposed by McLaughlin and associates, which constituted an innovative attempt to resolve the confusion and concern that occurs when very different proportions of…

Descriptors: State Standards, Comparative Analysis, Public Schools, National Competency Tests

Weighting Procedures and the Cluster Forming Algorithm for Delete-k Jackknife Variance Estimation for Institutional Surveys. Research Report. ETS RR-06-15

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2006

Weighting and variance estimation are two statistical issues involved in survey data analysis for large-scale assessment programs such as the Higher Education Information and Communication Technology (ICT) Literacy Assessment. Because survey data are always acquired by probability sampling, to draw unbiased or almost unbiased inferences for the…

Descriptors: Weighted Scores, Sampling, Statistical Analysis, Higher Education