ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	5

Descriptor

Sample Size	6
Sampling	6
Scoring	6
Scores	3
Foreign Countries	2
Generalizability Theory	2
Generalization	2
International Assessment	2
Probability	2
Simulation	2
Test Construction	2
Ability	1
Accuracy	1
Achievement Tests	1
Adult Literacy	1
Adults	1
Benchmarking	1
Best Practices	1
Classification	1
Coding	1
Cognitive Ability	1
Cognitive Tests	1
Comparative Analysis	1
Competence	1
Computation	1
More ▼

Source

American Journal of Evaluation	1
ETS Research Report Series	1
Journal of Educational and…	1
National Center for Education…	1
OECD Publishing	1

Author

Chan, Wendy	2
Chen, Michael	1
Dalton, Ben	1
Fan, Xitao	1
Herget, Debbie	1
Kim, Sooyeon	1
Kinney, Saki	1
Livingston, Samuel A.	1
Rogers, Jim	1
Smith, W. Zachary	1
Wilson, David	1
More ▼

Publication Type

Journal Articles	3
Reports - Research	3
Reports - Evaluative	2
Tests/Questionnaires	2
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Reports - General	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Researchers

Location

Australia	1
Belgium	1
Bermuda	1
Canada	1
Chile	1
Czech Republic	1
Denmark	1
Finland	1
France	1
Germany	1
Hungary	1
Indiana	1
Ireland	1
Italy	1
Mexico	1
Netherlands	1
New Zealand	1
Norway	1
Poland	1
Portugal	1
Slovenia	1
South Korea	1
Sweden	1
Switzerland	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International Adult Literacy…	1
Progress in International…	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

U.S. PIRLS and ePIRLS 2016 Technical Report and User's Guide. NCES 2019-113

Peer reviewed
PDF on ERIC

Download full text

Herget, Debbie; Dalton, Ben; Kinney, Saki; Smith, W. Zachary; Wilson, David; Rogers, Jim – National Center for Education Statistics, 2019

The Progress in International Reading Literacy Study (PIRLS) is an international comparative study of student performance in reading literacy at the fourth grade. PIRLS 2016 marks the fourth iteration of the study, which has been conducted every 5 years since 2001. New to the PIRLS assessment in 2016, ePIRLS provides a computer-based extension to…

Descriptors: Achievement Tests, Grade 4, Reading Achievement, Foreign Countries

Methods of Linking with Small Samples in a Common-Item Design: An Empirical Comparison. Research Report. ETS RR-09-38

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2009

A series of resampling studies was conducted to compare the accuracy of equating in a common item design using four different methods: chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating, and the circle-arc method. Four operational test forms, each containing more than 100 items, were used for…

Descriptors: Sampling, Sample Size, Accuracy, Test Items

PIAAC Technical Standards and Guidelines

Direct link

OECD Publishing, 2014

The Programme for International Assessment of Adult Competencies (PIAAC) will establish technical standards and guidelines to ensure that the survey design and implementation processes of PIAAC yield high-quality and internationally comparable data. This document provides a revised version of the technical standards and guidelines originally…

Descriptors: Adults, International Assessment, Adult Literacy, Competence

When Inter-Rater Reliability Is Obtained from Only Part of a Sample.

Download full text

Fan, Xitao; Chen, Michael – 1999

It is erroneous to extend or generalize the inter-rater reliability coefficient estimated from only a (small) proportion of the sample to the rest of the sample data where only one rater is used for scoring, although such generalization is often made implicitly in practice. It is shown that if inter-rater reliability estimate from part of a sample…

Descriptors: Estimation (Mathematics), Generalizability Theory, Interrater Reliability, Sample Size