ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	33

Descriptor

Sampling	77
Scoring	77
Test Construction	21
Educational Assessment	18
Elementary Secondary Education	18
Foreign Countries	18
Research Methodology	16
Data Analysis	15
Data Collection	15
Evaluation Methods	13
Test Items	13
Comparative Analysis	12
Test Results	11
National Surveys	10
Questionnaires	10
Research Design	10
Scores	10
Tables (Data)	10
Test Reliability	10
Academic Achievement	9
Achievement Tests	9
Scaling	9
Grade 4	8
Interrater Reliability	8
Performance Based Assessment	8
More ▼

Publication Type

Journal Articles	32
Reports - Research	28
Reports - Evaluative	21
Numerical/Quantitative Data	12
Reports - Descriptive	11
Collected Works - General	6
Guides - Non-Classroom	6
Speeches/Meeting Papers	5
Tests/Questionnaires	4
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Information Analyses	2
Opinion Papers	2
Reports - General	2
Books	1
Reference Materials -…	1
More ▼

Education Level

Elementary Education	8
Elementary Secondary Education	6
Secondary Education	5
Grade 4	4
Intermediate Grades	4
Grade 6	2
Grade 8	2
Higher Education	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Kindergarten	1
More ▼

Audience

Researchers	6
Practitioners	2
Teachers	1

Location

Australia	5
Norway	3
South Korea	3
United States	3
Belgium	2
Canada	2
Chile	2
Czech Republic	2
Denmark	2
France	2
Germany	2
Hungary	2
Ireland	2
Italy	2
Japan	2
Netherlands	2
Poland	2
Sweden	2
United Kingdom	2
Austria	1
Bermuda	1
China	1
Cyprus	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

National Assessment of…	13
Trends in International…	5
Program for International…	4
Progress in International…	2
SAT (College Admission Test)	2
Advanced Placement…	1
Childrens Report of Parental…	1
Developmental Indicators for…	1
Graduate Record Examinations	1
International Adult Literacy…	1
International Association for…	1
Work Keys (ACT)	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 77 results Save | Export

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Methodological Reflections on PISA's Creative Thinking Assessment

Peer reviewed

Direct link

Leslie Rutkowski; David Rutkowski – Journal of Creative Behavior, 2025

The Programme for International Student Assessment (PISA) introduced creative thinking as an innovative domain in 2022. This paper examines the unique methodological issues in international assessments and the implications of measuring creative thinking within PISA's framework, including stratified sampling, rotated form designs, and a distinct…

Descriptors: Creativity, Creative Thinking, Measurement, Sampling

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

U.S. Technical Report and User Guide for the 2019 Trends in International Mathematics and Science Study (TIMSS). Part 1. NCES 2022-049

Peer reviewed
PDF on ERIC

Download full text

Egan, Laura; Tang, Judy H.; Ferraro, David; Erberber, Ebru; Tsokodayi, Yemurai; Stearns, Pat – National Center for Education Statistics, 2022

Trends in International Mathematics and Science Study (TIMSS) is an international comparative study designed to measure trends in mathematics and science achievement at grades 4 and 8, as well as to collect information about educational contexts (such as students' schools, teachers, and homes) that may be related to student achievement. TIMSS has…

Descriptors: Achievement Tests, Mathematics Achievement, International Assessment, Foreign Countries

How Flexible Is Your Data? A Comparative Analysis of Scoring Methodologies across Learning Platforms in the Context of Group Differentiation

Peer reviewed
PDF on ERIC

Download full text

Ostrow, Korinn S.; Wang, Yan; Heffernan, Neil T. – Journal of Learning Analytics, 2017

Data is flexible in that it is molded by not only the features and variables available to a researcher for analysis and interpretation, but also by how those features and variables are recorded and processed prior to evaluation. "Big Data" from online learning platforms and intelligent tutoring systems is no different. The work presented…

Descriptors: Data, Comparative Analysis, Scoring, Mathematics Skills

How Flexible Is Your Data? A Comparative Analysis of Scoring Methodologies across Learning Platforms in the Context of Group Differentiation

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ostrow, Korinn S.; Wang, Yan; Heffernan, Neil T. – Grantee Submission, 2017

Descriptors: Data, Comparative Analysis, Scoring, Mathematics Skills

Dependability of Data Derived from Time Sampling Methods with Multiple Observation Targets

Peer reviewed

Direct link

Johnson, Austin H.; Chafouleas, Sandra M.; Briesch, Amy M. – School Psychology Quarterly, 2017

In this study, generalizability theory was used to examine the extent to which (a) time-sampling methodology, (b) number of simultaneous behavior targets, and (c) individual raters influenced variance in ratings of academic engagement for an elementary-aged student. Ten graduate-student raters, with an average of 7.20 hr of previous training in…

Descriptors: Generalizability Theory, Sampling, Elementary School Students, Learner Engagement

U.S. PIRLS and ePIRLS 2016 Technical Report and User's Guide. NCES 2019-113

Peer reviewed
PDF on ERIC

Download full text

Herget, Debbie; Dalton, Ben; Kinney, Saki; Smith, W. Zachary; Wilson, David; Rogers, Jim – National Center for Education Statistics, 2019

The Progress in International Reading Literacy Study (PIRLS) is an international comparative study of student performance in reading literacy at the fourth grade. PIRLS 2016 marks the fourth iteration of the study, which has been conducted every 5 years since 2001. New to the PIRLS assessment in 2016, ePIRLS provides a computer-based extension to…

Descriptors: Achievement Tests, Grade 4, Reading Achievement, Foreign Countries

Assessing Methods for Generalizing Experimental Impact Estimates to Target Populations

Peer reviewed

Direct link

Kern, Holger L.; Stuart, Elizabeth A.; Hill, Jennifer; Green, Donald P. – Journal of Research on Educational Effectiveness, 2016

Randomized experiments are considered the gold standard for causal inference because they can provide unbiased estimates of treatment effects for the experimental participants. However, researchers and policymakers are often interested in using a specific experiment to inform decisions about other target populations. In education research,…

Descriptors: Educational Research, Generalization, Sampling, Participant Characteristics

Choice of Target Population Weights in Rater Comparability Scoring and Equating. Research Report. ETS RR-13-03

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam – ETS Research Report Series, 2013

The purpose of this study was to demonstrate that the choice of sample weights when defining the target population under poststratification equating can be a critical factor in determining the accuracy of the equating results under a unique equating scenario, known as "rater comparability scoring and equating." The nature of data…

Descriptors: Scoring, Equated Scores, Sampling, Accuracy

Psychometrics in Support of a Valid Assessment of Linguistic Minorities: Implications for the Test and Sampling Designs

Peer reviewed

Direct link

Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016

In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…

Descriptors: Psychometrics, Linguistics, Test Construction, Testing

The Impact of Sampling Approach on Population Invariance in Automated Scoring of Essays. Research Report. ETS RR-13-18

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo – ETS Research Report Series, 2013

Many testing programs use automated scoring to grade essays. One issue in automated essay scoring that has not been examined adequately is population invariance and its causes. The primary purpose of this study was to investigate the impact of sampling in model calibration on population invariance of automated scores. This study analyzed scores…

Descriptors: Automation, Scoring, Essay Tests, Sampling

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	4
OECD Publishing	3
Journal of Creative Behavior	2
Journal of Educational…	2
Journal of Educational and…	2
Ministerial Council on…	2
National Center for Education…	2
American Journal of Evaluation	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment for Effective…	1
British Journal of…	1
Child Development	1
Clinical Linguistics &…	1
Developmental Psychology	1
Educational Measurement:…	1
Educational Research	1
Educational Research and…	1
Educational and Psychological…	1
Grantee Submission	1
International Association for…	1
International Journal of…	1
International Journal of…	1
Journal of Learning Analytics	1
Journal of Research on…	1
More ▼

Johnson, Eugene G.	3
Beaton, Albert E.	2
Chan, Wendy	2
Donovan, Jenny	2
Heffernan, Neil T.	2
Horkay, Nancy, Ed.	2
Lennon, Melissa	2
Martin, Michael O., Ed.	2
Ostrow, Korinn S.	2
Wang, Yan	2
Abdekhodaie, Zahra	1
Afflerbach, Peter	1
Baker, Eva L.	1
Ballator, Nada	1
Bayless, D. L.	1
Braun, Henry I.	1
Brennan, Robert L.	1
Briesch, Amy M.	1
Brualdi, Amy	1
Calderone, John, Ed.	1
Carol Eckerly	1
Chafouleas, Sandra M.	1
Chen, Michael	1
Childs, Ruth A.	1
More ▼