ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	27
Since 2006 (last 20 years)	44

Descriptor

Achievement Tests	104
Scaling	104
Scores	33
Academic Achievement	30
Elementary Secondary Education	30
Item Response Theory	26
Foreign Countries	24
Test Construction	24
Scoring	23
Test Items	21
Test Validity	21
Testing Programs	21
Mathematics Tests	20
Equated Scores	18
International Assessment	18
Standardized Tests	18
Latent Trait Theory	17
Test Interpretation	16
Item Analysis	14
Grade 4	13
Reading Tests	13
Statistical Analysis	13
Comparative Analysis	12
Educational Assessment	12
Psychometrics	12
More ▼

Publication Type

Journal Articles	44
Reports - Research	40
Reports - Evaluative	30
Speeches/Meeting Papers	23
Reports - Descriptive	15
Numerical/Quantitative Data	13
Opinion Papers	9
Tests/Questionnaires	5
Collected Works - General	3
Books	2
Guides - Non-Classroom	2
Information Analyses	2
Dissertations/Theses -…	1
Guides - General	1
More ▼

Education Level

Secondary Education	20
Elementary Secondary Education	15
Elementary Education	14
Grade 4	14
Grade 8	10
Intermediate Grades	10
Middle Schools	8
Junior High Schools	7
Grade 3	6
Grade 6	6
Grade 7	6
Grade 5	5
Early Childhood Education	4
Primary Education	4
Higher Education	2
Postsecondary Education	2
Grade 1	1
Grade 10	1
Grade 2	1
Grade 9	1
High Schools	1
More ▼

Audience

Researchers	6
Practitioners	1

Location

New York	4
Australia	3
Asia	2
Europe	2
Florida	2
Michigan	2
United States	2
Canada	1
China	1
Germany	1
Illinois	1
Japan	1
Macau	1
New Zealand	1
North America	1
Oregon (Portland)	1
South America	1
Tennessee	1
Texas	1
Trinidad and Tobago	1
More ▼

Laws, Policies, & Programs

Education Consolidation…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 104 results Save | Export

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

Reassessing Weights in Large-Scale Assessments and Multilevel Models

Peer reviewed

Direct link

Umut Atasever; Francis L. Huang; Leslie Rutkowski – Large-scale Assessments in Education, 2025

When analyzing large-scale assessments (LSAs) that use complex sampling designs, it is important to account for probability sampling using weights. However, the use of these weights in multilevel models has been widely debated, particularly regarding their application at different levels of the model. Yet, no consensus has been reached on the best…

Descriptors: Mathematics Tests, International Assessment, Elementary Secondary Education, Foreign Countries

Practical Significance of Item Misfit and Its Manifestations in Constructs Assessed in Large-Scale Studies

Peer reviewed

Direct link

Fährmann, Katharina; Köhler, Carmen; Hartig, Johannes; Heine, Jörg-Henrik – Large-scale Assessments in Education, 2022

When scaling psychological tests with methods of item response theory it is necessary to investigate to what extent the responses correspond to the model predictions. In addition to the statistical evaluation of item misfit, the question arises as to its practical significance. Although item removal is undesirable for several reasons, its…

Descriptors: Psychological Testing, Scaling, Test Items, Item Response Theory

Sampling Weights in Multilevel Modelling: An Investigation Using PISA Sampling Structures

Peer reviewed

Direct link

Mang, Julia; Küchenhoff, Helmut; Meinck, Sabine; Prenzel, Manfred – Large-scale Assessments in Education, 2021

Background: Standard methods for analysing data from large-scale assessments (LSA) cannot merely be adopted if hierarchical (or multilevel) regression modelling should be applied. Currently various approaches exist; they all follow generally a design-based model of estimation using the pseudo maximum likelihood method and adjusted weights for the…

Descriptors: Sampling, Hierarchical Linear Modeling, Simulation, Scaling

A Comparison of Methodologies for Scaling Longitudinal Social-Emotional Survey Responses

Peer reviewed

Direct link

Soland, James; Kuhfeld, Megan; Register, Brennan – Educational Assessment, 2023

Much of what we know about how children develop is based on survey data. In order to estimate growth across time and, thereby, better understand that development, short survey scales are typically administered at repeated timepoints. Before estimating growth, those repeated measures must be put onto the same scale. Yet, little research examines…

Descriptors: Comparative Analysis, Social Emotional Learning, Scaling, Effect Size

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

Conditioning: How Background Variables Can Influence PISA Scores

Peer reviewed

Direct link

Zieger, Laura Raffaella; Jerrim, J.; Anders, J.; Shure, N. – Assessment in Education: Principles, Policy & Practice, 2022

The OECD's Programme for International Student Assessment (PISA) has become one of the key studies for evidence-based education policymaking across the globe. PISA has however received a lot of methodological criticism, including how the test scores are created. The aim of this paper is to investigate the so-called 'conditioning model', where…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…

Descriptors: Test Validity, Evaluation Methods, School Districts, Scores

Reconsidering the Psychometrics of the GRS-S: Evidence for Parsimony in Measurement

Peer reviewed

Direct link

Petscher, Yaacov; Pfeiffer, Steven I. – Assessment for Effective Intervention, 2020

The authors evaluated measurement-level, factor-level, item-level, and scale-level revisions to the "Gifted Rating Scales-School Form" (GRS-S). Measurement-level considerations tested the extent to which treating the Likert-type scale rating as categorical or continuous produced different fit across unidimensional, correlated trait, and…

Descriptors: Psychometrics, Academically Gifted, Rating Scales, Factor Structure

Does Early Tracking Affect Learning Inequalities? Revisiting Difference-in-Differences Modeling Strategies with International Assessments

Peer reviewed

Direct link

Contini, Dalit; Cugnata, Federica – Large-scale Assessments in Education, 2020

The development of international surveys on children's learning like PISA, PIRLS and TIMSS--delivering comparable achievement measures across educational systems--has revealed large cross-country variability in average performance and in the degree of inequality across social groups. A key question is whether and how institutional differences…

Descriptors: International Assessment, Achievement Tests, Scores, Family Characteristics

U.S. Technical Report and User Guide for the 2019 Trends in International Mathematics and Science Study (TIMSS). Part 1. NCES 2022-049

Peer reviewed
PDF on ERIC

Download full text

Egan, Laura; Tang, Judy H.; Ferraro, David; Erberber, Ebru; Tsokodayi, Yemurai; Stearns, Pat – National Center for Education Statistics, 2022

Trends in International Mathematics and Science Study (TIMSS) is an international comparative study designed to measure trends in mathematics and science achievement at grades 4 and 8, as well as to collect information about educational contexts (such as students' schools, teachers, and homes) that may be related to student achievement. TIMSS has…

Descriptors: Achievement Tests, Mathematics Achievement, International Assessment, Foreign Countries

How Robust Are Cross-Country Comparisons of PISA Scores to the Scaling Model Used?

Peer reviewed

Direct link

Jerrim, John; Parker, Philip; Choi, Alvaro; Chmielewski, Anna Katyn; Sälzer, Christine; Shure, Nikki – Educational Measurement: Issues and Practice, 2018

The Programme for International Student Assessment (PISA) is an important international study of 15-olds' knowledge and skills. New results are released every 3 years, and have a substantial impact upon education policy. Yet, despite its influence, the methodology underpinning PISA has received significant criticism. Much of this criticism has…

Descriptors: Educational Assessment, Comparative Education, Achievement Tests, Foreign Countries

Distributional Properties of the PIRLS-Home Resource for Learning Scale and Observed Effects on Reading Achievement: Are Measurements of Educational Inequalities by Latent Indices without Bias?

Peer reviewed

Direct link

Walzebug, Anke; Kasper, Daniel – Assessment in Education: Principles, Policy & Practice, 2018

In "Progress in International Reading Literacy Study" (PIRLS) educational inequalities are measured, amongst others, through the relationship between students' reading achievements and the home resource for learning (HRL) scale. By applying the partial credit model and using the WLE estimates for the person parameters it is accepted that…

Descriptors: Grade 4, Achievement Tests, Foreign Countries, International Assessment

U.S. PIRLS and ePIRLS 2016 Technical Report and User's Guide. NCES 2019-113

Peer reviewed
PDF on ERIC

Download full text

Herget, Debbie; Dalton, Ben; Kinney, Saki; Smith, W. Zachary; Wilson, David; Rogers, Jim – National Center for Education Statistics, 2019

The Progress in International Reading Literacy Study (PIRLS) is an international comparative study of student performance in reading literacy at the fourth grade. PIRLS 2016 marks the fourth iteration of the study, which has been conducted every 5 years since 2001. New to the PIRLS assessment in 2016, ePIRLS provides a computer-based extension to…

Descriptors: Achievement Tests, Grade 4, Reading Achievement, Foreign Countries

Cross-Cultural Comparability of Noncognitive Constructs in TIMSS and PISA

Peer reviewed

Direct link

He, Jia; Barrera-Pedemonte, Fabián; Buchholz, Janine – Assessment in Education: Principles, Policy & Practice, 2019

Noncognitive assessments in Programme for International Student Assessment (PISA) and Trends in International Mathematics and Science Study share certain similarities and provide complementary information, yet their comparability is seldom checked and convergence not sought. We made use of student self-report data of Instrumental Motivation,…

Descriptors: Foreign Countries, Secondary School Students, International Assessment, Elementary Secondary Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational Measurement:…	9
Journal of Educational…	6
Journal of Educational and…	5
Large-scale Assessments in…	5
Applied Measurement in…	4
Assessment in Education:…	3
International Association for…	3
New York State Education…	3
Educational and Psychological…	2
National Center for Education…	2
Online Submission	2
ACT, Inc.	1
Assessment for Effective…	1
Consortium on Chicago School…	1
Educational Assessment	1
Evaluation in Education: An…	1
Journal of Psychoeducational…	1
Journal of Research in…	1
Language Assessment Quarterly	1
Language Testing	1
Measurement:…	1
Ministerial Council on…	1
National Center for Education…	1
OECD Publishing (NJ1)	1
ProQuest LLC	1
More ▼

Yen, Wendy M.	5
Hoover, H. D.	3
Ainley, John	2
Canner, Jane M.	2
Choppin, Bruce	2
Cook, Linda L.	2
Forster, Fred	2
Forsyth, Robert A.	2
Fraillon, Julian	2
Jiao, Hong	2
Lenke, Joanne M.	2
Linn, Robert L.	2
Lüdtke, Oliver	2
Martin, Michael O.	2
Martin, Michael O., Ed.	2
Mullis, Ina V. S.	2
Mullis, Ina V. S., Ed.	2
Phillips, S. E.	2
Price, Gary G.	2
Robitzsch, Alexander	2
Wang, Shudong	2
Abedi, Jamal	1
Allensworth, Elaine M.	1
Anders, J.	1
More ▼

Program for International…	12
National Assessment of…	9
Comprehensive Tests of Basic…	6
Trends in International…	6
Progress in International…	5
Stanford Achievement Tests	5
Iowa Tests of Basic Skills	4
SAT (College Admission Test)	4
College Board Achievement…	3
Metropolitan Achievement Tests	3
Florida Comprehensive…	2
North Carolina End of Course…	2
ACT Assessment	1
ACT Interest Inventory	1
Graduate Record Examinations	1
Iowa Tests of Educational…	1
Kaufman Test of Educational…	1
Sequential Tests of…	1
Stanford Diagnostic Reading…	1
Stanford Early School…	1
Texas Assessment of Academic…	1
Texas Educational Assessment…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼