Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 27 |
Since 2006 (last 20 years) | 44 |
Descriptor
Achievement Tests | 104 |
Scaling | 104 |
Scores | 33 |
Academic Achievement | 30 |
Elementary Secondary Education | 30 |
Item Response Theory | 26 |
Foreign Countries | 24 |
Test Construction | 24 |
Scoring | 23 |
Test Items | 21 |
Test Validity | 21 |
More ▼ |
Source
Author
Yen, Wendy M. | 5 |
Hoover, H. D. | 3 |
Ainley, John | 2 |
Canner, Jane M. | 2 |
Choppin, Bruce | 2 |
Cook, Linda L. | 2 |
Forster, Fred | 2 |
Forsyth, Robert A. | 2 |
Fraillon, Julian | 2 |
Jiao, Hong | 2 |
Lenke, Joanne M. | 2 |
More ▼ |
Publication Type
Education Level
Secondary Education | 20 |
Elementary Secondary Education | 15 |
Elementary Education | 14 |
Grade 4 | 14 |
Grade 8 | 10 |
Intermediate Grades | 10 |
Middle Schools | 8 |
Junior High Schools | 7 |
Grade 3 | 6 |
Grade 6 | 6 |
Grade 7 | 6 |
More ▼ |
Audience
Researchers | 6 |
Practitioners | 1 |
Location
New York | 4 |
Australia | 3 |
Asia | 2 |
Europe | 2 |
Florida | 2 |
Michigan | 2 |
United States | 2 |
Canada | 1 |
China | 1 |
Germany | 1 |
Illinois | 1 |
More ▼ |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022
One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…
Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis
Umut Atasever; Francis L. Huang; Leslie Rutkowski – Large-scale Assessments in Education, 2025
When analyzing large-scale assessments (LSAs) that use complex sampling designs, it is important to account for probability sampling using weights. However, the use of these weights in multilevel models has been widely debated, particularly regarding their application at different levels of the model. Yet, no consensus has been reached on the best…
Descriptors: Mathematics Tests, International Assessment, Elementary Secondary Education, Foreign Countries
Fährmann, Katharina; Köhler, Carmen; Hartig, Johannes; Heine, Jörg-Henrik – Large-scale Assessments in Education, 2022
When scaling psychological tests with methods of item response theory it is necessary to investigate to what extent the responses correspond to the model predictions. In addition to the statistical evaluation of item misfit, the question arises as to its practical significance. Although item removal is undesirable for several reasons, its…
Descriptors: Psychological Testing, Scaling, Test Items, Item Response Theory
Mang, Julia; Küchenhoff, Helmut; Meinck, Sabine; Prenzel, Manfred – Large-scale Assessments in Education, 2021
Background: Standard methods for analysing data from large-scale assessments (LSA) cannot merely be adopted if hierarchical (or multilevel) regression modelling should be applied. Currently various approaches exist; they all follow generally a design-based model of estimation using the pseudo maximum likelihood method and adjusted weights for the…
Descriptors: Sampling, Hierarchical Linear Modeling, Simulation, Scaling
Soland, James; Kuhfeld, Megan; Register, Brennan – Educational Assessment, 2023
Much of what we know about how children develop is based on survey data. In order to estimate growth across time and, thereby, better understand that development, short survey scales are typically administered at repeated timepoints. Before estimating growth, those repeated measures must be put onto the same scale. Yet, little research examines…
Descriptors: Comparative Analysis, Social Emotional Learning, Scaling, Effect Size
Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023
One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…
Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests
Zieger, Laura Raffaella; Jerrim, J.; Anders, J.; Shure, N. – Assessment in Education: Principles, Policy & Practice, 2022
The OECD's Programme for International Student Assessment (PISA) has become one of the key studies for evidence-based education policymaking across the globe. PISA has however received a lot of methodological criticism, including how the test scores are created. The aim of this paper is to investigate the so-called 'conditioning model', where…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019
Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…
Descriptors: Test Validity, Evaluation Methods, School Districts, Scores
Petscher, Yaacov; Pfeiffer, Steven I. – Assessment for Effective Intervention, 2020
The authors evaluated measurement-level, factor-level, item-level, and scale-level revisions to the "Gifted Rating Scales-School Form" (GRS-S). Measurement-level considerations tested the extent to which treating the Likert-type scale rating as categorical or continuous produced different fit across unidimensional, correlated trait, and…
Descriptors: Psychometrics, Academically Gifted, Rating Scales, Factor Structure
Contini, Dalit; Cugnata, Federica – Large-scale Assessments in Education, 2020
The development of international surveys on children's learning like PISA, PIRLS and TIMSS--delivering comparable achievement measures across educational systems--has revealed large cross-country variability in average performance and in the degree of inequality across social groups. A key question is whether and how institutional differences…
Descriptors: International Assessment, Achievement Tests, Scores, Family Characteristics
Egan, Laura; Tang, Judy H.; Ferraro, David; Erberber, Ebru; Tsokodayi, Yemurai; Stearns, Pat – National Center for Education Statistics, 2022
Trends in International Mathematics and Science Study (TIMSS) is an international comparative study designed to measure trends in mathematics and science achievement at grades 4 and 8, as well as to collect information about educational contexts (such as students' schools, teachers, and homes) that may be related to student achievement. TIMSS has…
Descriptors: Achievement Tests, Mathematics Achievement, International Assessment, Foreign Countries
Jerrim, John; Parker, Philip; Choi, Alvaro; Chmielewski, Anna Katyn; Sälzer, Christine; Shure, Nikki – Educational Measurement: Issues and Practice, 2018
The Programme for International Student Assessment (PISA) is an important international study of 15-olds' knowledge and skills. New results are released every 3 years, and have a substantial impact upon education policy. Yet, despite its influence, the methodology underpinning PISA has received significant criticism. Much of this criticism has…
Descriptors: Educational Assessment, Comparative Education, Achievement Tests, Foreign Countries
Walzebug, Anke; Kasper, Daniel – Assessment in Education: Principles, Policy & Practice, 2018
In "Progress in International Reading Literacy Study" (PIRLS) educational inequalities are measured, amongst others, through the relationship between students' reading achievements and the home resource for learning (HRL) scale. By applying the partial credit model and using the WLE estimates for the person parameters it is accepted that…
Descriptors: Grade 4, Achievement Tests, Foreign Countries, International Assessment
Herget, Debbie; Dalton, Ben; Kinney, Saki; Smith, W. Zachary; Wilson, David; Rogers, Jim – National Center for Education Statistics, 2019
The Progress in International Reading Literacy Study (PIRLS) is an international comparative study of student performance in reading literacy at the fourth grade. PIRLS 2016 marks the fourth iteration of the study, which has been conducted every 5 years since 2001. New to the PIRLS assessment in 2016, ePIRLS provides a computer-based extension to…
Descriptors: Achievement Tests, Grade 4, Reading Achievement, Foreign Countries
He, Jia; Barrera-Pedemonte, Fabián; Buchholz, Janine – Assessment in Education: Principles, Policy & Practice, 2019
Noncognitive assessments in Programme for International Student Assessment (PISA) and Trends in International Mathematics and Science Study share certain similarities and provide complementary information, yet their comparability is seldom checked and convergence not sought. We made use of student self-report data of Instrumental Motivation,…
Descriptors: Foreign Countries, Secondary School Students, International Assessment, Elementary Secondary Education