ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	11

Descriptor

Achievement Tests	11
Error of Measurement	11
Evaluation Methods	11
Foreign Countries	6
International Assessment	6
Mathematics Achievement	6
Comparative Analysis	5
Elementary Secondary Education	4
Mathematics Tests	4
Secondary School Students	4
Science Achievement	3
Science Tests	3
Scores	3
Simulation	3
Achievement Gains	2
Correlation	2
Data Interpretation	2
Educational Policy	2
Evaluation Problems	2
Factor Analysis	2
Grade 8	2
Hierarchical Linear Modeling	2
Item Response Theory	2
Measurement	2
National Competency Tests	2
More ▼

Source

Stanford Center for Education…	2
American Educational Research…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Large-scale Assessments in…	1
ProQuest LLC	1
Program on Education Policy…	1
Society for Research on…	1
Sociological Methods &…	1

Publication Type

Reports - Research	8
Journal Articles	6
Dissertations/Theses -…	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education	5
Secondary Education	5
Grade 4	2
Elementary Education	1
Grade 3	1
Grade 5	1
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Location

California (Stanford)	1
Florida	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	4
Trends in International…	4
National Assessment of…	2
Florida Comprehensive…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Measurement Invariance across Immigrant and Nonimmigrant Populations on PISA Non-Cognitive Scales

Peer reviewed

Direct link

Maritza Casas; Stephen G. Sireci – International Journal of Testing, 2025

In this study, we take a critical look at the degree to which the measurement of bullying and sense of belonging at school is invariant across groups of students defined by immigrant status. Our study focuses on the invariance of these constructs as measured on a recent PISA administration and includes a discussion of two statistical methods for…

Descriptors: Error of Measurement, Immigrants, Peer Groups, Bullying

A Comparison of Three Approaches to Covariate Effects on Latent Factors

Peer reviewed

Direct link

Wang, Ze – Large-scale Assessments in Education, 2022

In educational and psychological research, it is common to use latent factors to represent constructs and then to examine covariate effects on these latent factors. Using empirical data, this study applied three approaches to covariate effects on latent factors: the multiple-indicator multiple-cause (MIMIC) approach, multiple group confirmatory…

Descriptors: Comparative Analysis, Evaluation Methods, Grade 8, Mathematics Achievement

Closing the Gap in Student Achievement: Application of the Augmented Errors-in-Variables Method

Peer reviewed

Direct link

Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2022

This paper develops models to measure growth in student achievement with a focus on the possibility of differential growth in achievement for low and high-achieving students. We consider a gap-closing model that evaluates the degree to which students in a target group -- students in the bottom quartile of measured achievement -- perform better…

Descriptors: Academic Achievement, Achievement Gap, Models, Measurement Techniques

From OLS to Multilevel Multidimensional Mixture IRT: A Model Refinement Approach to Investigating Patterns of Relationships in PISA 2012 Data

Direct link

Gulsah Gurkan – ProQuest LLC, 2021

Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…

Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…

Descriptors: Test Validity, Evaluation Methods, School Districts, Scores

On the Treatment of Missing Data in Background Questionnaires in Educational Large-Scale Assessments: An Evaluation of Different Procedures

Peer reviewed

Direct link

Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

Large-scale assessments (LSAs) use Mislevy's "plausible value" (PV) approach to relate student proficiency to noncognitive variables administered in a background questionnaire. This method requires background variables to be completely observed, a requirement that is seldom fulfilled. In this article, we evaluate and compare the…

Descriptors: Data Analysis, Error of Measurement, Research Problems, Statistical Inference

Phantom Effects in Multilevel Compositional Analysis: Problems and Solutions

Peer reviewed

Direct link

Pokropek, Artur – Sociological Methods & Research, 2015

This article combines statistical and applied research perspective showing problems that might arise when measurement error in multilevel compositional effects analysis is ignored. This article focuses on data where independent variables are constructed measures. Simulation studies are conducted evaluating methods that could overcome the…

Descriptors: Error of Measurement, Hierarchical Linear Modeling, Simulation, Evaluation Methods

Linking U.S. School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Stanford Center for Education Policy Analysis, 2017

There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…

Descriptors: School Districts, Scores, Statistical Distributions, Database Design

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

Assessing Tradeoffs between Observational and Experimental Designs for Charter School Research. Program on Education Policy and Governance Working Papers Series. PEPG 15-04

Download full text

Ackerman, Matthew; Egalite, Anna J. – Program on Education Policy and Governance, 2015

When lotteries are infeasible, researchers must rely on observational methods to estimate charter effectiveness at raising student test scores. Considerable attention has been paid to observational studies by the Stanford Center for Research on Education Outcomes (CREDO), which have analyzed charter performance in 27 states. However, the…

Descriptors: Charter Schools, Observation, Special Education, Lunch Programs

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Ho, Andrew D.	2
Kalogrides, Demetra	2
Reardon, Sean F.	2
Ackerman, Matthew	1
Egalite, Anna J.	1
Grund, Simon	1
Gulsah Gurkan	1
Haag, Nicole	1
Lüdtke, Oliver	1
Maritza Casas	1
Michael Christian	1
Papay, John P.	1
Pokropek, Artur	1
Robert Meyer	1
Robitzsch, Alexander	1
Roppelt, Alexander	1
Sachse, Karoline A.	1
Sara Hu	1
Stephen G. Sireci	1
Wang, Ze	1
More ▼