ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	12

Descriptor

Evaluation Methods	12
Hierarchical Linear Modeling	12
Scores	12
Correlation	5
Comparative Analysis	4
Statistical Analysis	4
Academic Achievement	3
Effect Size	3
Elementary Secondary Education	3
Sample Size	3
Student Characteristics	3
Test Validity	3
Achievement Gains	2
Achievement Tests	2
Computation	2
Control Groups	2
Educational Assessment	2
Experimental Groups	2
Foreign Countries	2
Grade 8	2
Intervention	2
Longitudinal Studies	2
Mathematics Tests	2
Measurement	2
Models	2
More ▼

Source

ProQuest LLC	4
Grantee Submission	2
American Journal of Evaluation	1
Assessment for Effective…	1
Assessment in Education:…	1
ETS Research Report Series	1
Journal of Educational and…	1
Stanford Center for Education…	1

Publication Type

Reports - Research	8
Journal Articles	5
Dissertations/Theses -…	4
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	5
Junior High Schools	4
Middle Schools	4
Secondary Education	4
Elementary Secondary Education	3
Grade 8	3
Grade 10	2
Grade 11	2
Grade 5	2
Grade 6	2
Grade 7	2
Intermediate Grades	2
Grade 12	1
Grade 4	1
Grade 9	1
High Schools	1
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Location

Kentucky (Louisville)	1
Maine	1
Qatar	1
Texas	1
Uganda	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Building Validity Evidence for the Use of Aggregate Scores in Accountability

Direct link

Karen Blackburn Hoeve – ProQuest LLC, 2021

High stakes test-based accountability systems primarily rely on aggregates and derivatives of scores from tests that were originally developed to measure individual student mastery of content specifications. Current validity models do not explicitly address this use of aggregate scores to measure the performance of teachers, administrators, and…

Descriptors: Accountability, Test Validity, High Stakes Tests, Hierarchical Linear Modeling

Re-Examining Measurement Invariance of School Climate Surveys across Race/Ethnicity

Peer reviewed

Direct link

Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025

Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…

Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment

A Latent State Trait Model for Multilevel Mediation Analysis with Multiple Timepoints

Direct link

Lydia Bradford – ProQuest LLC, 2024

In randomized control trials (RCT), the recent focus has shifted to how an intervention yields positive results on its intended outcome. This aligns with the recent push of implementation science in healthcare (Bauer et al., 2015) but goes beyond this. RCTs have moved to evaluating the theoretical framing of the intervention as well as differing…

Descriptors: Hierarchical Linear Modeling, Mediation Theory, Randomized Controlled Trials, Research Design

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…

Descriptors: Test Validity, Evaluation Methods, School Districts, Scores

Comparability of Computer-Based and Paper-Based Science Assessments

Peer reviewed
PDF on ERIC

Download full text

Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E. – Grantee Submission, 2018

We compared students' performance on a paper-based test (PBT) and three computer-based tests (CBTs). The three computer-based tests used different test navigation and answer selection features, allowing us to examine how these features affect student performance. The study sample consisted of 9,698 fourth through twelfth grade students from across…

Descriptors: Evaluation Methods, Tests, Computer Assisted Testing, Scores

How Big Is That? Reporting the Effect Size and Cost of ASSISTments in the Maine Homework Efficacy Study

Download full text

Roschelle, Jeremy; Murphy, Robert; Feng, Mingyu; Bakia, Marianne – Grantee Submission, 2017

In a rigorous evaluation of ASSISTments as an online homework support conducted in the state of Maine, SRI International reported that "the intervention significantly increased student scores on an end-of-the-year standardized mathematics assessment as compared with a control group that continued with existing homework practices."…

Descriptors: Homework, Program Effectiveness, Effect Size, Cost Effectiveness

Finding Efficiency in the Design of Large Multisite Evaluations: Estimating Variances for Science Achievement Studies

Peer reviewed

Direct link

Westine, Carl D. – American Journal of Evaluation, 2016

Little is known empirically about intraclass correlations (ICCs) for multisite cluster randomized trial (MSCRT) designs, particularly in science education. In this study, ICCs suitable for science achievement studies using a three-level (students in schools in districts) MSCRT design that block on district are estimated and examined. Estimates of…

Descriptors: Efficiency, Evaluation Methods, Science Achievement, Correlation

A Longitudinal Study on State Mathematics and Reading Assessments: Comparisons of Growth Models on Students' Achievement Scores

Direct link

Chiu, Pui Chi – ProQuest LLC, 2012

This study examines student growth on mathematics and reading assessments across academic years (Spring 2006 through Spring 2009) using three different growth models: hierarchical linear model (HLM), value-added model (VAM), and student growth percentile model (SGP). Comparisons across these three growth models were conducted to investigate the…

Descriptors: Longitudinal Studies, Mathematics Tests, Reading Tests, Educational Assessment

Investigating the Dynamics of Formative Assessment: Relationships between Teacher Knowledge, Assessment Practice and Learning

Peer reviewed

Direct link

Herman, Joan; Osmundson, Ellen; Dai, Yunyun; Ringstaff, Cathy; Timms, Michael – Assessment in Education: Principles, Policy & Practice, 2015

This exploratory study of elementary school science examines questions central to policy, practice and research on formative assessment: What is the quality of teachers' content-pedagogical and assessment knowledge? What is the relationship between teacher knowledge and assessment practice? What is the relationship between teacher knowledge,…

Descriptors: Formative Evaluation, Elementary School Science, Student Evaluation, Evaluation Methods

Peer reviewed

Direct link

Karl, Andrew T.; Yang, Yan; Lohr, Sharon L. – Journal of Educational and Behavioral Statistics, 2013

Value-added models have been widely used to assess the contributions of individual teachers and schools to students' academic growth based on longitudinal student achievement outcomes. There is concern, however, that ignoring the presence of missing values, which are common in longitudinal studies, can bias teachers' value-added scores.…

Descriptors: Evaluation Methods, Teacher Effectiveness, Academic Achievement, Achievement Gains

Pupil, Teacher, and School Factors That Influence Student Achievement on the Primary Leaving Examination in Uganda: Measure Development and Multilevel Modeling

Direct link

Ochwo, Pius – ProQuest LLC, 2013

This study examined the multilevel factors that influence mathematics and English performance on the Primary Leaving Examinations (PLEs) among primary seven pupils (i.e., equivalent to the United States [U.S.] 7th graders) in Uganda. Existing student state test data from the Wakiso District were obtained. In addition, a newly created Teacher…

Descriptors: Foreign Countries, Teacher Characteristics, Student Characteristics, Institutional Characteristics

Evaluating Academic Progress without a Vertical Scale. Research Report. ETS RR-12-07

Peer reviewed
PDF on ERIC

Download full text

Yen, Wendy M.; Lall, Venessa F.; Monfils, Lora – ETS Research Report Series, 2012

Alternatives to vertical scales are compared for measuring longitudinal academic growth and for producing school-level growth measures. The alternatives examined were empirical cross-grade regression, ordinary least squares and logistic regression, and multilevel models. The student data used for the comparisons were Arabic Grades 4 to 10 in…

Descriptors: Foreign Countries, Scaling, Item Response Theory, Test Interpretation

Bakia, Marianne	1
Chiu, Pui Chi	1
Dai, Yunyun	1
DeBoer, George E.	1
Dena Dossett	1
Feng, Mingyu	1
Hardcastle, Joseph	1
Herman, Joan	1
Herrmann-Abell, Cari F.	1
Ho, Andrew D.	1
Jason C. Immekus	1
Jeffrey C. Valentine	1
Kalogrides, Demetra	1
Karen Blackburn Hoeve	1
Karl, Andrew T.	1
Lall, Venessa F.	1
Lohr, Sharon L.	1
Lydia Bradford	1
Monfils, Lora	1
Murphy, Robert	1
Ochwo, Pius	1
Osmundson, Ellen	1
Prathiba Batley	1
Reardon, Sean F.	1
Ringstaff, Cathy	1
More ▼