Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 7 |
Descriptor
Achievement Tests | 7 |
Evaluation Methods | 7 |
Hierarchical Linear Modeling | 7 |
Foreign Countries | 5 |
International Assessment | 4 |
Secondary School Students | 3 |
Academic Achievement | 2 |
Error of Measurement | 2 |
School Districts | 2 |
Science Achievement | 2 |
Scores | 2 |
More ▼ |
Source
American Journal of Evaluation | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
Prospects: Quarterly Review… | 1 |
School Effectiveness and… | 1 |
Sociological Methods &… | 1 |
Stanford Center for Education… | 1 |
Author
Al-bakr, Fawziah | 1 |
Artur Pokropek | 1 |
Carmen Köhler | 1 |
David Kaplan | 1 |
Ho, Andrew D. | 1 |
Johannes Hartig | 1 |
Kalogrides, Demetra | 1 |
Kraaykamp, Gerbert | 1 |
Lale Khorramdel | 1 |
Mingya Huang | 1 |
Pelzer, Ben | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 6 |
Reports - Descriptive | 1 |
Education Level
Secondary Education | 4 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 10 | 1 |
Grade 11 | 1 |
Grade 5 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Location
Texas | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 3 |
National Assessment of… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Mingya Huang; David Kaplan – Journal of Educational and Behavioral Statistics, 2025
The issue of model uncertainty has been gaining interest in education and the social sciences community over the years, and the dominant methods for handling model uncertainty are based on Bayesian inference, particularly, Bayesian model averaging. However, Bayesian model averaging assumes that the true data-generating model is within the…
Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Statistical Inference, Predictor Variables
Carmen Köhler; Lale Khorramdel; Artur Pokropek; Johannes Hartig – Journal of Educational Measurement, 2024
For assessment scales applied to different groups (e.g., students from different states; patients in different countries), multigroup differential item functioning (MG-DIF) needs to be evaluated in order to ensure that respondents with the same trait level but from different groups have equal response probabilities on a particular item. The…
Descriptors: Measures (Individuals), Test Bias, Models, Item Response Theory
Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019
Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…
Descriptors: Test Validity, Evaluation Methods, School Districts, Scores
van Hek, Margriet; Kraaykamp, Gerbert; Pelzer, Ben – School Effectiveness and School Improvement, 2018
Few studies on male-female inequalities in education have elaborated on whether school characteristics affect girls' and boys' educational performance differently. This study investigated how school resources, being schools' socioeconomic composition, proportion of girls, and proportion of highly educated teachers, and school practices, being…
Descriptors: Gender Differences, Reading Achievement, Institutional Characteristics, Educational Resources
Westine, Carl D. – American Journal of Evaluation, 2016
Little is known empirically about intraclass correlations (ICCs) for multisite cluster randomized trial (MSCRT) designs, particularly in science education. In this study, ICCs suitable for science achievement studies using a three-level (students in schools in districts) MSCRT design that block on district are estimated and examined. Estimates of…
Descriptors: Efficiency, Evaluation Methods, Science Achievement, Correlation
Pokropek, Artur – Sociological Methods & Research, 2015
This article combines statistical and applied research perspective showing problems that might arise when measurement error in multilevel compositional effects analysis is ignored. This article focuses on data where independent variables are constructed measures. Simulation studies are conducted evaluating methods that could overcome the…
Descriptors: Error of Measurement, Hierarchical Linear Modeling, Simulation, Evaluation Methods
Wiseman, Alexander W.; Al-bakr, Fawziah – Prospects: Quarterly Review of Comparative Education, 2013
In national education systems worldwide, teacher quality has become synonymous with education reform efforts, but a more elusive goal is empirically measuring teacher quality. One proposed measure of teacher quality, teacher licensing, also known as certification, is an increasingly ubiquitous component of national education systems and…
Descriptors: Comparative Analysis, Comparative Education, Academic Achievement, Teacher Certification