NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Individuals with Disabilities…1
What Works Clearinghouse Rating
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Youmi Suk – Journal of Educational and Behavioral Statistics, 2024
Machine learning (ML) methods for causal inference have gained popularity due to their flexibility to predict the outcome model and the propensity score. In this article, we provide a within-group approach for ML-based causal inference methods in order to robustly estimate average treatment effects in multilevel studies when there is cluster-level…
Descriptors: Artificial Intelligence, Causal Models, Statistical Inference, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Anderson, Daniel; Kahn, Joshua D.; Tindal, Gerald – Applied Measurement in Education, 2017
Unidimensionality and local independence are two common assumptions of item response theory. The former implies that all items measure a common latent trait, while the latter implies that responses are independent, conditional on respondents' location on the latent trait. Yet, few tests are truly unidimensional. Unmodeled dimensions may result in…
Descriptors: Robustness (Statistics), Item Response Theory, Mathematics Tests, Grade 6
Peer reviewed Peer reviewed
Direct linkDirect link
Youmi Suk; Peter M. Steiner; Jee-Seon Kim; Hyunseung Kang – Society for Research on Educational Effectiveness, 2021
Background/Context: Regression discontinuity (RD) designs are used for policy and program evaluation where subjects' eligibility into a program or policy is determined by whether an assignment variable (i.e., running variable) exceeds a pre-defined cutoff. Under a standard RD design with a continuous assignment variable, the average treatment…
Descriptors: Educational Policy, Eligibility, Cutting Scores, Testing Accommodations
Peer reviewed Peer reviewed
Direct linkDirect link
Lomos, Catalina – School Effectiveness and School Improvement, 2017
Within comparative school effectiveness research facilitated by large-scale data across countries, this article presents the results of the testing for measurement invariance of the latent concept of Professional Community (PC) across 23 European countries and more than 35,000 teachers in secondary schools. The newly proposed Multiple-Group Factor…
Descriptors: Foreign Countries, Teacher Characteristics, Comparative Education, School Effectiveness
Nese, Joseph F. T.; Stevens, Joseph J.; Schulte, Ann C.; Tindal, Gerald; Elliott, Stephen N. – Journal of Special Education, 2017
Our purpose was to examine different approaches to modeling the time-varying nature of exceptionality classification. Using longitudinal data from one state's mathematics achievement test for 28,829 students in Grades 3 to 8, we describe the reclassification rate within special education and between general and special education, and compare four…
Descriptors: Classification, Achievement Gains, Special Needs Students, Mathematics Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Long, Mark C. – Journal of Research on Educational Effectiveness, 2016
Using a "naïve" specification, this paper estimates the relationship between 36 high school characteristics and 24 student outcomes controlling for students' pre-high school characteristics. The goal of this exploration is not to generate casual estimates, but rather to: (a) compare the size of the relationships to determine which inputs…
Descriptors: Hypothesis Testing, Effect Size, High School Students, Student Characteristics
Peer reviewed Peer reviewed
Direct linkDirect link
Tong, Xin; Zhang, Zhiyong – Multivariate Behavioral Research, 2012
Growth curve models with different types of distributions of random effects and of intraindividual measurement errors for robust analysis are compared. After demonstrating the influence of distribution specification on parameter estimation, 3 methods for diagnosing the distributions for both random effects and intraindividual measurement errors…
Descriptors: Models, Robustness (Statistics), Statistical Analysis, Error of Measurement
Yin, Liqun – ProQuest LLC, 2013
In recent years, many states have adopted Item Response Theory (IRT) based vertically scaled tests due to their compelling features in a growth-based accountability context. However, selection of a practical and effective calibration/scaling method and proper understanding of issues with possible multidimensionality in the test data is critical to…
Descriptors: Item Response Theory, Scaling, Robustness (Statistics), Monte Carlo Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Raufelder, Diana; Hoferichter, Frances – International Journal of School & Educational Psychology, 2015
The current study presents a newly developed measurement: the TEMO (Teacher and Motivation) scale, which assesses adolescent students' perception of liked and disliked teachers and the resulting impact on their academic motivation. A total of 1,088 students from secondary schools in Germany participated in this study. To explore the underlying…
Descriptors: Foreign Countries, Measures (Individuals), Likert Scales, Student Evaluation of Teacher Performance
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Zhiyong; Lai, Keke; Lu, Zhenqiu; Tong, Xin – Structural Equation Modeling: A Multidisciplinary Journal, 2013
Despite the widespread popularity of growth curve analysis, few studies have investigated robust growth curve models. In this article, the "t" distribution is applied to model heavy-tailed data and contaminated normal data with outliers for growth curve analysis. The derived robust growth curve models are estimated through Bayesian…
Descriptors: Structural Equation Models, Bayesian Statistics, Statistical Inference, Statistical Distributions
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Jianjun – School Science and Mathematics, 2011
As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…
Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking
Peer reviewed Peer reviewed
Direct linkDirect link
Curcic, Svjetlana; Johnstone, Robin S. – Computers in the Schools, 2016
This study examined the effects of an intervention in writing with digital interactive books. To improve the writing skills of seventh- and eighth-grade students with a learning disability in reading, we conducted a quasi-experimental study in which the students read interactive digital books (i-books), took notes, wrote summaries, and acted as…
Descriptors: Intervention, Writing Skills, Learning Disabilities, Cartoons
Lavy, Victor – Centre for the Economics of Education (NJ1), 2010
There are large differences across countries in instructional time in public schooling institutions. For example, among European countries such as Belgium, France and Greece, pupils aged 15 have an average of over a thousand hours per year of total compulsory classroom instruction while in England, Luxembourg and Sweden the average is only 750…
Descriptors: Time on Task, Time Factors (Learning), Academic Achievement, Achievement Gap
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Aoyama, Kazuhiro; Stephens, Max – Mathematics Education Research Journal, 2003
Many educators and researchers are trying to define statistical literacy for the 21st century. Kimura, a Japanese science educator, has suggested that a key task of statistical literacy is the ability to extract qualitative information from quantitative information, and/or to create new information from qualitative and quantitative information.…
Descriptors: Foreign Countries, Questionnaires, Program Validation, Item Analysis