Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Wang, Ze – Educational Psychology, 2015
Using data from the Trends in International Mathematics and Science Study (TIMSS) 2007, this study examined the big-fish-little-pond-effects (BFLPEs) in 49 countries. In this study, the effect of math ability on math self-concept was decomposed into a within- and a between-level components using implicit mean centring and the complex data…
Descriptors: Nonverbal Ability, Mathematics, Self Concept, Hierarchical Linear Modeling
Tang, Yang; Cook, Thomas D.; Kisbu-Sakarya, Yasemin – Society for Research on Educational Effectiveness, 2015
Regression discontinuity design (RD) has been widely used to produce reliable causal estimates. Researchers have validated the accuracy of RD design using within study comparisons (Cook, Shadish & Wong, 2008; Cook & Steiner, 2010; Shadish et al, 2011). Within study comparisons examines the validity of a quasi-experiment by comparing its…
Descriptors: Pretests Posttests, Statistical Bias, Accuracy, Regression (Statistics)
Zhang, Jinming; Li, Jie – Journal of Educational Measurement, 2016
An IRT-based sequential procedure is developed to monitor items for enhancing test security. The procedure uses a series of statistical hypothesis tests to examine whether the statistical characteristics of each item under inspection have changed significantly during CAT administration. This procedure is compared with a previously developed…
Descriptors: Computer Assisted Testing, Test Items, Difficulty Level, Item Response Theory
Levy, Roy – Educational Psychologist, 2016
In this article, I provide a conceptually oriented overview of Bayesian approaches to statistical inference and contrast them with frequentist approaches that currently dominate conventional practice in educational research. The features and advantages of Bayesian approaches are illustrated with examples spanning several statistical modeling…
Descriptors: Bayesian Statistics, Models, Educational Research, Innovation
Hampf, Franziska; Wiederhold, Simon; Woessmann, Ludger – Large-scale Assessments in Education, 2017
Ample evidence indicates that a person's human capital is important for success on the labor market in terms of both wages and employment prospects. However, unlike the efforts to identify the impact of school attainment on labor-market outcomes, the literature on returns to cognitive skills has not yet provided convincing evidence that the…
Descriptors: Outcomes of Education, Human Capital, Labor Market, Income
Wittrock, Jill; Kimmel, Linda; Hunscher, Brian; Le, Kien Trung – International Journal of Social Research Methodology, 2017
Proxy reporting is a common practice during survey data collection to increase response rates while reducing fieldwork costs, and agreement between proxies and self-reports is critical to make reliable and valid inferences. This study is the first to unpack what influences proxy accuracy in a non-Western setting using data from the 2012 Qatar…
Descriptors: Research Methodology, Surveys, Data Collection, Instructional Program Divisions
Pinder, Jonathan P. – Decision Sciences Journal of Innovative Education, 2014
Business analytics courses, such as marketing research, data mining, forecasting, and advanced financial modeling, have substantial predictive modeling components. The predictive modeling in these courses requires students to estimate and test many linear regressions. As a result, false positive variable selection ("type I errors") is…
Descriptors: Data Collection, Data Analysis, Regression (Statistics), Predictive Measurement
Rhemtulla, Mijke; Jia, Fan; Wu, Wei; Little, Todd D. – International Journal of Behavioral Development, 2014
We examine the performance of planned missing (PM) designs for correlated latent growth curve models. Using simulated data from a model where latent growth curves are fitted to two constructs over five time points, we apply three kinds of planned missingness. The first is item-level planned missingness using a three-form design at each wave such…
Descriptors: Data Analysis, Error of Measurement, Models, Longitudinal Studies
Henson, Robin K.; Natesan, Prathiba; Axelson, Erika D. – Journal of Experimental Education, 2014
The authors examined the distributional properties of 3 improvement-over-chance, I, effect sizes each derived from linear and quadratic predictive discriminant analysis and from logistic regression analysis for the 2-group univariate classification. These 3 classification methods (3 levels) were studied under varying levels of data conditions,…
Descriptors: Effect Size, Probability, Comparative Analysis, Classification
Phillips, Gary W. – American Institutes for Research, 2014
This paper describes a statistical linking between the 2011 National Assessment of Educational Progress (NAEP) in Grade 4 reading and the 2011 Progress in International Reading Literacy Study (PIRLS) in Grade 4 reading. The primary purpose of the linking study is to obtain a statistical comparison between NAEP (a national assessment) and PIRLS (an…
Descriptors: National Competency Tests, Reading Achievement, Comparative Analysis, Measures (Individuals)
Hansen, Michael; Lemke, Mariann; Sorensen, Nicholas – National Center for Analysis of Longitudinal Data in Education Research (CALDER), 2014
Teacher and principal evaluation systems now emerging in response to federal, state and/or local policy initiatives typically require that a component of teacher evaluation be based on multiple performance metrics, which must be combined to produce summative ratings of teacher effectiveness. Districts have utilized three common approaches to…
Descriptors: Teacher Evaluation, Measures (Individuals), Error of Measurement, Teacher Effectiveness
Tipton, Elizabeth – Society for Research on Educational Effectiveness, 2014
Replication studies allow for making comparisons and generalizations regarding the effectiveness of an intervention across different populations, versions of a treatment, settings and contexts, and outcomes. One method for making these comparisons across many replication studies is through the use of meta-analysis. A recent innovation in…
Descriptors: Replication (Evaluation), Robustness (Statistics), Meta Analysis, Regression (Statistics)
Deke, John; Chiang, Hanley – Society for Research on Educational Effectiveness, 2014
Meeting the What Works Clearinghouse (WWC) attrition standard (or one of the attrition standards based on the WWC standard) is now an important consideration for researchers conducting studies that could potentially be reviewed by the WWC (or other evidence reviews). Understanding the basis of this standard is valuable for anyone seeking to meet…
Descriptors: Attrition (Research Studies), Student Attrition, Randomized Controlled Trials, Standards
New York State Education Department, 2018
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Thissen, David – Journal of Educational and Behavioral Statistics, 2016
David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…
Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation

Peer reviewed
Direct link
