NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Assessments and Surveys
Massachusetts Comprehensive…1
What Works Clearinghouse Rating
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kubsch, Marcus; Stamer, Insa; Steiner, Mara; Neumann, Knut; Parchmann, Ilka – Practical Assessment, Research & Evaluation, 2021
In light of the replication crisis in psychology, null-hypothesis significance testing (NHST) and "p"-values have been heavily criticized and various alternatives have been proposed, ranging from slight modifications of the current paradigm to banning "p"-values from journals. Since the physics education research community…
Descriptors: Data Analysis, Bayesian Statistics, Educational Research, Science Education
Peer reviewed Peer reviewed
Direct linkDirect link
Wiens, Stefan; Nilsson, Mats E. – Educational and Psychological Measurement, 2017
Because of the continuing debates about statistics, many researchers may feel confused about how to analyze and interpret data. Current guidelines in psychology advocate the use of effect sizes and confidence intervals (CIs). However, researchers may be unsure about how to extract effect sizes from factorial designs. Contrast analysis is helpful…
Descriptors: Data Analysis, Effect Size, Computation, Statistical Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gardner, Josh; Brooks, Christopher – Journal of Learning Analytics, 2018
Model evaluation -- the process of making inferences about the performance of predictive models -- is a critical component of predictive modelling research in learning analytics. We survey the state of the practice with respect to model evaluation in learning analytics, which overwhelmingly uses only naïve methods for model evaluation or…
Descriptors: Prediction, Models, Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Pan, Tianshu; Yin, Yue – Applied Measurement in Education, 2017
In this article, we propose using the Bayes factors (BF) to evaluate person fit in item response theory models under the framework of Bayesian evaluation of an informative diagnostic hypothesis. We first discuss the theoretical foundation for this application and how to analyze person fit using BF. To demonstrate the feasibility of this approach,…
Descriptors: Bayesian Statistics, Goodness of Fit, Item Response Theory, Monte Carlo Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Waters, Andrew; Studer, Christoph; Baraniuk, Richard – Journal of Educational Data Mining, 2014
Identifying collaboration between learners in a course is an important challenge in education for two reasons: First, depending on the courses rules, collaboration can be considered a form of cheating. Second, it helps one to more accurately evaluate each learners competence. While such collaboration identification is already challenging in…
Descriptors: Cooperation, Large Group Instruction, Online Courses, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
Klugkist, Irene; van Wesel, Floryt; Bullens, Jessie – International Journal of Behavioral Development, 2011
Null hypothesis testing (NHT) is the most commonly used tool in empirical psychological research even though it has several known limitations. It is argued that since the hypotheses evaluated with NHT do not reflect the research-question or theory of the researchers, conclusions from NHT must be formulated with great modesty, that is, they cannot…
Descriptors: Psychological Studies, Hypothesis Testing, Researchers, Evaluation Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2008
This report presents guidelines for addressing the multiple comparisons problem in impact evaluations in the education area. The problem occurs due to the large number of hypothesis tests that are typically conducted across outcomes and subgroups in these studies, which can lead to spurious statistically significant impact findings. The…
Descriptors: Guidelines, Testing, Hypothesis Testing, Statistical Significance
Peer reviewed Peer reviewed
Rubin, Donald B. – Journal of Educational Statistics, 1981
The use of Bayesian and empirical Bayesian techniques to summarize results from parallel randomized experiments is illustrated using the results of eight such experiments from an SAT coaching study. Graphical techniques, simulation techniques, and methods for monitoring the adequacy of model specification are highlighted. (Author/JKS)
Descriptors: Bayesian Statistics, Data Analysis, Educational Experiments, Goodness of Fit
Barnes, Tiffany, Ed.; Desmarais, Michel, Ed.; Romero, Cristobal, Ed.; Ventura, Sebastian, Ed. – International Working Group on Educational Data Mining, 2009
The Second International Conference on Educational Data Mining (EDM2009) was held at the University of Cordoba, Spain, on July 1-3, 2009. EDM brings together researchers from computer science, education, psychology, psychometrics, and statistics to analyze large data sets to answer educational research questions. The increase in instrumented…
Descriptors: Data Analysis, Educational Research, Conferences (Gatherings), Foreign Countries