Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 34 |
Descriptor
Comparative Analysis | 40 |
Evaluation Methods | 40 |
Probability | 40 |
Models | 11 |
Statistical Analysis | 11 |
Validity | 7 |
Computation | 6 |
Data Analysis | 6 |
Foreign Countries | 6 |
Scores | 6 |
Educational Research | 5 |
More ▼ |
Source
Author
Akbari, Alireza | 1 |
Alexander D. Latham | 1 |
Amemiya, Yasuo | 1 |
Anderson, Kaitlin | 1 |
Barr, James | 1 |
Benjamin, Aaron S. | 1 |
Berry, Kenneth J. | 1 |
Bockman, John F. | 1 |
Bos, Wilfried | 1 |
Bosch, Nigel | 1 |
Brusilovsky, Peter | 1 |
More ▼ |
Publication Type
Journal Articles | 27 |
Reports - Research | 21 |
Reports - Evaluative | 7 |
Dissertations/Theses -… | 4 |
Reports - Descriptive | 4 |
Speeches/Meeting Papers | 3 |
Book/Product Reviews | 1 |
Collected Works - Proceedings | 1 |
Opinion Papers | 1 |
Education Level
Higher Education | 7 |
Elementary Secondary Education | 5 |
Postsecondary Education | 4 |
Elementary Education | 2 |
Early Childhood Education | 1 |
Grade 11 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
More ▼ |
Audience
Location
Canada | 2 |
Turkey | 2 |
United Kingdom | 2 |
Arizona | 1 |
Asia | 1 |
Australia | 1 |
Brazil | 1 |
Connecticut | 1 |
Denmark | 1 |
Egypt | 1 |
Estonia | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022
The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…
Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction
Carly Oddleifson; Stephen Kilgus; David A. Klingbeil; Alexander D. Latham; Jessica S. Kim; Ishan N. Vengurlekar – Grantee Submission, 2025
The purpose of this study was to conduct a conceptual replication of Pendergast et al.'s (2018) study that examined the diagnostic accuracy of a nomogram procedure, also known as a naive Bayesian approach. The specific naive Bayesian approach combined academic and social-emotional and behavioral (SEB) screening data to predict student performance…
Descriptors: Bayesian Statistics, Accuracy, Social Emotional Learning, Diagnostic Tests
Bosch, Nigel; Paquette, Luc – Journal of Learning Analytics, 2018
Metrics including Cohen's kappa, precision, recall, and F[subscript 1] are common measures of performance for models of discrete student states, such as a student's affect or behaviour. This study examined discrete model metrics for previously published student model examples to identify situations where metrics provided differing perspectives on…
Descriptors: Models, Comparative Analysis, Prediction, Probability
Akbari, Alireza; Shahnazari, Mohammadtaghi – Language Testing in Asia, 2019
The present research paper introduces a translation evaluation method called Calibrated Parsing Items Evaluation (CPIE hereafter). This evaluation method maximizes translators' performance through identifying the parsing items with an optimal p-docimology and d-index (item discrimination). This method checks all the possible parses (annotations)…
Descriptors: Test Items, Translation, Computer Software, Evaluators
Ueno, Maomi; Miyazawa, Yoshimitsu – IEEE Transactions on Learning Technologies, 2018
Over the past few decades, many studies conducted in the field of learning science have described that scaffolding plays an important role in human learning. To scaffold a learner efficiently, a teacher should predict how much support a learner must have to complete tasks and then decide the optimal degree of assistance to support the learner's…
Descriptors: Scaffolding (Teaching Technique), Prediction, Probability, Comparative Analysis
Kim, Yongnam; Steiner, Peter – Educational Psychologist, 2016
When randomized experiments are infeasible, quasi-experimental designs can be exploited to evaluate causal treatment effects. The strongest quasi-experimental designs for causal inference are regression discontinuity designs, instrumental variable designs, matching and propensity score designs, and comparative interrupted time series designs. This…
Descriptors: Quasiexperimental Design, Causal Models, Statistical Inference, Randomized Controlled Trials
Steiner, Peter M.; Wong, Vivian – Society for Research on Educational Effectiveness, 2016
Despite recent emphasis on the use of randomized control trials (RCTs) for evaluating education interventions, in most areas of education research, observational methods remain the dominant approach for assessing program effects. Over the last three decades, the within-study comparison (WSC) design has emerged as a method for evaluating the…
Descriptors: Randomized Controlled Trials, Comparative Analysis, Research Design, Evaluation Methods
Jacovidis, Jessica N.; Foelber, Kelly J.; Horst, S. Jeanne – Journal of Experimental Education, 2017
Often program administrators are interested in knowing how students benefit from participation in programs compared to students who do not participate. Such comparisons may be sullied by the fact that participants self-select into programs, resulting in differences between groups prior to programming. By controlling for…
Descriptors: Probability, Scores, Statistical Analysis, Student Evaluation
Solomon, Benjamin G.; Forsberg, Ole J. – School Psychology Quarterly, 2017
Bayesian techniques have become increasingly present in the social sciences, fueled by advances in computer speed and the development of user-friendly software. In this paper, we forward the use of Bayesian Asymmetric Regression (BAR) to monitor intervention responsiveness when using Curriculum-Based Measurement (CBM) to assess oral reading…
Descriptors: Bayesian Statistics, Regression (Statistics), Least Squares Statistics, Evaluation Methods
Huang, Yun; González-Brenes, José P.; Kumar, Rohit; Brusilovsky, Peter – International Educational Data Mining Society, 2015
Latent variable models, such as the popular Knowledge Tracing method, are often used to enable adaptive tutoring systems to personalize education. However, finding optimal model parameters is usually a difficult non-convex optimization problem when considering latent variable models. Prior work has reported that latent variable models obtained…
Descriptors: Guidelines, Models, Prediction, Evaluation Methods
Liu, Yan; Zumbo, Bruno D.; Gustafson, Paul; Huang, Yi; Kroc, Edward; Wu, Amery D. – Practical Assessment, Research & Evaluation, 2016
A variety of differential item functioning (DIF) methods have been proposed and used for ensuring that a test is fair to all test takers in a target population in the situations of, for example, a test being translated to other languages. However, once a method flags an item as DIF, it is difficult to conclude that the grouping variable (e.g.,…
Descriptors: Test Items, Test Bias, Probability, Scores
Zamarro, Gema; Anderson, Kaitlin; Steele, Jennifer; Miller, Trey – Society for Research on Educational Effectiveness, 2016
The purpose of this study is to study the performance of different methods (inverse probability weighting and estimation of informative bounds) to control for differential attrition by comparing the results of different methods using two datasets: an original dataset from Portland Public Schools (PPS) subject to high rates of differential…
Descriptors: Data Analysis, Student Attrition, Evaluation Methods, Evaluation Research
Callister Everson, Kimberlee; Feinauer, Erika; Sudweeks, Richard R. – Harvard Educational Review, 2013
In this article, the authors provide a methodological critique of the current standard of value-added modeling forwarded in educational policy contexts as a means of measuring teacher effectiveness. Conventional value-added estimates of teacher quality are attempts to determine to what degree a teacher would theoretically contribute, on average,…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Evaluation Methods, Accountability
Taylor, Lauren Christine – ProQuest LLC, 2013
Considering the amount of funding that is distributed to educational research each year, leaders and policymakers have a vested interest in finding scientifically based evidence that answers causal questions regarding program effectiveness. The importance of program evaluation has long been recognized in many fields of research; however, the most…
Descriptors: Program Evaluation, Evaluation Methods, Comparative Analysis, Multivariate Analysis
Hansen, Ben B.; Fredrickson, Mark M. – Society for Research on Educational Effectiveness, 2014
The goal of this research is to make sensitivity analysis accessible not only to empirical researchers but also to the various stakeholders for whom educational evaluations are conducted. To do this it derives anchors for the omitted variable (OV)-program participation association intrinsically, using the Love plot to present a wide range of…
Descriptors: Research Methodology, Quasiexperimental Design, Evaluation Methods, Comparative Analysis