Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 23 |
Descriptor
Computation | 29 |
Evaluation Methods | 29 |
Probability | 29 |
Models | 14 |
Simulation | 9 |
Item Response Theory | 7 |
Statistical Analysis | 7 |
Comparative Analysis | 6 |
Data Analysis | 5 |
Validity | 5 |
Equations (Mathematics) | 4 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 21 |
Reports - Research | 12 |
Reports - Descriptive | 8 |
Reports - Evaluative | 5 |
Collected Works - Proceedings | 2 |
Speeches/Meeting Papers | 2 |
Books | 1 |
Dissertations/Theses -… | 1 |
Guides - Classroom - Teacher | 1 |
Education Level
Higher Education | 3 |
Elementary Secondary Education | 2 |
Adult Education | 1 |
Early Childhood Education | 1 |
Grade 9 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
New Zealand | 2 |
Australia | 1 |
Cyprus | 1 |
Denmark | 1 |
Estonia | 1 |
Germany | 1 |
Illinois | 1 |
Norway | 1 |
Oregon | 1 |
Pakistan | 1 |
South Korea | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
What Works Clearinghouse Rating
Wendy Chan – Asia Pacific Education Review, 2024
As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…
Descriptors: Probability, Scores, Causal Models, Statistical Inference
Porter, Kristin E. – Society for Research on Educational Effectiveness, 2016
In recent years, there has been increasing focus on the issue of multiple hypotheses testing in education evaluation studies. In these studies, researchers are typically interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time or across multiple treatment groups. When…
Descriptors: Hypothesis Testing, Intervention, Error Patterns, Evaluation Methods
Shieh, Gwowen – Journal of Experimental Education, 2015
Analysis of variance is one of the most frequently used statistical analyses in the behavioral, educational, and social sciences, and special attention has been paid to the selection and use of an appropriate effect size measure of association in analysis of variance. This article presents the sample size procedures for precise interval estimation…
Descriptors: Statistical Analysis, Sample Size, Computation, Effect Size
Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2015
Test-retest studies for assessing stability and change are widely used in different domains and allow improved or additional individual estimates of interest to be obtained. However, if these estimates are to be validly interpreted the responses given at Time-2 must be free of retest effects, and the fulfilment of this assumption must be…
Descriptors: Item Response Theory, Evaluation Methods, Responses, Testing
Ostrow, Korinn; Donnelly, Chistopher; Heffernan, Neil – International Educational Data Mining Society, 2015
As adaptive tutoring systems grow increasingly popular for the completion of classwork and homework, it is crucial to assess the manner in which students are scored within these platforms. The majority of systems, including ASSISTments, return the binary correctness of a student's first attempt at solving each problem. Yet for many teachers,…
Descriptors: Intelligent Tutoring Systems, Scoring, Testing, Credits
Orcan, Fatih – ProQuest LLC, 2013
Parceling is referred to as a procedure for computing sums or average scores across multiple items. Parcels instead of individual items are then used as indicators of latent factors in the structural equation modeling analysis (Bandalos 2002, 2008; Little et al., 2002; Yang, Nay, & Hoyle, 2010). Item parceling may be applied to alleviate some…
Descriptors: Structural Equation Models, Evaluation Methods, Simulation, Sample Size
Zamarro, Gema; Anderson, Kaitlin; Steele, Jennifer; Miller, Trey – Society for Research on Educational Effectiveness, 2016
The purpose of this study is to study the performance of different methods (inverse probability weighting and estimation of informative bounds) to control for differential attrition by comparing the results of different methods using two datasets: an original dataset from Portland Public Schools (PPS) subject to high rates of differential…
Descriptors: Data Analysis, Student Attrition, Evaluation Methods, Evaluation Research
Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015
When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…
Descriptors: Competence, Tests, Evaluation Methods, Adults
Wall, Melanie M.; Guo, Jia; Amemiya, Yasuo – Multivariate Behavioral Research, 2012
Mixture factor analysis is examined as a means of flexibly estimating nonnormally distributed continuous latent factors in the presence of both continuous and dichotomous observed variables. A simulation study compares mixture factor analysis with normal maximum likelihood (ML) latent factor modeling. Different results emerge for continuous versus…
Descriptors: Sample Size, Simulation, Form Classes (Languages), Diseases
Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014
We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…
Descriptors: Item Response Theory, Models, Goodness of Fit, Probability
Keselman, H. J.; Miller, Charles W.; Holland, Burt – Psychological Methods, 2011
There have been many discussions of how Type I errors should be controlled when many hypotheses are tested (e.g., all possible comparisons of means, correlations, proportions, the coefficients in hierarchical models, etc.). By and large, researchers have adopted familywise (FWER) control, though this practice certainly is not universal. Familywise…
Descriptors: Validity, Statistical Significance, Probability, Computation
Hourihan, Kathleen L.; Benjamin, Aaron S. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2010
Recently, Vul and Pashler (2008) demonstrated that the average of 2 responses from a single subject to general knowledge questions was more accurate than either single estimate. Importantly, this reveals that each guess contributes unique evidence relevant to the decision, contrary to views that eschew probabilistic representations of the…
Descriptors: Memory, Task Analysis, Cognitive Processes, Undergraduate Students
Kreiner, Svend – Applied Psychological Measurement, 2011
To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…
Descriptors: Item Analysis, Correlation, Item Response Theory, Models
VanDerHeyden, Amanda M. – Exceptional Children, 2011
Perhaps the greatest value of response to intervention (RTI) as a decision framework is that it brings attention to variables (e.g., mastery of prerequisite skills, frequency of instructional corrective feedback, reinforcement schedules for correct responding) that if changed might make a meaningful difference for students (e.g., child rate of…
Descriptors: Feedback (Response), Intervention, Classification, Response to Intervention
Lecoutre, Bruno; Lecoutre, Marie-Paule; Poitevineau, Jacques – Psychological Methods, 2010
P. R. Killeen's (2005a) probability of replication ("p[subscript rep]") of an experimental result is the fiducial Bayesian predictive probability of finding a same-sign effect in a replication of an experiment. "p[subscript rep]" is now routinely reported in "Psychological Science" and has also begun to appear in…
Descriptors: Research Methodology, Guidelines, Probability, Computation
Previous Page | Next Page »
Pages: 1 | 2