Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 33 |
| Since 2017 (last 10 years) | 901 |
| Since 2007 (last 20 years) | 2732 |
Descriptor
| Statistical Analysis | 3988 |
| Hypothesis Testing | 2382 |
| Foreign Countries | 1378 |
| Correlation | 766 |
| Questionnaires | 756 |
| Comparative Analysis | 730 |
| Scores | 548 |
| Testing | 514 |
| College Students | 447 |
| Computer Assisted Testing | 439 |
| Student Attitudes | 425 |
| More ▼ | |
Source
Author
| Tindal, Gerald | 12 |
| Alonzo, Julie | 10 |
| Lord, Frederic M. | 10 |
| Sinharay, Sandip | 10 |
| Lai, Cheng-Fei | 9 |
| Teo, Timothy | 8 |
| Wilcox, Rand R. | 8 |
| Algina, James | 7 |
| Games, Paul A. | 7 |
| Kim, Sooyeon | 7 |
| Marascuilo, Leonard A. | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 65 |
| Practitioners | 21 |
| Teachers | 20 |
| Students | 6 |
| Administrators | 5 |
| Policymakers | 4 |
| Counselors | 1 |
| Media Staff | 1 |
| Parents | 1 |
Location
| Nigeria | 160 |
| Germany | 80 |
| Australia | 65 |
| Turkey | 64 |
| India | 62 |
| Canada | 59 |
| Iran | 51 |
| Netherlands | 51 |
| China | 47 |
| Taiwan | 47 |
| Texas | 45 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 1 |
Keller, Bryan – Journal of Educational and Behavioral Statistics, 2020
Widespread availability of rich educational databases facilitates the use of conditioning strategies to estimate causal effects with nonexperimental data. With dozens, hundreds, or more potential predictors, variable selection can be useful for practical reasons related to communicating results and for statistical reasons related to improving the…
Descriptors: Nonparametric Statistics, Computation, Testing, Causal Models
Luke G. Eglington; Philip I. Pavlik – Grantee Submission, 2020
Decades of research has shown that spacing practice trials over time can improve later memory, but there are few concrete recommendations concerning how to optimally space practice. We show that existing recommendations are inherently suboptimal due to their insensitivity to time costs and individual- and item-level differences. We introduce an…
Descriptors: Scheduling, Drills (Practice), Memory, Testing
Luke G. Eglington; Philip I. Pavlik Jr. – npj Science of Learning, 2020
Decades of research has shown that spacing practice trials over time can improve later memory, but there are few concrete recommendations concerning how to optimally space practice. We show that existing recommendations are inherently suboptimal due to their insensitivity to time costs and individual- and item-level differences. We introduce an…
Descriptors: Scheduling, Drills (Practice), Memory, Testing
Akansha Singh; Germaine Uwimpuhwe; Dimitrios Vallis; Nasima Akhter; Tahani Coolen-Maturi; Steve Higgins; Jochen Einbeck; Martin Culliney; Sean Demack – Education Endowment Foundation, 2023
The aim of this study was to investigate and empirically derive parameters commonly used for statistical power and sample size calculations to better inform future trial design. Towards achieving this aim, the research project leveraged the richness of the National Pupil Database (NPD) and the Education Endowment Foundation (EEF) Archive to: (1)…
Descriptors: Foreign Countries, Statistical Analysis, Sample Size, Educational Research
Daffin, Lee William, Jr.; Jones, Ashley A. – Online Learning, 2018
As online education becomes a more popular and permanent option for obtaining an education after high school, it also raises questions as to the academic rigor of such classes and the academic integrity of the students taking the classes. The purpose of the current study is to explore the integrity issue and to investigate student performance on…
Descriptors: College Students, Online Courses, Psychology, Computer Assisted Testing
Lin, Chih-Kai; Zhang, Jinming – Journal of Educational Measurement, 2018
Under the generalizability-theory (G-theory) framework, the estimation precision of variance components (VCs) is of significant importance in that they serve as the foundation of estimating reliability. Zhang and Lin advanced the discussion of nonadditivity in data from a theoretical perspective and showed the adverse effects of nonadditivity on…
Descriptors: Generalizability Theory, Reliability, Computation, Statistical Analysis
Porter, Kristin E. – Society for Research on Educational Effectiveness, 2016
In recent years, there has been increasing focus on the issue of multiple hypotheses testing in education evaluation studies. In these studies, researchers are typically interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time or across multiple treatment groups. When…
Descriptors: Hypothesis Testing, Intervention, Error Patterns, Evaluation Methods
Haberman, Shelby J.; Lee, Yi-Hsuan – ETS Research Report Series, 2017
In investigations of unusual testing behavior, a common question is whether a specific pattern of responses occurs unusually often within a group of examinees. In many current tests, modern communication techniques can permit quite large numbers of examinees to share keys, or common response patterns, to the entire test. To address this issue,…
Descriptors: Student Evaluation, Testing, Item Response Theory, Maximum Likelihood Statistics
Sturz, Bradley R.; Bell, Z. Kade; Bodily, Kent D. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018
During spatial reorientation, the use of local geometric cues (e.g., corner angles) and global geometric cues (e.g., principal axis) is differentially influenced by enclosure size. Local geometric cues exert more influence in large enclosures compared to small enclosures, whereas the use of global geometric cues is not influenced by changes in…
Descriptors: Spatial Ability, Comparative Analysis, Testing, Classification
Timothy Lycurgus; Ben B. Hansen; Mark White – Grantee Submission, 2022
We present an aggregation scheme that increases power in randomized controlled trials and quasi-experiments when the intervention possesses a robust and well-articulated theory of change. Intervention studies using longitudinal data often include multiple observations on individuals, some of which may be more likely to manifest a treatment effect…
Descriptors: Statistical Analysis, Randomized Controlled Trials, Quasiexperimental Design, Intervention
Häggström, Olle – Educational and Psychological Measurement, 2017
Null hypothesis significance testing (NHST) provides an important statistical toolbox, but there are a number of ways in which it is often abused and misinterpreted, with bad consequences for the reliability and progress of science. Parts of contemporary NHST debate, especially in the psychological sciences, is reviewed, and a suggestion is made…
Descriptors: Hypothesis Testing, Statistical Analysis, Psychological Studies, Taxonomy
Ranger, Jochen; Kuhn, Jörg Tobias; Ortner, Tuulia M. – Educational and Psychological Measurement, 2020
The hierarchical model of van der Linden is the most popular model for responses and response times in tests. It is composed of two separate submodels--one for the responses and one for the response times--that are joined at a higher level. The submodel for the response times is based on the lognormal distribution. The lognormal distribution is a…
Descriptors: Reaction Time, Tests, Statistical Distributions, Models
Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2019
M-fluctuation tests are a recently proposed method for detecting differential item functioning in Rasch models. This article discusses a generalization of this method to two additional item response theory models: the two-parametric logistic model and the three-parametric logistic model with a common guessing parameter. The Type I error rate and…
Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Maximum Likelihood Statistics
Raimundo, João A. G.; Turnes, Tiago; de Aguiar, Rafael A.; Lisbôa, Felipe D.; Loch, Thiago; Ribeiro, Guilherme; Caputo, Fabrizio – Research Quarterly for Exercise and Sport, 2019
Purpose: Metabolic perturbation and VO[subscript 2] on-kinetics are potential modifiers of fatigue and vary in importance depending on the exercise task. Thus, performance fatigability during high-intensity exercise seems to be exercise mode dependent, affecting tolerance in the severe domain. However, the effects of exercise mode on severe domain…
Descriptors: Exercise, Comparative Analysis, Physical Activities, Testing
Thompson, W. Burt – Teaching of Psychology, 2019
When a psychologist announces a new research finding, it is often based on a rejected null hypothesis. However, if that hypothesis is true, the claim is a false alarm. Many students mistakenly believe that the probability of committing a false alarm equals alpha, the criterion for statistical significance, which is typically set at 5%. Instructors…
Descriptors: Statistical Analysis, Hypothesis Testing, Misconceptions, Data Interpretation

Peer reviewed
Direct link
