Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 12 |
Descriptor
Comparative Analysis | 19 |
Error of Measurement | 19 |
Hypothesis Testing | 19 |
Statistical Analysis | 7 |
Analysis of Variance | 4 |
Correlation | 4 |
Evaluation Methods | 4 |
Mathematical Models | 4 |
Simulation | 4 |
Measurement Techniques | 3 |
Monte Carlo Methods | 3 |
More ▼ |
Source
Author
Alqurashi, Fahad | 1 |
Bakker, J. | 1 |
Beek, F. J. A. | 1 |
Beyler, Amy | 1 |
Casleton, Emily | 1 |
Dunivant, Noel | 1 |
Feldt, Leonard S. | 1 |
Genschel, Ulrike | 1 |
Gummer, Tobias | 1 |
Haaring, C. | 1 |
Hall, Bruce W. | 1 |
More ▼ |
Publication Type
Reports - Research | 13 |
Journal Articles | 9 |
Speeches/Meeting Papers | 5 |
Dissertations/Theses -… | 2 |
Reports - Evaluative | 2 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Germany | 1 |
Iowa | 1 |
Saudi Arabia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Shunji Wang; Katerina M. Marcoulides; Jiashan Tang; Ke-Hai Yuan – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A necessary step in applying bi-factor models is to evaluate the need for domain factors with a general factor in place. The conventional null hypothesis testing (NHT) was commonly used for such a purpose. However, the conventional NHT meets challenges when the domain loadings are weak or the sample size is insufficient. This article proposes…
Descriptors: Hypothesis Testing, Error of Measurement, Comparative Analysis, Monte Carlo Methods
Jeffry White – Journal of Educational Research and Practice, 2024
Violations of normality and homogeneity are common in educational data. When this occurs, the use of parametric statistics may be inappropriate. A generalized form of nonparametric analyses based on the Puri and Sen L statistic provides an alternative approach. Using a chi-square distribution, this technique is easy to apply and has significant…
Descriptors: Nonparametric Statistics, Learning Analytics, Evaluation Methods, Guidance
Silber, Henning; Roßmann, Joss; Gummer, Tobias – International Journal of Social Research Methodology, 2018
In this article, we present the results of three question design experiments on inter-item correlations, which tested a grid design against a single-item design. The first and second experiments examined the inter-item correlations of a set with five and seven items, respectively, and the third experiment examined the impact of the question design…
Descriptors: Foreign Countries, Online Surveys, Experiments, Correlation
Spencer, Bryden – ProQuest LLC, 2016
Value-added models are a class of growth models used in education to assign responsibility for student growth to teachers or schools. For value-added models to be used fairly, sufficient statistical precision is necessary for accurate teacher classification. Previous research indicated precision below practical limits. An alternative approach has…
Descriptors: Monte Carlo Methods, Comparative Analysis, Accuracy, High Stakes Tests
Ravesloot, C. J.; Van der Schaaf, M. F.; Muijtjens, A. M. M.; Haaring, C.; Kruitwagen, C. L. J. J.; Beek, F. J. A.; Bakker, J.; Van Schaik, J.P.J.; Ten Cate, Th. J. – Advances in Health Sciences Education, 2015
Formula scoring (FS) is the use of a don't know option (DKO) with subtraction of points for wrong answers. Its effect on construct validity and reliability of progress test scores, is subject of discussion. Choosing a DKO may not only be affected by knowledge level, but also by risk taking tendency, and may thus introduce construct-irrelevant…
Descriptors: Scoring Formulas, Tests, Scores, Construct Validity
Zhang, Jinming; Li, Jie – Journal of Educational Measurement, 2016
An IRT-based sequential procedure is developed to monitor items for enhancing test security. The procedure uses a series of statistical hypothesis tests to examine whether the statistical characteristics of each item under inspection have changed significantly during CAT administration. This procedure is compared with a previously developed…
Descriptors: Computer Assisted Testing, Test Items, Difficulty Level, Item Response Theory
Casleton, Emily; Beyler, Amy; Genschel, Ulrike; Wilson, Alyson – Journal of Statistics Education, 2014
Undergraduate students who have just completed an introductory statistics course often lack deep understanding of variability and enthusiasm for the field of statistics. This paper argues that by introducing the commonly underemphasized concept of measurement error, students will have a better chance of attaining both. We further present lecture…
Descriptors: Undergraduate Students, Statistics, Measurement Techniques, Error of Measurement
Warachan, Boonyasit – ProQuest LLC, 2011
The objective of this research was to determine the robustness and statistical power of three different methods for testing the hypothesis that ordinal samples of five and seven Likert categories come from equal populations. The three methods are the two sample t-test with equal variances, the Mann-Whitney test, and the Kolmogorov-Smirnov test. In…
Descriptors: Statistical Analysis, Likert Scales, Hypothesis Testing, Data
Schochet, Peter Z. – Evaluation Review, 2009
In social policy evaluations, the multiple testing problem occurs due to the many hypothesis tests that are typically conducted across multiple outcomes and subgroups, which can lead to spurious impact findings. This article discusses a framework for addressing this problem that balances Types I and II errors. The framework involves specifying…
Descriptors: Policy, Evaluation, Testing Problems, Hypothesis Testing
Kim, Seonghoon; Feldt, Leonard S. – Journal of Educational Measurement, 2008
This article extends the Bonett (2003a) approach to testing the equality of alpha coefficients from two independent samples to the case of m [greater than or equal] 2 independent samples. The extended Fisher-Bonett test and its competitor, the Hakstian-Whalen (1976) test, are illustrated with numerical examples of both hypothesis testing and power…
Descriptors: Tests, Comparative Analysis, Hypothesis Testing, Error of Measurement
Thompson, Bruce – 1990
The use of multiple comparisons in analysis of variance (ANOVA) is discussed. It is argued that experimentwise Type I error rate inflation can be serious and that its influences are often unnoticed in ANOVA applications. Both classical balanced omnibus and orthogonal planned contrast tests inflate experimentwise error to an identifiable maximum.…
Descriptors: Analysis of Variance, Comparative Analysis, Error of Measurement, Hypothesis Testing
Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006
A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…
Descriptors: Cheating, Test Items, Simulation, Statistical Analysis
Alqurashi, Fahad – Online Submission, 2008
This paper reports the findings of an experiment that investigated the reactions of Saudi college students to collaborative learning techniques introduced in two modalities: face-to-face and web-based learning. Quantitative data were collected with a questionnaire that examined the changes of three constructs: attitudes toward collaboration,…
Descriptors: Foreign Countries, College Students, Student Attitudes, Cooperative Learning
Hough, Susan L.; Hall, Bruce W. – 1991
The meta-analytic techniques of G. V. Glass (1976) and J. E. Hunter and F. L. Schmidt (1977) were compared through their application to three meta-analytic studies from education literature. The following hypotheses were explored: (1) the overall mean effect size would be larger in a Hunter-Schmidt meta-analysis (HSMA) than in a Glass…
Descriptors: Comparative Analysis, Educational Research, Effect Size, Error of Measurement
Kristof, Walter – 1971
We concern ourselves with the hypothesis that two variables have a perfect disattenuated correlation, hence measure the same trait except for errors of measurement. This hypothesis is equivalent to saying, within the adopted model, that true scores of two psychological tests satisfy a linear relation. Statistical tests of this hypothesis are…
Descriptors: Analysis of Covariance, Comparative Analysis, Correlation, Error of Measurement
Previous Page | Next Page »
Pages: 1 | 2