Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Reliability | 15 |
Statistical Distributions | 15 |
Sample Size | 4 |
Sampling | 4 |
Scaling | 4 |
Test Construction | 4 |
Comparative Analysis | 3 |
Error of Measurement | 3 |
Mathematical Models | 3 |
Monte Carlo Methods | 3 |
Scores | 3 |
More ▼ |
Source
Author
Publication Type
Reports - Evaluative | 15 |
Journal Articles | 11 |
Speeches/Meeting Papers | 4 |
Education Level
Audience
Location
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Work Keys (ACT) | 1 |
What Works Clearinghouse Rating
Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014
As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…
Descriptors: Item Response Theory, Reliability, Models, Computation
Rodriguez, Michael C.; Maeda, Yukiko – Psychological Methods, 2006
The meta-analysis of coefficient alpha across many studies is becoming more common in psychology by a methodology labeled reliability generalization. Existing reliability generalization studies have not used the sampling distribution of coefficient alpha for precision weighting and other common meta-analytic procedures. A framework is provided for…
Descriptors: Generalization, Sampling, Reliability, Meta Analysis
Dolenz, Beverly – 1992
The correlation coefficient is an integral part of many other statistical techniques (analysis of variance, t-tests, etc.), since all analytic methods are actually correlational (G. V. Glass and K. D. Hopkins, 1984). The correlation coefficient is a statistical summary that represents the degree and direction of relationship between two variables.…
Descriptors: Analysis of Variance, Correlation, Heuristics, Relationship

Enders, Craig K.; Bandalos, Deborah L. – Applied Measurement in Education, 1999
Examined the degree to which coefficient alpha is affected by including items with different distribution shapes within a unidimensional scale. Computer simulation results indicate that reliability does not increase dramatically as a result of using differentially shaped items within a scale. Discusses implications for test construction. (SLD)
Descriptors: Computer Simulation, Reliability, Scaling, Statistical Distributions

Blair, R. Clifford; Higgins, James J. – Journal of Educational Statistics, 1985
This study was concerned with the effects of reliability of observations, sample size, magnitudes of treatment effects, and the shape of the sampled population on the relative power of the paired samples rank transform statistic and Wilcoxon's signed ranks statistic. (Author/LMO)
Descriptors: Effect Size, Hypothesis Testing, Power (Statistics), Reliability

Samejima, Fumiko – Applied Psychological Measurement, 1994
The reliability coefficient is predicted from the test information function (TIF) or two modified TIF formulas and a specific trait distribution. Examples illustrate the variability of the reliability coefficient across different trait distributions, and results are compared with empirical reliability coefficients. (SLD)
Descriptors: Adaptive Testing, Error of Measurement, Estimation (Mathematics), Reliability

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1992
An approximate statistical test is derived for the hypothesis that the intraclass reliability coefficients associated with two measurement procedures are equal. Control of Type 1 error is investigated by comparing empirical sampling distributions of the test statistic with its derived theoretical distribution. A numerical illustration is…
Descriptors: Equations (Mathematics), Hypothesis Testing, Mathematical Models, Measurement Techniques

Bramley, Tom – Evaluation & Research in Education, 2001
Analyzed data from a session of the General Certificate of Secondary Education (GCSE) mathematics examination to identify items displaying a bi-modal expected score distribution, try to explain the bi-modality, rescore the items to remove under-used middle categories, and determine the effect on test reliability of rescoring the data. Discusses…
Descriptors: Foreign Countries, Mathematics Tests, Reliability, Scores

Cornwell, John M. – Educational and Psychological Measurement, 1993
A comparison is made of the power and actual alpha levels of three tests of homogeneity for independent product-moment correlation coefficients using Monte Carlo methods while selectively studying sample size and varying the number of correlation reliabilities. How robust these are in applied work is discussed. (SLD)
Descriptors: Comparative Analysis, Correlation, Error of Measurement, Monte Carlo Methods
Wang, Tianyou; And Others – 1996
M. J. Kolen, B. A. Hanson, and R. L. Brennan (1992) presented a procedure for assessing the conditional standard error of measurement (CSEM) of scale scores using a strong true-score model. They also investigated the ways of using nonlinear transformation from number-correct raw score to scale score to equalize the conditional standard error along…
Descriptors: Ability, Classification, Error of Measurement, Goodness of Fit

Luecht, Richard M.; Hirsch, Thomas M. – Applied Psychological Measurement, 1992
Derivations of several item selection algorithms for use in fitting test items to target information functions (IFs) are described. These algorithms, which use an average growth approximation of target IFs, were tested by generating six test forms and were found to provide reliable fit. (SLD)
Descriptors: Algorithms, Computer Assisted Testing, Equations (Mathematics), Goodness of Fit

Kiger, Jack E.; Wise, Kenneth – College and Research Libraries, 1993
Describes the of attribute sampling to estimate characteristics of library collections and operations. The nature of statistical sampling and making a statistical inference are covered, and examples from library situations are given. Tables of determination of sample size and evaluation of results are included. (Contains six references.) (EAM)
Descriptors: Expectancy Tables, Library Administration, Library Collections, Methods
Johnson, Colleen Cook – 1993
The purpose of this study is to help define the precise nature and limits of the tolerable range in which a researcher may be relatively confident about the statistical validity of his or her research findings, focusing specifically on the statistical validity of results when violating the assumptions associated with the one-way, fixed-effects…
Descriptors: Analysis of Covariance, Analysis of Variance, Comparative Analysis, Computer Simulation

Schiel, Jeffrey L.; Shaw, Dale G. – Applied Measurement in Education, 1992
Changes in information retention resulting from changes in reliability and number of intervals in scale construction were studied to provide quantitative information to help in decisions about choosing intervals. Information retention reached a maximum when the number of intervals was about 8 or more and reliability was near 1.0. (SLD)
Descriptors: Decision Making, Knowledge Level, Mathematical Models, Monte Carlo Methods
Reckase, Mark D. – 1993
In this non-experimental study, a model was developed for portfolio assessment based on definitions and applications in the assessment literature. This model describes portfolio components, scores to be computed, and uses to be made of the scores. The literature was then reviewed to find examples of actual applications that would provide realistic…
Descriptors: Educational Assessment, Estimation (Mathematics), Grade 12, High School Students