Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Evaluation Methods | 8 |
Monte Carlo Methods | 8 |
Reliability | 8 |
Effect Size | 2 |
Error of Measurement | 2 |
Generalization | 2 |
Meta Analysis | 2 |
Sample Size | 2 |
Simulation | 2 |
Statistical Analysis | 2 |
Test Items | 2 |
More ▼ |
Source
Applied Psychological… | 1 |
Educational and Psychological… | 1 |
Grantee Submission | 1 |
Journal of Experimental… | 1 |
Language Testing | 1 |
Measurement:… | 1 |
ProQuest LLC | 1 |
Author
Allam, Reynald | 1 |
Brannick, Michael T. | 1 |
Douglas, Jeff | 1 |
He, Xuming | 1 |
Henson, Robert | 1 |
Kromrey, Jeffrey D. | 1 |
Lin, Chih-Kai | 1 |
Mason, Corinne | 1 |
Novak, Josip | 1 |
Owens, Corina M. | 1 |
Perlman, Carole L. | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 5 |
Reports - Evaluative | 2 |
Dissertations/Theses -… | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023
A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…
Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation
Lin, Chih-Kai – Language Testing, 2017
Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…
Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy
Yuan, Ke-Hai; Zhang, Zhiyong; Zhao, Yanyun – Grantee Submission, 2017
The normal-distribution-based likelihood ratio statistic T[subscript ml] = nF[subscript ml] is widely used for power analysis in structural Equation modeling (SEM). In such an analysis, power and sample size are computed by assuming that T[subscript ml] follows a central chi-square distribution under H[subscript 0] and a noncentral chi-square…
Descriptors: Statistical Analysis, Evaluation Methods, Structural Equation Models, Reliability
Smith, Julie M. – ProQuest LLC, 2011
This study examines the proposed Reliability Generalization (RG) method for studying reliability. RG employs the application of meta-analytic techniques similar to those used in validity generalization studies to examine reliability coefficients. This study explains why RG does not provide a proper research method for the study of reliability,…
Descriptors: Reliability, Generalization, Sampling, Research Methodology
Romano, Jeanine L.; Kromrey, Jeffrey D.; Owens, Corina M.; Scott, Heather M. – Journal of Experimental Education, 2011
In this study, the authors aimed to examine 8 of the different methods for computing confidence intervals around alpha that have been proposed to determine which of these, if any, is the most accurate and precise. Monte Carlo methods were used to simulate samples under known and controlled population conditions wherein the underlying item…
Descriptors: Intervals, Monte Carlo Methods, Rating Scales, Computation
Mason, Corinne; Allam, Reynald; Brannick, Michael T. – Educational and Psychological Measurement, 2007
Reliability generalization studies have provided estimates of the mean reliability coefficients and examined factors that explain the variability in the reliability estimates across studies for many different tests and measures. Different authors have used different data analyses to do such meta-analyses, and little research has addressed whether…
Descriptors: Reliability, Monte Carlo Methods, Meta Analysis, Generalization
Henson, Robert; Roussos, Louis; Douglas, Jeff; He, Xuming – Applied Psychological Measurement, 2008
Cognitive diagnostic models (CDMs) model the probability of correctly answering an item as a function of an examinee's attribute mastery pattern. Because estimation of the mastery pattern involves more than a continuous measure of ability, reliability concepts introduced by classical test theory and item response theory do not apply. The cognitive…
Descriptors: Diagnostic Tests, Classification, Probability, Item Response Theory
Perlman, Carole L. – 1982
The value-added model is presented as a method of assessing the impact of treatments in pretest-posttest quasi-experimental designs. The model yields a projection of treatment group performance at posttest time with no special treatment. The difference between the expected result and the actual outcome is "the value added," a measure of…
Descriptors: Achievement Gains, Age Differences, Early Childhood Education, Evaluation Methods