Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Error of Measurement | 15 |
Statistical Analysis | 15 |
Statistical Studies | 15 |
Research Design | 5 |
Evaluation Methods | 4 |
Mathematical Models | 4 |
Nonparametric Statistics | 4 |
Sample Size | 4 |
Sampling | 4 |
Analysis of Variance | 3 |
Data Analysis | 3 |
More ▼ |
Source
Evaluation in Education:… | 1 |
International Journal of… | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
Journal of Experimental… | 1 |
National Center for Research… | 1 |
ProQuest LLC | 1 |
Author
Algina, James | 1 |
Bernstein, Lawrence | 1 |
Bloom, Howard S. | 1 |
Bradshaw, Stephen C. | 1 |
Burstein, Nancy | 1 |
Cai, Li | 1 |
Hart, Roland J. | 1 |
Hedges, Larry V. | 1 |
Hill, Carolyn J. | 1 |
Jinjin Huang | 1 |
Lei, Ying | 1 |
More ▼ |
Publication Type
Reports - Research | 9 |
Speeches/Meeting Papers | 5 |
Journal Articles | 4 |
Reports - Evaluative | 3 |
Dissertations/Theses -… | 1 |
ERIC Publications | 1 |
Guides - Non-Classroom | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Researchers | 4 |
Location
Australia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
California Psychological… | 1 |
What Works Clearinghouse Rating
Simsek, Ahmet Salih – International Journal of Assessment Tools in Education, 2023
Likert-type item is the most popular response format for collecting data in social, educational, and psychological studies through scales or questionnaires. However, there is no consensus on whether parametric or non-parametric tests should be preferred when analyzing Likert-type data. This study examined the statistical power of parametric and…
Descriptors: Error of Measurement, Likert Scales, Nonparametric Statistics, Statistical Analysis
Jinjin Huang – ProQuest LLC, 2020
Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated.…
Descriptors: Robustness (Statistics), Item Response Theory, Test Items, Item Analysis
Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014
We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…
Descriptors: Item Response Theory, Models, Goodness of Fit, Probability
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2012
Statistical modeling of school effectiveness data was originally motivated by the dissatisfaction with the analysis of (school-leaving) examination results that took no account of the background of the students or regarded each school as an isolated unit of analysis. The application of multilevel analysis was generally regarded as a breakthrough,…
Descriptors: School Effectiveness, Data Analysis, Statistical Analysis, Statistical Studies

McMorris, Robert F. – Journal of Educational Measurement, 1972
Approximations were compared with exact statistics obtained on 85 different classroom tests constructed and administered by professors in a variety of fields; means and standard deviation of the resulting differences supported the use of approximations in practical situations. (Author)
Descriptors: Error of Measurement, Measurement Instruments, Reliability, Statistical Analysis
Hedges, Larry V. – 1982
Meta-analysis has become an important supplement to traditional methods of research reviewing, although many problems must be addressed by the reviewer who carries out a meta-analysis. These problems include identifying and obtaining appropriate studies, extracting estimates of effect size from the studies, coding or classifying studies, analyzing…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Mathematical Models
Ross, Kenneth N. – Evaluation in Education: International Progress, 1978
Student's empirical sampling approach is used to assess the magnitude of the sampling errors of statistics describing a recursive causal model. The data were gathered with four complex sample designs commonly used in educational surveys. Jackknife and half-sample error estimates are applied to the data. (Author/CTM)
Descriptors: Error of Measurement, Foreign Countries, Probability, Research Design

Zimmerman, Donald W. – Journal of Experimental Education, 1987
A program obtained random samples from known populations, some of which violated the homogeneity assumption. Student t tests and Mann-Whitney U Tests were performed on the sample value. Where the t test led to incorrect decisions, the use of Mann-Whitney U test in its place led to poorer results. (JAZ)
Descriptors: Computer Software, Error of Measurement, Monte Carlo Methods, Nonparametric Statistics
Skaggs, Gary; Lissitz, Robert W. – 1985
This study examined how four commonly used test equating procedures (linear, equipercentile, Rasch Model, and three-parameter) would respond to situations in which the properties or the two tests being equated were different. Data for two tests plus an external anchor test were generated from a three parameter model in which mean test differences…
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Goodness of Fit
Olejnik, Stephen F.; Algina, James – 1986
Sampling distributions for ten tests for comparing population variances in a two group design were generated for several combinations of equal and unequal sample sizes, population means, and group variances when distributional forms differed. The ten procedures included: (1) O'Brien's (OB); (2) O'Brien's with adjusted degrees of freedom; (3)…
Descriptors: Error of Measurement, Evaluation Methods, Measurement Techniques, Nonparametric Statistics

Misanchuk, Earl R. – 1978
Multiple matrix sampling of three subscales of the California Psychological Inventory was used to investigate the effects of four variables on error estimates of the mean (EEM) and variance (EEV). The four variables were examinee population size (600, 450, 300, 150, 100, and 75); number of subtests, (2, 3, 4, 5, 6, and 7), hence the number of…
Descriptors: Adults, Analysis of Variance, Error of Measurement, Item Sampling
Livingston, Samuel A. – 1986
This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…
Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models
Bernstein, Lawrence; Burstein, Nancy – 1994
The inherent methodological problem in conducting research at multiple sites is how to best derive an overall estimate of program impact across multiple sites, best being the estimate that minimizes the mean square error, that is, the square of the difference between the observed and true values. An empirical example illustrates the use of the…
Descriptors: Bias, Comprehensive Programs, Data Analysis, Data Collection
Hart, Roland J.; Bradshaw, Stephen C. – 1981
This report provides the statistical tools necessary to measure the extent of error that exists in organizational record data and group survey data. It is felt that traditional methods of measuring error are inappropriate or incomplete when applied to organizational groups, especially in studies of organizational change when the same variables are…
Descriptors: Adults, Analysis of Variance, Error of Measurement, Mathematical Formulas
Bloom, Howard S.; Michalopoulos, Charles; Hill, Carolyn J.; Lei, Ying – 2002
A study explored which nonexperimental comparison group methods provide the most accurate estimates of the impacts of mandatory welfare-to-work programs and whether the best methods work well enough to substitute for random assignment experiments. Findings were compared for nonexperimental comparison groups and statistical adjustment procedures…
Descriptors: Adult Education, Comparative Analysis, Control Groups, Error of Measurement