Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 6 |
Descriptor
Source
Author
Publication Type
Reports - Research | 26 |
Speeches/Meeting Papers | 19 |
Journal Articles | 8 |
Reports - Evaluative | 4 |
Reports - Descriptive | 1 |
Education Level
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 31 |
Policymakers | 1 |
Practitioners | 1 |
Teachers | 1 |
Location
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 2 |
ACT Assessment | 1 |
Medical College Admission Test | 1 |
Test of Standard Written… | 1 |
What Works Clearinghouse Rating
Blackwell, Matthew; Honaker, James; King, Gary – Sociological Methods & Research, 2017
We extend a unified and easy-to-use approach to measurement error and missing data. In our companion article, Blackwell, Honaker, and King give an intuitive overview of the new technique, along with practical suggestions and empirical applications. Here, we offer more precise technical details, more sophisticated measurement error model…
Descriptors: Error of Measurement, Correlation, Simulation, Bayesian Statistics
McCaffrey, Daniel F.; Casabianca, Jodi M. – Society for Research on Educational Effectiveness, 2013
As the education reform movement increasingly focuses on teachers and teaching, educators, policy-makers, and researchers need valid and reliable measures that can be used to evaluate individual teachers, provide guidance for improving teaching performance, and support research in ways that advance instruction and classroom dialog and practice. A…
Descriptors: Urban Schools, Classroom Observation Techniques, Video Technology, Observation
del Pino, Guido; San Martin, Ernesto; Gonzalez, Jorge; De Boeck, Paul – Psychometrika, 2008
This paper analyzes the sum score based (SSB) formulation of the Rasch model, where items and sum scores of persons are considered as factors in a logit model. After reviewing the evolution leading to the equality between their maximum likelihood estimates, the SSB model is then discussed from the point of view of pseudo-likelihood and of…
Descriptors: Computation, Models, Scores, Evaluation Methods
Eid, Michael; Nussbeck, Fridtjof W.; Geiser, Christian; Cole, David A.; Gollwitzer, Mario; Lischetzke, Tanja – Psychological Methods, 2008
The question as to which structural equation model should be selected when multitrait-multimethod (MTMM) data are analyzed is of interest to many researchers. In the past, attempts to find a well-fitting model have often been data-driven and highly arbitrary. In the present article, the authors argue that the measurement design (type of methods…
Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Statistical Analysis, Error of Measurement
Hartig, Johannes; Holzel, Britta; Moosbrugger, Helfried – Multivariate Behavioral Research, 2007
Numerous studies have shown increasing item reliabilities as an effect of the item position in personality scales. Traditionally, these context effects are analyzed based on item-total correlations. This approach neglects that trends in item reliabilities can be caused either by an increase in true score variance or by a decrease in error…
Descriptors: True Scores, Error of Measurement, Structural Equation Models, Simulation
Wang, Zhongmiao; Thompson, Bruce – Journal of Experimental Education, 2007
In this study the authors investigated the use of 5 (i.e., Claudy, Ezekiel, Olkin-Pratt, Pratt, and Smith) R[squared] correction formulas with the Pearson r[squared]. The authors estimated adjustment bias and precision under 6 x 3 x 6 conditions (i.e., population [rho] values of 0.0, 0.1, 0.3, 0.5, 0.7, and 0.9; population shapes normal, skewness…
Descriptors: Effect Size, Correlation, Mathematical Formulas, Monte Carlo Methods
Kish, Leslie – 1989
A brief, practical overview of "design effects" (DEFFs) is presented for users of the results of sample surveys. The overview is intended to help such users to determine how and when to use DEFFs and to compute them correctly. DEFFs are needed only for inferential statistics, not for descriptive statistics. When the selections for…
Descriptors: Computer Software, Error of Measurement, Mathematical Models, Research Design
Neel, John H. – 1987
Determination of statistical power for analysis of variance procedures requires five elements: (1) significance level; (2) effect size; (3) number of means; (4) error variance; and (5) sample size. Significance levels are traditionally chosen to be 0.5, .01, or .001. Effect size is not discussed in this paper. The number of means is determined by…
Descriptors: Analysis of Variance, Error of Measurement, Mathematical Models, Power (Statistics)
Robey, Randall R.; Barcikowski, Robert S. – 1987
The mixed model analysis of variance assumes a mathematical property known as sphericity. Several preliminary tests have been proposed to detect departures from the sphericity assumption. The logic of the preliminary testing procedure is to conduct the mixed model analysis of variance if the preliminary test suggests that the sphericity assumption…
Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Mathematical Models
Wainer, Howard; Thissen, David – 1985
Using simulated item response data, the performance of several "robust" and conventional schemes for ability estimation was evaluated in conjunction with logistic item response theory models (one, two, and three parameter models). The simulated item response data were generated using a model that is more complex than are the usual…
Descriptors: Adaptive Testing, Adults, Computer Assisted Testing, Error of Measurement
Interpreting the Results of Weighted Least-Squares Regression: Caveats for the Statistical Consumer.
Willett, John B.; Singer, Judith D. – 1987
In research, data sets often occur in which the variance of the distribution of the dependent variable at given levels of the predictors is a function of the values of the predictors. In this situation, the use of weighted least-squares (WLS) or techniques is required. Weights suitable for use in a WLS regression analysis must be estimated. A…
Descriptors: Error of Measurement, Estimation (Mathematics), Goodness of Fit, Least Squares Statistics

Silverman, B. W.; Wilson, J. D. – Journal of Documentation, 1987
This study estimates the proportion of academic and public library acquisitions in the United Kingdom for which a UK MARC record is available at the time of cataloging. A beta-binomial model is used to attribute standard errors to each estimate and to compare differences between the two types of libraries. (Author/LRW)
Descriptors: Academic Libraries, Cataloging, Comparative Analysis, Developed Nations

Yen, Wendy M. – Journal of Educational Measurement, 1984
A procedure for obtaining maximum likelihood trait estimates from number-correct (NC) scores for the three-parameter logistic model is presented. It produces an NC score to trait estimate conversion table. Analyses in the estimated true score metric confirm the conclusions made in the trait metric. (Author/DWH)
Descriptors: Achievement Tests, Error of Measurement, Estimation (Mathematics), Latent Trait Theory
Kulik, James A.; Kulik, Chen-Lin C. – 1986
Statistical methodologists have sometimes criticized the use of conventional statistics in meta-analysis, and in recent years a number of them have advocated the use of a special new statistical methodology for research synthesis. An examination of recent books describing this methodology shows that it is seriously limited in its applicability to…
Descriptors: Effect Size, Error of Measurement, Estimation (Mathematics), Mathematical Models
Skaggs, Gary; Lissitz, Robert W. – 1985
This study examined how four commonly used test equating procedures (linear, equipercentile, Rasch Model, and three-parameter) would respond to situations in which the properties or the two tests being equated were different. Data for two tests plus an external anchor test were generated from a three parameter model in which mean test differences…
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Goodness of Fit