NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,411 to 1,425 of 3,316 results Save | Export
Puhan, Gautam – Educational Testing Service, 2010
This study used real data to construct testing conditions for comparing results of chained linear, Tucker, and Levine-observed score equatings. The comparisons were made under conditions where the new- and old-form samples were similar in ability and when they differed in ability. The length of the anchor test was also varied to enable examination…
Descriptors: Equated Scores, Comparative Analysis, Statistical Analysis, Statistical Bias
Lee, Taehun – ProQuest LLC, 2010
In this dissertation, an Expectation-Maximization (EM) algorithm is developed and implemented to obtain maximum likelihood estimates of the parameters and the associated standard error estimates characterizing temporal flows for the latent variable time series following stationary vector ARMA processes, as well as the parameters defining the…
Descriptors: Maximum Likelihood Statistics, Computation, Mathematics, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Savalei, Victoria – Psychological Methods, 2010
Maximum likelihood is the most common estimation method in structural equation modeling. Standard errors for maximum likelihood estimates are obtained from the associated information matrix, which can be estimated from the sample using either expected or observed information. It is known that, with complete data, estimates based on observed or…
Descriptors: Structural Equation Models, Computation, Error of Measurement, Data
Peer reviewed Peer reviewed
Direct linkDirect link
Draxler, Clemens – Psychometrika, 2010
This paper is concerned with supplementing statistical tests for the Rasch model so that additionally to the probability of the error of the first kind (Type I probability) the probability of the error of the second kind (Type II probability) can be controlled at a predetermined level by basing the test on the appropriate number of observations.…
Descriptors: Statistical Analysis, Probability, Sample Size, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Shih, Ching-Lin – Applied Psychological Measurement, 2010
Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…
Descriptors: Methods, Test Bias, Test Items, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Zavorsky, Gerald S. – Measurement in Physical Education and Exercise Science, 2010
Measurement error is a common problem in several fields of research such as medicine, physiology, and exercise science. The standard deviation of repeated measurements on the same person is the measurement error. One way of presenting measurement error is called the repeatability, which is 2.77 multiplied by the within subject standard deviation.…
Descriptors: Physiology, Exercise Physiology, Medicine, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Guo, Hongwen – Psychometrika, 2010
After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…
Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Paek, Insu – Applied Psychological Measurement, 2010
Conservative bias in rejection of a null hypothesis from using the continuity correction in the Mantel-Haenszel (MH) procedure was examined through simulation in a differential item functioning (DIF) investigation context in which statistical testing uses a prespecified level [alpha] for the decision on an item with respect to DIF. The standard MH…
Descriptors: Test Bias, Statistical Analysis, Sample Size, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Young-Joo – Education Economics, 2013
This paper studies the effect of the Head Start program on children's achievements in reading and math tests during their first 4 years of schooling after completing the program. Using nationally representative data from the Early Childhood Longitudinal Study, I found large measurement error in the parental reports of Head Start attendance, which…
Descriptors: Preschool Education, Graduate Surveys, Preschool Evaluation, Mathematics Achievement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Williams, Matt N.; Gomez Grajales, Carlos Alberto; Kurkiewicz, Dason – Practical Assessment, Research & Evaluation, 2013
In 2002, an article entitled "Four assumptions of multiple regression that researchers should always test" by Osborne and Waters was published in "PARE." This article has gone on to be viewed more than 275,000 times (as of August 2013), and it is one of the first results displayed in a Google search for "regression…
Descriptors: Multiple Regression Analysis, Misconceptions, Reader Response, Predictor Variables
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Abu-Hamour, Bashir – International Journal of Special Education, 2013
This study examined the applicability of the Arabic version of the Curriculum Based Measurement Maze (CBM Maze) for Jordanian students. A sample of 150 students was recruited from two public primary schools in Jordan. The students were ranked into high, moderate, and low achievers in terms of their performance in the Arabic course. Then all of…
Descriptors: Foreign Countries, Elementary School Students, Semitic Languages, Grade Point Average
Khawand, Christopher – Society for Research on Educational Effectiveness, 2012
Instrumental variables (IV) methods allow for consistent estimation of causal effects, but suffer from poor finite-sample properties and data availability constraints. IV estimates also tend to have relatively large standard errors, often inhibiting the interpretability of differences between IV and non-IV point estimates. Lastly, instrumental…
Descriptors: Least Squares Statistics, Labor Supply, Measurement Techniques, Error of Measurement
Kenney McCulloch, Susan – ProQuest LLC, 2012
Many telephone surveys require interviewers to observe and record respondents' gender based solely on respondents' voice. Researchers may rely on these observations to: (1) screen for study eligibility; (2) determine skip patterns; (3) foster interviewer tailoring strategies; (4) contribute to nonresponse assessment and adjustments; (5)…
Descriptors: Telephone Surveys, Gender Differences, Acoustics, Observation
Stephens, Christopher Neil – ProQuest LLC, 2012
Augmentation procedures are designed to provide better estimates for a given test or subtest through the use of collateral information. The main purpose of this dissertation was to use Haberman's and Wainer's augmentation procedures on a large-scale, standardized achievement test to understand the relationship between reliability and…
Descriptors: Psychometrics, Error of Measurement, Scores, Reliability
Denbleyker, John Nickolas – ProQuest LLC, 2012
The shortcomings of the proportion above cut (PAC) statistic used so prominently in the educational landscape renders it a very problematic measure for making correct inferences with student test data. The limitations of PAC-based statistics are more pronounced with cross-test comparisons due to their dependency on cut-score locations. A better…
Descriptors: Achievement Gap, Bayesian Statistics, Inferences, Trend Analysis
Pages: 1  |  ...  |  91  |  92  |  93  |  94  |  95  |  96  |  97  |  98  |  99  |  ...  |  222