Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Puhan, Gautam – Educational Testing Service, 2010
This study used real data to construct testing conditions for comparing results of chained linear, Tucker, and Levine-observed score equatings. The comparisons were made under conditions where the new- and old-form samples were similar in ability and when they differed in ability. The length of the anchor test was also varied to enable examination…
Descriptors: Equated Scores, Comparative Analysis, Statistical Analysis, Statistical Bias
Lee, Taehun – ProQuest LLC, 2010
In this dissertation, an Expectation-Maximization (EM) algorithm is developed and implemented to obtain maximum likelihood estimates of the parameters and the associated standard error estimates characterizing temporal flows for the latent variable time series following stationary vector ARMA processes, as well as the parameters defining the…
Descriptors: Maximum Likelihood Statistics, Computation, Mathematics, Factor Analysis
Savalei, Victoria – Psychological Methods, 2010
Maximum likelihood is the most common estimation method in structural equation modeling. Standard errors for maximum likelihood estimates are obtained from the associated information matrix, which can be estimated from the sample using either expected or observed information. It is known that, with complete data, estimates based on observed or…
Descriptors: Structural Equation Models, Computation, Error of Measurement, Data
Draxler, Clemens – Psychometrika, 2010
This paper is concerned with supplementing statistical tests for the Rasch model so that additionally to the probability of the error of the first kind (Type I probability) the probability of the error of the second kind (Type II probability) can be controlled at a predetermined level by basing the test on the appropriate number of observations.…
Descriptors: Statistical Analysis, Probability, Sample Size, Error of Measurement
Wang, Wen-Chung; Shih, Ching-Lin – Applied Psychological Measurement, 2010
Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…
Descriptors: Methods, Test Bias, Test Items, Error of Measurement
Zavorsky, Gerald S. – Measurement in Physical Education and Exercise Science, 2010
Measurement error is a common problem in several fields of research such as medicine, physiology, and exercise science. The standard deviation of repeated measurements on the same person is the measurement error. One way of presenting measurement error is called the repeatability, which is 2.77 multiplied by the within subject standard deviation.…
Descriptors: Physiology, Exercise Physiology, Medicine, Error of Measurement
Guo, Hongwen – Psychometrika, 2010
After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…
Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores
Paek, Insu – Applied Psychological Measurement, 2010
Conservative bias in rejection of a null hypothesis from using the continuity correction in the Mantel-Haenszel (MH) procedure was examined through simulation in a differential item functioning (DIF) investigation context in which statistical testing uses a prespecified level [alpha] for the decision on an item with respect to DIF. The standard MH…
Descriptors: Test Bias, Statistical Analysis, Sample Size, Error of Measurement
Osborne, Jason W. – Practical Assessment, Research & Evaluation, 2011
Large surveys often use probability sampling in order to obtain representative samples, and these data sets are valuable tools for researchers in all areas of science. Yet many researchers are not formally prepared to appropriately utilize these resources. Indeed, users of one popular dataset were generally found "not" to have modeled…
Descriptors: Best Practices, Sampling, Sample Size, Data Analysis
De Witte, Kristof; Rogge, Nicky – Economics of Education Review, 2011
Students' evaluations of teacher performance (SETs) are increasingly used by universities. However, SETs are controversial mainly due to two issues: (1) teachers value various aspects of excellent teaching differently, and (2) SETs should not be determined on exogenous influences. Therefore, this paper constructs SETs using a tailored version of…
Descriptors: Student Evaluation of Teacher Performance, College Students, College Faculty, Error of Measurement
Quas, Jodi A. – Journal of Cognition and Development, 2011
In this article the author describes challenges associated with integrating physiological measures of stress into developmental research, especially in the domains of memory and cognition. An initial critical challenge concerns how to define stress, which can refer to one or a series of events, a response, the consequence of that response, an…
Descriptors: Expertise, Stress Management, Physiology, Measures (Individuals)
Gagnon, Robert; Lubarsky, Stuart; Lambert, Carole; Charlin, Bernard – Advances in Health Sciences Education, 2011
The Script Concordance Test (SCT) uses a panel-based, aggregate scoring method that aims to capture the variability of responses of experienced practitioners to particular clinical situations. The use of this type of scoring method is a key determinant of the tool's discriminatory power, but deviant answers could potentially diminish the…
Descriptors: Expertise, Oncology, Scoring, Error of Measurement
Cole, David A.; Cai, Li; Martin, Nina C.; Findling, Robert L.; Youngstrom, Eric A.; Garber, Judy; Curry, John F.; Hyde, Janet S.; Essex, Marilyn J.; Compas, Bruce E.; Goodyer, Ian M.; Rohde, Paul; Stark, Kevin D.; Slattery, Marcia J.; Forehand, Rex – Psychological Assessment, 2011
Our goals in this article were to use item response theory (IRT) to assess the relation of depressive symptoms to the underlying dimension of depression and to demonstrate how IRT-based measurement strategies can yield more reliable data about depression severity than conventional symptom counts. Participants were 3,403 children and adolescents…
Descriptors: Schizophrenia, Measurement, Error of Measurement, Severity (of Disability)
Collins, Rebecca L.; Martino, Steven C.; Elliott, Marc N. – Developmental Psychology, 2011
Longitudinal research has demonstrated a link between exposure to sexual content in media and subsequent changes in adolescent sexual behavior, including initiation of intercourse and various noncoital sexual activities. Based on a reanalysis of one of the data sets involved, Steinberg and Monahan (2011) have challenged these findings. However,…
Descriptors: Sexuality, Mass Media Effects, Adolescents, Evaluation Methods
Williams, Justin H. G.; Casey, Jackie M.; Braadbaart, Lieke; Culmer, Peter R.; Mon-Williams, Mark – Journal of Cognition and Development, 2014
We sought to develop a method for measuring imitation accuracy objectively in primary school children. Children imitated a model drawing shapes on the same computer-tablet interface they saw used in video clips, allowing kinematics of model and observers' actions to be directly compared. Imitation accuracy was reported as a correlation reflecting…
Descriptors: Imitation, Elementary School Students, Fidelity, Accuracy