NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,201 to 1,215 of 3,295 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013
This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…
Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Keller, Bryan S. B.; Kim, Jee-Seon; Steiner, Peter M. – Society for Research on Educational Effectiveness, 2013
Propensity score analysis (PSA) is a methodological technique which may correct for selection bias in a quasi-experiment by modeling the selection process using observed covariates. Because logistic regression is well understood by researchers in a variety of fields and easy to implement in a number of popular software packages, it has…
Descriptors: Probability, Scores, Statistical Analysis, Statistical Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Burt, Keith B.; Obradovic, Jelena – Developmental Review, 2013
The purpose of this paper is to review major statistical and psychometric issues impacting the study of psychophysiological reactivity and discuss their implications for applied developmental researchers. We first cover traditional approaches such as the observed difference score (DS) and the observed residual score (RS), including a review of…
Descriptors: Measurement Techniques, Psychometrics, Data Analysis, Researchers
Peer reviewed Peer reviewed
Direct linkDirect link
Kline, Rex B. – Educational Research and Evaluation, 2013
Test fairness and test bias are not synonymous concepts. Test bias refers to statistical evidence that the psychometrics or interpretation of test scores depend on group membership, such as gender or race, when such differences are not expected. A test that is grossly biased may be judged to be unfair, but test fairness concerns the broader, more…
Descriptors: Factor Analysis, Social Justice, Psychometrics, Test Bias
Spinella, Sarah – Online Submission, 2011
As result replicability is essential to science and difficult to achieve through external replicability, the present paper notes the insufficiency of null hypothesis statistical significance testing (NHSST) and explains the bootstrap as a plausible alternative, with a heuristic example to illustrate the bootstrap method. The bootstrap relies on…
Descriptors: Sampling, Statistical Inference, Statistical Significance, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Xiaofeng Steven – Evaluation Review, 2011
Covariate adjustment can increase the precision of estimates by removing unexplained variance from the error in randomized experiments, although chance covariate imbalance tends to counteract the improvement in precision. The author develops an easy measure to examine chance covariate imbalance in randomization by standardizing the average…
Descriptors: Measurement Techniques, Statistical Analysis, Experiments, Research Design
Peer reviewed Peer reviewed
Direct linkDirect link
Baldwin, Scott A.; Bauer, Daniel J.; Stice, Eric; Rohde, Paul – Psychological Methods, 2011
Partially clustered designs, where clustering occurs in some conditions and not others, are common in psychology, particularly in prevention and intervention trials. This article reports results from a simulation comparing 5 approaches to analyzing partially clustered data, including Type I errors, parameter bias, efficiency, and power. Results…
Descriptors: Multivariate Analysis, Error of Measurement, Statistical Analysis, Statistical Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Smits-Engelsman, Bouwien C. M.; Niemeijer, Anuschka S.; van Waelvelde, Hilde – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Formal testing of 3 year old children is a new feature in the revised version of the Movement Assessment Battery for Children (Movement ABC-2). Our study evaluated the reliability and explored the clinical applicability of the Movement ABC-2 Test in this young age group. A total of 50 typically children were given two trials of the test within a…
Descriptors: Measures (Individuals), Young Children, Psychomotor Skills, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Meyer, J. Patrick; Cash, Anne H.; Mashburn, Andrew – Educational Assessment, 2011
Student-teacher interactions are dynamic relationships that change and evolve over the course of a school year. Measuring classroom quality through observations that focus on these interactions presents challenges when observations are conducted throughout the school year. Variability in observed scores could reflect true changes in the quality of…
Descriptors: Observation, Reliability, Teacher Student Relationship, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Steiner, Peter M.; Cook, Thomas D.; Shadish, William R. – Journal of Educational and Behavioral Statistics, 2011
The effect of unreliability of measurement on propensity score (PS) adjusted treatment effects has not been previously studied. The authors report on a study simulating different degrees of unreliability in the multiple covariates that were used to estimate the PS. The simulation uses the same data as two prior studies. Shadish, Clark, and Steiner…
Descriptors: Statistical Bias, Reliability, Measurement, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Roberts, Ros; Johnson, Philip – Curriculum Journal, 2015
Recent school science curriculum developments in many countries emphasise that scientists derive evidence for their claims through different approaches; that such practices are bound up with disciplinary knowledge; and that the quality of data should be appreciated. This position paper presents an understanding of the validity of data as a set of…
Descriptors: Educational Quality, Data, Concept Mapping, Scientific Concepts
Peer reviewed Peer reviewed
Direct linkDirect link
Dong, Nianbo – American Journal of Evaluation, 2015
Researchers have become increasingly interested in programs' main and interaction effects of two variables (A and B, e.g., two treatment variables or one treatment variable and one moderator) on outcomes. A challenge for estimating main and interaction effects is to eliminate selection bias across A-by-B groups. I introduce Rubin's causal model to…
Descriptors: Probability, Statistical Analysis, Research Design, Causal Models
Peer reviewed Peer reviewed
Direct linkDirect link
Pereira, Nielsen; Bakhiet, Salaheldin Farah; Gentry, Marcia; Balhmar, Tahani Abdulrahman; Hakami, Sultan Mohammed – Journal of Advanced Academics, 2017
This study examined the psychometric properties and measurement invariance of the Arabic version of "My Class Activities" (MCA), an instrument designed to measure students' perceptions of interest, challenge, choice, and enjoyment in classrooms. Scores of 3,516 Sudanese students in Grades 2 to 8 were used. Confirmatory factor analysis…
Descriptors: Student Attitudes, Factor Analysis, Comparative Analysis, Gifted
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Rastegar, Behnaz; Safari, Fatemeh – International Journal of Education and Literacy Studies, 2017
Language learners' productive role in teaching and learning processes has recently been the focus of attention. Therefore, this study aimed at investigating the effect of oral vs. written output-based instruction on English as a foreign language (EFL) learners' vocabulary learning with a focus on reflective vs. impulsive learning styles. To this…
Descriptors: Cognitive Style, English (Second Language), Second Language Learning, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Lin, Johnny; Bentler, Peter M. – Multivariate Behavioral Research, 2012
Goodness-of-fit testing in factor analysis is based on the assumption that the test statistic is asymptotically chi-square, but this property may not hold in small samples even when the factors and errors are normally distributed in the population. Robust methods such as Browne's (1984) asymptotically distribution-free method and Satorra Bentler's…
Descriptors: Factor Analysis, Statistical Analysis, Scaling, Sample Size
Pages: 1  |  ...  |  77  |  78  |  79  |  80  |  81  |  82  |  83  |  84  |  85  |  ...  |  220