NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,066 to 1,080 of 3,295 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Lindstromberg, Seth – Language Teaching Research, 2016
This article reviews all (quasi)experimental studies appearing in the first 19 volumes (1997-2015) of "Language Teaching Research" (LTR). Specifically, it provides an overview of how statistical analyses were conducted in these studies and of how the analyses were reported. The overall conclusion is that there has been a tight adherence…
Descriptors: Meta Analysis, Second Language Learning, Second Language Instruction, Guidelines
Peer reviewed Peer reviewed
Direct linkDirect link
Davin, Kristin J. – Modern Language Journal, 2016
This article explores the implementation of dynamic assessment (DA) in an elementary school foreign language classroom by considering its theoretical basis and its applicability to second language (L2) teaching, learning, and development. In existing applications of L2 classroom DA, errors serve as a window into learners' instructional needs and…
Descriptors: Alternative Assessment, Elementary School Students, Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Zapata-Rivera, Diego; Zwick, Rebecca; Vezzu, Margaret – Educational Assessment, 2016
The goal of this study was to explore the effectiveness of a short web-based tutorial in helping teachers to better understand the portrayal of measurement error in test score reports. The short video tutorial included both verbal and graphical representations of measurement error. Results showed a significant difference in comprehension scores…
Descriptors: Error of Measurement, Tutorial Programs, Instructional Effectiveness, Web Based Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Pampaka, Maria; Hutcheson, Graeme; Williams, Julian – International Journal of Research & Method in Education, 2016
Missing data is endemic in much educational research. However, practices such as step-wise regression common in the educational research literature have been shown to be dangerous when significant data are missing, and multiple imputation (MI) is generally recommended by statisticians. In this paper, we provide a review of these advances and their…
Descriptors: Data Analysis, Statistical Inference, Error of Measurement, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Reeger, Adam; Gaasedelen, Owen; Welch, Catherine; Dunbar, Stephen – AERA Online Paper Repository, 2016
Student Growth Percentiles (SGPs) are increasingly being used in evaluations of teacher effectiveness. This study investigates two properties of SGPs: 1) SGP sensitivity to reference group characteristics such as sample size, free and reduced lunch (FRL) status, and English language learner (ELL) status; and 2) variation in score changes across…
Descriptors: Teacher Effectiveness, Teacher Evaluation, Accountability, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2015
Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CATs. This study investigated the accuracy, fidelity, and efficiency of a fully multidimensional CAT algorithm…
Descriptors: Computer Assisted Testing, Adaptive Testing, Accuracy, Fidelity
Peer reviewed Peer reviewed
Direct linkDirect link
Methe, Scott A.; Briesch, Amy M.; Hulac, David – Assessment for Effective Intervention, 2015
At present, it is unclear whether math curriculum-based measurement (M-CBM) procedures provide a dependable measure of student progress in math computation because support for its technical properties is based largely upon a body of correlational research. Recent investigations into the dependability of M-CBM scores have found that evaluating…
Descriptors: Measurement Techniques, Error of Measurement, Mathematics Curriculum, Curriculum Based Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Rosen, Brittany N.; Lee, Brian K.; Lee, Nora L.; Yang, Yunwen; Burstyn, Igor – Journal of Autism and Developmental Disorders, 2015
We conducted a meta-analysis of 15 studies on maternal prenatal smoking and ASD risk in offspring. Using a random-effects model, we found no evidence of an association (summary OR 1.02, 95% CI 0.93-1.12). Stratifying by study design, birth year, type of healthcare system, and adjustment for socioeconomic status or psychiatric history did not alter…
Descriptors: Smoking, Mothers, Prenatal Influences, Pervasive Developmental Disorders
Cho, Sun-Joo; Preacher, Kristopher J.; Bottge, Brian A. – Grantee Submission, 2015
Multilevel modeling (MLM) is frequently used to detect group differences, such as an intervention effect in a pre-test--post-test cluster-randomized design. Group differences on the post-test scores are detected by controlling for pre-test scores as a proxy variable for unobserved factors that predict future attributes. The pre-test and post-test…
Descriptors: Structural Equation Models, Hierarchical Linear Modeling, Intervention, Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Veroniki, Areti Angeliki; Pavlides, Marios; Patsopoulos, Nikolaos A.; Salanti, Georgia – Research Synthesis Methods, 2013
A problem that is frequently encountered during the systematic review process is when studies that meet the inclusion criteria do not provide the appropriate numerical estimates to include in a meta-analysis. For dichotomous outcomes, a method has been suggested by Di Pietrantonj for reconstructing the 2 × 2 table when the Odds Ratio…
Descriptors: Meta Analysis, Tables (Data), Statistical Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Gómez-Benito, Juana; Hidalgo, Maria Dolores; Zumbo, Bruno D. – Educational and Psychological Measurement, 2013
The objective of this article was to find an optimal decision rule for identifying polytomous items with large or moderate amounts of differential functioning. The effectiveness of combining statistical tests with effect size measures was assessed using logistic discriminant function analysis and two effect size measures: R[superscript 2] and…
Descriptors: Item Analysis, Test Items, Effect Size, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
de la Torre, Jimmy; Lee, Young-Sun – Journal of Educational Measurement, 2013
This article used the Wald test to evaluate the item-level fit of a saturated cognitive diagnosis model (CDM) relative to the fits of the reduced models it subsumes. A simulation study was carried out to examine the Type I error and power of the Wald test in the context of the G-DINA model. Results show that when the sample size is small and a…
Descriptors: Statistical Analysis, Test Items, Goodness of Fit, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Jihye; Oshima, T. C. – Educational and Psychological Measurement, 2013
In a typical differential item functioning (DIF) analysis, a significance test is conducted for each item. As a test consists of multiple items, such multiple testing may increase the possibility of making a Type I error at least once. The goal of this study was to investigate how to control a Type I error rate and power using adjustment…
Descriptors: Test Bias, Test Items, Statistical Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Chia-ling; Shen, I-hsuan; Chen, Chung-yao; Wu, Ching-yi; Liu, Wen-Yu; Chung, Chia-ying – Research in Developmental Disabilities: A Multidisciplinary Journal, 2013
This study examined criterion-related validity and clinimetric properties of the pediatric balance scale ("PBS") in children with cerebral palsy (CP). Forty-five children with CP (age range: 19-77 months) and their parents participated in this study. At baseline and at follow up, Pearson correlation coefficients were used to determine…
Descriptors: Measurement, Measures (Individuals), Correlation, Cerebral Palsy
Pages: 1  |  ...  |  68  |  69  |  70  |  71  |  72  |  73  |  74  |  75  |  76  |  ...  |  220