NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers2
Laws, Policies, & Programs
Elementary and Secondary…2
What Works Clearinghouse Rating
Showing 1 to 15 of 58 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Schamberger, Tamara; Schuberth, Florian; Henseler, Jörg – International Journal of Behavioral Development, 2023
Research in human development often relies on composites, that is, composed variables such as indices. Their composite nature renders these variables inaccessible to conventional factor-centric psychometric validation techniques such as confirmatory factor analysis (CFA). In the context of human development research, there is currently no…
Descriptors: Individual Development, Factor Analysis, Statistical Analysis, Structural Equation Models
Bonifay, Wes – Grantee Submission, 2022
Traditional statistical model evaluation typically relies on goodness-of-fit testing and quantifying model complexity by counting parameters. Both of these practices may result in overfitting and have thereby contributed to the generalizability crisis. The information-theoretic principle of minimum description length addresses both of these…
Descriptors: Statistical Analysis, Models, Goodness of Fit, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Bonifay, Wes; Depaoli, Sarah – Prevention Science, 2023
Statistical analysis of categorical data often relies on multiway contingency tables; yet, as the number of categories and/or variables increases, the number of table cells with few (or zero) observations also increases. Unfortunately, sparse contingency tables invalidate the use of standard goodness-of-fit statistics. Limited-information fit…
Descriptors: Bayesian Statistics, Programming Languages, Psychopathology, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Haberman, Shelby J. – Journal of Educational Measurement, 2020
Examples of the impact of statistical theory on assessment practice are provided from the perspective of a statistician trained in theoretical statistics who began to work on assessments. Goodness of fit of item-response models is examined in terms of restricted likelihood-ratio tests and generalized residuals. Minimum discriminant information…
Descriptors: Statistics, Goodness of Fit, Item Response Theory, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Kovalchik, Stephanie A.; Martino, Steven C.; Collins, Rebecca L.; Shadel, William G.; D'Amico, Elizabeth J.; Becker, Kirsten – Journal of Educational and Behavioral Statistics, 2018
Ecological momentary assessment (EMA) is a popular assessment method in psychology that aims to capture events, emotions, and cognitions in real time, usually repeatedly throughout the day. Because EMA typically involves more intensive monitoring than traditional assessment methods, missing data are commonly an issue and this missingness may bias…
Descriptors: Probability, Statistical Bias, Holistic Approach, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Lamprianou, Iasonas – Educational and Psychological Measurement, 2018
It is common practice for assessment programs to organize qualifying sessions during which the raters (often known as "markers" or "judges") demonstrate their consistency before operational rating commences. Because of the high-stakes nature of many rating activities, the research community tends to continuously explore new…
Descriptors: Social Networks, Network Analysis, Comparative Analysis, Innovation
Peer reviewed Peer reviewed
Direct linkDirect link
You, Hye Sun – Journal of Science Teacher Education, 2016
Growing evidence from recent curriculum documents and previous research suggests that reform-oriented science teaching practices promote students' conceptual understanding, levels of achievement, and motivation to learn, especially when students are actively engaged in constructing their ideas through scientific inquiries. However, it is difficult…
Descriptors: Educational Change, Science Instruction, Psychometrics, Teaching Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Heemsoth, Tim; Retelsdorf, Jan – Measurement in Physical Education and Exercise Science, 2018
Educational research emphasizes the advantages of multimethod designs. However, if the design comprises different perspectives, the question of construct validity emerges. We related this question to student and teacher ratings of student-student relations, which are of high interest in research on physical education. In our study, 2,160 students…
Descriptors: Educational Research, Factor Analysis, Peer Relationship, Physical Education Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Todd, Amber; Romine, William L.; Cook Whitt, Katahdin – Science Education, 2017
We describe the development, validation, and use of the "Learning Progression-Based Assessment of Modern Genetics" (LPA-MG) in a high school biology context. Items were constructed based on a current learning progression framework for genetics (Shea & Duncan, 2013; Todd & Kenyon, 2015). The 34-item instrument, which was tied to…
Descriptors: Genetics, Science Instruction, High School Students, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Köse, Alper – Educational Research and Reviews, 2014
The primary objective of this study was to examine the effect of missing data on goodness of fit statistics in confirmatory factor analysis (CFA). For this aim, four missing data handling methods; listwise deletion, full information maximum likelihood, regression imputation and expectation maximization (EM) imputation were examined in terms of…
Descriptors: Data Analysis, Data Collection, Statistical Analysis, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Onchiri, Sureiman – Educational Research and Reviews, 2013
Whenever you think you have an idea of how something works, you have a mental model. That is, in effect, a layman's way of talking about having an hypothesis. The hypothesis needs to be tested for how closely it fits reality--and reality is the data collected from an experiment. So the data is collected on the few and compared with a few…
Descriptors: Statistical Analysis, Goodness of Fit, Data Analysis, Statistical Distributions
Peer reviewed Peer reviewed
Direct linkDirect link
Fuller, Matthew B.; Skidmore, Susan T.; Bustamante, Rebecca M.; Holzweiss, Peggy C. – Review of Higher Education, 2016
Although touted as beneficial to student learning, cultures of assessment have not been examined adequately using validated instruments. Using data collected from a stratified, random sample (N = 370) of U.S. institutional research and assessment directors, the models tested in this study provide empirical support for the value of using the…
Descriptors: Higher Education, Administrators, Evaluation Methods, Attitude Measures
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Park, Siwon – Journal of Pan-Pacific Association of Applied Linguistics, 2017
This paper examines how different test methods may tap different aspects of second language knowledge. It employs multiple-choice (MC) and constructed response (CR) items which yield distinct or convergent information in the computer delivered testing of English in its presentation of this factor. In order to examine the effects of test method, a…
Descriptors: Evaluation Methods, Second Language Learning, English (Second Language), Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Hoelscher, Michael – Research in Comparative and International Education, 2017
This article argues that strong interrelations between methodological and theoretical advances exist. Progress in, especially comparative, methods may have important impacts on theory evaluation. By using the example of the "Varieties of Capitalism" approach and an international comparison of higher education systems, it can be shown…
Descriptors: Higher Education, Comparative Education, Research Methodology, Cross Cultural Studies
Peer reviewed Peer reviewed
Direct linkDirect link
Gordon, Rachel A.; Hofer, Kerry G.; Fujimoto, Ken A.; Risk, Nicole; Kaestner, Robert; Korenman, Sanders – Early Education and Development, 2015
Research Findings: The Early Childhood Environment Rating Scale-Revised (ECERS-R) is widely used, often to evaluate whether preschool programs are of sufficient quality to improve children's school readiness. We examined the validity of the measure for this purpose. Item response theory (IRT) analyses revealed that many items did not fit together…
Descriptors: Educational Quality, Preschool Education, Item Response Theory, School Readiness
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4