Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 110 |
Descriptor
Evaluation Research | 161 |
Statistical Analysis | 161 |
Evaluation Methods | 64 |
Research Methodology | 53 |
Qualitative Research | 29 |
Foreign Countries | 27 |
Educational Research | 19 |
Program Evaluation | 19 |
Research Design | 19 |
Higher Education | 18 |
Measurement Techniques | 18 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 11 |
Practitioners | 1 |
Teachers | 1 |
Location
California | 3 |
Florida | 3 |
Greece | 3 |
United Kingdom | 3 |
Australia | 2 |
North Carolina | 2 |
United Kingdom (England) | 2 |
United States | 2 |
Canada | 1 |
China | 1 |
Germany | 1 |
More ▼ |
Laws, Policies, & Programs
Aid to Families with… | 1 |
Manpower Development and… | 1 |
No Child Left Behind Act 2001 | 1 |
Safe and Drug Free Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
A. R. Georgeson – Structural Equation Modeling: A Multidisciplinary Journal, 2025
There is increasing interest in using factor scores in structural equation models and there have been numerous methodological papers on the topic. Nevertheless, sum scores, which are computed from adding up item responses, continue to be ubiquitous in practice. It is therefore important to compare simulation results involving factor scores to…
Descriptors: Structural Equation Models, Scores, Factor Analysis, Statistical Bias
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Li, Hongli; Hunter, Charles Vincent; Bialo, Jacquelyn Anne – Language Assessment Quarterly, 2022
The purpose of this study is to review the status of differential item functioning (DIF) research in language testing, particularly as it relates to the investigation of sources (or causes) of DIF, which is a defining characteristic of the third generation DIF. This review included 110 DIF studies of language tests dated from 1985 to 2019. We…
Descriptors: Test Bias, Language Tests, Statistical Analysis, Evaluation Research
Wind, Stefanie A.; Peterson, Meghan E. – Language Testing, 2018
The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on…
Descriptors: Language Tests, Evaluators, Evaluation Methods, Interrater Reliability
Rahman, Md Shidur – Journal of Education and Learning, 2017
The researchers of various disciplines often use qualitative and quantitative research methods and approaches for their studies. Some of these researchers like to be known as qualitative researchers; others like to be regarded as quantitative researchers. The researchers, thus, are sharply polarised; and they involve in a competition of pointing…
Descriptors: Qualitative Research, Statistical Analysis, Literature Reviews, Research Methodology
Hosp, John L.; Ford, Jeremy W.; Huddle, Sally M.; Hensley, Kiersten K. – Assessment for Effective Intervention, 2018
Replication is a foundation of the development of a knowledge base in an evidence-based field such as education. This study includes two direct replications of Hosp, Hensley, Huddle, and Ford which found evidence of criterion-related validity of curriculum-based measurement (CBM) for reading and mathematics with postsecondary students with…
Descriptors: Replication (Evaluation), Evaluation Research, Curriculum Based Assessment, Developmental Disabilities
Bloom, Howard S.; Spybrook, Jessaca – Journal of Research on Educational Effectiveness, 2017
Multisite trials, which are being used with increasing frequency in education and evaluation research, provide an exciting opportunity for learning about how the effects of interventions or programs are distributed across sites. In particular, these studies can produce rigorous estimates of a cross-site mean effect of program assignment…
Descriptors: Program Effectiveness, Program Evaluation, Sample Size, Evaluation Research
Ogange, Betty Obura; Agak, John O.; Okelo, Kevin Odhiambo; Kiprotich, Peter – Open Praxis, 2018
Assessment is an integral part of the teaching-learning process in both conventional and distance education contexts. Literature suggests that with the increase in the use of Information and Communications Technology in the delivery of learning, a number of institutions are resorting to formative assessment practices that are mediated by…
Descriptors: Formative Evaluation, Online Courses, Student Attitudes, Questionnaires
Andrews, Martin; Brown, Rachael; Mesher, Lynne – Practitioner Research in Higher Education, 2018
Within the Higher Education sector in the UK, it is acknowledged that the area of 'Assessment and Feedback' receives consistently poor levels of satisfaction from students when they complete module level feedback, course level feedback and the National Student Survey (NSS). There is evidence to suggest that this problem is pronounced within…
Descriptors: Feedback (Response), Student Evaluation, Learner Engagement, Case Studies
Bergsmann, Evelyn; Klug, Julia; Burger, Christoph; Först, Nora; Spiel, Christiane – Assessment & Evaluation in Higher Education, 2018
There is a lively discussion on how to evaluate competence-based higher education in both evaluation and competence research. The instruments used are often limited to course evaluation or specific competences, taking a rather narrow perspective. Furthermore, the instruments often comprise predetermined competences that cannot be adapted to higher…
Descriptors: Questionnaires, Minimum Competency Testing, Screening Tests, Higher Education
Munter, Charles; Cobb, Paul; Shekell, Calli – American Journal of Evaluation, 2016
We examined the extent to which mathematics program evaluations that have been conducted according to methodologically rigorous standards have attended to the theories underlying the programs being evaluated. Our analysis focused on the 37 reports of K-12 mathematics program evaluations in the last two decades that have met standards for inclusion…
Descriptors: Evaluation Research, Clearinghouses, Standards, Mathematics Education
Garvey, Jason C. – Journal of College Student Development, 2017
The purpose of this article is to clarify the discrepancy in the use of "queer" as a sexual identity classification in education survey research. This study extends the work completed by Dugan and Yurman (2011), who empirically demonstrated problems with treating LGB students as a homogenous population through collapsing all respondents…
Descriptors: Classification, Educational Research, Sexual Identity, Subcultures
Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017
This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…
Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments
Alderman, Lyn – Quality in Higher Education, 2016
In Australia, a review of the higher education sector is usually triggered by a change in government leadership, followed by the development and implementation of the government's response in the form of a reform package to enact change. The aim of this study was to conduct an independent evaluation of a large-scale national government policy…
Descriptors: Federal Legislation, Educational Legislation, Higher Education, Leadership
Bloom, Howard S.; Raudenbush, Stephen W.; Weiss, Michael J.; Porter, Kristin – Journal of Research on Educational Effectiveness, 2017
The present article considers a fundamental question in evaluation research: "By how much do program effects vary across sites?" The article first presents a theoretical model of cross-site impact variation and a related estimation model with a random treatment coefficient and fixed site-specific intercepts. This approach eliminates…
Descriptors: Evaluation Research, Program Evaluation, Welfare Services, Employment