Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 37 |
Descriptor
Evaluation Methods | 64 |
Evaluation Research | 64 |
Statistical Analysis | 64 |
Research Methodology | 19 |
Foreign Countries | 10 |
Qualitative Research | 10 |
Program Evaluation | 9 |
Student Evaluation | 9 |
Higher Education | 8 |
Effect Size | 7 |
Test Validity | 7 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 7 |
Higher Education | 7 |
Postsecondary Education | 5 |
Adult Education | 3 |
Elementary Education | 2 |
Grade 8 | 2 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Researchers | 4 |
Practitioners | 1 |
Teachers | 1 |
Location
Australia | 2 |
Florida | 2 |
United Kingdom (England) | 2 |
California | 1 |
Germany | 1 |
Greece | 1 |
Kenya | 1 |
Ohio | 1 |
Romania | 1 |
United Kingdom | 1 |
United States | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
Raven Progressive Matrices | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Wind, Stefanie A.; Peterson, Meghan E. – Language Testing, 2018
The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on…
Descriptors: Language Tests, Evaluators, Evaluation Methods, Interrater Reliability
Bloom, Howard S.; Spybrook, Jessaca – Journal of Research on Educational Effectiveness, 2017
Multisite trials, which are being used with increasing frequency in education and evaluation research, provide an exciting opportunity for learning about how the effects of interventions or programs are distributed across sites. In particular, these studies can produce rigorous estimates of a cross-site mean effect of program assignment…
Descriptors: Program Effectiveness, Program Evaluation, Sample Size, Evaluation Research
Ogange, Betty Obura; Agak, John O.; Okelo, Kevin Odhiambo; Kiprotich, Peter – Open Praxis, 2018
Assessment is an integral part of the teaching-learning process in both conventional and distance education contexts. Literature suggests that with the increase in the use of Information and Communications Technology in the delivery of learning, a number of institutions are resorting to formative assessment practices that are mediated by…
Descriptors: Formative Evaluation, Online Courses, Student Attitudes, Questionnaires
Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017
This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…
Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments
Alderman, Lyn – Quality in Higher Education, 2016
In Australia, a review of the higher education sector is usually triggered by a change in government leadership, followed by the development and implementation of the government's response in the form of a reform package to enact change. The aim of this study was to conduct an independent evaluation of a large-scale national government policy…
Descriptors: Federal Legislation, Educational Legislation, Higher Education, Leadership
Bloom, Howard S.; Raudenbush, Stephen W.; Weiss, Michael J.; Porter, Kristin – Journal of Research on Educational Effectiveness, 2017
The present article considers a fundamental question in evaluation research: "By how much do program effects vary across sites?" The article first presents a theoretical model of cross-site impact variation and a related estimation model with a random treatment coefficient and fixed site-specific intercepts. This approach eliminates…
Descriptors: Evaluation Research, Program Evaluation, Welfare Services, Employment
Debelak, Rudolf; Arendasy, Martin – Educational and Psychological Measurement, 2012
A new approach to identify item clusters fitting the Rasch model is described and evaluated using simulated and real data. The proposed method is based on hierarchical cluster analysis and constructs clusters of items that show a good fit to the Rasch model. It thus gives an estimate of the number of independent scales satisfying the postulates of…
Descriptors: Test Items, Factor Analysis, Evaluation Methods, Simulation
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012
Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…
Descriptors: Test Items, Simulation, Testing, Statistical Analysis
Häkkinen, P. – Journal of Computer Assisted Learning, 2013
Several studies have analysed and assessed online performance and discourse using quantitative and qualitative methods. Quantitative measures have typically included the analysis of participation rates and learning outcomes in terms of grades. Qualitative measures of postings, discussions and context features aim to give insights into the nature…
Descriptors: Computer Mediated Communication, Educational Technology, Technology Uses in Education, Statistical Analysis
Iannone, Paola; Simpson, Adrian – Research in Mathematics Education, 2013
A consistent message emerges from research on undergraduate students' perceptions of assessment which describes traditional assessment as detrimental to learning. However this literature has not included students in the pure sciences. Mathematics education literature advocates the introduction of innovative assessment at university. In this…
Descriptors: Undergraduate Students, Student Attitudes, Mathematics Tests, Alternative Assessment
MacDonald, Craig Matthew – ProQuest LLC, 2012
The concept of usefulness has implicitly played a pivotal role in evaluation research, but the meaning of usefulness has changed over time from system reliability to user performance and learnability/ease of use for non-experts. Despite massive technical and social changes, usability remains the "gold standard" for system evaluation.…
Descriptors: Educational Technology, Laboratory Experiments, Usability, Aesthetics
Liu, Yan; Zumbo, Bruno D. – Educational and Psychological Measurement, 2012
There is a lack of research on the effects of outliers on the decisions about the number of factors to retain in an exploratory factor analysis, especially for outliers arising from unintended and unknowingly included subpopulations. The purpose of the present research was to investigate how outliers from an unintended and unknowingly included…
Descriptors: Factor Analysis, Factor Structure, Evaluation Research, Evaluation Methods
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2012
Statistical modeling of school effectiveness data was originally motivated by the dissatisfaction with the analysis of (school-leaving) examination results that took no account of the background of the students or regarded each school as an isolated unit of analysis. The application of multilevel analysis was generally regarded as a breakthrough,…
Descriptors: School Effectiveness, Data Analysis, Statistical Analysis, Statistical Studies
Haardorfer, Regine; Gagne, Phill – Focus on Autism and Other Developmental Disabilities, 2010
Some researchers have argued for the use of or have attempted to make use of randomization tests in single-subject research. To address this tide of interest, the authors of this article describe randomization tests, discuss the theoretical rationale for applying them to single-subject research, and provide an overview of the methodological…
Descriptors: Research Design, Researchers, Evaluation Methods, Research Methodology