Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 46 |
Descriptor
Evaluation Methods | 56 |
Statistical Analysis | 56 |
Classification | 55 |
Foreign Countries | 16 |
Models | 14 |
Comparative Analysis | 13 |
Research Methodology | 9 |
Data Analysis | 7 |
Scores | 7 |
Educational Research | 6 |
Item Response Theory | 6 |
More ▼ |
Source
Author
Ackerman, Matthew | 1 |
Ali, Md Ramjan | 1 |
Andrich, David | 1 |
Bae, Sungwon | 1 |
Bahreini, Kiavash | 1 |
Barnes, Tiffany, Ed. | 1 |
Battaglia, Onofrio Rosario | 1 |
Beguin, A. A. | 1 |
Bellmore, Amy D. | 1 |
Bender, M. Lionel | 1 |
Berk, Ronald A. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 10 |
Postsecondary Education | 9 |
Secondary Education | 7 |
High Schools | 4 |
Elementary Secondary Education | 3 |
Junior High Schools | 3 |
Middle Schools | 3 |
Elementary Education | 1 |
Grade 10 | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Location
Netherlands | 3 |
United States | 2 |
Afghanistan | 1 |
Africa | 1 |
Bangladesh | 1 |
California (Stanford) | 1 |
Canada | 1 |
Canada (Montreal) | 1 |
China | 1 |
Finland | 1 |
Florida | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Child Behavior Checklist | 1 |
Florida Comprehensive… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Bonifay, Wes; Depaoli, Sarah – Prevention Science, 2023
Statistical analysis of categorical data often relies on multiway contingency tables; yet, as the number of categories and/or variables increases, the number of table cells with few (or zero) observations also increases. Unfortunately, sparse contingency tables invalidate the use of standard goodness-of-fit statistics. Limited-information fit…
Descriptors: Bayesian Statistics, Programming Languages, Psychopathology, Classification
Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017
Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…
Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy
Rajagopal, Prabha; Ravana, Sri Devi – Information Research: An International Electronic Journal, 2017
Introduction: The use of averaged topic-level scores can result in the loss of valuable data and can cause misinterpretation of the effectiveness of system performance. This study aims to use the scores of each document to evaluate document retrieval systems in a pairwise system evaluation. Method: The chosen evaluation metrics are document-level…
Descriptors: Information Retrieval, Documentation, Scores, Information Systems
Lamprianou, Iasonas – Educational and Psychological Measurement, 2018
It is common practice for assessment programs to organize qualifying sessions during which the raters (often known as "markers" or "judges") demonstrate their consistency before operational rating commences. Because of the high-stakes nature of many rating activities, the research community tends to continuously explore new…
Descriptors: Social Networks, Network Analysis, Comparative Analysis, Innovation
Sérandour, Guillaume; Illanes, Alfredo; Maturana, Jorge; Cádiz, Janet – Assessment & Evaluation in Higher Education, 2016
Assessment is a notorious source of preoccupation for faculty and university governing bodies, especially when an institution initiates curricular reforms which shift the programme learning outcomes for knowledge to competencies. One obstacle to acceptance arises from a culture of quantitative assessment (often represented by a single mark), which…
Descriptors: College Outcomes Assessment, Competency Based Education, Arithmetic, Statistical Analysis
Huang, Zuqing; Qiu, Robin G. – Quality in Higher Education, 2016
University ranking or higher education assessment in general has been attracting more and more public attention over the years. However, the subjectivity-based evaluation index and indicator selections and weights that are widely adopted in most existing ranking systems have been called into question. In other words, the objectivity and…
Descriptors: Higher Education, Educational Quality, Liberal Arts, Reputation
Non-Hierarchical Clustering as a Method to Analyse an Open-Ended Questionnaire on Algebraic Thinking
Di Paola, Benedetto; Battaglia, Onofrio Rosario; Fazio, Claudio – South African Journal of Education, 2016
The problem of taking a data set and separating it into subgroups, where the members of each subgroup are more similar to each other than they are to members outside the subgroup, has been extensively studied in science and mathematics education research. Student responses to written questions and multiple-choice tests have been characterised and…
Descriptors: Foreign Countries, Grade 10, Questionnaires, Algebra
Washington-Ottombre, Camille; Bigalke, Siiri – International Journal of Sustainability in Higher Education, 2018
Purpose: This paper aims to compose a systematic understanding of campus sustainability innovations and unpack the complex drivers behind the elaboration of specific innovations. More precisely, the authors ask two fundamental questions: What are the topics and modes of implementation of campus sustainability innovations? What are the external and…
Descriptors: Higher Education, Educational Innovation, Sustainability, Program Implementation
Zwaal, Wichard; Otting, Hans – Journal of Problem Based Learning in Higher Education, 2016
The study focuses on the seven-step procedure (SSP) in problem-based learning (PBL). The way students apply the seven-step procedure will help us understand how students work in a problem-based learning curriculum. So far, little is known about how students rate the performance and importance of the different steps, the amount of time they spend…
Descriptors: Management Development, Hospitality Occupations, Problem Based Learning, Teaching Methods
Kazanidis, Ioannis; Theodosiou, Theodosios; Petasakis, Ioannis; Valsamidis, Stavros – Interactive Learning Environments, 2016
Database files and additional log files of Learning Management Systems (LMSs) contain an enormous volume of data which usually remain unexploited. A new methodology is proposed in order to analyse these data both on the level of both the courses and the learners. Specifically, "regression analysis" is proposed as a first step in the…
Descriptors: Foreign Countries, Online Courses, Course Evaluation, Electronic Learning
Gogia, Laura Park – ProQuest LLC, 2016
Virginia Commonwealth University (VCU) is implementing a large scale exploration of digital pedagogies, including connected learning and open education, in an effort to promote digital fluency and integrative thinking among students. The purpose of this study was to develop a classroom assessment toolkit for faculty who wish to document student…
Descriptors: Educational Technology, Technology Uses in Education, Technological Literacy, Evaluation Methods
Fuller, Matthew B.; Skidmore, Susan T.; Bustamante, Rebecca M.; Holzweiss, Peggy C. – Review of Higher Education, 2016
Although touted as beneficial to student learning, cultures of assessment have not been examined adequately using validated instruments. Using data collected from a stratified, random sample (N = 370) of U.S. institutional research and assessment directors, the models tested in this study provide empirical support for the value of using the…
Descriptors: Higher Education, Administrators, Evaluation Methods, Attitude Measures
Andrich, David – Educational and Psychological Measurement, 2013
Assessments in response formats with ordered categories are ubiquitous in the social and health sciences. Although the assumption that the ordering of the categories is working as intended is central to any interpretation that arises from such assessments, testing that this assumption is valid is not standard in psychometrics. This is surprising…
Descriptors: Item Response Theory, Classification, Statistical Analysis, Models
Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016
A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…
Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy
González-Brenes, José P.; Huang, Yun – International Educational Data Mining Society, 2015
Classification evaluation metrics are often used to evaluate adaptive tutoring systems-- programs that teach and adapt to humans. Unfortunately, it is not clear how intuitive these metrics are for practitioners with little machine learning background. Moreover, our experiments suggest that existing convention for evaluating tutoring systems may…
Descriptors: Intelligent Tutoring Systems, Evaluation Methods, Program Evaluation, Student Behavior