Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 17 |
Since 2006 (last 20 years) | 22 |
Descriptor
Accuracy | 22 |
Evaluation Methods | 22 |
Artificial Intelligence | 4 |
Classification | 4 |
Decision Making | 3 |
Educational Policy | 3 |
Educational Quality | 3 |
Error of Measurement | 3 |
Evidence | 3 |
Foreign Countries | 3 |
Reliability | 3 |
More ▼ |
Source
Author
Arnold Y. L. Wong | 1 |
Bolden, Benjamin | 1 |
Briesch, Amy M. | 1 |
Chahna Gonsalves | 1 |
Cheng, Ying | 1 |
Cochran-Smith, Marilyn | 1 |
Cousineau, Denis | 1 |
Curtis C. H. Yu | 1 |
DeLuca, Christopher | 1 |
Dilin Liu | 1 |
Dino Samartzis | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 22 |
Journal Articles | 16 |
Information Analyses | 2 |
Speeches/Meeting Papers | 2 |
Opinion Papers | 1 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
Elementary Secondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Kindergarten | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Policymakers | 1 |
Location
Connecticut | 1 |
Delaware | 1 |
District of Columbia | 1 |
Florida | 1 |
India | 1 |
New York | 1 |
Pennsylvania | 1 |
Rhode Island | 1 |
Tennessee | 1 |
Texas | 1 |
Turkey | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 1 |
Wechsler Individual… | 1 |
Wechsler Intelligence Scale… | 1 |
Wide Range Achievement Test | 1 |
Woodcock Johnson Tests of… | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Jie Qin; Dilin Liu – Applied Linguistics, 2025
In response to calls for an assessment tool that provides a separate performance dimension from the linguistic quality-oriented measures of complexity, accuracy, and fluency (CAF) and guided by systemic functional linguistic (SFL) theories, this study introduces a set of fine-grained objective measures of communication/content/function…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Linguistic Theory
Wang, Yufeng; Fang, Hui; Jin, Qun; Ma, Jianhua – Interactive Learning Environments, 2022
Peer assessment has become a primary solution to the challenge of evaluating a large number of students in Massive Open Online Courses (MOOCs). In peer assessment, all students need to evaluate a subset of other students' assignments, and then these peer grades are aggregated to predict a final score for each student. Unfortunately, due to the…
Descriptors: Supervision, Peer Evaluation, Student Evaluation, Large Group Instruction
Chahna Gonsalves – Journal of Learning Development in Higher Education, 2025
Generative AI (GenAI) is transforming higher education. It has already challenged the validity of traditional assessment methods and revealed concerns about the authenticity and reliability of conventional approaches. This opinion piece proposes an expanded theoretical framework for contextual learning, incorporating practical, situational,…
Descriptors: Artificial Intelligence, Higher Education, Evaluation Methods, Technology Uses in Education
Tahereh Firoozi; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2023
The proliferation of large language models represents a paradigm shift in the landscape of automated essay scoring (AES) systems, fundamentally elevating their accuracy and efficacy. This study presents an extensive examination of large language models, with a particular emphasis on the transformative influence of transformer-based models, such as…
Descriptors: Turkish, Writing Evaluation, Essays, Accuracy
Ghosh, Krishnendu – Education and Information Technologies, 2022
The paper presents a method for recommending augmentations against conceptual gaps in textbooks. Question Answer (QA) pairs from community question-answering (cQA) forums are noted to offer precise and comprehensive illustrations of concepts. Our proposed method retrieves QA pairs for a target concept to suggest two types of augmentations: basic…
Descriptors: Foreign Countries, Textbooks, Textbook Content, Discourse Analysis
Jae Q. J. Liu; Kelvin T. K. Hui; Fadi Al Zoubi; Zing Z. X. Zhou; Dino Samartzis; Curtis C. H. Yu; Jeremy R. Chang; Arnold Y. L. Wong – International Journal for Educational Integrity, 2024
The application of artificial intelligence (AI) in academic writing has raised concerns regarding accuracy, ethics, and scientific rigour. Some AI content detectors may not accurately identify AI-generated texts, especially those that have undergone paraphrasing. Therefore, there is a pressing need for efficacious approaches or guidelines to…
Descriptors: Artificial Intelligence, Investigations, Identification, Human Factors Engineering
Kayyali, Mustafa – Online Submission, 2023
University rankings have a growing impact on how people view the academic excellence of higher education. The complicated relationship between rankings and academic excellence is explored in this essay along with how it may affect higher education policy and practice. The importance of rankings and their influence on institutional decision-making…
Descriptors: Correlation, Reputation, Educational Quality, Institutional Characteristics
Vitello, Sylvia; Leech, Tony – Cambridge University Press & Assessment, 2022
In summer 2021, as exams could not take place, GCSE, AS and A level grades in England were awarded by teachers, in accordance with relatively broad official guidance. This guidance stressed that grades had to be based on evidence of candidate work, though what this was, how much was needed or where/when it should come from were not tightly…
Descriptors: Grading, Foreign Countries, Exit Examinations, Secondary Education
Jacob M. Schauer; Kaitlyn G. Fitzgerald; Sarah Peko-Spicer; Mena C. R. Whalen; Rrita Zejnullahi; Larry V. Hedges – Grantee Submission, 2021
Several programs of research have sought to assess the replicability of scientific findings in different fields, including economics and psychology. These programs attempt to replicate several findings and use the results to say something about large-scale patterns of replicability in a field. However, little work has been done to understand the…
Descriptors: Statistical Analysis, Research Methodology, Evaluation Methods, Replication (Evaluation)
Elaine Chapman; Jian Zhao; Peyman G. P. Sabet – Education Research and Perspectives, 2024
Effective assessments guide student learning, refine teaching practices, ensure curriculum alignment, and foster workforce readiness. However, the emergence of generative artificial intelligence (GenAI) tools, such as ChatGPT, has significantly disrupted traditional assessment processes, raising concerns about academic integrity and necessitating…
Descriptors: Artificial Intelligence, Evaluation Methods, Influence of Technology, Integrity
Mbaye, Baba – International Association for Development of the Information Society, 2018
The significant amount of information available on the web has led to difficulties for the learner to find useful information and relevant resources to carry out their training. The recommender systems have achieved significant success in the area of e-commerce, they still have difficulties in formulating relevant recommendations on e-learning…
Descriptors: Information Systems, Electronic Learning, Referral, Information Sources
Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017
Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…
Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy
Volpe, Robert J.; Briesch, Amy M. – School Psychology Review, 2018
The purpose of this commentary is to discuss articles in the special issue focused on improving procedures for universal screening for social-emotional and behavioral problems. Based on work by Hunsley and Mash (2007), we examine factors to consider for those seeking to establish evidence-based behavioral screening practices in school settings.…
Descriptors: Evidence Based Practice, Screening Tests, Behavior Problems, Student Behavior
Cochran-Smith, Marilyn; Reagan, Emilie M. – National Academy of Education, 2021
In 2010, as the result of a congressionally mandated study, the National Research Council (NRC) published the report "Preparing Teachers: Building Evidence for Sound Policy" (NRC, 2010). Reflecting the unprecedented attention to teacher quality that had emerged internationally in response to the exigencies of the "global knowledge…
Descriptors: Best Practices, Teacher Education Programs, Program Evaluation, Knowledge Economy
Flanagan, Dawn P.; Schneider, W. Joel – International Journal of School & Educational Psychology, 2016
When education works, it creates productive, innovative citizens eager to contribute to a well-functioning democracy. In contrast, educational failure has lifelong consequences, with some individuals experiencing decades of preventable hardship. Dawn Flanagan and Joel Schneider write in this response that, like Kranzler, Floyd, Benson, Zabowski,…
Descriptors: Learning Disabilities, Identification, Diagnostic Tests, Criticism
Previous Page | Next Page ยป
Pages: 1 | 2