ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	55
Since 2006 (last 20 years)	115

Descriptor

Comparative Analysis	129
Correlation	129
Scoring	85
Scoring Rubrics	44
Foreign Countries	36
Scores	31
Statistical Analysis	24
English (Second Language)	22
Writing Evaluation	21
Second Language Learning	20
Validity	20
Computer Assisted Testing	19
Language Tests	18
Teaching Methods	18
Interrater Reliability	16
Essays	15
Reliability	15
Student Attitudes	15
Student Evaluation	15
Undergraduate Students	15
Evaluators	14
Writing Tests	14
College Students	13
Evaluation Methods	13
Accuracy	11
More ▼

Publication Type

Journal Articles	108
Reports - Research	98
Reports - Evaluative	16
Dissertations/Theses -…	8
Tests/Questionnaires	7
Reports - Descriptive	3
Speeches/Meeting Papers	3
Information Analyses	2
Numerical/Quantitative Data	1

Education Level

Higher Education	41
Postsecondary Education	30
Secondary Education	19
Elementary Education	13
Middle Schools	8
Grade 4	7
High Schools	7
Junior High Schools	6
Grade 5	5
Elementary Secondary Education	4
Grade 3	4
Early Childhood Education	3
Grade 7	3
Grade 8	3
Intermediate Grades	3
Grade 11	2
Grade 6	2
Primary Education	2
Grade 1	1
Grade 10	1
Grade 2	1
Grade 9	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Practitioners

Location

China	4
Australia	3
Taiwan	3
Tennessee	3
Turkey	3
Canada	2
Germany	2
Hong Kong	2
India	2
Japan	2
Spain	2
Washington	2
Brazil	1
Canada (Montreal)	1
Colorado	1
Estonia	1
Florida	1
Florida (Orlando)	1
Indonesia	1
Latvia	1
Malaysia	1
Missouri	1
Netherlands	1
New York	1
Norway	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 129 results Save | Export

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Interpretable Cognitive State Prediction via Temporal Fuzzy Cognitive Map

Peer reviewed

Direct link

Yuang Wei; Bo Jiang – IEEE Transactions on Learning Technologies, 2024

Understanding student cognitive states is essential for assessing human learning. The deep neural networks (DNN)-inspired cognitive state prediction method improved prediction performance significantly; however, the lack of explainability with DNNs and the unitary scoring approach fail to reveal the factors influencing human learning. Identifying…

Descriptors: Cognitive Mapping, Models, Prediction, Short Term Memory

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Rater Connections and the Detection of Bias in Performance Assessment

Peer reviewed

Direct link

Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022

In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…

Descriptors: Evaluators, Bias, Identification, Performance Based Assessment

Comparison of Classical Test Theory vs. Multi-Facet Rasch Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022

Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…

Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation

The Correlation between Motivation and Achievement: Goals in an AP Classroom

Download full text

Clarice A. Calhoun – Online Submission, 2024

The present study investigated the correlation between achievement and motivation in high school advanced placement students. This study looked into the gap of how much motivation an AP student needs to reach achievement because of increased student involvement in an AP classroom. This study analyzes this correlation with a qualitative interview…

Descriptors: Correlation, Academic Achievement, Advanced Placement, Honors Curriculum

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

A Comparison of Student and Research-Based Evaluations of Explanation Quality in an Introductory Physics Course for Engineers

Direct link

Joe Olsen – ProQuest LLC, 2023

Instructional explanations are an ubiquitous component of classroom instruction, but are relatively neglected in science education when compared to other facets of teaching and learning. The ubiquity of instructional explanations and their potential to stimulate learning in students suggests that they should garner more attention from science…

Descriptors: Physics, Comparative Analysis, Student Attitudes, Educational Quality

Reliability and Stability of the Metrical Stress Effect on Segmental Production Accuracy in Persons with Apraxia of Speech

Peer reviewed

Direct link

Bailey, Dallin J.; Bunker, Lisa; Mauszycki, Shannon; Wambaugh, Julie L. – International Journal of Language & Communication Disorders, 2019

Background: Acquired apraxia of speech (AOS) involves speech-production deficits on both the segmental and suprasegmental levels. Recent research has identified a non-linear interaction between the metrical structure of bisyllabic words and word-production accuracy in German speakers with AOS, with trochaic words (strong-weak stress) being…

Descriptors: Accuracy, Suprasegmentals, Phonology, German

Binding Costs in Processing Efficiency as Determinants of Cognitive Ability

Peer reviewed
PDF on ERIC

Download full text

Goecke, Benjamin; Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2021

Performance in elementary cognitive tasks is moderately correlated with fluid intelligence and working memory capacity. These correlations are higher for more complex tasks, presumably due to increased demands on working memory capacity. In accordance with the binding hypothesis, which states that working memory capacity reflects the limit of a…

Descriptors: Intelligence, Cognitive Processes, Short Term Memory, Reaction Time

Students' Use of Formalisations for Improved Logical Reasoning

Peer reviewed

Direct link

Bronkhorst, Hugo; Roorda, Gerrit; Suhre, Cor; Goedhart, Martin – Research in Mathematics Education, 2022

Logical reasoning as part of critical thinking is becoming more and more important to prepare students for their future life in society, work, and study. This article presents the results of a quasi-experimental study with a pre-test-post-test control group design focusing on the effective use of formalisations to support logical reasoning. The…

Descriptors: Mathematics Instruction, Teaching Methods, Logical Thinking, Critical Thinking

Evidence-Based Decision about Test Scoring Rules in Clinical Anatomy Multiple-Choice Examinations

Peer reviewed

Direct link

Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia – Anatomical Sciences Education, 2015

In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…

Descriptors: Anatomy, Multiple Choice Tests, Scoring, Decision Making

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

Exploring Differences in Measurement and Reporting of Classroom Observation Inter-Rater Reliability

Peer reviewed
PDF on ERIC

Download full text

Wilhelm, Anne Garrison; Gillespie Rouse, Amy; Jones, Francesca – Practical Assessment, Research & Evaluation, 2018

Although inter-rater reliability is an important aspect of using observational instruments, it has received little theoretical attention. In this article, we offer some guidance for practitioners and consumers of classroom observations so that they can make decisions about inter-rater reliability, both for study design and in the reporting of data…

Descriptors: Interrater Reliability, Measurement, Observation, Educational Research

Wise Crowd Content Assessment and Educational Rubrics

Peer reviewed

Direct link

Passonneau, Rebecca J.; Poddar, Ananya; Gite, Gaurav; Krivokapic, Alisa; Yang, Qian; Perin, Dolores – International Journal of Artificial Intelligence in Education, 2018

Development of reliable rubrics for educational intervention studies that address reading and writing skills is labor-intensive, and could benefit from an automated approach. We compare a main ideas rubric used in a successful writing intervention study to a highly reliable wise-crowd content assessment method developed to evaluate…

Descriptors: Computer Assisted Testing, Writing Evaluation, Content Analysis, Scoring Rubrics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

ProQuest LLC	8
ETS Research Report Series	6
Online Submission	4
Applied Measurement in…	3
Language Testing	3
Advances in Physiology…	2
Applied Psychological…	2
Australasian Journal of…	2
Educational Sciences: Theory…	2
International Journal of…	2
International Journal of…	2
Journal of Educational…	2
Journal of Speech, Language,…	2
Online Learning	2
TESL Canada Journal	2
ACT, Inc.	1
Active Learning in Higher…	1
Alberta Journal of…	1
American Journal of Business…	1
Anatomical Sciences Education	1
Applied Linguistics	1
Art Therapy: Journal of the…	1
Assessment & Evaluation in…	1
Australian Educational…	1
British Journal of…	1
More ▼

Attali, Yigal	3
Bertling, Maria	2
Hartnett, Rodney T.	2
Linn, Robert L.	2
MacWhinney, Brian	2
Mattern, Krista	2
Radunzel, Justine	2
Sinharay, Sandip	2
Abdul Gafoor, K.	1
Abrami, Philip C.	1
Adams, Deanne M.	1
Akturk, Ahmet Oguz	1
Allan S. Cohen	1
Allen, Melissa M.	1
Altay, Figen	1
Amanda Huee-Ping Wong	1
Apple, Kristen	1
Arbat, Gerard	1
Asmar, Abdo	1
Awada, Ghada	1
Bailey, Dallin J.	1
Bain, Fabienne	1
Baldwin, Peter	1
Barclay, Alexandra	1
More ▼

Test of English as a Foreign…	6
SAT (College Admission Test)	4
ACT Assessment	2
College Board Achievement…	2
College and University…	2
National Assessment of…	2
Peabody Picture Vocabulary…	2
Trends in International…	2
Wechsler Intelligence Scale…	2
Beery Developmental Test of…	1
Bender Visual Motor Gestalt…	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
Goodenough Harris Drawing Test	1
Graduate Record Examinations	1
MacArthur Communicative…	1
McCarthy Scales of Childrens…	1
Myers Briggs Type Indicator	1
NEO Five Factor Inventory	1
National Longitudinal Survey…	1
National Survey of Student…	1
Praxis Series	1
Program for International…	1
Test of Language Development	1
United States Medical…	1
More ▼