ERIC - Search Results

Publication Date

In 2025	96
Since 2024	281
Since 2021 (last 5 years)	964
Since 2016 (last 10 years)	2282
Since 2006 (last 20 years)	3752

Descriptor

Scoring Rubrics	2102
Scoring	1604
Foreign Countries	1241
Teaching Methods	704
Comparative Analysis	553
Student Evaluation	537
Second Language Learning	511
English (Second Language)	509
Undergraduate Students	501
Scores	485
Statistical Analysis	477
Evaluation Methods	476
Student Attitudes	471
Correlation	419
College Students	399
Writing Evaluation	380
Feedback (Response)	359
Second Language Instruction	336
Elementary School Students	307
Language Tests	305
Interrater Reliability	276
Computer Assisted Testing	275
Essays	267
Accuracy	259
Models	258
More ▼

Publication Type

Reports - Research	3752
Journal Articles	3360
Tests/Questionnaires	428
Speeches/Meeting Papers	127
Information Analyses	41
Numerical/Quantitative Data	35
Books	5
Collected Works - General	4
Dissertations/Theses -…	3
Non-Print Media	3
Multilingual/Bilingual…	2
Reports -…	2
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	1644
Postsecondary Education	1364
Elementary Education	584
Secondary Education	561
Middle Schools	313
High Schools	253
Junior High Schools	209
Early Childhood Education	208
Elementary Secondary Education	185
Primary Education	139
Intermediate Grades	131
Grade 4	109
Grade 5	92
Grade 3	79
Grade 8	74
Grade 7	67
Preschool Education	67
Grade 6	64
Kindergarten	59
Grade 2	58
Adult Education	45
Grade 1	45
Grade 9	30
Grade 11	28
Grade 10	27
More ▼

Audience

Teachers	11
Practitioners	9
Researchers	7
Administrators	4
Policymakers	4
Media Staff	2
Counselors	1

Location

Turkey	145
Australia	108
China	79
Indonesia	60
California	58
Canada	50
Spain	50
Taiwan	45
Iran	44
Japan	42
United States	42
Florida	39
Texas	39
Netherlands	37
New York	36
United Kingdom	36
Hong Kong	30
Germany	27
Pennsylvania	27
South Korea	26
Illinois	25
Malaysia	24
United Kingdom (England)	24
Georgia	23
India	23
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	18
Individuals with Disabilities…	6
Elementary and Secondary…	3
Elementary and Secondary…	2
Race to the Top	2
American Recovery and…	1
Bilingual Education Act 1968	1
Civil Rights Act 1964	1
Deferred Action for Childhood…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Equal Educational…	1
Individuals with Disabilities…	1
Lau v Nichols	1
United Nations Convention on…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	8
Does not meet standards	6

Showing 1 to 15 of 3,752 results Save | Export

Statistically Guided Grading Judgements: Contextualisation or Contamination?

Peer reviewed

Direct link

Louise Badham – Oxford Review of Education, 2025

Different sources of assessment evidence are reviewed during International Baccalaureate (IB) grade awarding to convert marks into grades and ensure fair results for students. Qualitative and quantitative evidence are analysed to determine grade boundaries, with statistical evidence weighed against examiner judgement and teachers' feedback on…

Descriptors: Advanced Placement Programs, Grading, Interrater Reliability, Evaluative Thinking

Enhancing the Performance of Automated Scoring Model for Kinematic Graph Answers Using Synthetic Graph Images

Peer reviewed

Direct link

Jae-Sang Han; Hyun-Joo Kim – Journal of Science Education and Technology, 2025

This study explores the potential to enhance the performance of convolutional neural networks (CNNs) for automated scoring of kinematic graph answers through data augmentation using Deep Convolutional Generative Adversarial Networks (DCGANs). By developing and fine-tuning a DCGAN model to generate high-quality graph images, we explored its…

Descriptors: Performance, Automation, Scoring, Models

Inter-Rater Reliability in Comprehensive Examination Scoring: The Case for Consistent and Collaborative Rater Training and Calibration

Download full text

Saenz, David Arron – Online Submission, 2023

There is a vast body of literature documenting the positive impacts that rater training and calibration sessions have on inter-rater reliability as research indicates several factors including frequency and timing play crucial roles towards ensuring inter-rater reliability. Additionally, increasing amounts research indicate possible links in…

Descriptors: Interrater Reliability, Scoring, Training, Scoring Rubrics

SDG Rubrics for Higher Education: Framework Design, Indicator Development, and Practical Applications

Peer reviewed

Direct link

Keshav Panray Jungbadoor; Xi Hong; Liu Liu; Yunan Zhu; Xinni Huang; Viraiyan Teeroovengadum; Gwilym Croucher; Angel Calderon; Sara Bice; Hamish Coates – Tertiary Education and Management, 2025

This paper reports on a multiyear program of international collaborative research delivered with the aim of conceptualising, validating and prototyping rubrics for evaluating and reporting university activities and outcomes relevant to the UN SDGs. The paper sets foundations by building on earlier analysis of research on university engagement with…

Descriptors: Higher Education, Universities, Sustainable Development, Scoring Rubrics

Exploring the Frontiers of Generative AI in Assessment: Is There Potential for a Human-AI Partnership?

Peer reviewed

Direct link

David DiSabito; Lisa Hansen; Thomas Mennella; Josephine Rodriguez – New Directions for Teaching and Learning, 2025

This chapter investigates the integration of generative AI (GenAI), specifically ChatGPT, into institutional and course-level assessment at Western New England University. It explores the potential of GenAI to streamline the assessment process, making it more efficient, equitable, and objective. Through the development of a proprietary GenAI tool,…

Descriptors: Artificial Intelligence, Technology Uses in Education, Man Machine Systems, Educational Assessment

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Automatic Essay Scoring for Discussion Forum in Online Learning Based on Semantic and Keyword Similarities

Peer reviewed

Direct link

Dhini, Bachriah Fatwa; Girsang, Abba Suganda; Sufandi, Unggul Utan; Kurniawati, Heny – Asian Association of Open Universities Journal, 2023

Purpose: The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the…

Descriptors: Computer Assisted Testing, Scoring, Writing Evaluation, Essays

Scoring Difficulty in Summary Writing Assessment: Toward the Reconstruction of Analytic Rubric

Peer reviewed
PDF on ERIC

Download full text

Makiko Kato – Journal of Education and Learning, 2025

This study aims to examine whether differences exist in the factors influencing the difficulty of scoring English summaries and determining scores based on the raters' attributes, and to collect candid opinions, considerations, and tentative suggestions for future improvements to the analytic rubric of summary writing for English learners. In this…

Descriptors: Writing Evaluation, Scoring, Writing Skills, English (Second Language)

Machine Learning and Hebrew NLP for Automated Assessment of Open-Ended Questions in Biology

Peer reviewed

Direct link

Ariely, Moriah; Nazaretsky, Tanya; Alexandron, Giora – International Journal of Artificial Intelligence in Education, 2023

Machine learning algorithms that automatically score scientific explanations can be used to measure students' conceptual understanding, identify gaps in their reasoning, and provide them with timely and individualized feedback. This paper presents the results of a study that uses Hebrew NLP to automatically score student explanations in Biology…

Descriptors: Artificial Intelligence, Algorithms, Natural Language Processing, Hebrew

Measuring the Quality of Academic Doctoral Learning through a Crowdsourced Dissertation Rubric

Peer reviewed

Direct link

Heather D. Hussey; Tara Lehan; Kate McConnell – Learning Assistance Review, 2024

Rubrics (e.g., Valid Assessment of Learning in Undergraduate Education (VALUE) rubrics) that measure specific skills exist, and researchers have demonstrated their benefits; however, most of them were designed for use with undergraduate students. Although some rubrics have been created to assess dissertations and oral defenses, few have been…

Descriptors: Scoring Rubrics, Doctoral Programs, Doctoral Dissertations, Online Courses

Scoring Running Records: Complexities and Affordances

Peer reviewed

Direct link

Rodgers, Emily; D'Agostino, Jerome V.; Berenbon, Rebecca; Johnson, Tracy; Winkler, Christa – Journal of Early Childhood Literacy, 2023

Running Records are thought to be an excellent formative assessment tool because they generate results that educators can use to make their teaching more responsive. Despite the technical nature of scoring Running Records and the kinds of important decisions that are attached to their analysis, few studies have investigated assessor accuracy. We…

Descriptors: Formative Evaluation, Scoring, Accuracy, Difficulty Level

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

Using Think-Aloud Interviews to Examine a Clinically Oriented Performance Assessment Rubric

Peer reviewed

Direct link

Roduta Roberts, Mary; Gotch, Chad M.; Cook, Megan; Werther, Karin; Chao, Iris C. I. – Measurement: Interdisciplinary Research and Perspectives, 2022

Performance-based assessment is a common approach to assess the development and acquisition of practice competencies among health professions students. Judgments related to the quality of performance are typically operationalized as ratings against success criteria specified within a rubric. The extent to which the rubric is understood,…

Descriptors: Protocol Analysis, Scoring Rubrics, Interviews, Performance Based Assessment

Automated Speech Scoring System under the Lens: Evaluating and Interpreting the Linguistic Cues for Language Proficiency

Peer reviewed

Direct link

Bamdev, Pakhi; Grover, Manraj Singh; Singla, Yaman Kumar; Vafaee, Payman; Hama, Mika; Shah, Rajiv Ratn – International Journal of Artificial Intelligence in Education, 2023

English proficiency assessments have become a necessary metric for filtering and selecting prospective candidates for both academia and industry. With the rise in demand for such assessments, it has become increasingly necessary to have the automated human-interpretable results to prevent inconsistencies and ensure meaningful feedback to the…

Descriptors: Language Proficiency, Automation, Scoring, Speech Tests

On the Limitations of Human-Computer Agreement in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Pechenizkiy, Mykola – International Educational Data Mining Society, 2021

Scoring essays is generally an exhausting and time-consuming task for teachers. Automated Essay Scoring (AES) facilitates the scoring process to be faster and more consistent. The most logical way to assess the performance of an automated scorer is by measuring the score agreement with the human raters. However, we provide empirical evidence that…

Descriptors: Man Machine Systems, Automation, Computer Assisted Testing, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 251

Grantee Submission	116
Online Submission	104
ETS Research Report Series	95
Assessment & Evaluation in…	74
Language Testing	59
Educational and Psychological…	37
English Language Teaching	34
Applied Measurement in…	33
Language Assessment Quarterly	33
Educational Measurement:…	28
Educational Assessment	27
CBE - Life Sciences Education	26
Education and Information…	26
Language Testing in Asia	26
Physical Review Physics…	25
Journal of Speech, Language,…	24
Assessment in Education:…	23
Chemistry Education Research…	23
International Journal of…	23
Journal of Educational…	23
Society for Research on…	23
Journal of Science Education…	21
Practical Assessment,…	21
International Education…	20
International Journal of…	20
More ▼

McNamara, Danielle S.	25
Johnson, Evelyn S.	19
Moylan, Laura A.	19
Zheng, Yuzhu	18
Crossley, Scott A.	17
Attali, Yigal	16
Linn, Marcia C.	16
Liu, Ou Lydia	13
Crawford, Angela R.	12
Wyse, Adam E.	12
Allen, Laura K.	10
Dimitrov, Dimiter M.	10
Wind, Stefanie A.	10
Zhang, Mo	10
Lee, Hee-Sun	9
Mercer, Sterett H.	9
Clauser, Brian E.	8
Schoen, Robert C.	8
Danielle S. McNamara	7
Haudek, Kevin C.	7
Margolis, Melissa J.	7
Urban-Lurain, Mark	7
Wolfe, Edward W.	7
Zechner, Klaus	7
Bridgeman, Brent	6
More ▼

Test of English as a Foreign…	65
National Assessment of…	24
International English…	22
Graduate Record Examinations	21
Peabody Picture Vocabulary…	19
SAT (College Admission Test)	18
Wechsler Intelligence Scale…	18
Program for International…	14
Woodcock Johnson Tests of…	14
Wechsler Individual…	12
ACT Assessment	11
edTPA (Teacher Performance…	11
Trends in International…	9
Dynamic Indicators of Basic…	8
Gates MacGinitie Reading Tests	8
Praxis Series	7
Test of English for…	7
Torrance Tests of Creative…	7
Autism Diagnostic Observation…	5
Clinical Evaluation of…	5
Early Childhood Environment…	5
Flesch Kincaid Grade Level…	5
Stanford Achievement Tests	5
United States Medical…	5
Expressive One Word Picture…	4
More ▼