ERIC - Search Results

Publication Date

In 2025	3
Since 2024	6
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	29
Since 2006 (last 20 years)	51

Descriptor

Evaluation Methods	96
Test Reliability	96
Test Validity	55
Scoring	48
Scoring Rubrics	38
Student Evaluation	31
Interrater Reliability	20
Test Construction	16
Foreign Countries	15
Elementary Secondary Education	14
Evaluation Criteria	14
Higher Education	14
Writing Evaluation	14
Measurement Techniques	12
Scoring Formulas	12
Performance Based Assessment	11
Scores	11
Educational Assessment	10
Psychometrics	10
Testing	10
Writing (Composition)	9
Rating Scales	8
Student Attitudes	8
Teacher Competencies	8
Testing Problems	8
More ▼

Education Level

Higher Education	14
Elementary Education	10
Postsecondary Education	10
Elementary Secondary Education	9
Early Childhood Education	3
Kindergarten	3
Adult Education	2
Grade 1	2
Middle Schools	2
Primary Education	2
Secondary Education	2
Grade 2	1
Grade 6	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Preschool Education	1
More ▼

Audience

Practitioners	10
Policymakers	4
Teachers	3
Researchers	2
Administrators	1

Location

United Kingdom (England)	3
California	2
Colorado (Denver)	2
North Carolina (Charlotte)	2
Tennessee (Memphis)	2
Vermont	2
Arkansas (Little Rock)	1
Australia	1
Canada	1
Croatia	1
Europe	1
Finland	1
Florida	1
Malaysia	1
New York (New York)	1
Norway	1
Pennsylvania (Pittsburgh)	1
Russia	1
Spain	1
Texas (Dallas)	1
United Kingdom	1
United Kingdom (Scotland)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

National Assessment of…	3
Advanced Placement…	2
Childrens Depression Inventory	1
Graduate Record Examinations	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 96 results Save | Export

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Rubric Development and Validation for Assessing Tasks' Solving via AI Chatbots

Peer reviewed
PDF on ERIC

Download full text

Mohammad Hmoud; Hadeel Swaity; Eman Anjass; Eva María Aguaded-Ramírez – Electronic Journal of e-Learning, 2024

This research aimed to develop and validate a rubric to assess Artificial Intelligence (AI) chatbots' effectiveness in accomplishing tasks, particularly within educational contexts. Given the rapidly growing integration of AI in various sectors, including education, a systematic and robust tool for evaluating AI chatbot performance is essential.…

Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Measuring Student and Educator Digital Competence beyond Self-Assessment: Developing and Validating Two Rubric-Based Frameworks

Peer reviewed

Direct link

Flor de Lis González-Mujico – Education and Information Technologies, 2024

Over the past decade, self-assessment tools have garnered significant attention in the interest of measuring the skillset required by educators and students to function productively and ethically in digitally mediated environments, particularly in relation to education policy implementation. Since stated beliefs do not always align with actual…

Descriptors: Technological Literacy, Evaluation Methods, Test Validity, Test Construction

Design of a Simple Rubric to Peer-Evaluate the Teamwork Skills of Engineering Students

Peer reviewed

Direct link

Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024

Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…

Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork

Assessing Digital Maturity of Schools: Framework and Instrument

Peer reviewed

Direct link

Begicevic Redjep, Nina; Balaban, Igor; Zugec, Bojan – Technology, Pedagogy and Education, 2021

The European Commission emphasises the need for educational institutions to integrate digital technologies in their teaching, learning and organisational practices. This study contributes to the field of digital transformation of schools by proposing and validating a Framework for Digitally Mature Schools (FDMS) and an instrument for assessing the…

Descriptors: Technology Integration, Information Technology, Program Evaluation, Educational Assessment

Students' Perceptions of Fairness in Groupwork Assessment: Validity Evidence for Peer Assessment Fairness Instrument

Peer reviewed

Direct link

Amirhossein Rasooli; Jim Turner; Tünde Varga-Atkins; Edd Pitt; Shaghayegh Asgari; Will Moindrot – Assessment & Evaluation in Higher Education, 2025

Groupwork is a crucial aspect of work contexts and a key twenty first century skill. Assessment of groupwork provides a persistent challenge for educators in university contexts with students reporting experiences of unfairness from their peers during groupwork. This study developed a novel Peer Assessment Fairness Instrument to explore factors…

Descriptors: Foreign Countries, Undergraduate Students, Student Attitudes, College Faculty

A Rubric Study for the Evaluation of Caricature Creation Building Skills of 6th Grade Students

Peer reviewed
PDF on ERIC

Download full text

Çifci, Musa; Kaplan, Kadir – Journal of Language and Linguistic Studies, 2020

This study aimed to develop "Caricature Creation Rubric" which can be used to evaluate the products produced by 6th grade students at the end of their caricature creation process and to make its validity and reliability studies. The criteria in the graded key were determined by using the "Caricature Literacy Module" prepared by…

Descriptors: Cartoons, Scoring Rubrics, Evaluation Methods, Student Evaluation

Exploring Rating Quality in the Context of High-Stakes Rater-Mediated Educational Assessments

Direct link

Wenjing Guo – ProQuest LLC, 2021

Constructed response (CR) items are widely used in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district and state-level assessments in the United States. One unique feature of CR items is that they depend on human raters to assess the quality of examinees' work. The judgment of human…

Descriptors: National Competency Tests, Responses, Interrater Reliability, Error of Measurement

Knowing and Doing: The Development of Information Literacy Measures to Assess Knowledge and Practice

Peer reviewed
PDF on ERIC

Download full text

Nierenberg, Ellen; Låg, Torstein; Dahl, Tove Irene – Journal of Information Literacy, 2021

This study touches upon three major themes in the field of information literacy (IL): the assessment of IL, the association between IL knowledge and skills, and the dimensionality of the IL construct. Three quantitative measures were developed and tested with several samples of university students to assess knowledge and skills for core facets of…

Descriptors: Information Literacy, College Students, Evaluation Methods, Knowledge Level

Reconsidering the Assessment Policy: Practical Use of Liberal Multiple-Choice Tests (SAC Method)

Peer reviewed
PDF on ERIC

Download full text

Cesur, Kursat – Educational Policy Analysis and Strategic Research, 2019

Examinees' performances are assessed using a wide variety of different techniques. Multiple-choice (MC) tests are among the most frequently used ones. Nearly, all standardized achievement tests make use of MC test items and there is a variety of ways to score these tests. The study compares number right and liberal scoring (SAC) methods. Mixed…

Descriptors: Multiple Choice Tests, Scoring, Evaluation Methods, Guessing (Tests)

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

The End of Points

Direct link

Feldman, Jo – Educational Leadership, 2018

Have teachers become too dependent on points? This article explores educators' dependency on their points systems, and the ways that points can distract teachers from really analyzing students' capabilities and achievements. Feldman argues that using a more subjective grading system can help illuminate crucial information about students and what…

Descriptors: Grading, Evaluation Methods, Evaluation Criteria, Achievement Rating

Developing a High Performance Digital Education Ecosystem: Institutional Self-Assessment Instruments

Direct link

Volungeviciene, Airina; Brown, Mark; Greenspon, Rasa; Gaebel, Michael; Morrisroe, Alison – European University Association, 2021

Digitally enhanced learning and teaching is widely used across the European Higher Education Area, with general acceptance growing over the years and institutions widely acknowledging the benefits it brings to the student experience. The strategic focus being placed on digitally enhanced learning and teaching has increased, undoubtedly accelerated…

Descriptors: Educational Technology, Technology Uses in Education, Program Evaluation, Self Evaluation (Groups)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Bill & Melinda Gates…	3
ProQuest LLC	3
College Teaching	2
ETS Research Report Series	2
Electronic Journal of…	2
Journal of Educational…	2
Journal of Educational…	2
Psychology in the Schools	2
Applied Measurement in…	1
Assessing Writing	1
Assessment	1
Assessment & Evaluation in…	1
Assessment Update	1
Assessment for Effective…	1
Audio-Visual Language Journal	1
Early Education and…	1
Education and Information…	1
Educational Assessment	1
Educational Leadership	1
Educational Policy Analysis…	1
Educational Research	1
Educational Research and…	1
English Language Teaching	1
European Journal of…	1
European Journal of…	1
More ▼

Crawford, Angela R.	2
Gearhart, Maryl	2
Johnson, Evelyn S.	2
Kane, Thomas J.	2
Koretz, Daniel	2
Moylan, Laura A.	2
Novak, John R.	2
Staiger, Douglas O.	2
Zheng, Yuzhu	2
Ackerman, Debra J.	1
Aghbar, Ali-Asghar	1
Ahmed, Wondimu	1
Aksu, Gökhan	1
Allison, Howard K., II	1
Amirhossein Rasooli	1
Amrein-Beardsley, Audrey	1
Andrews, Jac	1
Apache, R. R.	1
Araceli Martinez Ortiz	1
Bae, Yunhee	1
Baker, Eva L.	1
Baker, Holly	1
Balaban, Igor	1
Bardhoshi, Gerta	1
More ▼

Journal Articles	51
Reports - Research	48
Reports - Evaluative	23
Speeches/Meeting Papers	9
Information Analyses	7
Reports - Descriptive	7
Tests/Questionnaires	7
Opinion Papers	5
Dissertations/Theses -…	3
ERIC Publications	2
Guides - Classroom - Teacher	2
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Reports - General	2
Collected Works - Serial	1
ERIC Digests in Full Text	1
Guides - General	1
Historical Materials	1
Reference Materials -…	1
More ▼