ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	23

Descriptor

Interrater Reliability	272
Scoring	67
Evaluation Methods	64
Higher Education	64
Test Reliability	59
Evaluators	55
Test Construction	44
Elementary Secondary Education	43
Performance Based Assessment	41
Test Validity	38
Writing Evaluation	37
Comparative Analysis	35
Measurement Techniques	35
Rating Scales	35
Test Items	33
Evaluation Criteria	28
Educational Assessment	27
Scores	27
Standard Setting (Scoring)	27
Generalizability Theory	25
Student Evaluation	24
Essay Tests	23
Language Tests	22
Foreign Countries	21
Standards	21
More ▼

Source

Online Submission	14
Grantee Submission	4
International Educational…	3
AERA Online Paper Repository	2
Academic Medicine	2
Applied Measurement in…	2
Center for Educational…	1
Educational Measurement:…	1
Educational and Psychological…	1
Evaluation and Program…	1
International Association for…	1
Journal of Communication…	1
Journal of Speech, Language,…	1
Multivariate Behavioral…	1
North American Chapter of the…	1
More ▼

Publication Type

Speeches/Meeting Papers	272
Reports - Research	177
Reports - Evaluative	75
Information Analyses	13
Tests/Questionnaires	11
Journal Articles	9
Reports - Descriptive	6
Opinion Papers	5
Collected Works - Serials	1
Dissertations/Theses	1
Guides - Non-Classroom	1
More ▼

Education Level

Higher Education	10
Postsecondary Education	7
Elementary Secondary Education	4
Secondary Education	3
Grade 4	2
Grade 6	2
Grade 8	2
High Schools	2
Elementary Education	1
Grade 10	1

Audience

Researchers	58
Practitioners	7
Teachers	3
Administrators	2
Counselors	1

Location

Australia	3
California	3
Nevada	3
Illinois	2
Netherlands	2
Pennsylvania	2
California (Berkeley)	1
Canada	1
Cuba	1
Denmark	1
Egypt	1
Georgia	1
India	1
Israel	1
Japan	1
Louisiana	1
New Jersey	1
North Carolina	1
Ohio	1
Rhode Island	1
Sweden	1
Texas (Houston)	1
Thailand	1
United Kingdom	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Education Consolidation…

What Works Clearinghouse Rating

Showing 1 to 15 of 272 results Save | Export

Examining Inter-Rater Reliability of Evaluators Judging Teacher Performance: Proposing an Alternative to Cohen's Kappa. CEME Technical Report. CEMETR-2021-06

Download full text

Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle L. – Center for Educational Measurement and Evaluation, 2021

The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…

Descriptors: Interrater Reliability, Teacher Evaluation, Test Validity, Evaluation Methods

A Comparison of Manual versus Automated Quantitative Production Analysis of Connected Speech

Peer reviewed

Direct link

Fromm, Davida; Katta, Saketh; Paccione, Mason; Hecht, Sophia; Greenhouse, Joel; MacWhinney, Brian; Schnur, Tatiana T. – Journal of Speech, Language, and Hearing Research, 2021

Purpose: Analysis of connected speech in the field of adult neurogenic communication disorders is essential for research and clinical purposes, yet time and expertise are often cited as limiting factors. The purpose of this project was to create and evaluate an automated program to score and compute the measures from the Quantitative Production…

Descriptors: Speech, Automation, Statistical Analysis, Adults

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

Quantified Qualitative Analysis: Rubric Development and Inter-Rater Reliability as Iterative Design

Peer reviewed
PDF on ERIC

Download full text

McCarthy, Kathryn S.; Magliano, Joseph P.; Snyder, Jacob O.; Kenney, Elizabeth A.; Newton, Natalie N.; Perret, Cecile A.; Knezevic, Melanie; Allen, Laura K.; McNamara, Danielle S. – Grantee Submission, 2021

The objective in the current paper is to examine the processes of how our research team negotiated meaning using an iterative design approach as we established, developed, and refined a rubric to capture comprehension processes and strategies evident in students' verbal protocols. The overarching project comprises multiple data sets, multiple…

Descriptors: Scoring Rubrics, Interrater Reliability, Design, Learning Processes

Computer-Programmed Decision Trees for Assessing Teacher Noticing

Peer reviewed

Direct link

Schack, Edna O.; Dueber, David; Thomas, Jonathan Norris; Fisher, Molly H.; Jong, Cindy – AERA Online Paper Repository, 2019

Scoring of teachers' noticing responses is typically burdened with rater bias and reliance upon interrater consensus. The authors sought to make the scoring process more objective, equitable, and generalizable. The development process began with a description of response characteristics for each professional noticing component disconnected from…

Descriptors: Models, Teacher Evaluation, Observation, Bias

Developing a Tool for Measuring Student Orientations with Respect to Understanding in Mathematical Learning

Peer reviewed
PDF on ERIC

Download full text

Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023

The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…

Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Contextual Definition Generation

Peer reviewed
PDF on ERIC

Download full text

Direct link

Yarbro, Jeffrey T.; Olney, Andrew M. – Grantee Submission, 2021

This paper explores the concept of dynamically generating definitions using a deep-learning model. We do this by creating a dataset that contains definition entries and contexts associated with each definition. We then fine-tune a GPT-2 based model on the dataset to allow the model to generate contextual definitions. We evaluate our model with…

Descriptors: Definitions, Learning Processes, Models, Context Effect

Examining the Validity and Reliability of a University's Teacher Performance Assessment (TPA)

Download full text

Klecker, Beverly M. – Online Submission, 2018

The Council for the Accreditation of Educator Preparation Programs (CAEP), required evidence of reliability and validity of measures used in a university's Educator Preparation Program (EPP). This paper describes processes that provided this evidence for the Teacher Performance Assessment (TPA). Literature examined included Messick (1989), Linn…

Descriptors: College Faculty, Teacher Evaluation, Performance Based Assessment, Test Validity

"Hello, [REDACTED]": Protecting Student Privacy in Analyses of Online Discussion Forums

Peer reviewed
PDF on ERIC

Download full text

Bosch, Nigel; Crues, R. Wes; Shaik, Najmuddin; Paquette, Luc – Grantee Submission, 2020

Online courses often include discussion forums, which provide a rich source of data to better understand and improve students' learning experiences. However, forum messages frequently contain private information that prevents researchers from analyzing these data. We present a method for discovering and redacting private information including…

Descriptors: Privacy, Discussion Groups, Asynchronous Communication, Methods

Modeling Creativity in Visual Programming: From Theory to Practice

Peer reviewed
PDF on ERIC

Download full text

Kovalkov, Anastasia; Paassen, Benjamin; Segal, Avi; Gal, Kobi; Pinkwart, Niels – International Educational Data Mining Society, 2021

Promoting creativity is considered an important goal of education, but creativity is notoriously hard to define and measure. In this paper, we make the journey from defining a formal creativity and applying the measure in a practical domain. The measure relies on core theoretical concepts in creativity theory, namely fluency, flexibility, and…

Descriptors: Creativity, Theory Practice Relationship, Evaluators, Specialists

Improving Teacher Education through Assessment of Portfolio Reviews: The Role of Inter-Rater Reliability

Peer reviewed

Direct link

McGough, David J. – AERA Online Paper Repository, 2017

This paper describes the implementation of an inter-rater reliability measure for assessing portfolio scores in a teacher education program. The reliability coefficient for the portfolio scores from completers of a newly revised program were compared with the reliability coefficient of the scores from a second set of reviewers who discussed the…

Descriptors: Interrater Reliability, Teacher Education Programs, Program Evaluation, Portfolio Assessment

Predicting Misalignment between Teachers' and Students' Essay Scores Using Natural Language Processing Tools

Peer reviewed
PDF on ERIC

Download full text

Allen, Laura K.; Crossley, Scott A.; McNamara, Danielle S. – Grantee Submission, 2015

We investigated linguistic factors that relate to misalignment between students' and teachers' ratings of essay quality. Students (n = 126) wrote essays and rated the quality of their work. Teachers then provided their own ratings of the essays. Results revealed that students who were less accurate in their self-assessments produced essays that…

Descriptors: Essays, Scores, Natural Language Processing, Interrater Reliability

Validity Research on Teacher Evaluation Systems Based on the Framework for Teaching

Download full text

Milanowski, Anthony T. – Online Submission, 2011

After decades of disinterest, evaluation of the performance of elementary and secondary teachers in the United States has become an important educational policy issue. As U.S. states and districts have tried to upgrade their evaluation processes, one of the models that has been increasingly used is the Framework for Teaching. This paper summarizes…

Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Observation

The Impact of ICT as Another Route to Overcome Learning Barriers for Students with SEN: A Case Study in an Egyptian Context

Download full text

Al-Gawhary, Wedad; Kambouri, Maria – International Association for Development of the Information Society, 2012

The purpose of this case study was to measure the impact of using ICT in Individual Learning Programmes of students with learning disabilities. Twenty five students and thirteen teachers took part in the research which was based on classroom observations. The Kappa coefficient was employed as a measure to statistically quantify the students'…

Descriptors: Foreign Countries, Special Needs Students, Down Syndrome, Autism

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 19

Littlefield, John H.	5
Lunz, Mary E.	5
Cason, Carolyn L.	4
Cason, Gerald J.	4
Jaeger, Richard M.	4
Capie, William	3
Engelhard, George, Jr.	3
Michaels, Hillary	3
Moffett, David W.	3
Myford, Carol M.	3
O'Neill, Thomas R.	3
Plake, Barbara S.	3
Reid, Barbara K.	3
Allen, Laura K.	2
Anderson, Judith A.	2
Busch, John Christian	2
Chang, Lei	2
Cope, Ronald T.	2
Crehan, Kevin D.	2
De Champlain, Andre F.	2
Du, Yi	2
Ferrara, Steven F.	2
Friedman, Greg	2
Halpin, Glennelle	2
More ▼

National Assessment of…	4
Teacher Performance…	4
Test of English as a Foreign…	4
Early Childhood Environment…	2
Medical College Admission Test	2
Praxis Series	2
ACTFL Oral Proficiency…	1
Alabama High School…	1
Behavioral and Emotional…	1
Child Behavior Checklist	1
Cognitive Abilities Test	1
Communication and Symbolic…	1
General Educational…	1
Graduate Management Admission…	1
Graduate Record Examinations	1
Group Assessment of Logical…	1
International English…	1
Iowa Tests of Basic Skills	1
Minnesota Multiphasic…	1
National Teacher Examinations	1
Peabody Individual…	1
Self Directed Search	1
Strong Campbell Interest…	1
Texas Assessment of Academic…	1
Torrance Tests of Creative…	1
More ▼