ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	13
Since 2007 (last 20 years)	25

Descriptor

Evaluators	53
Test Reliability	53
Test Validity	25
Interrater Reliability	21
Evaluation Methods	16
Scoring	16
Test Construction	13
Language Tests	11
Scores	10
Comparative Analysis	9
Foreign Countries	9
English (Second Language)	8
Evaluation Criteria	8
Rating Scales	8
Higher Education	7
Performance Based Assessment	7
Second Language Learning	7
Student Evaluation	7
Educational Assessment	6
Elementary Secondary Education	6
Test Items	6
Writing Tests	6
Correlation	5
Language Proficiency	5
Computer Software	4
More ▼

Publication Type

Reports - Research	32
Journal Articles	29
Speeches/Meeting Papers	15
Reports - Evaluative	10
Tests/Questionnaires	8
Reports - Descriptive	7
Guides - Non-Classroom	2
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Numerical/Quantitative Data	1
Opinion Papers	1
More ▼

Education Level

Elementary Secondary Education	3
Higher Education	3
Postsecondary Education	3
Secondary Education	3
Elementary Education	2
Adult Education	1
Early Childhood Education	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Middle Schools	1
More ▼

Audience

Practitioners	2
Teachers	2
Administrators	1
Researchers	1

Location

Iran	2
United Kingdom	2
United States	2
Europe	1
Hawaii	1
Hong Kong	1
Illinois	1
Indonesia	1
Israel	1
New Jersey	1
South Africa	1
Texas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Test of English as a Foreign…	2
Alabama High School…	1
New Jersey High School…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 53 results Save | Export

The Value of Expanding Perspectives on Assessment

Peer reviewed

Direct link

Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024

In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…

Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Making Each Point Count: Revising a Local Adaptation of the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE Rubric

Peer reviewed

Direct link

Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024

In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…

Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)

Inter-Rater Agreement in Assigning Cognitive Demand to Life Sciences Examination Questions

Peer reviewed

Direct link

Dempster, Edith R.; Kirby, Nicola F. – Perspectives in Education, 2018

Taxonomies of cognitive demand are frequently used to ensure that assessment tasks include questions ranging from low to high cognitive demand. This paper investigates inter-rater agreement among four evaluators on the cognitive demand of the South African National Senior Certificate Life Sciences examinations after training, practice and…

Descriptors: Interrater Reliability, Biological Sciences, Cognitive Processes, Test Items

Operationalizing the Reading-into-Writing Construct in Analytic Rating Scales: Effects of Different Approaches on Rating

Peer reviewed

Direct link

Lestari, Santi B.; Brunfaut, Tineke – Language Testing, 2023

Assessing integrated reading-into-writing task performances is known to be challenging, and analytic rating scales have been found to better facilitate the scoring of these performances than other common types of rating scales. However, little is known about how specific operationalizations of the reading-into-writing construct in analytic rating…

Descriptors: Reading Writing Relationship, Writing Tests, Rating Scales, Writing Processes

Autism at a Glance: A Pilot Study Optimizing Thin-Slice Observations

Peer reviewed

Direct link

Hampton, Lauren H.; Curtis, Philip R.; Roberts, Megan Y. – Autism: The International Journal of Research and Practice, 2019

Borrowing from a clinical psychology observational methodology, thin-slice observations were used to assess autism characteristics in toddlers. Thin-slices are short observations taken from a longer behavior stream which are assigned ratings by multiple raters using a 5-point scale. The raters' observations are averaged together to assign a…

Descriptors: Autism, Pervasive Developmental Disorders, Observation, Toddlers

Rater Certification Tests: A Psychometric Approach

Peer reviewed

Direct link

Attali, Yigal – Educational Measurement: Issues and Practice, 2019

Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…

Descriptors: Evaluators, Certification, High Stakes Tests, Scoring

Evaluating Students' Performance in Responding to Art: The Development and Validation of an Art Criticism Assessment Rubric

Peer reviewed

Direct link

Tam, Cheung On – International Journal of Art & Design Education, 2018

This article reports on the development and validation of a rubric for assessing students' written responses to artworks. Since the implementation of the Hong Kong New Senior Secondary Curriculum in 2009, art educators have seen responding to artworks as increasingly important. In this context, the Art Criticism Assessment Rubric (ACAR) was…

Descriptors: Foreign Countries, Art Education, Art Appreciation, Student Evaluation

The Effects of Primacy on Rater Cognition: An Eye-Tracking Study

Direct link

Ballard, Laura – ProQuest LLC, 2017

Rater scoring has an impact on writing test reliability and validity. Thus, there has been a continued call for researchers to investigate issues related to rating (Crusan, 2015). Investigating the scoring process and understanding how raters arrive at particular scores are critical "because the score is ultimately what will be used in making…

Descriptors: Evaluators, Schemata (Cognition), Eye Movements, Scoring Rubrics

Construct Exploration of Teacher Readiness as an Assessor of Vocational High School Competency Test

Peer reviewed
PDF on ERIC

Download full text

Cahyono, Sulistio Mukti; Kartawagiran, Badrun; Mahmudah, Fitri Nur – European Journal of Educational Research, 2021

Teachers who can adapt and be ready for all changes will also be able to provide a balance to increase the competence of vocational high school students. This is also not denied when teachers become assessors in student competency tests. The objectives of this study were to produce an instrument for the readiness of teachers as assessors; to…

Descriptors: Readiness, Vocational Education Teachers, Vocational High Schools, High School Students

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Multiple Mini-Interviews in the Age of the Internet: Does Preparation Help Applicants to Medical School?

Peer reviewed

Direct link

Moshinsky, Avital; Ziegler, David; Gafni, Naomi – International Journal of Testing, 2017

Many medical schools have adopted multiple mini-interviews (MMI) as an advanced selection tool. MMIs are expensive and used to test only a few dozen candidates per day, making it infeasible to develop a different test version for each test administration. Therefore, some items are reused both within and across years. This study investigated the…

Descriptors: Interviews, Medical Schools, Test Validity, Test Reliability

A Study of the Danielson Framework for Teaching for Evaluating Early Childhood Teachers

Peer reviewed

Direct link

Hood, Lisa; Rodriguez, Sarai Coba; Rosa, Pamela Reimer; Hunt, Erika Lee – AERA Online Paper Repository, 2016

Teacher-evaluation has become a key measure for determining teacher effectiveness and the examination of the evaluation process at early childhood levels is a critical area of study. Responding to federal policies, many states have reformed their teacher evaluation systems. One of the more recent developments in state policy is the inclusion of…

Descriptors: Early Childhood Teachers, Teacher Evaluation, Evaluation Methods, Test Reliability

The Examining Evaluator Feedback Survey. REL 2016-100

Peer reviewed
PDF on ERIC

Download full text

Cherasaro, Trudy L.; Brodersen, R. Marc; Yanoski, David C.; Welp, Laura C.; Reale, Marianne L. – Regional Educational Laboratory Central, 2015

This report presents a survey tool, developed by REL Central at Marzano Research, designed to gather information from teachers about their perceptions of and responses to evaluator feedback. District or state administrators can use this survey to systematically collect teacher perceptions on five key aspects of evaluation feedback: (1) feedback…

Descriptors: Teacher Surveys, Evaluators, Teacher Attitudes, Feedback (Response)

Marking as Judgment

Peer reviewed

Direct link

Brooks, Val – Research Papers in Education, 2012

An aspect of assessment which has received little attention compared with perennial concerns, such as standards or reliability, is the role of judgment in marking. This paper explores marking as an act of judgment, paying particular attention to the nature of judgment and the processes involved. It brings together studies which have explored…

Descriptors: Educational Assessment, Test Reliability, Test Validity, Value Judgment

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Language Testing	3
AERA Online Paper Repository	1
American Journal of Evaluation	1
Assessment & Evaluation in…	1
Assessment Update	1
Assessment in Education:…	1
Autism: The International…	1
Canadian Modern Language…	1
Cogent Education	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Research Quarterly	1
Educational and Psychological…	1
European Journal of…	1
Evaluation & Research in…	1
International Journal of Art…	1
International Journal of…	1
International Journal of…	1
Journal of Autism and…	1
Journal of Continuing…	1
Journal of Educational…	1
Journal of Vocational…	1
Multivariate Behavioral…	1
National Center for Education…	1
Perspectives in Education	1
More ▼

Aaron Zimmerman	1
Abedi, Jamal	1
Ahrari, Ramin	1
Angoff, William H.	1
Ann Tai Choe	1
Apache, R. R.	1
Arnold, Voiza	1
Attali, Yigal	1
Ballard, Laura	1
Barth, Amy E.	1
Barwell, Fred	1
Bejar, Isaac I.	1
Bethany L. Miller	1
Bloom, Diane S.	1
Boser, Judith A.	1
Brodersen, R. Marc	1
Brooks, Val	1
Brunfaut, Tineke	1
Busenbark, Lynn	1
Bölte, Sven	1
Cahyono, Sulistio Mukti	1
Carifio, James	1
Cherasaro, Trudy L.	1
Choque Olsson, Nora	1
More ▼