Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 5 |
Descriptor
Error of Measurement | 5 |
Evaluators | 5 |
Item Response Theory | 3 |
Academic Language | 2 |
Accuracy | 2 |
Comparative Analysis | 2 |
Foreign Countries | 2 |
Models | 2 |
Scores | 2 |
Writing Evaluation | 2 |
Accountability | 1 |
More ▼ |
Source
Applied Measurement in… | 1 |
Educational Assessment | 1 |
International Journal of… | 1 |
Journal of Educational… | 1 |
Participatory Educational… | 1 |
Author
Sata, Mehmet | 2 |
Carl Westine | 1 |
Erman Aslanoglu, Aslihan | 1 |
James Soland | 1 |
Karakaya, Ismail | 1 |
Kelly Edwards | 1 |
Lee, Won-Chan | 1 |
Michelle Boyer | 1 |
Song, Yoon Ah | 1 |
Stella Y. Kim | 1 |
Tong Wu | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 5 |
Education Level
Elementary Education | 1 |
Grade 7 | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Turkey | 2 |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 1 |
What Works Clearinghouse Rating
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Kelly Edwards; James Soland – Educational Assessment, 2024
Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores
Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022
This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…
Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy
Erman Aslanoglu, Aslihan; Sata, Mehmet – Participatory Educational Research, 2021
When students present writing tasks that require higher order thinking skills to work, one of the most important problems is scoring these writing tasks objectively. The fact that raters give scores below or above their performance based on several environmental factors affects the consistency of the measurements. Inconsistencies in scoring…
Descriptors: Interrater Reliability, Evaluators, Error of Measurement, Writing Evaluation
Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill
Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022
In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…
Descriptors: Evaluators, Training, Comparative Analysis, Academic Language