ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	22
Since 2016 (last 10 years)	54
Since 2006 (last 20 years)	88

Descriptor

Interrater Reliability	115
Item Response Theory	115
Foreign Countries	36
Scoring	33
Scoring Rubrics	23
Evaluators	19
Rating Scales	19
Test Items	19
Scores	17
Test Construction	16
Psychometrics	15
Test Reliability	15
Performance Based Assessment	14
Student Evaluation	14
Correlation	13
Language Tests	13
English (Second Language)	12
Generalizability Theory	12
Models	12
Validity	12
Error of Measurement	11
Evaluation Methods	11
Test Validity	11
Writing Evaluation	11
Comparative Analysis	9
More ▼

Publication Type

Journal Articles	81
Reports - Research	80
Reports - Evaluative	21
Speeches/Meeting Papers	18
Reports - Descriptive	9
Dissertations/Theses -…	5
Tests/Questionnaires	5
Numerical/Quantitative Data	2
Collected Works - Serials	1

Education Level

Higher Education	22
Elementary Education	17
Postsecondary Education	17
Intermediate Grades	8
Middle Schools	8
Elementary Secondary Education	7
Secondary Education	7
Grade 4	5
Grade 6	5
Junior High Schools	5
Grade 5	3
High Schools	3
Grade 8	2
Kindergarten	2
Early Childhood Education	1
Grade 11	1
Grade 2	1
Grade 7	1
Primary Education	1
More ▼

Audience

Researchers	2
Practitioners	1

Location

Turkey	9
Taiwan	4
South Korea	3
Australia	2
Canada	2
Finland	2
Hong Kong	2
Netherlands	2
New Mexico	2
United Kingdom	2
California (Berkeley)	1
China	1
Georgia	1
Iran	1
Japan	1
Kuwait	1
Missouri	1
Oregon	1
Sweden	1
Taiwan (Taipei)	1
Turkey (Ankara)	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

American Recovery and…	1
Elementary and Secondary…	1

Assessments and Surveys

Home Observation for…	1
International English…	1
Peabody Picture Vocabulary…	1
Strengths and Difficulties…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 115 results Save | Export

Communal Factors in Rater Severity and Consistency over Time in High-Stakes Oral Assessment

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…

Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory

Developing a Tool for Measuring Student Orientations with Respect to Understanding in Mathematical Learning

Peer reviewed
PDF on ERIC

Download full text

Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023

The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…

Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability

The Relationship of Special Education Teacher Performance on Observation Instruments with Student Outcomes

Peer reviewed
PDF on ERIC

Download full text

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Learning Disabilities, 2021

In this study, we examined the relationship of special education teachers' performance on the Recognizing Effective Special Education Teachers (RESET) Explicit Instruction observation protocol with student growth on academic measures. Special education teachers provided video-recorded observations of three instructional lessons along with data…

Descriptors: Special Education Teachers, Teacher Effectiveness, Teacher Evaluation, Direct Instruction

Development of Gazi Functional Vision Assessment Instrument

Peer reviewed
PDF on ERIC

Download full text

Safak, Pinar; Cakmak, Salih; Karakoc, Tamer; Aydin O'Dwyer, Pinar – European Journal of Educational Research, 2021

This study aimed to develop a valid and reliable instrument that measures the functional vision of students with low vision. Thus, an assessment tool and performance activities were developed for three vision skill groups (near vision skills, distance vision skills, and visual field) that include functional vision skills. The universe was 1485…

Descriptors: Foreign Countries, Vision Tests, Diagnostic Tests, Vision

Investigating Musical Aptitude Examination with a Many-Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Gübes, Nese Öztürk – Participatory Educational Research, 2021

The aim of this study is to show how a many-facet Rasch measurement model (MFRM) can be used for quality control whilst monitoring a musical aptitude examination. The data used in this study was gathered from a musical aptitude examination which was applied in 2019-2020 academic year for selecting teacher candidates to a music education department…

Descriptors: Foreign Countries, Music Education, Teacher Education Programs, Preservice Teacher Education

Posterior Predictive Model Checking of the Hierarchical Rater Model

Direct link

Nnamdi Chika Ezike – ProQuest LLC, 2022

Fitting wrongly specified models to observed data may lead to invalid inferences about the model parameters of interest. The current study investigated the performance of the posterior predictive model checking (PPMC) approach in detecting model-data misfit of the hierarchical rater model (HRM). The HRM is a rater-mediated model that incorporates…

Descriptors: Prediction, Models, Interrater Reliability, Item Response Theory

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Using Many-Facet Rasch Measurement and Generalizability Theory to Explore Rater Effects for Direct Behavior Rating--Multi-Item Scales

Peer reviewed

Direct link

Anthony, Christopher J.; Styck, Kara M.; Volpe, Robert J.; Robert, Christopher R. – School Psychology, 2023

Although originally conceived of as a marriage of direct behavioral observation and indirect behavior rating scales, recent research has indicated that Direct Behavior Ratings (DBRs) are affected by rater idiosyncrasies (rater effects) similar to other indirect forms of behavioral assessment. Most of this research has been conducted using…

Descriptors: Item Response Theory, Generalizability Theory, Interrater Reliability, Behavior Rating Scales

Examining Rating Quality in Rater-Mediated Activities for Standard-Item Alignment Research

Direct link

Yvette Jackson – ProQuest LLC, 2023

Rater-mediated activities in educational research occur when an expert judge or rater utilizes an instrument to judge persons or items and generates scale scores. Scale scores are from a subjective judgment and must undergo a quality control measure called rating quality. Rating quality in this study is broadly defined as the extent to which…

Descriptors: Educational Research, Evaluators, Test Theory, Item Response Theory

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Examination of Map Reading Skills with Orienteering Activity: An Example of Many Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Uyar, Seyma; Yayla, Onur; Zunber, Hidayet – International Journal of Assessment Tools in Education, 2022

The purpose of the current study is to examine the map reading skills of Social Studies pre-service teachers with orienteering, which is an activity-based and more active practice. To this end, a total of 10 students attending the Department of Social Studies Teaching in the Education Faculty of Burdur Mehmet Akif Ersoy University and taking the…

Descriptors: Map Skills, Navigation, Item Response Theory, Social Studies

To What Extent Are Item Discrimination Values Realistic? A New Index for Two-Dimensional Structures

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Uysal, Ibrahim – International Journal of Assessment Tools in Education, 2022

Most researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the…

Descriptors: Item Analysis, Correlation, Item Response Theory, Test Items

Development and Validation of a Survey Instrument for Measuring Pre-Service Teachers' Pedagogical Content Knowledge

Peer reviewed

Direct link

Martin, David; Jamieson-Proctor, Romina – International Journal of Research & Method in Education, 2020

In Australia, one of the key findings of the Teacher Education Ministerial Advisory Group was that not all graduating pre-service teachers possess adequate pedagogical content knowledge (PCK) to teach effectively. The concern is that higher education providers working with pre-service teachers are using pedagogical practices and assessments which…

Descriptors: Test Construction, Preservice Teachers, Pedagogical Content Knowledge, Foreign Countries

The Relationship of Special Education Teacher Performance on Observation Instruments with Student Outcomes

Peer reviewed
PDF on ERIC

Download full text

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Grantee Submission, 2020

In this study, we examined the relationship of special education teachers' performance on the RESET Explicit Instruction observation protocol with student growth on academic measures. Special education teachers provided video recorded observations of three instructional lessons along with data from standardized, curriculum-based academic measures…

Descriptors: Special Education Teachers, Teacher Effectiveness, Teacher Evaluation, Direct Instruction

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Experimental Education, 2022

In this study, we examined the scoring and generalizability assumptions of an explicit instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Education, Classroom Observation Techniques, Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Language Testing	7
International Journal of…	5
Journal of Educational…	5
ProQuest LLC	5
Grantee Submission	4
Online Submission	4
Educational and Psychological…	3
Eurasian Journal of…	3
Journal of Outcome Measurement	3
Language Assessment Quarterly	3
Measurement in Physical…	3
Society for Research on…	3
Applied Measurement in…	2
Applied Psychological…	2
Assessment in Education:…	2
Educational Assessment	2
International Journal of…	2
New Mexico Public Education…	2
SAGE Open	2
Adapted Physical Activity…	1
Assessing Writing	1
Canadian Journal of Applied…	1
Creativity Research Journal	1
ETS Research Report Series	1
Educational Policy Analysis…	1
More ▼

Johnson, Evelyn S.	6
Moylan, Laura A.	6
Zheng, Yuzhu	6
Crawford, Angela R.	5
Lunz, Mary E.	5
Engelhard, George, Jr.	4
Wind, Stefanie A.	4
Karakaya, Ismail	3
O'Neill, Thomas R.	3
Wyse, Adam E.	3
Avery, Marybell	2
Dyson, Ben	2
Fisette, Jennifer L.	2
Fox, Connie	2
Franck, Marian	2
Friedman, Greg	2
Graber, Kim C.	2
Güler, Nese	2
Hsieh, Mingchuan	2
Kang, Minsoo	2
Michaels, Hillary	2
Newman, Larry S.	2
Ochieng, Charles	2
Park, Youngsik	2
More ▼