Publication Date
| In 2026 | 0 |
| Since 2025 | 60 |
| Since 2022 (last 5 years) | 286 |
| Since 2017 (last 10 years) | 782 |
| Since 2007 (last 20 years) | 2044 |
Descriptor
| Interrater Reliability | 3126 |
| Foreign Countries | 655 |
| Test Reliability | 504 |
| Evaluation Methods | 503 |
| Test Validity | 411 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Ratner, Nan Bernstein – Language, Speech, and Hearing Services in Schools, 2018
Purpose: The purpose of the present clinical forum is to compare how 2 clinicians might select among therapy options for a preschool-aged child who presents with stuttering close to onset. Method: I discuss approaches to full evaluation of the child's profile, advisement of evidence-based practice options open to the family, the need for…
Descriptors: Outcomes of Treatment, Progress Monitoring, Evidence Based Practice, Preschool Children
Haug, Tobias; Ebling, Sarah; Braem, Penny Boyes; Tissi, Katja; Sidler-Miserez, Sandra – Language Education & Assessment, 2019
In German Switzerland the learning and assessment of Swiss German Sign Language ("Deutschschweizerische Gebärdensprache," DSGS) takes place in different contexts, for example, in tertiary education or in continuous education courses. By way of the still ongoing implementation of the Common European Framework of Reference for DSGS,…
Descriptors: German, Sign Language, Language Tests, Test Items
Desstya, Anatri; Prasetyo, Zuhdan Kun; Suyanta; Susila, Ihwan; Irwanto – International Journal of Instruction, 2019
This study aims to report the development an instrument that is standardized (reviewed by validity, reliability, and difficulty index) to detect science misconception in an elementary school teacher. This study used a 4-D model; defining, designing, developing, and disseminating. First, it was prepared with 47 opened-ended questions, and then it…
Descriptors: Elementary School Teachers, Misconceptions, Evaluation Methods, Teacher Evaluation
Ramon-Casas, Marta; Nuño, Neus; Pons, Ferran; Cunillera, Toni – Assessment & Evaluation in Higher Education, 2019
This article presents an empirical evaluation of the validity and reliability of a peer-assessment activity to improve academic writing competences. Specifically, we explored a large group of psychology undergraduate students with different initial writing skills. Participants (n = 365) produced two different essays, which were evaluated by their…
Descriptors: Peer Evaluation, Validity, Reliability, Writing Skills
Davis, Larry; Norris, John – ETS Research Report Series, 2021
The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…
Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests
Ghadyani, Fariba; Tahririan, Mohammad Hassan; Afzali, Katayoon – Language Teaching Research Quarterly, 2022
There is a dearth of research on hope in studies of second or foreign (L2) language learning. Therefore, the present research contributes conceptually to a deep understanding of hope for learning English as a foreign language and the ways it may be developed. To do so, an exploratory mixed-methods design was employed. Using in-depth interviews,…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Psychological Patterns
A Meta-Analytic Review of the Relations between Motivation and Reading Achievement for K-12 Students
Toste, Jessica R.; Didion, Lisa; Peng, Peng; Filderman, Marissa J.; McClelland, Amanda M. – Review of Educational Research, 2020
The purpose of this meta-analytic review was to investigate the relation between motivation and reading achievement among students in kindergarten through 12th grade. A comprehensive search of peer-reviewed published research resulted in 132 articles with 185 independent samples and 1,154 reported effect sizes (Pearson's r). Results of our…
Descriptors: Meta Analysis, Reading Achievement, Reading Motivation, Kindergarten
Guo, Xiuyan; Lei, Pui-Wa – International Journal of Testing, 2020
Little research has been done on the effects of peer raters' quality characteristics on peer rating qualities. This study aims to address this gap and investigate the effects of key variables related to peer raters' qualities, including content knowledge, previous rating experience, training on rating tasks, and rating motivation. In an experiment…
Descriptors: Peer Evaluation, Error Patterns, Correlation, Knowledge Level
Park, Mi Sun – Language Assessment Quarterly, 2020
In the present study, I examined the effects of rater characteristics, in particular, raters' familiarity with a foreign accent, on the assessment of second language (L2) pronunciation. Forty-three native English-speaking teachers were divided into three groups according to their reported types of familiarity with Korean accents: heritage,…
Descriptors: Evaluators, Familiarity, Second Language Learning, English (Second Language)
Gingerich, Andrea; Ramlo, Susan E.; van der Vleuten, Cees P. M.; Eva, Kevin W.; Regehr, Glenn – Advances in Health Sciences Education, 2017
Whenever multiple observers provide ratings, even of the same performance, inter-rater variation is prevalent. The resulting "idiosyncratic rater variance" is considered to be unusable error of measurement in psychometric models and is a threat to the defensibility of our assessments. Prior studies of inter-rater variation in clinical…
Descriptors: Interrater Reliability, Error of Measurement, Psychometrics, Q Methodology
Gillam, Sandra Laing; Gillam, Ronald B.; Fargo, Jamison D.; Olszewski, Abbie; Segura, Hugo – Communication Disorders Quarterly, 2017
The purpose of this study was to assess the basic psychometric properties of a progress-monitoring tool designed to measure narrative discourse skills in school-age children with language impairments (LI). A sample of 109 children with LI between the ages of 5 years 7 months and 9 years 9 months completed the "Test of Narrative Language"…
Descriptors: Progress Monitoring, Language Impairments, Children, Story Telling
Johnson, Austin H.; Chafouleas, Sandra M.; Briesch, Amy M. – School Psychology Quarterly, 2017
In this study, generalizability theory was used to examine the extent to which (a) time-sampling methodology, (b) number of simultaneous behavior targets, and (c) individual raters influenced variance in ratings of academic engagement for an elementary-aged student. Ten graduate-student raters, with an average of 7.20 hr of previous training in…
Descriptors: Generalizability Theory, Sampling, Elementary School Students, Learner Engagement
Taylor, Lauren J.; Eapen, Valsamma; Maybery, Murray; Midford, Sue; Paynter, Jessica; Quarmby, Lyndsay; Smith, Timothy; Williams, Katrina; Whitehouse, Andrew J. – Journal of Autism and Developmental Disorders, 2017
Previous research shows inconsistency in clinician-assigned diagnoses of Autism Spectrum Disorder (ASD). We conducted an exploratory study that examined the concordance of diagnoses between a multidisciplinary assessment team and a range of independent clinicians throughout Australia. Nine video-taped Autism Diagnostic Observation Schedule (ADOS)…
Descriptors: Autism, Pervasive Developmental Disorders, Clinical Diagnosis, Foreign Countries
Beers, Jason Ronald – ProQuest LLC, 2017
Purpose. The purpose of this study was to identify technology-related strategies used by educational leaders to increase prosocial behavior in K-12 schools. Information and communication technology (ICT) is developing at a rapid rate and is becoming more ubiquitous among students. Discovering and understanding common technology-related strategies…
Descriptors: Technology Uses in Education, Educational Strategies, Prosocial Behavior, Elementary Secondary Education
Albert M. Jimenez; Sally J. Zepeda – Sage Research Methods Cases, 2017
The work presented in this case study results from a study conducted in 2012-2014 examining a newly created teacher evaluation system to determine the inter-rater reliability of the classroom observation instrument. The teacher evaluation system was the result of a partnership between the school district and the university in the same city…
Descriptors: Case Studies, Interrater Reliability, Teacher Evaluation, Observation

Peer reviewed
Direct link
