ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	19

Descriptor

Correlation	25
Error of Measurement	25
Interrater Reliability	25
Scores	7
Test Reliability	7
Children	6
Psychometrics	6
Reliability	5
Scoring	5
Statistical Analysis	5
Test Validity	5
Classification	4
Diagnostic Tests	4
Foreign Countries	4
Measurement Techniques	4
Accuracy	3
Cerebral Palsy	3
Comparative Analysis	3
Computer Assisted Testing	3
Evaluation Methods	3
Generalizability Theory	3
Goodness of Fit	3
Higher Education	3
Language Tests	3
Observation	3
More ▼

Source

Developmental Medicine &…	2
Measurement in Physical…	2
Research in Developmental…	2
Applied Psychological…	1
Athletic Training Education…	1
Contemporary Educational…	1
Developmental Psychology	1
ETS Research Report Series	1
Educational Assessment	1
Educational Sciences: Theory…	1
Grantee Submission	1
IDEA Center, Inc.	1
International Journal of…	1
Language, Speech, and Hearing…	1
National Center for Analysis…	1
Online Submission	1
Practical Assessment,…	1
Research Synthesis Methods	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	18
Reports - Evaluative	6
Speeches/Meeting Papers	5
Numerical/Quantitative Data	2
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
Elementary Education	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	3
Administrators	1

Location

California	1
Canada	1
Canada (Toronto)	1
Florida	1
Illinois	1
Japan	1
Netherlands (Amsterdam)	1
Nevada	1
Ohio	1
Rhode Island	1
Turkey	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022

Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Language, Speech, and Hearing Services in Schools, 2022

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Estimating Hazard Ratios from Published Kaplan-Meier Survival Curves: A Methods Validation Study

Peer reviewed

Direct link

Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019

Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…

Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials

Intra- and Inter-Rater Reliability of the Rate of Force Development of Hip Abductor Muscles Measured by Hand-Held Dynamometer

Peer reviewed

Direct link

Takeda, Kazuya; Tanabe, Shigeo; Koyama, Soichiro; Nagai, Tomoko; Sakurai, Hiroaki; Kanada, Yoshikiyo; Shomoto, Koji – Measurement in Physical Education and Exercise Science, 2018

The aim of this study was to clarify the intra- and inter-rater reliability of the rate of force development in hip abductor muscle force measurements using a hand-held dynamometer. Thirty healthy adults were separately assessed by two independent raters on two separate days. Rate of force development was calculated from the slope of the…

Descriptors: Interrater Reliability, Human Body, Measurement Equipment, Handheld Devices

The Miscalculation of Interrater Reliability: A Case Study Involving the AAC&U VALUE Rubrics

Peer reviewed
PDF on ERIC

Download full text

Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017

Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…

Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives

Updated Technical Manual for the IDEA Feedback System for Administrators. IDEA Technical Report No. 20

Download full text

Benton, Stephen L.; Li, Dan – IDEA Center, Inc., 2018

This technical report describes the results of analyses performed on data collected from 2013 to 2017, using the IDEA Feedback System for Administrators (FSA). The FSA is used to gather impressions from core constituents about an administrator's performance of relevant administrative roles, as well as her/his leadership style, interpersonal…

Descriptors: Feedback (Response), Administrators, Administrator Attitudes, Administrator Role

Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Peer reviewed

Direct link

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills

Reliability of Entry-Level Athletic Trainers' Palpation Skills of Bony Anatomical Landmarks in the Lumbopelvic Region

Peer reviewed

Direct link

Schultz, Sarah M.; Jacobs, Michelle M.; Gorgos, Kara S.; Wasylyk, Nicole T.; Hanrahan, Sean; Van Lunen, Bonnie L. – Athletic Training Education Journal, 2015

Context: Accuracy of locating various lumbopelvic landmarks for novice athletic trainers has not been examined. Objective: To examine reliability of novice athletic trainers for identification of the L4 spinous process and right and left posterior superior iliac spine (PSIS). Design: Cross-sectional reliability. Setting: Laboratory. Patients or…

Descriptors: Athletics, Allied Health Personnel, Entry Workers, Reliability

Investigation of Coefficient of Individual Agreement in Terms of Sample Size, Random and Monotone Missing Ratio, and Number of Repeated Measures

Peer reviewed
PDF on ERIC

Download full text

Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016

Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…

Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability

Reliability of the Test of Integrated Language and Literacy Skills (TILLS)

Peer reviewed

Direct link

Mailend, Marja-Liisa; Plante, Elena; Anderson, Michele A.; Applegate, E. Brooks; Nelson, Nickola W. – International Journal of Language & Communication Disorders, 2016

Background: As new standardized tests become commercially available, it is critical that clinicians have access to the information about a test's psychometric properties, including aspects of reliability. Aims: The purpose of the three studies reported in this article was to investigate the reliability of a new test, the Test of Integrated…

Descriptors: Standardized Tests, Psychometrics, Reliability, Language Skills

High Intra- and Inter-Rater Chance Variation of the Movement Assessment Battery for Children 2, Ageband 2

Peer reviewed

Direct link

Holm, Inger; Tveter, Anne Therese; Aulie, Vibeke Smith; Stuge, Britt – Research in Developmental Disabilities: A Multidisciplinary Journal, 2013

The aim of the present study was to evaluate the intra- and inter-tester reliability of the movement assessment battery for children-second edition (MABC-2), ageband 2. We wanted to analyze the collected data, with adequate statistical methods, to provide relevant recommendations for physical therapists who are interpreting changes in the context…

Descriptors: Physical Therapy, Correlation, Scores, Error of Measurement

Observed Sensitivity during Family Interactions and Cumulative Risk: A Study of Multiple Dyads per Family

Peer reviewed

Direct link

Browne, Dillon T.; Leckie, George; Prime, Heather; Perlman, Michal; Jenkins, Jennifer M. – Developmental Psychology, 2016

The present study sought to investigate the family, individual, and dyad-specific contributions to observed cognitive sensitivity during family interactions. Moreover, the influence of cumulative risk on sensitivity at the aforementioned levels of the family was examined. Mothers and 2 children per family were observed interacting in a round robin…

Descriptors: Family Relationship, Family (Sociological Unit), Sibling Relationship, Siblings

A Clinical Tool to Measure Trunk Control in Children with Cerebral Palsy: The Trunk Control Measurement Scale

Peer reviewed

Direct link

Heyrman, Lieve; Molenaers, Guy; Desloovere, Kaat; Verheyden, Geert; De Cat, Jos; Monbaliu, Elegast; Feys, Hilde – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011

In this study the psychometric properties of the Trunk Control Measurement Scale (TCMS) in children with cerebral palsy (CP) were examined. Twenty-six children with spastic CP (mean age 11 years 3 months, range 8-15 years; Gross Motor Function Classification System level I n = 11, level II n = 5, level III n = 10) were included in this study. To…

Descriptors: Construct Validity, Cerebral Palsy, Test Validity, Interrater Reliability

Validity Research on Teacher Evaluation Systems Based on the Framework for Teaching

Download full text

Milanowski, Anthony T. – Online Submission, 2011

After decades of disinterest, evaluation of the performance of elementary and secondary teachers in the United States has become an important educational policy issue. As U.S. states and districts have tried to upgrade their evaluation processes, one of the models that has been increasingly used is the Framework for Teaching. This paper summarizes…

Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Observation

Previous Page | Next Page »

Pages: 1 | 2

Anna-Maria Fall	2
Beula M. Magimairaj	2
Greg Roberts	2
Philip Capin	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Sharon Vaughn	2
Anderson, Michele A.	1
Applegate, E. Brooks	1
Aulie, Vibeke Smith	1
Becher, Jules G.	1
Benton, Stephen L.	1
Browne, Dillon T.	1
Chan, Kelvin K. W.	1
Cheng, Sierra	1
Cope, Ronald T.	1
Dallmeijer, Annet J.	1
De Cat, Jos	1
De Cock, P.	1
Deklerck, J.	1
Desloovere, K.	1
Desloovere, Kaat	1
Erdogan, Semra	1
Feys, H.	1
More ▼