ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	19
Since 2007 (last 20 years)	50

Descriptor

Error of Measurement	124
Statistical Analysis	124
Reliability	64
Test Reliability	54
Correlation	30
Mathematical Models	26
Scores	22
True Scores	22
Comparative Analysis	20
Analysis of Variance	18
Sampling	17
Measurement Techniques	16
Interrater Reliability	14
Test Interpretation	12
Computation	11
Item Analysis	11
Research Methodology	11
Statistical Bias	11
Foreign Countries	10
Simulation	10
Test Items	10
Academic Achievement	9
Sample Size	9
Test Construction	9
Test Validity	9
More ▼

Publication Type

Reports - Research	69
Journal Articles	59
Reports - Evaluative	13
Speeches/Meeting Papers	7
Reports - Descriptive	5
Numerical/Quantitative Data	3
Dissertations/Theses -…	2
Guides - General	2
Guides - Non-Classroom	2
Books	1
Information Analyses	1
Legal/Legislative/Regulatory…	1
Non-Print Media	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	8
Elementary Education	6
Elementary Secondary Education	6
Postsecondary Education	5
Middle Schools	4
Grade 4	3
Grade 5	3
Grade 8	3
Junior High Schools	3
Intermediate Grades	2
Secondary Education	2
Early Childhood Education	1
Grade 1	1
Grade 10	1
Grade 3	1
Grade 7	1
High Schools	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	5
Students	1

Location

Australia	2
Germany	2
Portugal	2
California	1
Chile	1
China	1
Finland	1
Georgia	1
Japan	1
Maryland	1
Netherlands (Amsterdam)	1
North Carolina	1
South Carolina	1
Spain	1
Spain (Madrid)	1
Taiwan (Taipei)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
Advanced Placement…	1
Cognitive Abilities Test	1
Comprehensive Tests of Basic…	1
Early Childhood Longitudinal…	1
Eysenck Personality Inventory	1
Flesch Kincaid Grade Level…	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
Praxis Series	1
Stanford Achievement Tests	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 124 results Save | Export

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Twenty Years of Network Meta-Analysis: Continuing Controversies and Recent Developments

Peer reviewed

Direct link

A. E. Ades; Nicky J. Welton; Sofia Dias; David M. Phillippo; Deborah M. Caldwell – Research Synthesis Methods, 2024

Network meta-analysis (NMA) is an extension of pairwise meta-analysis (PMA) which combines evidence from trials on multiple treatments in connected networks. NMA delivers internally consistent estimates of relative treatment efficacy, needed for rational decision making. Over its first 20 years NMA's use has grown exponentially, with applications…

Descriptors: Network Analysis, Meta Analysis, Medicine, Clinical Experience

Adaptive Pairwise Comparison for Educational Measurement

Peer reviewed

Direct link

Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020

Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…

Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement

Estimating Hazard Ratios from Published Kaplan-Meier Survival Curves: A Methods Validation Study

Peer reviewed

Direct link

Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019

Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…

Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials

Kappa Coefficients for Missing Data

Peer reviewed

Direct link

De Raadt, Alexandra; Warrens, Matthijs J.; Bosker, Roel J.; Kiers, Henk A. L. – Educational and Psychological Measurement, 2019

Cohen's kappa coefficient is commonly used for assessing agreement between classifications of two raters on a nominal scale. Three variants of Cohen's kappa that can handle missing data are presented. Data are considered missing if one or both ratings of a unit are missing. We study how well the variants estimate the kappa value for complete data…

Descriptors: Interrater Reliability, Data, Statistical Analysis, Statistical Bias

Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Peer reviewed

Direct link

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills

Modifying Spearman's Attenuation Equation to Yield Partial Corrections for Measurement Error--With Application to Sample Size Calculations

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2018

Spearman's correction for attenuation (measurement error) corrects a correlation coefficient for measurement errors in either-or-both of two variables, and follows from the assumptions of classical test theory. Spearman's equation removes all measurement error from a correlation coefficient which translates into "increasing the reliability of…

Descriptors: Error of Measurement, Correlation, Sample Size, Computation

The Effects of Sample Size and Missing Data Rates on Generalizability Coefficients

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Karaman, Haydar; Dogan, Nuri – Eurasian Journal of Educational Research, 2018

Purpose of the Study: Missing data are a common problem encountered while implementing measurement instruments. Yet the extent to which reliability, validity, average discrimination and difficulty of the test results are affected by the missing data has not been studied much. Since it is inevitable that missing data have an impact on the…

Descriptors: Sample Size, Data Analysis, Research Problems, Error of Measurement

Robust Coefficients Alpha and Omega and Confidence Intervals with Outlying Observations and Missing Data: Methods and Software

Peer reviewed

Direct link

Zhang, Zhiyong; Yuan, Ke-Hai – Educational and Psychological Measurement, 2016

Cronbach's coefficient alpha is a widely used reliability measure in social, behavioral, and education sciences. It is reported in nearly every study that involves measuring a construct through multiple items. With non-tau-equivalent items, McDonald's omega has been used as a popular alternative to alpha in the literature. Traditional estimation…

Descriptors: Computation, Statistical Analysis, Robustness (Statistics), Error of Measurement

Robust Coefficients Alpha and Omega and Confidence Intervals with Outlying Observations and Missing Data Methods and Software

Peer reviewed
PDF on ERIC

Download full text

Zhang, Zhiyong; Yuan, Ke-Hai – Grantee Submission, 2016

Descriptors: Computation, Error of Measurement, Robustness (Statistics), Statistical Analysis

Kappa and Rater Accuracy: Paradigms and Parameters

Peer reviewed

Direct link

Conger, Anthony J. – Educational and Psychological Measurement, 2017

Drawing parallels to classical test theory, this article clarifies the difference between rater accuracy and reliability and demonstrates how category marginal frequencies affect rater agreement and Cohen's kappa. Category assignment paradigms are developed: comparing raters to a standard (index) versus comparing two raters to one another…

Descriptors: Interrater Reliability, Evaluators, Accuracy, Statistical Analysis

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

The Reliability of a 5km Run Test on a Motorized Treadmill

Peer reviewed

Direct link

Driller, Matthew; Brophy-Williams, Ned; Walker, Anthony – Measurement in Physical Education and Exercise Science, 2017

The purpose of the present study was to determine the reliability of a 5km run test on a motorized treadmill. Over three consecutive weeks, 12 well-trained runners completed three 5km time trials on a treadmill following a standardized warm-up. Runners were partially-blinded to their running speed and distance covered. Total time to complete the…

Descriptors: Athletics, Physical Activities, Athletes, Test Reliability

Intra- and Inter-Rater Reliability of the Rate of Force Development of Hip Abductor Muscles Measured by Hand-Held Dynamometer

Peer reviewed

Direct link

Takeda, Kazuya; Tanabe, Shigeo; Koyama, Soichiro; Nagai, Tomoko; Sakurai, Hiroaki; Kanada, Yoshikiyo; Shomoto, Koji – Measurement in Physical Education and Exercise Science, 2018

The aim of this study was to clarify the intra- and inter-rater reliability of the rate of force development in hip abductor muscle force measurements using a hand-held dynamometer. Thirty healthy adults were separately assessed by two independent raters on two separate days. Rate of force development was calculated from the slope of the…

Descriptors: Interrater Reliability, Human Body, Measurement Equipment, Handheld Devices

The Reliability and Sources of Error of Using Rubrics-Based Assessment for Student Projects

Peer reviewed

Direct link

Menéndez-Varela, José-Luis; Gregori-Giralt, Eva – Assessment & Evaluation in Higher Education, 2018

Rubrics are widely used in higher education to assess performance in project-based learning environments. To date, the sources of error that may affect their reliability have not been studied in depth. Using generalisability theory as its starting-point, this article analyses the influence of the assessors and the criteria of the rubrics on the…

Descriptors: Scoring Rubrics, Student Projects, Active Learning, Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	14
Journal of Educational…	7
Applied Psychological…	5
ETS Research Report Series	4
Grantee Submission	3
Measurement in Physical…	3
Structural Equation Modeling:…	3
Applied Measurement in…	2
Early Education and…	2
ProQuest LLC	2
Psychometrika	2
Research Synthesis Methods	2
Research on Social Work…	2
ACT, Inc.	1
Assessment & Evaluation in…	1
Audio-Visual Language Journal	1
Behavioral Research and…	1
Brookings Papers on Education…	1
Canadian Journal of School…	1
Carnegie Foundation for the…	1
Creativity Research Journal	1
Educ Psychol Meas	1
Educational Assessment	1
Educational Psychology	1
Eurasian Journal of…	1
More ▼

Brennan, Robert L.	3
Yuan, Ke-Hai	3
Zhang, Zhiyong	3
Bashaw, W. L.	2
Cureton, Edward E.	2
Edwards, Keith J.	2
Fan, Xitao	2
Feldt, Leonard S.	2
Forsyth, Robert A.	2
Harris, Chester W.	2
Huynh, Huynh	2
Kim, Sooyeon	2
Leite, Walter L.	2
Linn, Robert L.	2
Livingston, Samuel A.	2
McMorris, Robert F.	2
Rentz, R. Robert	2
Shoemaker, David M.	2
Subkoviak, Michael J.	2
Werts, Charles E.	2
Zimmerman, Donald W.	2
A. E. Ades	1
Abell, Neil	1
Alkahtani, Saif F.	1
More ▼