ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	12

Descriptor

Error of Measurement	54
Statistical Analysis	54
Test Reliability	54
Mathematical Models	18
Comparative Analysis	12
True Scores	11
Scores	10
Correlation	9
Test Interpretation	9
Test Validity	9
Measurement Techniques	8
Analysis of Variance	7
Raw Scores	7
Item Analysis	6
Simulation	6
Academic Achievement	5
Criterion Referenced Tests	5
Goodness of Fit	5
Reading Tests	5
Sampling	5
Statistical Significance	5
Test Construction	5
Test Theory	5
Tests	5
Achievement Gains	4
More ▼

Source

Educational and Psychological…	6
Journal of Educational…	5
ETS Research Report Series	2
Grantee Submission	2
Measurement in Physical…	2
Psychometrika	2
ACT, Inc.	1
Applied Measurement in…	1
Applied Psychological…	1
Audio-Visual Language Journal	1
Behavioral Research and…	1
Brookings Papers on Education…	1
Canadian Journal of School…	1
Educ Psychol Meas	1
Educational Assessment	1
Journal of Speech and Hearing…	1
Measurement and Evaluation in…	1
Measurement and Evaluation in…	1
More ▼

Publication Type

Reports - Research	33
Journal Articles	17
Speeches/Meeting Papers	5
Numerical/Quantitative Data	3
Reports - Evaluative	3
Guides - General	2
Non-Print Media	1
Reference Materials -…	1
Reports - Descriptive	1

Education Level

Higher Education	3
Elementary Education	2
Elementary Secondary Education	2
Grade 5	2
Middle Schools	2
Grade 10	1
Grade 4	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Researchers

Location

Australia	1
California	1
Netherlands (Amsterdam)	1
North Carolina	1
South Carolina	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
Advanced Placement…	1
Comprehensive Tests of Basic…	1
Early Childhood Longitudinal…	1
Metropolitan Achievement Tests	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 54 results Save | Export

Robust Coefficients Alpha and Omega and Confidence Intervals with Outlying Observations and Missing Data: Methods and Software

Peer reviewed

Direct link

Zhang, Zhiyong; Yuan, Ke-Hai – Educational and Psychological Measurement, 2016

Cronbach's coefficient alpha is a widely used reliability measure in social, behavioral, and education sciences. It is reported in nearly every study that involves measuring a construct through multiple items. With non-tau-equivalent items, McDonald's omega has been used as a popular alternative to alpha in the literature. Traditional estimation…

Descriptors: Computation, Statistical Analysis, Robustness (Statistics), Error of Measurement

Robust Coefficients Alpha and Omega and Confidence Intervals with Outlying Observations and Missing Data Methods and Software

Peer reviewed
PDF on ERIC

Download full text

Zhang, Zhiyong; Yuan, Ke-Hai – Grantee Submission, 2016

Descriptors: Computation, Error of Measurement, Robustness (Statistics), Statistical Analysis

The Reliability of a 5km Run Test on a Motorized Treadmill

Peer reviewed

Direct link

Driller, Matthew; Brophy-Williams, Ned; Walker, Anthony – Measurement in Physical Education and Exercise Science, 2017

The purpose of the present study was to determine the reliability of a 5km run test on a motorized treadmill. Over three consecutive weeks, 12 well-trained runners completed three 5km time trials on a treadmill following a standardized warm-up. Runners were partially-blinded to their running speed and distance covered. Total time to complete the…

Descriptors: Athletics, Physical Activities, Athletes, Test Reliability

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Peer reviewed

Direct link

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills

Reliability Estimates for Undergraduate Grade Point Average

Peer reviewed

Direct link

Westrick, Paul A. – Educational Assessment, 2017

Undergraduate grade point average (GPA) is a commonly employed measure in educational research, serving as a criterion or as a predictor depending on the research question. Over the decades, researchers have used a variety of reliability coefficients to estimate the reliability of undergraduate GPA, which suggests that there has been no consensus…

Descriptors: Undergraduate Students, Test Reliability, College Entrance Examinations, Longitudinal Studies

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

The Role of Multiple-Group Measurement Invariance in Family Psychology Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Kern, Justin L.; McBride, Brent A.; Laxman, Daniel J.; Dyer, W. Justin; Santos, Rosa M.; Jeans, Laurie M. – Grantee Submission, 2016

Measurement invariance (MI) is a property of measurement that is often implicitly assumed, but in many cases, not tested. When the assumption of MI is tested, it generally involves determining if the measurement holds longitudinally or cross-culturally. A growing literature shows that other groupings can, and should, be considered as well.…

Descriptors: Psychology, Measurement, Error of Measurement, Measurement Objectives

An Application of Generalizability Theory to Evaluate the Technical Quality of an Alternate Assessment

Peer reviewed

Direct link

Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013

Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…

Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

Download full text

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7

Administration and Scoring Errors of Graduate Students Learning the WISC-IV: Issues and Controversies

Peer reviewed

Direct link

Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012

A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…

Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring

Three Approximations of Standard Error of Measurement: An Empirical Approach.

PDF pending restoration

Garvin, Alfred D. – 1976

Three successively simpler formulas for approximating the standard error of measurement were derived by applying successively more simplifying assumptions to the standard formula based on the standard deviation and the Kuder-Richardson formula 20 estimate of reliability. The accuracy of each of these three formulas, with respect to the standard…

Descriptors: Error of Measurement, Statistical Analysis, Test Reliability

Test Length and the Standard Error of Measurement

Peer reviewed

Gardner, P. L. – Journal of Educational Measurement, 1970

Descriptors: Error of Measurement, Mathematical Models, Statistical Analysis, Test Reliability

The Accuracy of Three Approximations for Test Reliability.

Download full text

Kleinke, David J. – 1976

Data from 200 college-level tests were used to compare three reliability approximations (two of Saupe and one of Cureton) to Kuder-Richardson Formula 20 (KR20). While the approximations correlated highly (about .9) with the reliability estimate, they tended to be underapproximations. The explanation lies in an apparent bias of Lord's approximation…

Descriptors: Comparative Analysis, Correlation, Error of Measurement, Statistical Analysis

Properties of a Proposed Approximation to the Standard Error of Measurement.

Download full text

Nitko, Anthony J. – 1982

An approximation formula for the standard error of measurement was recently proposed by Garvin. The properties of this approximation to the standard error of measurement are described in this paper and illustrated with hypothetical data. It is concluded that the approximation is a systematic overestimate of the standard error of measurement…

Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Formulas, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Bashaw, W. L.	2
Cureton, Edward E.	2
Feldt, Leonard S.	2
Harris, Chester W.	2
Huynh, Huynh	2
Rentz, R. Robert	2
Shoemaker, David M.	2
Yuan, Ke-Hai	2
Zhang, Zhiyong	2
Alonzo, Julie	1
Barford, Sean W.	1
Barker, Pierce	1
Belfry, M. Joan	1
Benson, Jeri	1
Brennan, Robert L.	1
Bridgeman, Brent	1
Brophy-Williams, Ned	1
Charter, Richard A.	1
Crocker, A. C.	1
Cummings, Oliver W.	1
Dombrowski, Stefan C.	1
Driller, Matthew	1
Dunivant, Noel	1
Dyer, W. Justin	1
More ▼