ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	12

Descriptor

Computation	15
Error of Measurement	15
Test Reliability	15
Scores	6
Test Items	5
Intervals	4
Item Response Theory	4
Measurement Techniques	4
Test Construction	4
Data Analysis	3
Interrater Reliability	3
Psychometrics	3
Academic Standards	2
Achievement Gap	2
Achievement Rating	2
Cutting Scores	2
Definitions	2
Educational Practices	2
English	2
Essay Tests	2
Goodness of Fit	2
Mathematical Applications	2
Mathematics Achievement	2
Measures (Individuals)	2
Nonparametric Statistics	2
More ▼

Source

Educational and Psychological…	3
Grantee Submission	2
New Mexico Public Education…	2
International Journal of…	1
Journal of Chemical Education	1
Journal of Educational and…	1
Measurement and Evaluation in…	1
Measurement:…	1
Psychological Methods	1
Research Papers in Education	1

Author

Ho, Andrew D.	2
Reardon, Sean F.	2
Yuan, Ke-Hai	2
Zhang, Zhiyong	2
Bardhoshi, Gerta	1
Bramley, Tom	1
Dhawan, Vikas	1
Enders, Craig K.	1
Erford, Bradley T.	1
Griph, Gerald W.	1
Harshman, Jordan	1
Lee, Yi-Hsuan	1
Marcoulides, George A.	1
Rae, Gordon	1
Raykov, Tenko	1
Rentz, R. Robert	1
Yezierski, Ellen	1
Zhang, Jinming	1
More ▼

Publication Type

Journal Articles	12
Reports - Research	8
Reports - Descriptive	4
Numerical/Quantitative Data	2
Reports - Evaluative	2
Guides - Non-Classroom	1

Education Level

Elementary Secondary Education	2
Secondary Education	2
High Schools	1

Audience

Location

New Mexico	2
Georgia	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

New Jersey College Basic…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Evaluating the Discrepancy between Scale Reliability and Cronbach's Coefficient Alpha Using Latent Variable Modeling

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2023

This article outlines a readily applicable procedure for point and interval estimation of the population discrepancy between reliability and the popular Cronbach's coefficient alpha for unidimensional multi-component measuring instruments with uncorrelated errors, which are widely used in behavioral and social research. The method is developed…

Descriptors: Measurement, Test Reliability, Measurement Techniques, Error of Measurement

Robust Coefficients Alpha and Omega and Confidence Intervals with Outlying Observations and Missing Data: Methods and Software

Peer reviewed

Direct link

Zhang, Zhiyong; Yuan, Ke-Hai – Educational and Psychological Measurement, 2016

Cronbach's coefficient alpha is a widely used reliability measure in social, behavioral, and education sciences. It is reported in nearly every study that involves measuring a construct through multiple items. With non-tau-equivalent items, McDonald's omega has been used as a popular alternative to alpha in the literature. Traditional estimation…

Descriptors: Computation, Statistical Analysis, Robustness (Statistics), Error of Measurement

Robust Coefficients Alpha and Omega and Confidence Intervals with Outlying Observations and Missing Data Methods and Software

Peer reviewed
PDF on ERIC

Download full text

Zhang, Zhiyong; Yuan, Ke-Hai – Grantee Submission, 2016

Descriptors: Computation, Error of Measurement, Robustness (Statistics), Statistical Analysis

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Test-Retest Reliability of the Adaptive Chemistry Assessment Survey for Teachers: Measurement Error and Alternatives to Correlation

Peer reviewed

Direct link

Harshman, Jordan; Yezierski, Ellen – Journal of Chemical Education, 2016

Determining the error of measurement is a necessity for researchers engaged in bench chemistry, chemistry education research (CER), and a multitude of other fields. Discussions regarding what constructs measurement error entails and how to best measure them have occurred, but the critiques about traditional measures have yielded few alternatives.…

Descriptors: Science Instruction, Chemistry, Error of Measurement, Psychometrics

Practical Issues in Estimating Achievement Gaps from Coarsened Data

Peer reviewed

Direct link

Reardon, Sean F.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2015

In an earlier paper, we presented methods for estimating achievement gaps when test scores are coarsened into a small number of ordered categories, preventing fine-grained distinctions between individual scores. We demonstrated that gaps can nonetheless be estimated with minimal bias across a broad range of simulated and real coarsened data…

Descriptors: Achievement Gap, Performance Factors, Educational Practices, Scores

Practical Issues in Estimating Achievement Gaps from Coarsened Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Reardon, Sean F.; Ho, Andrew D. – Grantee Submission, 2015

Ho and Reardon (2012) present methods for estimating achievement gaps when test scores are coarsened into a small number of ordered categories, preventing fine-grained distinctions between individual scores. They demonstrate that gaps can nonetheless be estimated with minimal bias across a broad range of simulated and real coarsened data…

Descriptors: Achievement Gap, Performance Factors, Educational Practices, Scores

Problems in Estimating Composite Reliability of "Unitised" Assessments

Peer reviewed

Direct link

Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013

This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…

Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability

A Note on Using Stratified Alpha to Estimate the Composite Reliability of a Test Composed of Interrelated Nonhomogeneous Items

Peer reviewed

Direct link

Rae, Gordon – Psychological Methods, 2007

The relationship between stratified alpha (alpha-sub(s)) and the reliability of a test composed of interrelated nonhomogeneous items is examined. It is mathematically demonstrated that when there is congeneric equivalence within the strata or subtests, the difference between the coefficients is a function of the variances of the loadings within…

Descriptors: Test Reliability, Test Items, Computation, Error of Measurement

Rules of Thumb for Estimating Reliability Coefficients Using Generalizability Theory.

Peer reviewed

Rentz, R. Robert – Educational and Psychological Measurement, 1980

This paper elaborates on the work of Cardinet, and others, by clarifying some points regarding calculations, specifically with reference to existing computer programs, and by presenting illustrative examples of the calculation and interpretation of several generalizability coefficients from a complex six-facet (factor) design. (Author/RL)

Descriptors: Analysis of Variance, Computation, Computer Programs, Error of Measurement

The Impact of Missing Data on Sample Reliability Estimates: Implications for Reliability Reporting Practices

Peer reviewed

Direct link

Enders, Craig K. – Educational and Psychological Measurement, 2004

A method for incorporating maximum likelihood (ML) estimation into reliability analyses with item-level missing data is outlined. An ML estimate of the covariance matrix is first obtained using the expectation maximization (EM) algorithm, and coefficient alpha is subsequently computed using standard formulae. A simulation study demonstrated that…

Descriptors: Intervals, Simulation, Test Reliability, Computation

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

Interpreting Scores on the New Jersey College Basic Skills Placement Test.

New Jersey Basic Skills Council, Trenton. – 1983

The New Jersey College Basic Skills Placement Test (NJCBSPT) is designed to measure basic reading, writing, and mathematics skills of students entering New Jersey colleges. The test consists of five sections: Essay, Reading Comprehension, Sentence Sense, Mathematical Computation, and Elementary Algebra. The test is intended to answer the question…

Descriptors: Algebra, Basic Skills, College Entrance Examinations, Computation

New Mexico Standards Based Assessment (NMSBA) Technical Report: 2006 Spring Administration

Download full text

Griph, Gerald W. – New Mexico Public Education Department, 2006

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2006 NMSBA. The 2006 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Calibration, scaling, and equating procedures; (4) Standard setting;…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring