ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	12

Descriptor

Error of Measurement	35
Reliability	35
Sampling	35
Statistical Analysis	13
Research Methodology	12
Correlation	9
Validity	9
Analysis of Variance	8
Research Design	8
Scores	7
Data Analysis	6
Sample Size	6
Surveys	6
Comparative Analysis	5
Data Collection	5
Hypothesis Testing	5
True Scores	5
Evaluation Methods	4
Higher Education	4
Item Analysis	4
Measurement Techniques	4
Statistical Bias	4
Computation	3
Factor Analysis	3
Generalization	3
More ▼

Source

Applied Measurement in…	3
Applied Psychological…	3
Educational and Psychological…	3
Assessment	1
Association for Institutional…	1
Canadian Journal of Program…	1
Developmental Psychology	1
Educational Testing Service	1
Journal of Experimental…	1
Multivariate Behavioral…	1
National Assessment Governing…	1
Online Submission	1
ProQuest LLC	1
Research Papers in Education	1
Research Quarterly	1
Teaching of Psychology	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	14
Reports - Evaluative	8
Reports - Descriptive	4
Speeches/Meeting Papers	3
Tests/Questionnaires	3
Guides - Non-Classroom	2
Books	1
Collected Works - Serial	1
Dissertations/Theses -…	1

Education Level

Higher Education	2
Elementary Education	1
High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	2
Students	1

Location

United States	2
Canada	1
New York	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
National Household Education…	1
Texas Assessment of Academic…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Evaluating the Consistency of Angoff-Based Cut Scores Using Subsets of Items within a Generalizability Theory Framework

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015

The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…

Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items

An Investigation of Measurement Invariance of the Key Stage 2 National Curriculum Science Sampling Test in England

Peer reviewed

Direct link

He, Qingping; Anwyll, Steve; Glanville, Matthew; Opposs, Dennis – Research Papers in Education, 2014

Since 2010, the whole national cohort Key Stage 2 (KS2) National Curriculum test in science in England has been replaced with a sampling test taken by pupils at the age of 11 from a nationally representative sample of schools annually. The study reported in this paper compares the performance of different subgroups of the samples (classified by…

Descriptors: National Curriculum, Sampling, Foreign Countries, Factor Analysis

Correlation Attenuation Due to Measurement Error: A New Approach Using the Bootstrap Procedure

Peer reviewed

Direct link

Padilla, Miguel A.; Veprinsky, Anna – Educational and Psychological Measurement, 2012

Issues with correlation attenuation due to measurement error are well documented. More than a century ago, Spearman proposed a correction for attenuation. However, this correction has seen very little use since it can potentially inflate the true correlation beyond one. In addition, very little confidence interval (CI) research has been done for…

Descriptors: Correlation, Error of Measurement, Sampling, Statistical Inference

Sources of Score Scale Inconsistency. Research Report. ETS RR-11-10

Download full text

Haberman, Shelby J.; Dorans, Neil J. – Educational Testing Service, 2011

For testing programs that administer multiple forms within a year and across years, score equating is used to ensure that scores can be used interchangeably. In an ideal world, samples sizes are large and representative of populations that hardly change over time, and very reliable alternate test forms are built with nearly identical psychometric…

Descriptors: Scores, Reliability, Equated Scores, Test Construction

A Survey Data Quality Strategy: The Institutional Research Perspective. IR Applications, Volume 34

Download full text

Liu, Qin – Association for Institutional Research, 2012

This discussion constructs a survey data quality strategy for institutional researchers in higher education in light of total survey error theory. It starts with describing the characteristics of institutional research and identifying the gaps in literature regarding survey data quality issues in institutional research and then introduces the…

Descriptors: Institutional Research, Higher Education, Quality Control, Researchers

Reliability Generalization: An Examination of the Positive Affect and Negative Affect Schedule

Peer reviewed

Direct link

Leue, Anja; Lange, Sebastian – Assessment, 2011

The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…

Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior

Attenuation of the Squared Canonical Correlation Coefficient under Varying Estimates of Score Reliability

Direct link

Wilson, Celia M. – ProQuest LLC, 2010

Research pertaining to the distortion of the squared canonical correlation coefficient has traditionally been limited to the effects of sampling error and associated correction formulas. The purpose of this study was to compare the degree of attenuation of the squared canonical correlation coefficient under varying conditions of score reliability.…

Descriptors: Monte Carlo Methods, Measurement, Multivariate Analysis, Error of Measurement

A Survey Data Quality Strategy: The Institutional Research Perspective

Download full text

Liu, Qin – Online Submission, 2009

This paper intends to construct a survey data quality strategy for institutional researchers in higher education in light of total survey error theory. It starts with describing the characteristics of institutional research and identifying the gaps in literature regarding survey data quality issues in institutional research. Then it is followed by…

Descriptors: Higher Education, Institutional Research, Quality Control, Researchers

Correction for Attenuation with Biased Reliability Estimates and Correlated Errors in Populations and Samples

Peer reviewed

Direct link

Zimmerman, Donald W. – Educational and Psychological Measurement, 2007

Properties of the Spearman correction for attenuation were investigated using Monte Carlo methods, under conditions where correlations between error scores exist as a population parameter and also where correlated errors arise by chance in random sampling. Equations allowing for all possible dependence among true and error scores on two tests at…

Descriptors: Monte Carlo Methods, Correlation, Sampling, Data Analysis

Scale Reliability, Cronbach's Coefficient Alpha, and Violations of Essential Tau-Equivalence with Fixed Congeneric Components.

Peer reviewed

Raykov, Tenko – Multivariate Behavioral Research, 1997

The population discrepancy between Cronbach's Coefficient Alpha (L. Cronbach, 1951) and scale reliability with fixed congeneric measure, uncorrelated errors, and sampling of subjects was studied. The difference is expressed in terms of the individual component violations of the assumption of equal tau-equivalence that is necessary and sufficient…

Descriptors: Error of Measurement, Reliability, Sampling, Scaling

Sample Characteristics and Measurement Reliability: An Empirical Exploration.

Download full text

Fan, Xitao; Yin, Ping – 2001

The literature on measurement reliability shows the consensus that group heterogeneity with regard to the trait being measured is a factor that affects the sample measurement reliability, but the degree of such effect is not entirely clear. Sample performance also has the potential to affect measurement reliability because of its effect on the…

Descriptors: Error of Measurement, Measurement Techniques, Reliability, Sample Size

A Nomogram to Assist in Planning Surveys of Small (N .tl. 2,000) Populations.

King, Harry A. – Research Quarterly, 1978

Some statistical considerations in applying survey sampling methods to small populations are explored. (DS)

Descriptors: Error of Measurement, Program Development, Reliability, Sampling

Previous Page | Next Page »

Pages: 1 | 2 | 3

Forsyth, Robert A.	2
Liu, Qin	2
Anwyll, Steve	1
Bartz, Albert E.	1
Bradshaw, Stephen C.	1
Brennan, Robert L.	1
Carol Eckerly	1
Conley, Valerie	1
David Navarro-González	1
Dorans, Neil J.	1
Evans, Brian	1
Fabia Morales-Vives	1
Fan, Xitao	1
Fink, Arlene	1
Fink, Steven	1
Gao, Xiaohong	1
Glanville, Matthew	1
Haberman, Shelby J.	1
Haertel, Edward H.	1
Hart, Roland J.	1
He, Qingping	1
Hedley, R. Alan	1
Hill, Susan	1
Hsin-Yun Lee	1
More ▼