ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	15

Descriptor

Accuracy	15
Error of Measurement	15
Sampling	15
Sample Size	6
Evaluation Methods	5
Simulation	5
Computation	4
Test Items	4
Comparative Analysis	3
Correlation	3
Equated Scores	3
Case Studies	2
Effect Size	2
Foreign Countries	2
Goodness of Fit	2
Intervals	2
Item Analysis	2
Item Response Theory	2
Models	2
Multivariate Analysis	2
Regression (Statistics)	2
Scores	2
Statistical Analysis	2
Time	2
Ability	1
More ▼

Source

ETS Research Report Series	2
Educational and Psychological…	2
AERA Online Paper Repository	1
Applied Measurement in…	1
Educational Testing Service	1
Grantee Submission	1
Journal of Special Education	1
Online Submission	1
Psicologica: International…	1
Research Matters	1
Social Indicators Research	1
Society for Research on…	1
Topics in Early Childhood…	1
More ▼

Publication Type

Reports - Research	13
Journal Articles	10
Reports - Evaluative	2
Speeches/Meeting Papers	2

Education Level

Early Childhood Education	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Australia	1
European Union	1
Pennsylvania	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Using the Standardized Root Mean Squared Residual (SRMR) to Assess Exact Fit in Structural Equation Models

Peer reviewed

Direct link

Pavlov, Goran; Maydeu-Olivares, Alberto; Shi, Dexin – Educational and Psychological Measurement, 2021

We examine the accuracy of p values obtained using the asymptotic mean and variance (MV) correction to the distribution of the sample standardized root mean squared residual (SRMR) proposed by Maydeu-Olivares to assess the exact fit of SEM models. In a simulation study, we found that under normality, the MV-corrected SRMR statistic provides…

Descriptors: Structural Equation Models, Goodness of Fit, Simulation, Error of Measurement

Impact of Item Parameter Drift on Rasch Scale Stability in Small Samples over Multiple Administrations

Peer reviewed

Direct link

Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020

Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…

Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Methods to Estimate the Variance of Some Indices of the Signal Detection Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Suero, Manuel; Privado, Jesús; Botella, Juan – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

A simulation study is presented to evaluate and compare three methods to estimate the variance of the estimates of the parameters d and "C" of the signal detection theory (SDT). Several methods have been proposed to calculate the variance of their estimators, "d'" and "c." Those methods have been mostly assessed by…

Descriptors: Evaluation Methods, Theories, Simulation, Statistical Analysis

Multivariate Regression with Small Samples: A Comparison of Estimation Methods

Peer reviewed

Direct link

Finch, William Holmes; Hernandez Finch, Maria E. – AERA Online Paper Repository, 2017

High dimensional multivariate data, where the number of variables approaches or exceeds the sample size, is an increasingly common occurrence for social scientists. Several tools exist for dealing with such data in the context of univariate regression, including regularization methods such as Lasso, Elastic net, Ridge Regression, as well as the…

Descriptors: Multivariate Analysis, Regression (Statistics), Sampling, Sample Size

Considerations for Time Sampling Interval Durations in the Measurement of Young Children's Classroom Engagement

Peer reviewed

Direct link

Zakszeski, Brittany N.; Hojnoski, Robin L.; Wood, Brenna K. – Topics in Early Childhood Special Education, 2017

Classroom engagement is important to young children's academic and social development. Accurate methods of capturing this behavior are needed to inform and evaluate intervention efforts. This study compared the accuracy of interval durations (i.e., 5 s, 10 s, 15 s, 20 s, 30 s, and 60 s) of momentary time sampling (MTS) in approximating the…

Descriptors: Intervals, Time, Sampling, Learner Engagement

Identifying Issues and Concerns with the Use of Interval-Based Systems in Single Case Research Using a Pilot Simulation Study

Peer reviewed

Direct link

Ledford, Jennifer R.; Ayres, Kevin M.; Lane, Justin D.; Lam, Man Fung – Journal of Special Education, 2015

Momentary time sampling (MTS), whole interval recording (WIR), and partial interval recording (PIR) are commonly used in applied research. We discuss potential difficulties with analyzing data when these systems are used and present results from a pilot simulation study designed to determine the extent to which these issues are likely to be…

Descriptors: Intervals, Research Methodology, Sampling, Time

How Much Confidence Can We Have in EU-SILC? Complex Sample Designs and the Standard Error of the Europe 2020 Poverty Indicators

Peer reviewed

Direct link

Goedeme, Tim – Social Indicators Research, 2013

If estimates are based on samples, they should be accompanied by appropriate standard errors and confidence intervals. This is true for scientific research in general, and is even more important if estimates are used to inform and evaluate policy measures such as those aimed at attaining the Europe 2020 poverty reduction target. In this article I…

Descriptors: Foreign Countries, Poverty, Social Isolation, Social Indicators

Total Survey Error & Institutional Research: A Case Study of the University Experience Survey

Download full text

Whiteley, Sonia – Online Submission, 2014

Total Survey Error (TSE) is a component of Total Survey Quality (TSQ) that supports the assessment of the extent to which a survey is "fit-for-purpose". While TSQ looks at a number of dimensions, such as relevance, credibility and accessibility, TSE is has a more operational focus on accuracy and minimising errors. Mitigating survey…

Descriptors: Surveys, Accuracy, Institutional Research, Case Studies

Accounting for One-Group Clustering in Effect-Size Estimation

Peer reviewed
PDF on ERIC

Download full text

Citkowicz, Martyna; Hedges, Larry V. – Society for Research on Educational Effectiveness, 2013

In some instances, intentionally or not, study designs are such that there is clustering in one group but not in the other. This paper describes methods for computing effect size estimates and their variances when there is clustering in only one group and the analysis has not taken that clustering into account. The authors provide the effect size…

Descriptors: Multivariate Analysis, Effect Size, Sampling, Sample Size

Limits on the Accuracy of Linking. Research Report. ETS RR-10-22

Download full text

Haberman, Shelby J. – Educational Testing Service, 2010

Sampling errors limit the accuracy with which forms can be linked. Limitations on accuracy are especially important in testing programs in which a very large number of forms are employed. Standard inequalities in mathematical statistics may be used to establish lower bounds on the achievable inking accuracy. To illustrate results, a variety of…

Descriptors: Testing Programs, Equated Scores, Sampling, Accuracy

Evaluation of Methods to Compute Complex Sample Standard Errors in Latent Regression Models. Research Report. ETS RR-09-49

Peer reviewed
PDF on ERIC

Download full text

Oranje, Andreas; Li, Deping; Kandathil, Mathew – ETS Research Report Series, 2009

Several complex sample standard error estimators based on linearization and resampling for the latent regression model of the National Assessment of Educational Progress (NAEP) are studied with respect to design choices such as number of items, number of regressors, and the efficiency of the sample. This paper provides an evaluation of the extent…

Descriptors: Error of Measurement, Computation, Regression (Statistics), National Competency Tests

Methods of Linking with Small Samples in a Common-Item Design: An Empirical Comparison. Research Report. ETS RR-09-38

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2009

A series of resampling studies was conducted to compare the accuracy of equating in a common item design using four different methods: chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating, and the circle-arc method. Four operational test forms, each containing more than 100 items, were used for…

Descriptors: Sampling, Sample Size, Accuracy, Test Items

Ayres, Kevin M.	1
Botella, Juan	1
Bramley, Tom	1
Chun Wang	1
Citkowicz, Martyna	1
David Navarro-González	1
Fabia Morales-Vives	1
Finch, William Holmes	1
Goedeme, Tim	1
Gongjun Xu	1
Haberman, Shelby J.	1
Hedges, Larry V.	1
Hernandez Finch, Maria E.	1
Hojnoski, Robin L.	1
Jiaying Xiao	1
Jones, Andrew T.	1
Kandathil, Mathew	1
Kim, Sooyeon	1
Kopp, Jason P.	1
Lam, Man Fung	1
Lane, Justin D.	1
Ledford, Jennifer R.	1
Li, Deping	1
Livingston, Samuel A.	1
Maydeu-Olivares, Alberto	1
More ▼