ERIC - Search Results

Publication Date

In 2025	3
Since 2024	8
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	33

Descriptor

Error of Measurement	39
Evaluation Methods	39
Statistical Bias	39
Simulation	17
Sample Size	9
Statistical Analysis	8
Accuracy	7
Item Response Theory	7
Research Methodology	7
Research Problems	7
Sampling	7
Comparative Analysis	6
Achievement Gains	5
Classification	5
Educational Assessment	5
Models	5
Research Reports	5
Statistical Inference	5
Academic Achievement	4
Correlation	4
Educational Policy	4
Evaluation Problems	4
Maximum Likelihood Statistics	4
Monte Carlo Methods	4
Predictor Variables	4
More ▼

Publication Type

Journal Articles	25
Reports - Research	22
Reports - Evaluative	13
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
Reports - Descriptive	2
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	3
Higher Education	3
Adult Education	1
Elementary Education	1
High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Practitioners

Location

Asia	1
Australia	1
Netherlands	1
New York	1
Ohio	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Program for International…	1
Schools and Staffing Survey…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

Estimation of Finite Population Variance under Stratified Sampling in the Presence of Measurement Errors

Peer reviewed

Direct link

Abdul Haq; Muhammad Usman; Manzoor Khan – Measurement: Interdisciplinary Research and Perspectives, 2024

Measurement errors may significantly distort the properties of an estimator. In this paper, estimators of the finite population variance using the information on first and second raw moments of the study variable are developed under stratified random sampling that incorporate the variance of a measurement error component. Additionally, combined…

Descriptors: Sampling, Error of Measurement, Evaluation Methods, Statistical Bias

The Impact of "Negligible" Cross-Loadings in Investigations of Measurement Invariance with MGCFA and MGESEM

Peer reviewed

Direct link

Timothy R. Konold; Elizabeth A. Sanders; Kelvin Afolabi – Structural Equation Modeling: A Multidisciplinary Journal, 2025

Measurement invariance (MI) is an essential part of validity evidence concerned with ensuring that tests function similarly across groups, contexts, and time. Most evaluations of MI involve multigroup confirmatory factor analyses (MGCFA) that assume simple structure. However, recent research has shown that constraining non-target indicators to…

Descriptors: Evaluation Methods, Error of Measurement, Validity, Monte Carlo Methods

Bias-Adjusted Three-Step Multilevel Latent Class Modeling with Covariates

Peer reviewed

Direct link

Johan Lyrvall; Zsuzsa Bakk; Jennifer Oser; Roberto Di Mari – Structural Equation Modeling: A Multidisciplinary Journal, 2024

We present a bias-adjusted three-step estimation approach for multilevel latent class models (LC) with covariates. The proposed approach involves (1) fitting a single-level measurement model while ignoring the multilevel structure, (2) assigning units to latent classes, and (3) fitting the multilevel model with the covariates while controlling for…

Descriptors: Hierarchical Linear Modeling, Statistical Bias, Error of Measurement, Simulation

Evaluating Imputation-Based Fit Statistics in Structural Equation Modeling with Ordinal Data: The Mi2S Approach

Peer reviewed

Direct link

Suppanut Sriutaisuk; Yu Liu; Seungwon Chung; Hanjoe Kim; Fei Gu – Educational and Psychological Measurement, 2025

The multiple imputation two-stage (MI2S) approach holds promise for evaluating the model fit of structural equation models for ordinal variables with multiply imputed data. However, previous studies only examined the performance of MI2S-based residual-based test statistics. This study extends previous research by examining the performance of two…

Descriptors: Structural Equation Models, Error of Measurement, Programming Languages, Goodness of Fit

Causal Mediation Analysis for an Ordinal Outcome with Multiple Mediators

Peer reviewed

Direct link

Yuejin Zhou; Wenwu Wang; Tao Hu; Tiejun Tong; Zhonghua Liu – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Causal mediation analysis is a popular approach for investigating whether the effect of an exposure on an outcome is through a mediator to better understand the underlying causal mechanism. In recent literature, mediation analysis with multiple mediators has been proposed for continuous and dichotomous outcomes. In contrast, methods for mediation…

Descriptors: Regression (Statistics), Causal Models, Evaluation Methods, Vignettes

Towards Representation Learning for Weighting Problems in Design-Based Causal Inference

Peer reviewed

Direct link

Oscar Clivio; Avi Feller; Chris Holmes – Grantee Submission, 2024

Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do…

Descriptors: Evaluation Methods, Causal Models, Error of Measurement, Guidelines

Comparing Factor Score Approaches to SEM in Multigroup Models with Small Samples

Peer reviewed

Direct link

Emma Somer; Carl Falk; Milica Miocevic – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Factor Score Regression (FSR) is increasingly employed as an alternative to structural equation modeling (SEM) in small samples. Despite its popularity in psychology, the performance of FSR in multigroup models with small samples remains relatively unknown. The goal of this study was to examine the performance of FSR, namely Croon's correction and…

Descriptors: Scores, Structural Equation Models, Comparative Analysis, Sample Size

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Classification Consistency and Accuracy with Atypical Score Distributions

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2020

The current study aims to evaluate the performance of three non-IRT procedures (i.e., normal approximation, Livingston-Lewis, and compound multinomial) for estimating classification indices when the observed score distribution shows atypical patterns: (a) bimodality, (b) structural (i.e., systematic) bumpiness, or (c) structural zeros (i.e., no…

Descriptors: Classification, Accuracy, Scores, Cutting Scores

Some Methods and Evaluation for Linking and Equating with Small Samples

Peer reviewed

Direct link

Peabody, Michael R. – Applied Measurement in Education, 2020

The purpose of the current article is to introduce the equating and evaluation methods used in this special issue. Although a comprehensive review of all existing models and methodologies would be impractical given the format, a brief introduction to some of the more popular models will be provided. A brief discussion of the conditions required…

Descriptors: Evaluation Methods, Equated Scores, Sample Size, Item Response Theory

Equating with Small and Unbalanced Samples

Peer reviewed

Direct link

Goodman, Joshua T.; Dallas, Andrew D.; Fan, Fen – Applied Measurement in Education, 2020

Recent research has suggested that re-setting the standard for each administration of a small sample examination, in addition to the high cost, does not adequately maintain similar performance expectations year after year. Small-sample equating methods have shown promise with samples between 20 and 30. For groups that have fewer than 20 students,…

Descriptors: Equated Scores, Sample Size, Sampling, Weighted Scores

Evaluation of Structure Complexity Magnitude, Degree of Cross-Loading on Secondary Dimension and Model Specification on MIRT Parameter Estimation

Direct link

Hosseinzadeh, Mostafa – ProQuest LLC, 2021

In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…

Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

Testing Autocorrelation and Partial Autocorrelation: Asymptotic Methods versus Resampling Techniques

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ke, Zijun; Zhang, Zhiyong – Grantee Submission, 2018

Autocorrelation and partial autocorrelation, which provide a mathematical tool to understand repeating patterns in time series data, are often used to facilitate the identification of model orders of time series models (e.g., moving average and autoregressive models). Asymptotic methods for testing autocorrelation and partial autocorrelation such…

Descriptors: Correlation, Mathematical Formulas, Sampling, Monte Carlo Methods

An Unbiased Estimate of Global Interrater Agreement

Peer reviewed

Direct link

Cousineau, Denis; Laurencelle, Louis – Educational and Psychological Measurement, 2017

Assessing global interrater agreement is difficult as most published indices are affected by the presence of mixtures of agreements and disagreements. A previously proposed method was shown to be specifically sensitive to global agreement, excluding mixtures, but also negatively biased. Here, we propose two alternatives in an attempt to find what…

Descriptors: Interrater Reliability, Evaluation Methods, Statistical Bias, Accuracy

Previous Page | Next Page »

Pages: 1 | 2 | 3

Structural Equation Modeling:…	4
Educational and Psychological…	3
Applied Measurement in…	2
Applied Psychological…	2
Education and the Public…	2
Grantee Submission	2
Journal of Educational…	2
ProQuest LLC	2
Advances in Physiology…	1
Carnegie Foundation for the…	1
Evaluation Review	1
Group of Eight (NJ1)	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Educational and…	1
Journal of Research on…	1
Measurement:…	1
Multivariate Behavioral…	1
National Education Policy…	1
New Directions for…	1
Psicologica: International…	1
Psychological Methods	1
Society for Research on…	1
Sociological Methods &…	1
More ▼

Reardon, Sean F.	2
Woods, Carol M.	2
Abdul Haq	1
Ahn, Soyeon	1
Alcala-Quintana, Rocio	1
Avi Feller	1
Bamezai, Anil	1
Bloom, Howard S.	1
Botella, Juan	1
Brandriet, Alexandra	1
Carl Falk	1
Chris Holmes	1
Cimpian, Joseph R.	1
Cousineau, Denis	1
Dallas, Andrew D.	1
Diakow, Ronli Phyllis	1
Dorn, Sherman	1
Echternacht, Gary	1
Elizabeth A. Sanders	1
Emma Somer	1
Fan, Fen	1
Fei Gu	1
Garcia-Perez, Miguel A.	1
Goodman, Joshua T.	1
More ▼