Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Error of Measurement | 3295 |
Statistical Analysis | 599 |
Scores | 504 |
Item Response Theory | 445 |
Correlation | 434 |
Comparative Analysis | 422 |
Foreign Countries | 415 |
Test Reliability | 408 |
Computation | 404 |
Simulation | 370 |
Reliability | 355 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Julian Schuessler; Peter Selb – Sociological Methods & Research, 2025
Directed acyclic graphs (DAGs) are now a popular tool to inform causal inferences. We discuss how DAGs can also be used to encode theoretical assumptions about nonprobability samples and survey nonresponse and to determine whether population quantities including conditional distributions and regressions can be identified. We describe sources of…
Descriptors: Data Collection, Graphs, Error of Measurement, Statistical Bias
Xiaowen Liu – International Journal of Testing, 2024
Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…
Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation
Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024
This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…
Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures
Qian Zhang; Qi Wang – Structural Equation Modeling: A Multidisciplinary Journal, 2024
In the article, we focused on the issues of measurement error and omitted confounders while conducting mediation analysis under experimental studies. Depending on informativeness of the confounders between the mediator (M) and outcome (Y), we described two approaches. When researchers are confident that primary confounders are included (e.g.,…
Descriptors: Error of Measurement, Research and Development, Mediation Theory, Causal Models
Mark White; Matt Ronfeldt – Educational Assessment, 2024
Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…
Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
Johan Lyrvall; Zsuzsa Bakk; Jennifer Oser; Roberto Di Mari – Structural Equation Modeling: A Multidisciplinary Journal, 2024
We present a bias-adjusted three-step estimation approach for multilevel latent class models (LC) with covariates. The proposed approach involves (1) fitting a single-level measurement model while ignoring the multilevel structure, (2) assigning units to latent classes, and (3) fitting the multilevel model with the covariates while controlling for…
Descriptors: Hierarchical Linear Modeling, Statistical Bias, Error of Measurement, Simulation
The Design and Optimality of Survey Counts: A Unified Framework via the Fisher Information Maximizer
Xin Guo; Qiang Fu – Sociological Methods & Research, 2024
Grouped and right-censored (GRC) counts have been used in a wide range of attitudinal and behavioural surveys yet they cannot be readily analyzed or assessed by conventional statistical models. This study develops a unified regression framework for the design and optimality of GRC counts in surveys. To process infinitely many grouping schemes for…
Descriptors: Attitude Measures, Surveys, Research Design, Research Methodology
Steffen Erickson – Society for Research on Educational Effectiveness, 2024
Background: Structural Equation Modeling (SEM) is a powerful and broadly utilized statistical framework. Researchers employ these models to dissect relationships into direct, indirect, and total effects (Bollen, 1989). These models unpack the "black box" issues within cause-and-effect studies by examining the underlying theoretical…
Descriptors: Structural Equation Models, Causal Models, Research Methodology, Error of Measurement
Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023
In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…
Descriptors: Test Items, Scores, Goodness of Fit, Statistics
Suppanut Sriutaisuk; Yu Liu; Seungwon Chung; Hanjoe Kim; Fei Gu – Educational and Psychological Measurement, 2025
The multiple imputation two-stage (MI2S) approach holds promise for evaluating the model fit of structural equation models for ordinal variables with multiply imputed data. However, previous studies only examined the performance of MI2S-based residual-based test statistics. This study extends previous research by examining the performance of two…
Descriptors: Structural Equation Models, Error of Measurement, Programming Languages, Goodness of Fit
Dan Wei; Peida Zhan; Hongyun Liu – Structural Equation Modeling: A Multidisciplinary Journal, 2024
In latent growth curve modeling (LGCM), overall fit indices have garnered increased disputation for model selection, and model fit evaluation based on the mean structure has becoming popularity. The present study developed a versatile fit index, named Weighted Root Mean Squared Errors (WRMSE), based on individual case residuals (ICRs) with the aim…
Descriptors: Structural Equation Models, Goodness of Fit, Error of Measurement, Computation
Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…
Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis
Chunhua Cao; Xinya Liang – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Cross-loadings are common in multiple-factor confirmatory factor analysis (CFA) but often ignored in measurement invariance testing. This study examined the impact of ignoring cross-loadings on the sensitivity of fit measures (CFI, RMSEA, SRMR, SRMRu, AIC, BIC, SaBIC, LRT) to measurement noninvariance. The manipulated design factors included the…
Descriptors: Goodness of Fit, Error of Measurement, Sample Size, Factor Analysis
Tenko Raykov – Structural Equation Modeling: A Multidisciplinary Journal, 2024
This note demonstrates that measurement invariance does not guarantee meaningful and valid group comparisons in multiple-population settings. The article follows on a recent critical discussion by Robitzsch and Lüdtke, who argued that measurement invariance was not a pre-requisite for such comparisons. Within the framework of common factor…
Descriptors: Error of Measurement, Prerequisites, Factor Analysis, Evaluation Methods