Publication Date
In 2025 | 5 |
Since 2024 | 16 |
Since 2021 (last 5 years) | 34 |
Since 2016 (last 10 years) | 65 |
Since 2006 (last 20 years) | 132 |
Descriptor
Error of Measurement | 185 |
Reliability | 185 |
Scores | 59 |
Correlation | 35 |
Validity | 35 |
Statistical Analysis | 31 |
Generalizability Theory | 30 |
Foreign Countries | 28 |
Psychometrics | 28 |
Measurement Techniques | 25 |
Factor Analysis | 20 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 185 |
Journal Articles | 149 |
Speeches/Meeting Papers | 15 |
Numerical/Quantitative Data | 2 |
Information Analyses | 1 |
Legal/Legislative/Regulatory… | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Researchers | 5 |
Policymakers | 1 |
Practitioners | 1 |
Teachers | 1 |
Location
Canada | 5 |
United States | 5 |
North Carolina | 4 |
Turkey | 4 |
China | 3 |
Pennsylvania | 3 |
Spain | 3 |
Australia | 2 |
California | 2 |
Germany | 2 |
Philippines | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024
The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…
Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias
William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024
Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…
Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement
Jonas Flodén – British Educational Research Journal, 2025
This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…
Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring
Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024
Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…
Descriptors: Influences, Models, Measurement Techniques, Reliability
Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024
Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…
Descriptors: Foreign Countries, Students, English Language Learners, Speech
Phillip K. Wood – Structural Equation Modeling: A Multidisciplinary Journal, 2024
The logistic and confined exponential curves are frequently used in studies of growth and learning. These models, which are nonlinear in their parameters, can be estimated using structural equation modeling software. This paper proposes a single combined model, a weighted combination of both models. Mplus, Proc Calis, and lavaan code for the model…
Descriptors: Structural Equation Models, Computation, Computer Software, Weighted Scores
Yan Xia; Selim Havan – Educational and Psychological Measurement, 2024
Although parallel analysis has been found to be an accurate method for determining the number of factors in many conditions with complete data, its application under missing data is limited. The existing literature recommends that, after using an appropriate multiple imputation method, researchers either apply parallel analysis to every imputed…
Descriptors: Data Interpretation, Factor Analysis, Statistical Inference, Research Problems
Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023
Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…
Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores
Mohammad Mehdi Latifi; Dariush Tahmasebi Aghbelaghi; Sajad Khani Pordanjani – European Journal of Education, 2025
The present study sought to assess the psychometric properties of the Iranian adaptation of the Vietnam Teacher Resilience Scale for Asia (VITRS), referred to as the Iranian Teachers' Resilience Scale (ITRS) and to examine its measurement invariance across middle and high school teachers in Iran. In total, 700 participants completed the…
Descriptors: Resilience (Psychology), Error of Measurement, Factor Analysis, Teacher Attitudes
Najera, Hector – Measurement: Interdisciplinary Research and Perspectives, 2023
Measurement error affects the quality of population orderings of an index and, hence, increases the misclassification of the poor and the non-poor groups and affects statistical inferences from binary regression models. Hence, the conclusions about the extent, profile, and distribution of poverty are likely to be misleading. However, the size and…
Descriptors: Poverty, Error of Measurement, Classification, Statistical Inference
Zachary del Rosario – Journal of Statistics and Data Science Education, 2024
Variability is underemphasized in domains such as engineering. Statistics and data science education research offers a variety of frameworks for understanding variability, but new frameworks for domain applications are necessary. This study investigated the professional practices of working engineers to develop such a framework. The Neglected,…
Descriptors: Foreign Countries, Engineering Education, Engineering, Technical Occupations
Cristian Zanon; Nan Zhao; Nursel Topkaya; Ertugrul Sahin; David L. Vogel; Melissa M. Ertl; Samineh Sanatkar; Hsin-Ya Liao; Mark Rubin; Makilim N. Baptista; Winnie W. S. Mak; Fatima Rashed Al-Darmaki; Georg Schomerus; Ying-Fen Wang; Dalia Nasvytiene – International Journal of Testing, 2025
Examinations of the internal structure of the Depression, Anxiety, and Stress Scale-21 (DASS-21) have yielded inconsistent conclusions within and across cultural contexts. This study examined the dimensionality and reliability of the DASS-21 across three theoretically plausible factor structures (i.e., unidimensional, oblique three-factor, and…
Descriptors: Anxiety, Depression (Psychology), Psychometrics, Cultural Context
Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025
The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…
Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis
Grant, Chris; Beach, Tyson A. C.; Hogg-Johnson, Sheilah; Chivers, Michael; Howarth, Samuel J. – Measurement in Physical Education and Exercise Science, 2020
This study evaluated whether real-time applied load feedback and a predefined applied load limit improved inter-session reliability and measurement error of passive glenohumeral rotation range-of-motion measurements. Twenty-one male recreational overhead athletes completed two data collection sessions, approximately 1-week apart. Measurements of…
Descriptors: Reliability, Measurement, Males, Athletes
Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2022
Composite reliability, or coefficient omega, can be estimated using structural equation modeling. Composite reliability is usually estimated under the basic independent clusters model of confirmatory factor analysis (ICM-CFA). However, due to the existence of cross-loadings, the model fit of the exploratory structural equation model (ESEM) is…
Descriptors: Comparative Analysis, Structural Equation Models, Factor Analysis, Reliability