Publication Date
In 2025 | 39 |
Descriptor
Source
Author
Kim De Roover | 2 |
Alyssa M. Merbler | 1 |
Amanda Timmerman | 1 |
Amy E. Ramage | 1 |
Anders Hjorth-Trolle | 1 |
Anders Holm | 1 |
Andrea Chirico | 1 |
Andrea L. B. Ford | 1 |
Audrey Linden | 1 |
Bang Quan Zheng | 1 |
Ben Van Dusen | 1 |
More ▼ |
Publication Type
Journal Articles | 38 |
Reports - Research | 34 |
Reports - Evaluative | 3 |
Reports - Descriptive | 2 |
Information Analyses | 1 |
Education Level
Secondary Education | 6 |
Elementary Education | 5 |
Higher Education | 5 |
Postsecondary Education | 5 |
High Schools | 2 |
Early Childhood Education | 1 |
Grade 10 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Preschool Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 2 |
Depression Anxiety and Stress… | 1 |
Force Concept Inventory | 1 |
NEO Personality Inventory | 1 |
What Works Clearinghouse Rating
Stefanie A. Wind; Benjamin Lugu; Yurou Wang – International Journal of Testing, 2025
Mokken Scale Analysis (MSA) is a nonparametric approach that offers exploratory tools for understanding the nature of item responses while emphasizing invariance requirements. MSA is often discussed as it relates to Rasch measurement theory, which also emphasizes invariance, but uses parametric models. Researchers who have compared and combined…
Descriptors: Item Response Theory, Scaling, Surveys, Evaluation Methods
Susan K. Johnsen – Gifted Child Today, 2025
The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…
Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement
Ádám Stefkovics – International Journal of Social Research Methodology, 2025
Interviewer effects in telephone surveys on political topics are likely to occur. The literature has yielded considerable evidence about the impact of basic interviewer characteristics, but research is lacking on how interviewers' beliefs may shape responses. This study is aimed at assessing the association between the interviewers' party…
Descriptors: Interviews, Political Attitudes, Telephone Surveys, Political Issues
Timothy R. Konold; Elizabeth A. Sanders; Kelvin Afolabi – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Measurement invariance (MI) is an essential part of validity evidence concerned with ensuring that tests function similarly across groups, contexts, and time. Most evaluations of MI involve multigroup confirmatory factor analyses (MGCFA) that assume simple structure. However, recent research has shown that constraining non-target indicators to…
Descriptors: Evaluation Methods, Error of Measurement, Validity, Monte Carlo Methods
Julian Schuessler; Peter Selb – Sociological Methods & Research, 2025
Directed acyclic graphs (DAGs) are now a popular tool to inform causal inferences. We discuss how DAGs can also be used to encode theoretical assumptions about nonprobability samples and survey nonresponse and to determine whether population quantities including conditional distributions and regressions can be identified. We describe sources of…
Descriptors: Data Collection, Graphs, Error of Measurement, Statistical Bias
Suppanut Sriutaisuk; Yu Liu; Seungwon Chung; Hanjoe Kim; Fei Gu – Educational and Psychological Measurement, 2025
The multiple imputation two-stage (MI2S) approach holds promise for evaluating the model fit of structural equation models for ordinal variables with multiply imputed data. However, previous studies only examined the performance of MI2S-based residual-based test statistics. This study extends previous research by examining the performance of two…
Descriptors: Structural Equation Models, Error of Measurement, Programming Languages, Goodness of Fit
M. Van Harskamp; S. De Maeyer; W. Sass; P. Van Petegem; J. Boeve-de Pauw – Environmental Education Research, 2025
There is a need for valid and reliable instruments to assess learning outcomes in education for sustainable development (ESD). Measurement invariance (MI) needs to be established before results of these instruments can be validly compared between groups. Despite its importance, establishing MI is an often overlooked validation step. To provide an…
Descriptors: Measurement, Sustainable Development, Error of Measurement, Questionnaires
Philipp Sterner; Kim De Roover; David Goretzko – Structural Equation Modeling: A Multidisciplinary Journal, 2025
When comparing relations and means of latent variables, it is important to establish measurement invariance (MI). Most methods to assess MI are based on confirmatory factor analysis (CFA). Recently, new methods have been developed based on exploratory factor analysis (EFA); most notably, as extensions of multi-group EFA, researchers introduced…
Descriptors: Error of Measurement, Measurement Techniques, Factor Analysis, Structural Equation Models
Hung-Yu Huang – Educational and Psychological Measurement, 2025
The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…
Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability
Jonas Flodén – British Educational Research Journal, 2025
This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…
Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025
A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…
Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models
Duane Knudson – Measurement in Physical Education and Exercise Science, 2025
Small sample sizes contribute to several problems in research and knowledge advancement. This conceptual replication study confirmed and extended the inflation of type II errors and confidence intervals in correlation analyses of small sample sizes common in kinesiology/exercise science. Current population data (N = 18, 230, & 464) on four…
Descriptors: Kinesiology, Exercise, Biomechanics, Movement Education
Tamás Hoffmann; Bence Basa; László Bernáth; Katalin N. Kollár – Canadian Journal of School Psychology, 2025
Children live complex social lives that has various aspects, including intimate friendships, peer-acceptance and bullying dynamics which is usually studied separately in research. This study aims to investigate the interplays of these three important fields by analyzing the moderating effects of number of friendships on the relation between…
Descriptors: Foreign Countries, Elementary School Students, Friendship, Peer Acceptance
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students