ERIC - Search Results

Publication Date

In 2025

Publication Type

Journal Articles	38
Reports - Research	34
Reports - Evaluative	3
Reports - Descriptive	2
Information Analyses	1

Education Level

Secondary Education	6
Elementary Education	5
Higher Education	5
Postsecondary Education	5
High Schools	2
Early Childhood Education	1
Grade 10	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1

Audience

Location

Belgium	1
China	1
Ethiopia	1
Germany	1
Hungary	1
Hungary (Budapest)	1
Iran	1
Italy	1
South Korea	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Depression Anxiety and Stress…	1
Force Concept Inventory	1
NEO Personality Inventory	1

What Works Clearinghouse Rating

In 2025 X

Showing 1 to 15 of 39 results Save | Export

Combining Mokken Scale Analysis with Rasch Measurement Theory to Explore Differences in Measurement Quality between Subgroups

Peer reviewed

Direct link

Stefanie A. Wind; Benjamin Lugu; Yurou Wang – International Journal of Testing, 2025

Mokken Scale Analysis (MSA) is a nonparametric approach that offers exploratory tools for understanding the nature of item responses while emphasizing invariance requirements. MSA is often discussed as it relates to Rasch measurement theory, which also emphasizes invariance, but uses parametric models. Researchers who have compared and combined…

Descriptors: Item Response Theory, Scaling, Surveys, Evaluation Methods

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

The Association between the Interviewers' and the Respondents' Political Attitudes in a Telephone Survey

Peer reviewed

Direct link

Ádám Stefkovics – International Journal of Social Research Methodology, 2025

Interviewer effects in telephone surveys on political topics are likely to occur. The literature has yielded considerable evidence about the impact of basic interviewer characteristics, but research is lacking on how interviewers' beliefs may shape responses. This study is aimed at assessing the association between the interviewers' party…

Descriptors: Interviews, Political Attitudes, Telephone Surveys, Political Issues

The Impact of "Negligible" Cross-Loadings in Investigations of Measurement Invariance with MGCFA and MGESEM

Peer reviewed

Direct link

Timothy R. Konold; Elizabeth A. Sanders; Kelvin Afolabi – Structural Equation Modeling: A Multidisciplinary Journal, 2025

Measurement invariance (MI) is an essential part of validity evidence concerned with ensuring that tests function similarly across groups, contexts, and time. Most evaluations of MI involve multigroup confirmatory factor analyses (MGCFA) that assume simple structure. However, recent research has shown that constraining non-target indicators to…

Descriptors: Evaluation Methods, Error of Measurement, Validity, Monte Carlo Methods

Graphical Causal Models for Survey Inference

Peer reviewed

Direct link

Julian Schuessler; Peter Selb – Sociological Methods & Research, 2025

Directed acyclic graphs (DAGs) are now a popular tool to inform causal inferences. We discuss how DAGs can also be used to encode theoretical assumptions about nonprobability samples and survey nonresponse and to determine whether population quantities including conditional distributions and regressions can be identified. We describe sources of…

Descriptors: Data Collection, Graphs, Error of Measurement, Statistical Bias

Evaluating Imputation-Based Fit Statistics in Structural Equation Modeling with Ordinal Data: The Mi2S Approach

Peer reviewed

Direct link

Suppanut Sriutaisuk; Yu Liu; Seungwon Chung; Hanjoe Kim; Fei Gu – Educational and Psychological Measurement, 2025

The multiple imputation two-stage (MI2S) approach holds promise for evaluating the model fit of structural equation models for ordinal variables with multiply imputed data. However, previous studies only examined the performance of MI2S-based residual-based test statistics. This study extends previous research by examining the performance of two…

Descriptors: Structural Equation Models, Error of Measurement, Programming Languages, Goodness of Fit

Measurement Invariance of the Action Competence in Sustainable Development Questionnaire: Can We Compare between Groups?

Peer reviewed

Direct link

M. Van Harskamp; S. De Maeyer; W. Sass; P. Van Petegem; J. Boeve-de Pauw – Environmental Education Research, 2025

There is a need for valid and reliable instruments to assess learning outcomes in education for sustainable development (ESD). Measurement invariance (MI) needs to be established before results of these instruments can be validly compared between groups. Despite its importance, establishing MI is an often overlooked validation step. To provide an…

Descriptors: Measurement, Sustainable Development, Error of Measurement, Questionnaires

New Developments in Measurement Invariance Testing: An Overview and Comparison of EFA-Based Approaches

Peer reviewed

Direct link

Philipp Sterner; Kim De Roover; David Goretzko – Structural Equation Modeling: A Multidisciplinary Journal, 2025

When comparing relations and means of latent variables, it is important to establish measurement invariance (MI). Most methods to assess MI are based on confirmatory factor analysis (CFA). Recently, new methods have been developed based on exploratory factor analysis (EFA); most notably, as extensions of multi-group EFA, researchers introduced…

Descriptors: Error of Measurement, Measurement Techniques, Factor Analysis, Structural Equation Models

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Linear Probability Model Revisited: Why It Works and How It Should Be Specified

Peer reviewed

Direct link

Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025

A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…

Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models

Confirming Increased Statistical Errors in Testing Correlations from Small Sample Sizes

Peer reviewed

Direct link

Duane Knudson – Measurement in Physical Education and Exercise Science, 2025

Small sample sizes contribute to several problems in research and knowledge advancement. This conceptual replication study confirmed and extended the inflation of type II errors and confidence intervals in correlation analyses of small sample sizes common in kinesiology/exercise science. Current population data (N = 18, 230, & 464) on four…

Descriptors: Kinesiology, Exercise, Biomechanics, Movement Education

The Role of Reciprocated Friendships in the Behavioral Correlates of Sociometric Categories

Peer reviewed

Direct link

Tamás Hoffmann; Bence Basa; László Bernáth; Katalin N. Kollár – Canadian Journal of School Psychology, 2025

Children live complex social lives that has various aspects, including intimate friendships, peer-acceptance and bullying dynamics which is usually studied separately in research. This study aims to investigate the interplays of these three important fields by analyzing the moderating effects of number of friendships on the relation between…

Descriptors: Foreign Countries, Elementary School Students, Friendship, Peer Acceptance

Evaluating Measurement Invariance of Students' Practices Regarding Online Information Questionnaire in PISA 2022: A Comparative Study Using MGCFA and Alignment Method

Peer reviewed

Direct link

Esra Sözer Boz – Education and Information Technologies, 2025

International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…

Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students

Previous Page | Next Page »

Pages: 1 | 2 | 3

Structural Equation Modeling:…	5
Educational and Psychological…	3
International Journal of…	3
Sociological Methods &…	3
European Journal of Education	2
Psychology in the Schools	2
Advances in Physiology…	1
American Journal on…	1
Annenberg Institute for…	1
Autism: The International…	1
British Educational Research…	1
Canadian Journal of School…	1
Child Development	1
Education and Information…	1
Educational Measurement:…	1
Environmental Education…	1
Field Methods	1
Gifted Child Today	1
International Journal of…	1
International Journal of…	1
Journal of Autism and…	1
Journal of Early Adolescence	1
Journal of Educational…	1
Journal of Speech, Language,…	1
Measurement in Physical…	1
More ▼

Kim De Roover	2
Alyssa M. Merbler	1
Amanda Timmerman	1
Amy E. Ramage	1
Anders Hjorth-Trolle	1
Anders Holm	1
Andrea Chirico	1
Andrea L. B. Ford	1
Audrey Linden	1
Bang Quan Zheng	1
Ben Van Dusen	1
Bence Basa	1
Benjamin Lugu	1
Breanne J. Byiers	1
Brendan A. Schuetze	1
Carl Westine	1
Chantel C. Burkitt	1
Chelsea M. Durber	1
Chenchen Xu	1
Cristian Zanon	1
Dalia Nasvytiene	1
Dandan Tang	1
Dariush Tahmasebi Aghbelaghi	1
David Goretzko	1
David L. Vogel	1
More ▼

Error of Measurement	39
Evaluation Methods	16
Test Reliability	13
Factor Analysis	12
Foreign Countries	12
Test Validity	11
Goodness of Fit	8
Robustness (Statistics)	8
Measurement Techniques	7
Interrater Reliability	6
Psychometrics	6
Rating Scales	6
Scores	6
Factor Structure	5
Gender Differences	5
Questionnaires	5
Reliability	5
Statistical Bias	5
Structural Equation Models	5
Accuracy	4
Comparative Testing	4
Correlation	4
Elementary School Students	4
Evaluation Criteria	4
Item Response Theory	4
More ▼