ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	8

Descriptor

Error of Measurement	8
Guidelines	8
Comparative Analysis	3
Sample Size	3
Item Analysis	2
Item Response Theory	2
Networks	2
Simulation	2
Social Science Research	2
Statistical Bias	2
Test Items	2
Test Length	2
Accuracy	1
Acquired Immunodeficiency…	1
Algorithms	1
Artificial Intelligence	1
At Risk Persons	1
Bayesian Statistics	1
Causal Models	1
Computer Software	1
Conflict	1
Construct Validity	1
Data Analysis	1
Decision Making	1
Drug Abuse	1
More ▼

Source

Sociological Methods &…	2
Educational and Psychological…	1
Grantee Submission	1
Journal of Educational…	1
Journal of Educational and…	1
Language Assessment Quarterly	1
Practical Assessment,…	1

Publication Type

Journal Articles	7
Reports - Research	5
Reports - Descriptive	2
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Frequentist and Bayesian Factorial Invariance Using R

Peer reviewed
PDF on ERIC

Download full text

Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024

The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…

Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability

Towards Representation Learning for Weighting Problems in Design-Based Causal Inference

Peer reviewed

Direct link

Oscar Clivio; Avi Feller; Chris Holmes – Grantee Submission, 2024

Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do…

Descriptors: Evaluation Methods, Causal Models, Error of Measurement, Guidelines

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

How Events Enter (or Not) Data Sets: The Pitfalls and Guidelines of Using Newspapers in the Study of Conflict

Peer reviewed

Direct link

Demarest, Leila; Langer, Arnim – Sociological Methods & Research, 2022

While conflict event data sets are increasingly used in contemporary conflict research, important concerns persist regarding the quality of the collected data. Such concerns are not necessarily new. Yet, because the methodological debate and evidence on potential errors remains scattered across different subdisciplines of social sciences, there is…

Descriptors: Guidelines, Research Methodology, Conflict, Social Science Research

A Sample Size Formula for Network Scale-Up Studies

Peer reviewed

Direct link

Nathaniel Josephs; Dennis M. Feehan; Forrest W. Crawford – Sociological Methods & Research, 2024

The network scale-up method (NSUM) is a survey-based method for estimating the number of individuals in a hidden or hard-to-reach subgroup of a general population. In NSUM surveys, sampled individuals report how many others they know in the subpopulation of interest (e.g. "How many sex workers do you know?") and how many others they know…

Descriptors: Sample Size, Surveys, Population Groups, Epidemiology

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

Peer reviewed

Direct link

Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022

Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…

Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations

Evaluation of Language and Teaching Skill Domains for International Teaching Assistants: An Approach Based on Invariant Measurement

Peer reviewed

Direct link

Chang, Heesun – Language Assessment Quarterly, 2022

Drawing on the framework of invariant measurement from Rasch measurement theory, the purpose of this study is to psychometrically evaluate the 20 language and teaching skill domains of the International Teaching Assistant (ITA) Test using the many-facet Rasch model and to empirically explore performance differences between females and males in…

Descriptors: Teaching Assistants, Grammar, Second Language Learning, Second Language Instruction

Avi Feller	1
Chang, Heesun	1
Chris Holmes	1
Demarest, Leila	1
Dennis M. Feehan	1
Forrest W. Crawford	1
Goodrich, J. Marc	1
Huang, Feifei	1
Koziol, Natalie A.	1
Langer, Arnim	1
Lee, Won-Chan	1
Li, Yixing	1
Li, Zonglong	1
Nathaniel Josephs	1
Oscar Clivio	1
Teck Kiang Tan	1
Wallin, Gabriel	1
Wang, Shaojie	1
Wiberg, Marie	1
Yoon, HyeonJin	1
Yu, Sufang	1
Zhang, Minqiang	1
More ▼