Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 8 |
Descriptor
Error of Measurement | 8 |
Guidelines | 8 |
Comparative Analysis | 3 |
Sample Size | 3 |
Item Analysis | 2 |
Item Response Theory | 2 |
Networks | 2 |
Simulation | 2 |
Social Science Research | 2 |
Statistical Bias | 2 |
Test Items | 2 |
More ▼ |
Source
Sociological Methods &… | 2 |
Educational and Psychological… | 1 |
Grantee Submission | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
Language Assessment Quarterly | 1 |
Practical Assessment,… | 1 |
Author
Avi Feller | 1 |
Chang, Heesun | 1 |
Chris Holmes | 1 |
Demarest, Leila | 1 |
Dennis M. Feehan | 1 |
Forrest W. Crawford | 1 |
Goodrich, J. Marc | 1 |
Huang, Feifei | 1 |
Koziol, Natalie A. | 1 |
Langer, Arnim | 1 |
Lee, Won-Chan | 1 |
More ▼ |
Publication Type
Journal Articles | 7 |
Reports - Research | 5 |
Reports - Descriptive | 2 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
International English… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024
The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…
Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability
Oscar Clivio; Avi Feller; Chris Holmes – Grantee Submission, 2024
Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do…
Descriptors: Evaluation Methods, Causal Models, Error of Measurement, Guidelines
Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023
This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…
Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores
Demarest, Leila; Langer, Arnim – Sociological Methods & Research, 2022
While conflict event data sets are increasingly used in contemporary conflict research, important concerns persist regarding the quality of the collected data. Such concerns are not necessarily new. Yet, because the methodological debate and evidence on potential errors remains scattered across different subdisciplines of social sciences, there is…
Descriptors: Guidelines, Research Methodology, Conflict, Social Science Research
Nathaniel Josephs; Dennis M. Feehan; Forrest W. Crawford – Sociological Methods & Research, 2024
The network scale-up method (NSUM) is a survey-based method for estimating the number of individuals in a hidden or hard-to-reach subgroup of a general population. In NSUM surveys, sampled individuals report how many others they know in the subpopulation of interest (e.g. "How many sex workers do you know?") and how many others they know…
Descriptors: Sample Size, Surveys, Population Groups, Epidemiology
Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022
Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…
Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods
Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022
Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…
Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations
Chang, Heesun – Language Assessment Quarterly, 2022
Drawing on the framework of invariant measurement from Rasch measurement theory, the purpose of this study is to psychometrically evaluate the 20 language and teaching skill domains of the International Teaching Assistant (ITA) Test using the many-facet Rasch model and to empirically explore performance differences between females and males in…
Descriptors: Teaching Assistants, Grammar, Second Language Learning, Second Language Instruction