Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 8 |
Descriptor
Error of Measurement | 8 |
Guidelines | 8 |
Sample Size | 8 |
Simulation | 6 |
Test Items | 4 |
Comparative Analysis | 3 |
Effect Size | 3 |
Item Analysis | 3 |
Item Response Theory | 3 |
Models | 3 |
Test Length | 3 |
More ▼ |
Source
Educational and Psychological… | 2 |
Journal of Educational… | 2 |
Journal of Speech, Language,… | 1 |
Language Teaching Research | 1 |
Measurement and Evaluation in… | 1 |
Sociological Methods &… | 1 |
Author
Andersson, Björn | 1 |
Dennis M. Feehan | 1 |
Forrest W. Crawford | 1 |
French, Brian F. | 1 |
Gebski, Val | 1 |
Goodrich, J. Marc | 1 |
Huang, Feifei | 1 |
Jones, Mark | 1 |
Koziol, Natalie A. | 1 |
Lee, Won-Chan | 1 |
Li, Yixing | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 6 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Nathaniel Josephs; Dennis M. Feehan; Forrest W. Crawford – Sociological Methods & Research, 2024
The network scale-up method (NSUM) is a survey-based method for estimating the number of individuals in a hidden or hard-to-reach subgroup of a general population. In NSUM surveys, sampled individuals report how many others they know in the subpopulation of interest (e.g. "How many sex workers do you know?") and how many others they know…
Descriptors: Sample Size, Surveys, Population Groups, Epidemiology
Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022
Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…
Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods
Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022
Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…
Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations
Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017
This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.
Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation
Andersson, Björn – Journal of Educational Measurement, 2016
In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…
Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests
Lindstromberg, Seth – Language Teaching Research, 2016
This article reviews all (quasi)experimental studies appearing in the first 19 volumes (1997-2015) of "Language Teaching Research" (LTR). Specifically, it provides an overview of how statistical analyses were conducted in these studies and of how the analyses were reported. The overall conclusion is that there has been a tight adherence…
Descriptors: Meta Analysis, Second Language Learning, Second Language Instruction, Guidelines
French, Brian F.; Maller, Susan J. – Educational and Psychological Measurement, 2007
Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…
Descriptors: Effect Size, Test Bias, Guidelines, Error of Measurement
Jones, Mark; Onslow, Mark; Packman, Ann; Gebski, Val – Journal of Speech, Language, and Hearing Research, 2006
Purpose: The purpose of this study was to develop guidelines for the statistical analysis of percentage of syllables stuttered (%SS) data in stuttering research. Method; Data on %SS from various independent sources were used to develop a statistical model to describe this type of data. On the basis of this model, %SS data were simulated with…
Descriptors: Statistical Analysis, Guidelines, Sample Size, Stuttering