Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 22 |
Descriptor
Error of Measurement | 24 |
Probability | 24 |
Simulation | 24 |
Bayesian Statistics | 7 |
Computation | 7 |
Statistical Analysis | 7 |
Evaluation Methods | 6 |
Item Response Theory | 6 |
Models | 6 |
Comparative Analysis | 5 |
Monte Carlo Methods | 5 |
More ▼ |
Source
Author
Blackwell, Matthew | 2 |
Honaker, James | 2 |
King, Gary | 2 |
Sijtsma, Klaas | 2 |
Tijmstra, Jesper | 2 |
Andersson, Björn | 1 |
Austin, Peter C. | 1 |
Barr, James | 1 |
Beretvas, S. Natasha | 1 |
Bixi Zhang | 1 |
Bolsinova, Maria | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Research | 14 |
Reports - Evaluative | 4 |
Dissertations/Theses -… | 3 |
Reports - Descriptive | 2 |
Guides - Non-Classroom | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Secondary Education | 1 |
Audience
Researchers | 3 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
What Works Clearinghouse Rating
Bixi Zhang; Spyros Konstantopoulos – Society for Research on Educational Effectiveness, 2022
Background: Meta-analysis refers to the statistical methods employed to combine results of several empirical studies in a topic of interest (Hedges & Olkin, 1985). Meta-analysis is often included in literature review studies to quantitatively analyze data from a collection of studies (Valentine et al., 2010). The statistical power of a…
Descriptors: Meta Analysis, Probability, Effect Size, Research Methodology
Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023
This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…
Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation
Yongyun Shin; Stephen W. Raudenbush – Grantee Submission, 2023
We consider two-level models where a continuous response R and continuous covariates C are assumed missing at random. Inferences based on maximum likelihood or Bayes are routinely made by estimating their joint normal distribution from observed data R[subscript obs] and C[subscript obs]. However, if the model for R given C includes random…
Descriptors: Maximum Likelihood Statistics, Hierarchical Linear Modeling, Error of Measurement, Statistical Distributions
Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022
BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…
Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing
Greifer, Noah – ProQuest LLC, 2018
There has been some research in the use of propensity scores in the context of measurement error in the confounding variables; one recommended method is to generate estimates of the mis-measured covariate using a latent variable model, and to use those estimates (i.e., factor scores) in place of the covariate. I describe a simulation study…
Descriptors: Evaluation Methods, Probability, Scores, Statistical Analysis
Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019
In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…
Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences
Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020
Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…
Descriptors: Test Items, Goodness of Fit, Probability, Accuracy
Blackwell, Matthew; Honaker, James; King, Gary – Sociological Methods & Research, 2017
Although social scientists devote considerable effort to mitigating measurement error during data collection, they often ignore the issue during data analysis. And although many statistical methods have been proposed for reducing measurement error-induced biases, few have been widely used because of implausible assumptions, high levels of model…
Descriptors: Error of Measurement, Monte Carlo Methods, Data Collection, Simulation
Andersson, Björn – Journal of Educational Measurement, 2016
In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…
Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests
Miratrix, Luke; Feller, Avi; Pillai, Natesh; Pati, Debdeep – Society for Research on Educational Effectiveness, 2016
Modeling the distribution of site level effects is an important problem, but it is also an incredibly difficult one. Current methods rely on distributional assumptions in multilevel models for estimation. There it is hoped that the partial pooling of site level estimates with overall estimates, designed to take into account individual variation as…
Descriptors: Probability, Models, Statistical Distributions, Bayesian Statistics
Hong, Hwanhee; Chu, Haitao; Zhang, Jing; Carlin, Bradley P. – Research Synthesis Methods, 2016
Bayesian statistical approaches to mixed treatment comparisons (MTCs) are becoming more popular because of their flexibility and interpretability. Many randomized clinical trials report multiple outcomes with possible inherent correlations. Moreover, MTC data are typically sparse (although richer than standard meta-analysis, comparing only two…
Descriptors: Bayesian Statistics, Meta Analysis, Outcomes of Treatment, Comparative Analysis
Blackwell, Matthew; Honaker, James; King, Gary – Sociological Methods & Research, 2017
We extend a unified and easy-to-use approach to measurement error and missing data. In our companion article, Blackwell, Honaker, and King give an intuitive overview of the new technique, along with practical suggestions and empirical applications. Here, we offer more precise technical details, more sophisticated measurement error model…
Descriptors: Error of Measurement, Correlation, Simulation, Bayesian Statistics
Tipton, Elizabeth – Society for Research on Educational Effectiveness, 2014
Replication studies allow for making comparisons and generalizations regarding the effectiveness of an intervention across different populations, versions of a treatment, settings and contexts, and outcomes. One method for making these comparisons across many replication studies is through the use of meta-analysis. A recent innovation in…
Descriptors: Replication (Evaluation), Robustness (Statistics), Meta Analysis, Regression (Statistics)
Mashiku, Alinda K. – ProQuest LLC, 2013
The current Situational Space Awareness (SSA) is faced with a huge task of tracking the increasing number of space objects. The tracking of space objects requires frequent and accurate monitoring for orbit maintenance and collision avoidance using methods for statistical orbit determination. Statistical orbit determination enables us to obtain…
Descriptors: Statistical Analysis, Space Sciences, Probability, Prediction
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Previous Page | Next Page »
Pages: 1 | 2