ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	25

Descriptor

Error of Measurement	27
Models	27
Probability	27
Computation	8
Comparative Analysis	7
Item Response Theory	7
Measurement	7
Statistical Analysis	7
Bayesian Statistics	6
Simulation	6
Regression (Statistics)	5
Statistical Distributions	5
Foreign Countries	4
Maximum Likelihood Statistics	4
Tests	4
Academic Achievement	3
Correlation	3
Data Analysis	3
Evaluation Methods	3
Goodness of Fit	3
Guidelines	3
Inferences	3
Measurement Techniques	3
Prediction	3
Predictor Variables	3
More ▼

Source

Sociological Methods &…	5
Educational and Psychological…	4
Journal of Educational and…	3
Psychometrika	2
Advances in Physiology…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational Data…	1
Journal of Educational…	1
National Center for Research…	1
Open Learning	1
Psicologica: International…	1
Psychological Reports	1
Psychological Review	1
Research Synthesis Methods	1
Social Indicators Research	1
Society for Research on…	1
More ▼

Publication Type

Journal Articles	24
Reports - Research	17
Reports - Evaluative	6
Reports - Descriptive	3
Opinion Papers	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

United Kingdom	2
Europe	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

British Household Panel Survey	1
Early Childhood Longitudinal…	1
Schools and Staffing Survey…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Linear Probability Model Revisited: Why It Works and How It Should Be Specified

Peer reviewed

Direct link

Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025

A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…

Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

A Note on a Reformulation of the KHB Method

Peer reviewed

Direct link

Breen, Richard; Bernt Karlson, Kristian; Holm, Anders – Sociological Methods & Research, 2021

The Karlson-Holm-Breen (KHB) method has rapidly become popular as a way of separating the impact of confounding from rescaling when comparing conditional and unconditional parameter estimates in nonlinear probability models such as the logit and probit. In this note, we show that the same estimates can be obtained in a somewhat different way to…

Descriptors: Probability, Models, Computation, Comparative Analysis

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

BIC Extensions for Order-Constrained Model Selection

Peer reviewed

Direct link

Mulder, J.; Raftery, A. E. – Sociological Methods & Research, 2022

The Schwarz or Bayesian information criterion (BIC) is one of the most widely used tools for model comparison in social science research. The BIC, however, is not suitable for evaluating models with order constraints on the parameters of interest. This article explores two extensions of the BIC for evaluating order-constrained models, one where a…

Descriptors: Models, Social Science Research, Programming Languages, Bayesian Statistics

An Updated Dynamic Bayesian Forecasting Model for the US Presidential Election

Peer reviewed
PDF on ERIC

Download full text

Direct link

Heidemanns, Merlin; Gelman, Andrew; Morris, G. Elliott – Grantee Submission, 2020

During modern general election cycles, information to forecast the electoral outcome is plentiful. So-called fundamentals like economic growth provide information early in the cycle. Trial-heat polls become informative closer to Election Day. Our model builds on (Linzer, 2013) and is implemented in Stan (Team, 2020). We improve on the estimation…

Descriptors: Evaluation, Bayesian Statistics, Elections, Presidents

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Kappa and Rater Accuracy: Paradigms and Parameters

Peer reviewed

Direct link

Conger, Anthony J. – Educational and Psychological Measurement, 2017

Drawing parallels to classical test theory, this article clarifies the difference between rater accuracy and reliability and demonstrates how category marginal frequencies affect rater agreement and Cohen's kappa. Category assignment paradigms are developed: comparing raters to a standard (index) versus comparing two raters to one another…

Descriptors: Interrater Reliability, Evaluators, Accuracy, Statistical Analysis

Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models

Peer reviewed

Direct link

Andersson, Björn – Journal of Educational Measurement, 2016

In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…

Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests

Using Dirichlet Processes for Modeling Heterogeneous Treatment Effects across Sites

Peer reviewed
PDF on ERIC

Download full text

Miratrix, Luke; Feller, Avi; Pillai, Natesh; Pati, Debdeep – Society for Research on Educational Effectiveness, 2016

Modeling the distribution of site level effects is an important problem, but it is also an incredibly difficult one. Current methods rely on distributional assumptions in multilevel models for estimation. There it is hoped that the partial pooling of site level estimates with overall estimates, designed to take into account individual variation as…

Descriptors: Probability, Models, Statistical Distributions, Bayesian Statistics

A Bayesian Missing Data Framework for Generalized Multiple Outcome Mixed Treatment Comparisons

Peer reviewed

Direct link

Hong, Hwanhee; Chu, Haitao; Zhang, Jing; Carlin, Bradley P. – Research Synthesis Methods, 2016

Bayesian statistical approaches to mixed treatment comparisons (MTCs) are becoming more popular because of their flexibility and interpretability. Many randomized clinical trials report multiple outcomes with possible inherent correlations. Moreover, MTC data are typically sparse (although richer than standard meta-analysis, comparing only two…

Descriptors: Bayesian Statistics, Meta Analysis, Outcomes of Treatment, Comparative Analysis

A Unified Approach to Measurement Error and Missing Data: Details and Extensions

Peer reviewed

Direct link

Blackwell, Matthew; Honaker, James; King, Gary – Sociological Methods & Research, 2017

We extend a unified and easy-to-use approach to measurement error and missing data. In our companion article, Blackwell, Honaker, and King give an intuitive overview of the new technique, along with practical suggestions and empirical applications. Here, we offer more precise technical details, more sophisticated measurement error model…

Descriptors: Error of Measurement, Correlation, Simulation, Bayesian Statistics

Design Effects of Multilevel Estimates from National Probability Samples

Peer reviewed
PDF on ERIC

Download full text

Direct link

Stapleton, Laura M.; Kang, Yoonjeong – Sociological Methods & Research, 2018

This research empirically evaluates data sets from the National Center for Education Statistics (NCES) for design effects of ignoring the sampling design in weighted two-level analyses. Currently, researchers may ignore the sampling design beyond the levels that they model which might result in incorrect inferences regarding hypotheses due to…

Descriptors: Probability, Hierarchical Linear Modeling, Sampling, Inferences

Propensity Score Analysis with Fallible Covariates: A Note on a Latent Variable Modeling Approach

Peer reviewed

Direct link

Raykov, Tenko – Educational and Psychological Measurement, 2012

A latent variable modeling approach that permits estimation of propensity scores in observational studies containing fallible independent variables is outlined, with subsequent examination of treatment effect. When at least one covariate is measured with error, it is indicated that the conventional propensity score need not possess the desirable…

Descriptors: Computation, Probability, Error of Measurement, Observation

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Previous Page | Next Page »

Pages: 1 | 2

Monroe, Scott	2
Andersson, Björn	1
Bernt Karlson, Kristian	1
Birnbaum, Michael H.	1
Blackwell, Matthew	1
Breen, Richard	1
Cai, Li	1
Calvert, Carol Elaine	1
Carlin, Bradley P.	1
Chu, Haitao	1
Conger, Anthony J.	1
Culpepper, Steven Andrew	1
Curran-Everett, Douglas	1
Dirkzwager, Arie	1
Draxler, Clemens	1
Ellis, Jules L.	1
Feller, Avi	1
Ferrando, Pere J.	1
Gelman, Andrew	1
Goeun Lee	1
Heidemanns, Merlin	1
Holm, Anders	1
Honaker, James	1
Hong, Hwanhee	1
Jin-young Choi	1
More ▼