Home  >>  Archives  >>  Volume 15 Number 2  >>  st0387

The Stata Journal
Volume 15 Number 2: pp. 437-456



Subscribe to the Stata Journal
cover

Multiple imputation of covariates by substantive-model compatible fully conditional specification

Jonathan W. Bartlett
Department of Medical Statistics
London School of Hygiene and Tropical Medicine
London, UK
[email protected]
Tim P. Morris
MRC Clinical Trials Unit at UCL
Institute of Clinical Trials and Methodology
and
London School of Hygiene and Tropical Medicine
London, UK
[email protected]
Abstract.  Multiple imputation is a practical, principled approach to handling missing data. When used to impute missing values in covariates of regression models, imputation models may be misspecified if they are not compatible with the substantive model of interest for the outcome. In this article, we introduce the smcfcs command, which imputes covariates by substantive-model compatible fully conditional specification. This modifies the popular fully conditional specification or chained-equations approach to multiple imputation by imputing each covariate compatibly with a user-specified substantive model. We compare the smcfcs command with standard fully conditional specification imputation using mi impute chained in a simulation study and illustrative analysis of data from a study investigating time to tumor recurrence in breast cancer.
Terms of use     View this article (PDF)

View all articles by these authors: Jonathan W. Bartlett, Tim P. Morris

View all articles with these keywords: smcfcs, multiple imputation, substantive model compatible, congenial, interactions, nonlinearities

Download citation: BibTeX  RIS

Download citation and abstract: BibTeX  RIS