Missing data – Page 2 – The Stats Geek

Variance estimation for reference-based multiple imputation – the debate continues

April 18, 2024 by Jonathan Bartlett

Reference-based imputation methods have become a popular approach to handling missing data in clinical trials after patients experience what is nowadays referred to as an intercurrent event. Roughly speaking, these approaches impute such missing data in one treatment group (e.g. those in the active treatment group) based to some extent on estimates of parameters from another treatment group (e.g. the control treatment group). The approach was proposed in a paper by my colleague James Carpenter and others in 2013.

Multiple imputation and its application – 2nd edition published

September 7, 2023 by Jonathan Bartlett

I am delighted to write this blog post announcing the publication of the second edition of the book ‘Multiple Imputation and its Application’, published by Wiley, and which I am a co-author along with colleagues James Carpenter, Tim Morris, Angela Wood, Matteo Quartagno, and Mike Kenward.

Key additions in the second edition are:

in depth discussion of congeniality and compatibility, and the practical implications of the theory for these for data analysts
an updated chapter on performing imputation with derived variables, such as interactions, non-linear effects, sum scores, splines
expanded chapter on MI with survival data, including imputing missing covariates in Cox models and MI for case-cohort and nested case-control studies
new chapters on multiple imputation for / in the context of:
- prognostic models
- measurement error and misclassification
- causal inference
- using MI in practice
practical and theoretical exercises in each chapter

We hope it will be useful for those handling missing data by multiple imputation in their analyses, particularly in regards to thinking about how to use it in a way which accommodates the various complexities that are often present in statistical analyses.

The book should now be available “in all good bookshops”, as they say. You can find it at [amazon text=&asin=1119756081&text=Amazon] (please note I may receive a commission if you subsequently purchase from Amazon after clicking this link).

Multiple imputation for missing baseline covariates in discrete time survival analysis

January 10, 2023 by Jonathan Bartlett

A while ago I got involved in a project led by Anna-Carolina Haensch and Bernd Weiß investigating multiple imputation methods for baseline covariates in discrete time survival analysis. The work has recently been published open access in the journal Sociological Methods & Research. The paper investigates a variety of different multiple imputation approaches. My main contribution was the extension of the substantive model compatible fully conditional specification (smcfcs) approach for multiple imputation to the discrete time survival model setting, and extending the functionality of the smcfcs package in R to incorporate this. In this short post I’ll give a quick demonstration of this functionality.