Jonathan Bartlett

What might the true sensitivity be for lateral flow Covid-19 tests?

January 10, 2021December 7, 2020 by Jonathan Bartlett

Disclaimer: I am in absolutely no way an expert on Covid-19 or tests for it. This post was motivated from undergraduate teaching for medical statistics, and probably makes assumptions which are implausible.

This morning I spoke to my undergraduate medical statistics students briefly about this piece in the BMJ published a few days ago raising concerns about the possible low test sensitivity of the rapid lateral flow tests which are being used in various contexts currently in the UK. This piece cites a UK government document which says concerning the lateral flow tests that:

Results of the PHE and Oxford University Innova evaluation show it has an overall analytical sensitivity of 76.8% for all PCR-positive individuals but detects over 90% of individuals with high viral loads
https://www.gov.uk/government/publications/community-testing-explainer/community-testing-a-guide-for-local-delivery#what-the-community-testing-programme-is

and that

In field evaluations, such as Liverpool, these tests still perform effectively and detect at least 50% of all PCR positive individuals and more than 70% of individuals with higher viral loads in both symptomatic and asymptomatic individuals.
https://www.gov.uk/government/publications/community-testing-explainer/community-testing-a-guide-for-local-delivery#what-the-community-testing-programme-is

Auxiliary variables and congeniality in multiple imputation

December 5, 2020 by Jonathan Bartlett

Meng’s concept of congeniality in multiple imputation (MI) is I think a tricky one (for me anyway!). Loosely speaking congeniality is about whether the imputation and analysis models make different assumptions about the data. Meng gave a definition in his 1994 paper, but I prefer the one given in a more recent paper by Xie and Meng, which is what I and Rachael Hughes used in our paper this year on different methods of combining bootstrapping with MI. In words (see the papers for the same in equations) it is that there exists a Bayesian model for the data such that:

given complete/full data, the posterior mean of the parameter of interest matches the point estimate given by fitting our analysis model of interest to that data, and the posterior variance matches the variance estimator calculated by our analysis model fit.
the conditional distribution of the missing data given the observed in this Bayesian model matches that used by our imputation model.

If they are congenial and the models are correctly specified, Rubin’s variance estimator is (asymptotically) unbiased for the true repeated sampling variance of the MI point estimator(s).

One of the potentially useful features of MI are that we can include variables in the imputation stage which we then don’t use in the analysis model. Including such auxiliary variables in the imputation model can increase the likelihood that the MAR assumption holds when the auxiliary variable is associated with the probability of missingness, and can increase efficiency according to how strongly it is correlated with the variable(s) being imputed. A nice paper (among many) on the potential of including auxiliary variables in MI is Hardt et al 2012. In this post, I’ll consider whether including auxiliary variables in the imputation model leads to uncongeniality. The post was prompted following a discussion earlier in the year with my colleague Paul von Hippel.

Convergence plots for smcfcs in R

December 4, 2020 by Jonathan Bartlett

The smcfcs package in R imputes missing values of covariates compatibly (congenially) with the user’s specified outcome or substantive model. Just like the regular chained equations (fully conditional specification) multiple imputation method, smcfcs is an iterative procedure, and users should check that they have used enough iterations for the process to have (hopefully) converged to its stationary distribution. The smcfcs package returns a matrix with the parameter estimates of the outcome/substantive model from each imputed dataset and iteration within these. But it requires the user to figure out how to appropriately plot these.