# Causal interpretation of the hazard ratio from RCTs when proportional hazards holds

In 2015 I wrote a post about the causal interpretation of hazard ratios estimated in randomised trials, following a paper by Aalen and colleagues. One of the arguments made in that paper was that the hazard ratio does not have a valid interpretation as a causal effect in this setting, even when the proportional hazards assumption holds:

This makes it unclear what the hazard ratio computed for a randomized survival study really means. Note, that this has nothing to do with the fit of the Cox model. The model may fit perfectly in the marginal case with X as the only covariate, but the present problem remains.

With recent discussions on estimands in light of the estimand addendum to ICH E9, I have been thinking more on the argument/claim by Aalen et al.

Suppose we have a randomised trial where we know (somehow) that the ratio of the hazards in the two groups is constant over time - that is the proportional hazards assumption holds. Suppose that the true hazard ratio comparing treatment 1 to treatment 0 is $\theta$. Then due to the relationship between the survival function and hazard, as is well known, it follows that:

$S_{1}(t) = S_{0}(t)^{\theta}$

where $S_{1}(t)$ and $S_{0}(t)$ are the marginal survival functions in the two treatment groups. This can then be re-expressed as:

$\theta = \frac{\log(S_{1}(t))}{\log(S_{0}(t))}$

Thus under proportional hazards, at any time, the hazard ratio is equal to the ratio of the log of the survival probabilities under the two treatments to this time. The survival functions can be expressed as the population means of $S_{1}(t)=E(1(T_{1}>t))$ where $T_{1}$ denotes the potential failure time under treatment 1 for a randomly selected individual from the population (and similarly for treatment 0). Thus the hazard ratio can be expressed as a functional of the potential outcomes under the two treatments.

Admittedly the interpretation is not that nice, but I would argue the hazard ratio nevertheless does have a causal interpretation (assuming proportional hazards holds). If at a particular time t the survival probabilities are both close to 1, the preceding expression can be approximated to show the hazard ratio is approximately the relative risk of failure under the two treatments, something which has also been written about before.

I am certainly not a causal inference expert, so I do not know what is required to claim that a quantity is a valid causal effect measure. But it would seem to me that if one views the hazard ratio as the previously described ratio of (logged) marginal probabilities, at least when proportional hazards holds, it is a valid causal effect measure. If any causal inference or survival analysis people can help, or point out that I am missing something obvious, please add a comment!

### 7 thoughts on “Causal interpretation of the hazard ratio from RCTs when proportional hazards holds”

1. Rhian Daniel

This is exactly Stijn Vansteelandt's opinion on this, I believe!

• Thank you Rhian!

2. Odd Aalen replied via email to the post:

Thanks, Jonathan, for comments on our paper. You define a hazard ratio based on comparing two potential survival times and assume the hazard ratio is constant. My response would be that this hazard ratio is not a treatment effect. There will almost always be unmeasured frailty effects. Typically this will mean that the surviving members of the group with the best treatment will get a different mix of frailties when time is running compared with the group with inferior treatment. So your hazard ratio will measure the direct effect of treatment for an individual plus an effect coming from a different mix in the two groups. Initial randomization does not change this issue.

So the hazard ratio is not a pure biological effect of treatment. It is also dependent on the heterogeneity of the treatment groups, and on the extent to which one can adjust for various factors. Typically, this will result in an underestimation of the real treatment effect.

This is closely related to the issue of survival collider bias which is an important concern in epidemiology and survival analysis. In a paper from our group we study this issue both from a frailty point of view and also using causal DAGs. We show how well known paradoxes like false protectivity and the obesity paradox can be understood in terms of frailty models. Reference: Stensrud, M. J., Valberg, M., Røysland, K., & Aalen, O. O. (2017). Exploring selection bias by causal frailty models: the magnitude matters. Epidemiology, 28(3), 379-386.

Still, of course, the Cox model is very important and useful. No method is perfect. What we are doing is to take a hard look at what the results mean, and then it is not so clear as people might think. This is important because the method is used in thousands of clinical trials, for instance, and so it has a tremendous impact. Therefore one should understand the shortcoming of the results as well. It should be noticed also that the assumption of proportional hazards is often not fulfilled. This is seen even in top clinical journals, without any comments being made on the lack of proportionality. So there is a world of statistics where we discuss the principles, and a world out there where they apply our methods to large issues without too much insight into the basis of the methods. We should be concerned about that one as well.