How to interpret hazard ratios

Survival analysis of time-to-event outcomes is very commonly performed using Cox’s famous proportional hazards model. The model estimates hazard ratios for the ‘effects’ of covariates. Starting with HernĂ¡n’s ‘Hazard of Hazard Ratios’ paper, hazard ratios have been investigated and critiqued from a causal inference perspective. Following this, Aalen wrote an important paper on whether’s analysis of a randomised trial using Cox’s model yields a causal effect, and there have been a number of more recent papers investigating the issue further. The criticisms and complexity arise due to the definition of the hazard and the presence of so-called frailty factors – unmeasured variables which influence when someone has the event of interest.

I had briefly blogged about this topic before, in particular about the causal interpretation of the hazard ratio when the proportional hazards assumption holds. I’m really pleased to have now (finally!) finished a short expositional article with colleagues Dominic Magirr and Tim Morris about how we think hazard ratios should be interpreted. Using a simple example we review the key issue arising from the effects of frailty, articulate how we think hazard ratios ought to be interpreted, and argue that it should be viewed as a causal quantity. A pre-print of our article is available now on arXiv.

Does the log rank test assume proportional hazards?

A student asked me recently whether the log rank test for time to event data assumes that the hazard ratio between the two groups is constant over time, as is assumed in Cox’s famous proportional hazards model. The BMJ ‘Statistics at square one’ Survival Analysis article for example says the test assumes:

That the risk of an event in one group relative to the other does not change with time. Thus if linoleic acid reduces the risk of death in patients with colorectal cancer, then this risk reduction does not change with time (the so called proportional hazards assumption ).

https://www.bmj.com/about-bmj/resources-readers/publications/statistics-square-one/12-survival-analysis

Personally I would not say the log rank test assumes proportional hazards. Under the null hypothesis that the (true) survival curves in the two groups are the same, or equivalently that the hazard functions are identical in the two groups, the log rank test would only wrongly reject 5% of the time. Of course under this null the hazards are proportional (indeed identical).

When this null does not hold, if the hazard ratio is constant over time, the log rank test is the most powerful test. When it is not constant over time it is not optimal in terms of power, but the non-constant hazard ratio does not invalidate the test per se. It just means that there may be alternative methods of analysis that might be preferable (see my recent PSI event slides here).

Non-proportional hazards – an introduction to their possible causes and interpretation

I had the pleasure today to participate in a PSI event on non-proportional hazards and applications in immuno-oncology. Non-proportional hazards are increasingly encountered in clinical trials, and there remain important questions about how to analyse trials when non-proportional hazards could occur. These include questions about how to formulate an appropriate hypothesis test for assessing evidence of benefit of the new treatment over the control and how to best quantify the treatment effect. The talks were really very interesting, and lead me to believe there is still lots of important work to be done in this area.

Here are the slides of my talk in case they are of interest, where I discuss some of the subtleties involved in interpreting changes in hazards and hazard ratios over time, which are complicated by the ubiquitous presence of frailty effects. I’ve posted on this topic previously quite a lot – for those interested see the related posts below.