The Low Hanging Fruit of Low Volatility Backtests

Ed. Originally written in 2011. Published in 2013

“Look ma, I have skill!”
An idea that would have been regarded as heresy in the 1990s has gained acceptance and respectability: the idea that investors are not rewarded for risk, systematic or otherwise.

Thanks to the long-term performance dominance of low volatility assets over the past few decades, producing an apparently informative backtest has become easy. Just make sure your ranking process favors lower volatility assets and your strategy has a nice tailwind.

Prior to the Fama/French litany it had been just as easy to find attractive backtest results by favoring Value and Small Caps. (And eventually, Momentum). That’s why regressing against Fama/French factors has become standard in any kind of investment anomaly or strategy analysis hoping to be taken seriously.

While Small Caps performed better than Large Caps in the long run, they have endured cycles of being out of favor for entire decades. Value and Growth have seen pronounced cycles too, and by construction the stocks favored by Momentum today might be quite different from the stocks favored by Momentum a year ago or a year from now.

Further, stocks can conceivably cross over categories. Small Caps can become Large Caps; Value can become Growth, and today’s Momentum stock might not be one in a year.

Stickiness of Low Volatility Means Low Volatility Strategies Enjoy Low Turnover
But when you rank stocks by volatility, you’re ranking stocks by something more permanent than Momentum, Capitalization, or Valuation. Stocks ranked today as Volatile are likely to have been ranked as Volatile a year ago, and probably will be in another year (if they’re still around).

The table below displays the rank correlation of the constituents of the SPY ETF as of July 29, 2011, sorted by trailing 1-year standard deviation of daily log returns, to their ranks the following year. The following year they’re sorted both by standard deviation and downside risk.

Year-endCorrelation of this year’s Standard Deviation with
Next year’s
Correlation of this year’s Standard Deviation with
Next year’s Downside Risk
20090.840.84
20080.880.89
20070.720.76
20060.670.61
20050.870.85
20040.830.8
20030.770.77
20020.860.87
20010.770.82
20000.770.77
19990.830.78
19980.840.81
19970.830.79
19960.840.82
19950.870.85
19940.890.85
19930.860.79
19920.850.84
19910.860.82
19900.860.82
19890.770.76
19880.730.75
19870.710.82
19860.590.71
19850.690.74
19840.840.79

Source:PortfolioWizards

Notice two things: first, today’s volatile stocks tend to be next year’s; second, whether you rank by standard deviation or by downside risk doesn’t seem to make much difference.

While this data set is survivorship-biased, even in data sets that aren’t subject to survivorship bias this pattern remains. Which means if you want to manage a Low Volatility portfolio, chances are it won’t have to have high turnover.

Lake Woebegone Backtests
We’ve now endured a few decades when systematic risk has mostly been punished, which is one reason why you’re seeing so many “low volatility” and “minimum volatility” strategies appearing. All these back-tested results are above average! Whether the return to systematic risk turns out to have been a myth remains an open question to many, but when you’re exploring new investment products and strategies, keep in mind that we’ve just lived through an era when almost any strategy with lower risk than cap weighted indices will have outperformed those indices. Look for low volatility bias. Is low volatility an explicit part of the author’s strategy, or was it an unintended consequence that happened to be in favor?

Given the dominance of the superior performance of Low Volatility over the past several decades, superior backtest results you see today are likely to have another feature in addition to exposure to the Fama/French factors: a Low Volatility tilt.

Not that there’s anything wrong with Low Volatility – just make sure that whenever you’re evaluating the performance of a manager, an ETF, or a vendor, you’re aware of how much of a Low Vol effect there is in the data.