SSRN-id2502613

•

UNB

0

Breno Brito

29/07/2020

E aí, curtiu este material?

Ajude a incentivar outros estudantes a melhorar o conteúdo

Gostou desse material? Compartilhe! 🧡

Finanças

9.871 Materiais compartilhados

Baixe o app para aproveitar ainda mais

Leia os materiais offline, sem usar a internet. Além de vários outros recursos!

Prévia do material em texto

Electronic copy available at: http://ssrn.com/abstract=2502613
Marcos López de Prado
Lawrence Berkeley National Laboratory
Computational Research Division
OPTIMAL TRADING RULES
WITHOUT BACKTESTING
Electronic copy available at: http://ssrn.com/abstract=2502613
Key Points
2
• Calibrating a trading rule using historical simulations leads to
backtest overfitting.
• Big Data and High Performance Computing make backtest
overfitting a major source of in-sample performance inflation.
• In this presentation we describe a procedure for determining the
optimal trading rule (OTR) without running alternative model
configurations through a backtest engine.
• We present empirical evidence of the existence of such optimal
solutions for the case of prices following a discrete Ornstein-
Uhlenbeck process, and show how they can be computed
numerically.
Electronic copy available at: http://ssrn.com/abstract=2502613
SECTION I
Trading Rules
What is a Trading Rule?
4
• Each investment strategy requires an implementation
tactic, often referred to as trading rules.
• While strategies can be very heterogeneous in nature,
tactics are relatively homogeneous.

• For example, a position may be:
– entered when the strategy’s signal reaches a certain value.
– exited when a profit-taking or stop-loss threshold is exceeded.
• These entry and exit rules rely on parameters that are
usually calibrated via historical simulations (backtests).
Trading rules provide the algorithm that
must be followed to enter and exit a strategy’s position.
Overfitting of a Trading Rule
5
• Bailey et al. [2014] have pointed out the
dangers of calibrating a strategy through
backtests.
– For a large enough number of trials, a
backtest can always be fit to any desired
performance over a given sample.
– Standard statistical techniques designed to
prevent regression overfitting, such as hold-
out, are inaccurate in the context of backtest
evaluation.
– Under memory effects, overfitting leads to
systematic losses, not noise.
“Pseudo-Mathematics and
Financial Charlatanism”
explains why most historical
simulations published in the
academic & practitioners’
financial journals are likely
to be false positives.
A very real problem…
6
• Lawrence Berkeley National Laboratory has
developed a website that finds “optimal
investment strategies” in random series.

• Because the series are random, there is no
signal in the data one could profit from.
• It is therefore obvious that such investment
strategies are purely the result of overfitting.
• The implication is that overfitting is hard to
avoid when calibrating a strategy or trading
rule on historical data.
http://datagrid.lbl.gov/backtest/index.php
In-Sample performance
Out-of-Sample performance
http://www.lbl.gov/about/
http://datagrid.lbl.gov/backtest/index.php
http://datagrid.lbl.gov/backtest/index.php
SECTION II
The Problem
The Problem (1/4)
8
• Suppose an investment strategy S that invests in 𝑖 = 1,… 𝐼
opportunities or bets.
• At each opportunity i, S takes a position of 𝑚𝑖 units of security
X, where 𝑚𝑖 ∈ −∞, ∞ .
• The transaction that entered such opportunity was priced at a
value 𝑚𝑖𝑃𝑖,0, where 𝑃𝑖,0 is the average price per unit at which
the 𝑚𝑖 securities were transacted.
• As other market participants transact security X, we can mark-
to-market (MtM) the value of that opportunity i after t
observed transactions as 𝑚𝑖𝑃𝑖,𝑡.
• This represents the value of opportunity i if it were liquidated
at the price observed in the market after t transactions.
The Problem (2/4)
9
• Accordingly, we can compute the MtM profit/loss of
opportunity i after t transactions as 𝜋𝑖,𝑡 = 𝑚𝑖 𝑃𝑖,𝑡 − 𝑃𝑖,0 .
• A standard trading rule provides the logic for exiting
opportunity i at 𝑡 = 𝑇𝑖. This occurs when one of two
conditions is verified:
 𝜋𝑖,𝑇𝑖 ≥ 𝜋𝑖, where 𝜋𝑖 > 0 is the profit-taking threshold for opportunity
i.
 𝜋𝑖,𝑇𝑖 ≤ 𝜋𝑖, where 𝜋𝑖 < 0 is the stop-loss threshold for opportunity i.
• Because 𝜋𝑖 < 𝜋𝑖, only one of the two exit conditions can
trigger the exit from opportunity i.
• Assuming that opportunity i can be exited at 𝑇𝑖, its final
profit/loss is 𝜋𝑖,𝑇𝑖.
The Problem (3/4)
10
• At the onset of each opportunity, the goal is to realize an
expected profit 𝐸0 𝜋𝑖,𝑇𝑖 = 𝑚𝑖 𝐸0 𝑃𝑖,𝑇𝑖 − 𝑃𝑖,0 , where
𝐸0 𝑃𝑖,𝑇𝑖 is the forecasted price and 𝑃𝑖,0 is the entry level of
opportunity i.

• One way to calibrate the trading rule is to overfit:
– Define a set of alternative values of R, Ω ≔ 𝑅 .
– Simulate historically (also called backtest) the performance of S under
alternative values of 𝑅 ∈ Ω.
– Select the optimal 𝑅∗.

DEFINITION 1 (Trading Rule): A trading rule for strategy S is
defined by the set of parameters 𝑅 ≔ 𝜋𝑖 , 𝜋𝑖 , 𝑖 = 1, … 𝐼.
The Problem (4/4)
11
• More formally, 𝑅∗ = arg max
𝑅∈Ω
𝑆𝑅𝑅, 𝑆𝑅𝑅 =
𝐸 𝜋𝑖,𝑇𝑖 𝑅
𝜎 𝜋𝑖,𝑇𝑖 𝑅
, where
𝐸 . and 𝜎 . are respectively the expected value and standard
deviation of 𝜋𝑖,𝑇𝑖, conditional on trading rule R, over 𝑖 = 1, … 𝐼.
This typically leads to overfitting.

• Intuitively, an optimal in-sample (IS) trading rule 𝑅∗ is overfit
when it is expected to underperform the median of alternative
trading rules 𝑅 ∈ Ω out-of-sample (OOS).
DEFINITION 2 (Overfit Trading Rule): 𝑅∗ is overfit if
𝐸
𝐸 𝜋𝑗,𝑇𝑗 𝑅
∗
𝜎 𝜋𝑗,𝑇𝑗 𝑅
∗
< 𝑀𝑒Ω 𝐸
𝐸 𝜋𝑗,𝑇𝑗 𝑅
𝜎 𝜋𝑗,𝑇𝑗 𝑅
, where 𝑗 = 𝐼 + 1,… 𝐽.
SECTION III
Our Framework
The Framework
13
• We are interested in providing an OTR for those scenarios
where overfitting would be most damaging, such as when 𝜋𝑖,𝑡
exhibits serial correlation.
• In particular, suppose a discrete Ornstein-Uhlenbeck (O-U)
process on prices
𝑃𝑖,𝑡 = 1 − 𝜑 𝐸0 𝑃𝑖,𝑇𝑖 + 𝜑𝑃𝑖,𝑡−1 + 𝜎𝜀𝑖,𝑡
such that the random shocks are IID distributed 𝜀𝑖,𝑡~𝑁 0,1 :
– The seed value for this process is 𝑃𝑖,0.
– The level targeted by opportunity i is 𝐸0 𝑃𝑖,𝑇𝑖 .
– 𝜑 determines the speed at which 𝑃𝑖,0 converges towards 𝐸0 𝑃𝑖,𝑇𝑖 .
𝜋𝑖,𝑡~𝑁 𝑚𝑖 1 − 𝜑 𝐸0 𝑃𝑖,𝑇𝑖 𝜑
𝑗
𝑡−1
𝑗=0
− 𝑃𝑖,0 , 𝑚𝑖
2𝜎2 𝜑2𝑗
𝑡−1
𝑗=0

Numerical Solution (1/2)
14
1. Estimate the input parameters 𝜎, 𝜑 from
𝑃𝑖,𝑡 = 𝐸0 𝑃𝑖,𝑇𝑖 + 𝜑 𝑃𝑖,𝑡−1 − 𝐸0 𝑃𝑖,𝑇𝑖 + 𝜉𝑡
2. Construct a mesh of stop-loss and profit-taking pairs, 𝜋𝑖 , 𝜋𝑖 .
3. Generate a large number of paths (e.g., 100,000) for 𝜋𝑖,𝑡
applying our estimates 𝜎 , 𝜑 .
4. Apply the 100,000 paths generated in Step 3 on each node of
the mesh 𝜋𝑖 , 𝜋𝑖 generated in Step 2.
 For each node, we apply the stop-loss and profit-taking logic, giving us
100,000 values of 𝜋𝑖,𝑇𝑖.
 For each node we compute the Sharpe ratio associated with that
trading rule.

Numerical Solution (2/2)
15
5. Compute the solution:
 We determine the pair 𝜋𝑖 , 𝜋𝑖 within the mesh of trading rules that
is optimal, given the input parameters 𝜎 , 𝜑 and the observed initial
conditions 𝑃𝑖,0, 𝐸0 𝑃𝑖,𝑇𝑖 .
 If strategy S provides a profit target 𝜋𝑖 for a particular opportunity i,
we can use that information in conjunction with the results in Step 4
to determine the optimal stop-loss, 𝜋𝑖.
 If the trader has a maximum stop-loss 𝜋𝑖 imposed by the fund’s
management, we can use that information in conjunction with the
results in Step 4 to determine the optimal profit taking 𝜋𝑖 within the
range of stop-losses 0, 𝜋𝑖 .

SECTION IV
Some Cases
Cases with Zero Long-Run Equilibrium
17
• From left to right, these figures show the Sharpe ratios for parameter combinations
𝜇, 𝜏, 𝜎 = 0,5,1 , 𝜇, 𝜏, 𝜎 = 0,50,1 , 𝜇, 𝜏, 𝜎 = 0,100,1 .
• For small half-life, performance is maximized in a narrow range of combinations of small
profit-taking with large stop-losses: The optimal trading ruleis to hold an inventory long
enough until a small profit arises, even at the expense of experiencing 5 or 7-fold losses.
• This is in fact what many market-makers do in practice, and is consistent with the
“asymmetric payoff dilemma” described in Easley et al. [2011].
• The worst possible trading rule in this setting would be to combine a short stop-loss with
large profit-taking threshold, a situation that market-makers avoid in practice.
Cases with Positive Long-Run Equilibrium
18
• From left to right, these figures show the Sharpe ratios for parameter combinations
𝜇, 𝜏, 𝜎 = 5,5,1 , 𝜇, 𝜏, 𝜎 = 5,50,1 , 𝜇, 𝜏, 𝜎 = 5,100,1 .
• Because positions tend to make money, the optimal profit-taking is higher than in the
previous cases.
• As we increase the half-life, the range of optimal profit-taking widens, while the range of
optimal stop-losses narrows, shaping the initial rectangular area closer to a square and
then a semi-circle.
• Again, a larger half-life brings the process closer to a random walk.
Cases with Negative Long-Run Equilibrium
19
• From left to right, these figures show the Sharpe ratios for parameter combinations
𝜇, 𝜏, 𝜎 = −5,5,1 , 𝜇, 𝜏, 𝜎 = −5,50,1 , 𝜇, 𝜏, 𝜎 = −5,100,1 .
• Results appear to be rotated complementaries of what we obtained in the Positive case
(like rotated photographic negatives).
• The reason is, that the profit in the previous figures translates into a loss in these
figures, and vice versa: One case is an image of the other, just as a gambler’s loss is the
house’s gain.
SECTION V
The Carr - de Prado Conjecture
A Conjecture Worth A Lot Of Money…
21
• While in this paper we do not derive the closed-form solution
to the optimal trading strategies problem, our experimental
results seem to support the following OTR conjecture:

• We believe that solving this conjecture would have substantial
economic value in a trading world where a few milliseconds
separate winners from losers.
“Given a financial instrument’s price characterized by a
discrete O-U process, there is a unique optimal trading rule
in terms of a combination of profit-taking and stop-loss
that maximizes the rule’s Sharpe ratio.”
http://arxiv.org/abs/1408.1159
SECTION VI
Conclusions
Conclusions
23
• We have shown how to determine experimentally the optimal
trading strategy associated with prices following a discrete
Ornstein-Uhlenbeck process.
• Because the derivation of such trading strategy is not the result
of a historical simulation, our procedure avoids the risks
associated with backtest overfitting.
• Depending on factors such as the frequency at which trading
takes place, the holding period, etc., the time it takes to run our
numerical procedure may be too lengthy.
• It would be beneficial to count with a closed-form solution that
computes the Sharpe ratio of every combination 𝜋𝑖 , 𝜋𝑖 , which
we could then optimize analytically to determine the optimal 𝑅𝑖
∗.
THANKS FOR YOUR ATTENTION!

24
SECTION VII
The stuff nobody reads
Bibliography
• Bailey, D. and M. López de Prado (2012): “The Sharpe Ratio Efficient Frontier,” Journal of
Risk, 15(2), pp. 3-44. Available at http://ssrn.com/abstract=1821643.
• Bailey, D. and M. López de Prado (2013): “Stop-Out Under Serial Correlation,” Working
paper. Available at http://ssrn.com/abstract=2201302.
• Bailey, D., J. Borwein, M. López de Prado and J. Zhu (2014): “Pseudo-Mathematics and
Financial Charlatanism: The Effects of Backtest Overfitting on Out-Of-Sample
Performance,” Notices of the American Mathematical Society, 61(5), pp. 458-471.
Available at http://ssrn.com/abstract=2308659.
• Bailey, D., J. Borwein, M. López de Prado and J. Zhu (2013): “The Probability of Backtest
Overfitting.” Working paper. Available at http://ssrn.com/abstract=2326253.
• Bertram, W. (2009): “Analytic Solutions for Optimal Statistical Arbitrage Trading.”
Working paper. Available at http://ssrn.com/abstract=1505073.
• Carr, P. and M. López de Prado (2014): “Determining Optimal Trading Rules Without
Backtesting”, Working Paper. Available at http://arxiv.org/abs/1408.1159
• Easley, D., M. Lopez de Prado and M. O’Hara (2011): “The Exchange of Flow-Toxicity.”
Journal of Trading, Vol. 6(2), pp. 8-13, Spring. Available at
http://ssrn.com/abstract=1748633.
26
http://ssrn.com/abstract=1821643
http://ssrn.com/abstract=1821643
http://ssrn.com/abstract=2201302
http://ssrn.com/abstract=2201302
http://ssrn.com/abstract=2308659
http://ssrn.com/abstract=2308659
http://ssrn.com/abstract=2326253
http://ssrn.com/abstract=2326253
http://ssrn.com/abstract=2326253
http://ssrn.com/abstract=1505073
http://ssrn.com/abstract=1505073
http://arxiv.org/abs/1408.1159
http://arxiv.org/abs/1408.1159
http://ssrn.com/abstract=1748633
http://ssrn.com/abstract=1748633
Bio
Marcos López de Prado is Senior Managing Director at Guggenheim Partners. He is also a Research
Affiliate at Lawrence Berkeley National Laboratory's Computational Research Division (U.S. Department
of Energy’s Office of Science).

Before that, Marcos was Head of Quantitative Trading & Research at Hess Energy Trading Company (the
trading arm of Hess Corporation, a Fortune 100 company) and Head of Global Quantitative Research at
Tudor Investment Corporation. In addition to his 15+ years of trading and investment management
experience at some of the largest corporations, he has received several academic appointments,
including Postdoctoral Research Fellow of RCC at Harvard University and Visiting Scholar at Cornell
University. Marcos earned a Ph.D. in Financial Economics (2003), a second Ph.D. in Mathematical
Finance (2011) from Complutense University, is a recipient of the National Award for Excellence in
Academic Performance by the Government of Spain (National Valedictorian, 1998) among other awards,
and was admitted into American Mensa with a perfect test score.

Marcos is the co-inventor of four international patent applications on High Frequency Trading. He has
collaborated with ~30 leading academics, resulting in some of the most read papers in Finance (SSRN),
three textbooks, publications in the top Mathematical Finance journals, etc. Marcos has an Erdös #2 and
an Einstein #4 according to the American Mathematical Society.
27
Disclaimer
• The views expressed in this document are the authors’
and do not necessarily reflect those of the
organizations he is affiliated with.
• No investment decision or particular course of action is
recommended by this presentation.
• All Rights Reserved.
28
Notice:

The research contained in this presentation is the result of a
continuing collaboration with

Dr. Peter P. Carr
Morgan Stanley & Courant Institute (NYU)

The full paper is available at:
http://arxiv.org/abs/1408.1159

For additional details, please visit:
http://ssrn.com/author=434076
www.QuantResearch.info
http://arxiv.org/abs/1408.1159
http://arxiv.org/abs/1408.1159
http://ssrn.com/author=434076
http://www.quantresearch.info/