SlideShare une entreprise Scribd logo
1  sur  44
Télécharger pour lire hors ligne
active learning in trading algorithms
J.P. Morgan
Linear Quantitative Research | David Fellah
November 10, 2016
QuantCon - Singapore, 2016
0. | Introduction
Section 1: Market response
1. Review of the algorithmic trading problem: forecasting and feedback mechanisms
2. Impact modeling
2. Empirical examples - real world trading is a complex, dynamical problem
Section 2: Machine learning in forecasting
1. Forecasting: Random Forests for high-frequency price direction prediction
2. Empirical results
Section 3: Reinforcement Learning
1. Putting it all together: Reinforcement Learning
2. Problem setup
3. Learning
Please note:
This material is provided for information only and is not intended as a recommendation or an offer or solicitation
for the purchase or sale of any security or financial instrument.
1
1. | Microstructure review
Market response
1. 60-80% of trading occurs algorithmically - most trading occurs between machines.
2. Auto-correlation is a measure of concordance with previous observations: how likely do we observe another buyer-initiated trade?
3. Factors that contribute to the autocorrelation of signed trade-flow Parent order splitting, market fragmentation.
4. Joint information sets: Left plot - information due to trading activity. Right plot - information due to limit order placement activity.
1 10 100
lag
0.00010
0.00100
0.01000
0.10000
1.00000
acf
eMini-S&P
Gold
FTSE
-1500 -1000 -500 0 500 1000 1500
x
-0.010
-0.005
0.000
0.005
0.010
f(x)
Observations
Prediction
95% µ conf int
2
1. | Microstructure review
Limit orders cluster at varying price levels - reveals latent supply.
Buyer/seller-initiated orders generate trade flow - reveals intent.
Price patterns reveal competing agents - typical of algorithms exhibiting passive bias.
Cancellation patterns reveal the transient nature of liquidity.
Impact is dynamic - a model is needed for training, intra-trade planning, post-trade attribution.
3
1. | Microstructure review
A transient model of market impact provides a consistent transform between orders executed at the "micro"-scale and the aggregate,
"parent" or "meta" order trading costs. Practically speaking, this enables the formulation of a class or family of models that operate at the
threshold between signals and scheduling, depending on the amount of information available at the point of decision.
Micro-formulation is suitable for trading
decisions at the micro-scale (order placement).
f models the immediate market response,
following a transaction.
G models the subsequent decay or memory
effect.
pt = p0 × (1 +
t
0
impulse
f(.)
decay
G(t − s)− + (.)ds)
(1)
Description f(.) G(.) (.)
MI.1 (t − s) 0.5spread
MI.2 (t − s)1− spread
HFMI s (t − s)1− spreads
1 2 3 4 5 T
Time
0
5
10
15
20
PriceImpact
0 100 200 300 400 500 600 700 800
Market Trade Count
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
PriceImpact(ticks)
5%
10%
15%
20%
25%
30%
35%
4
1. | Microstructure review
Low-frequency market impact
A transient model of market impact provides a consistent transform between orders executed at the "micro"-scale and the aggregate,
"parent" or "meta" order trading costs. Practically speaking, this enables the formulation of a class or family of models that operate at the
threshold between signals and scheduling, depending on the amount of information available at the point of decision.
Macro-formulation is suitable for
scheduling/allocation decisions.
pt = p0×(1+
t
0
impulse
s
decay
(t − s)− ds)+ t+ ×
(2)
where
: Scale parameter
: Volatility parameter
: Trade Rate parameter
: Impact Decay parameter
: Spread parameter
t : Noise term, t
H0∼ IID(0, )
Region
NA 488 0.89 .663 1-0.325 0.118
APAC 389 0.857 0.573 1-0.393 0.102
EMEA 349 0.734 0.65 1-0.39 0.066
06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00
Time
150
100
50
0
50
100
Returns(centredonarrivaltime)
Returns/Impact Profile Example
0
2
4
6
8
10
bps(impact)
Est Impact
06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00
Time
5000
4000
3000
2000
1000
0
1000
2000
Filled
Execution Profile Example
Lit
Dark
5
1. | Microstructure review
Impact accrual over multiple submissions
1. Consider a single parent order, with several lit and dark executions occurring sequentially.
2. Impact decays as a power-law over volume time.
3. The decay of impact is accounted in optimal scheduling.
06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00
Time
15000
10000
5000
0
5000
10000
Filled
Execution Profile 2014.10.17
Lit
Dark
06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00
Time
4.90
4.95
5.00
5.05
5.10
5.15
5.20
Price
Returns/Impact Profile 2014.10.17
2
0
2
4
6
8
10
12
14
bps(impact)
Buy Est Impact
6
1. | Microstructure review
Basket Impact Profiles. Slippage is struck from arrival. The blue series represents the buy-side and sells are red; each group is a
dollar-weighted basket. Prior to order arrival sells exceed buys versus arrival, and the area is shaded green. As orders arrival and trading
begins, impact moves sells lower and buys higher. The impact estimate is on the second (right) y-axis. For example, buys move by +15bps
and the estimated impact is roughly 10. The darker red/blue lines are the accumulated impact curves and subsequent decay as orders
complete. Completion time would roughly correspond to the peak of impact. After this point, in this sample - buys revert and sells continue
on their path downwards. (b) represents the total dollars bought and sold in lit and dark venues. Generally they are correlated but it is a
matter of further research to measure their indiviudal and joint contributions to cost.
11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20
Time
7
6
5
4
3
2
1
0
1
2
bps(returns)
Returns/Impact Profile
0.2
0.4
0.6
0.8
1.0
bps(impact)
Buy Est Impact
Sell Est Impact
(a)
11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20
Time
15
10
5
0
5
10
bps(returns)
Returns/Impact Profile
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
bps(impact)
Buy Est Impact
Sell Est Impact
(b)
11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20
Time
10
8
6
4
2
0
2
4
6
bps(returns)
Returns/Impact Profile
0.0
0.5
1.0
1.5
2.0
2.5
bps(impact)
Buy Est Impact
Sell Est Impact
(c)
11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20
Time
0
500000
1000000
1500000
2000000
2500000
3000000
3500000
4000000
$filled
Execution Profile
11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20
Time
0
500000
1000000
1500000
2000000
2500000
$filled
Execution Profile
11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20
Time
0
500000
1000000
1500000
2000000
2500000
$filled
Execution Profile
7
1. | Microstructure review
1. Model is used for simulating market response due
to both passive and aggressive orders.
2. Midpoint price reconstruction through order/trade
flows provide indicative fair value.
3. Changes in book prices reflect tick/spread
constraints.
6 4 2 0 2 4 6
φt
0.010
0.005
0.000
0.005
rt
pearsonr = 0.68; p = 5.3e-08
1500 1000 500 0 500 1000 1500
ψt
0.005
0.000
0.005
rt
pearsonr = 0.27; p = 0.053
8
1. | The Trading Problem
VWAP slippage minimisation
VWAPslippage =
m
j=1
j (Mej − Pej ) +
m
j=1
( j (
lj
k=1 Pm
j
k
j
k
lj
k=1
j
k
− Mej ) +
m
j=1
(
lj
k=1
j
k − j )
lj
k=1 Pm
j
k
j
k
lj
k=1
j
k
)) (3)
9
1. | The Trading Problem
VWAP slippage minimisation
different trading style required depending on rate of trading targeted
10
1. | The Trading Problem
Optimal execution is path-dependent
1. In this simulation, the trading algorithm is measuring the impact of its activities jointly with market activity, considering both
passive and aggressive placements.
2. Top series indicates price time series and market trades; middle chart charts net traded volume (due to executions) and net order
flow (due to quote changes).
3. Impact is a path-dependent process - it depends on individual transactions considering market state at the time of execution, and
how latent terms propagate over volume-time.
4. The optimisation problem is to minimise the difference between the series in the bottom chart.
5. In this context, rewards that can be attributed to individual transactions are only known at the end of the order - i.e. rewards are
delayed.
0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 9500.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
price
+6.768e3
bid price
ask price
weighted mid point
market trades
0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950
-1.0
-0.5
0.0
0.5
1.0
Signal(normalized)
net traded volume
net order flow
0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950
time (quotes)
0
1
2
3
4
5
6
estimatedpriceimpact
estimated price
estimated price (+2.5%)
11
2. | Machine learning in forecasting
Section 2: Machine learning in forecasting
1. Review: Understanding model performance through bias-variance decomposition
2. Forecasting: Random Forests for high-frequency price direction prediction
3. Performance measures of classifiers
4. Trading with signals
12
2. | ML Forecasting - review
Bias-variance decomposition and overfitting
Bias reflects the error between the forecast, ˆf(x) and observed, target variable f(x) - how far is the prediction from reality?
Variance is the variability of the model for a given data point - how does the prediction vary over examples?
Noise is the irriducible error.
The objective of most machine learning algorithms is to minimise loss with respect to some penalty function subject to a balance
between bias and variance.
Theorem. For the squared error loss, the bias-variance decomposition of
the expected generalization error at X = x is
Err(x) = Bias2 + Variance + Noise (4)
where
Bias2 : á[ ˆf(x)] − f(x)
2
Variance : á ˆf(x) − á[ ˆf(x)]
2
Noise : 2
e
13
2. | ML Forecasting - review
Random Forests
Error: (Residual error: Lowest achievable error, independent of L.)
Bias: Decision trees usually have low bias.
Variance:They often suffer from high variance - i.e. don’t generalise well due to overfitting.
𝒙
𝑝 𝜑1
(𝑌 = 𝑐|𝑋 = 𝒙)
𝜑1 𝜑 𝑀
…
𝑝 𝜑 𝑚
(𝑌 = 𝑐|𝑋 = 𝒙)
∑
𝑝 𝜓(𝑌 = 𝑐|𝑋 = 𝒙)
14
2. | ML Forecasting - Random Forest Classifiers
Directional classifiers are designed to predict large positive price moves, large negative price moves and flat price moves. The correctly
classified instances for large positive price moves are shown in blue. The instances that were incorrectly classified as large positive price
moves are shown in red (so we predicted the price would move up, but it did not). The green points are the instances that we predicted as
being either flat or negative price categories.
1. The x-axis is the same signal as shown in the
regression plots, being limit order flow over the
past 100 ticks.
2. The presence of blue instances with negative order
flow indicates that other information can be used
to predict price moves. The prediction probability
is seen to increase with higher values of order flow.
3. The larger, darker points indicate a higher
prediction probability. We are more certain of the
result with a higher the prediction probability,
which is additional, useful information.
-3 -2 -1 0 1 2 3
order flow
-1.0
-0.5
0.0
0.5
1.0
priceresponse(ticks)
Classification Random Forest :: Class 1 :: Precision=60%, Recall=64%
Ensemble classifiers combine multiple signals, in this case around 15, that are weighted to give the best prediction power.
15
2. | ML Forecasting - Measuring Predictive Power
Ensemble learning techniques are used to combine multiple, highly correlated signals into a composite vote that performs better than
the individual voters.
Branches on a tree can capture performant signal compositions in different regimes, such as momentum, low-spread, etc.
Every transaction across the entire market is fed through a network of signals which are multi-tick horizon and also are accompanied
with probabilities.
During training, feature imbalances were taken into account through re-sampling of the data.
Feature importance map for the random decision tree, by signal name.
sellVolumeAtAskPriceOrLower10
BIDSIZE
sellCountAtAskPriceOrLower10
stVolImb10
sellTradedVol10
prevAvgBidPrice10
sellCountAtAskPriceOrLower20
ordFlowSum10
ordFlowSum20
ordFlowAvg20
stVolImb40
stVolImbNorm10
spread10
prevAvgAskPrice40
ordFlowAvg10
ordAskFlowSum10
prevAvgWMPPPrice40
prevAvgBidPrice20
stVolImb20
sellTradedVol20
stVolImbNorm20
ordAskFlowSum20
prevAvgWMPPPrice10
prevWMPPPrice10
prevBidPrice10
ordBidFlowSum10
ordFlowAvg40
spread20
buyCountAtAskPriceOrHigher10
minDBidPrice10
buyVolumeAtAskPriceOrHigher10
prevVWAPWMPPPrice40
obImb40
ordBidFlowSum20
buyTradedVol10
buyCountAtAskPriceOrHigher20
sellVolumeAtAskPriceOrLower20
prevAvgAskPrice20
buyTradedVol20
prevVWAPWMPPPrice20
spread40
buyVolumeAtAskPriceOrHigher20
minDBidPrice20
ordAskFlowSum40
prevWMPPPrice20
prevAvgBidPrice40
prevAvgWMPPPrice20
maxDAskPrice40
ordBidFlowSum40
stVolImbNorm40
prevBidPrice40
prevAskPrice40
obImb20
trdVol20
obImbNorm20
prevVWAPWMPPPrice10
trdVol10
prevBidPrice20
prevWMPPPrice40
ordFlowSum40
buyTradedVol40
prevAskPrice20
ASKSIZE
prevAvgAskPrice10
sellVolumeAtAskPriceOrLower40
obImbNorm40
obImb10
minDAskPrice40
sellTradedVol40
sellCountAtAskPriceOrLower40
trdVol40
prevAskPrice10
buyVolumeAtAskPriceOrHigher40
buyCountAtAskPriceOrHigher40
maxDBidPrice10
maxDBidPrice20
maxDAskPrice20
minDBidPrice40
maxDBidPrice40
obImbNorm10
minDAskPrice20
maxDAskPrice10
minDAskPrice10
0.00
0.02
0.04
0.06
0.08
0.10
0.12 Feature importances
16
2. | Measuring Predictive Power
Prediction Accuracy
Horizon (ticks) BAYGn.DE CCH.L DAIGn.DE ISP.MI SAPG.DE SASY.PA TOTF.PA VOD.L
10 0.797 0.777 0.662 0.877 0.712 0.773 0.748 0.791
20 0.749 0.731 0.638 0.832 0.678 0.728 0.708 0.739
50 0.691 0.679 0.620 0.763 0.642 0.671 0.656 0.669
100 0.662 0.649 0.612 0.714 0.623 0.645 0.639 0.648
200 0.644 0.625 0.603 0.675 0.612 0.631 0.625 0.631
800 1000 1200 1400 1600 1800 2000
time (ticks)
241.6
241.8
242.0
242.2
242.4
242.6
242.8
243.0
price
VOD.L :: Bid and Ask Short-term Price Prediction 07:09:46 to 07:15:44
bid price
ask price
bid price predict down
ask price predict up
0 5 10 15 20 25 30 35 40 45
time (ticks)
0.00
0.02
0.04
0.06
0.08
price
+9.828e1 SASY.PA :: Short-term Bid Price Prediction 07:00:28 to 07:00:33
bid price
ask price
Bid Price Prediction
Market Sell
Market Buy
17
2. | ML Forecasting - Trading with Signals
The simulator is then used to test if signal classification is useful over the time frames of the limit order submissions.
1. The upper plot shows an example of Crude Oil placement and filling using parameters similar to Production. Note the micro-structure
changes over the timescales relevant to limit order placement.
2. The middle plot shows the prediction probabilities into up, down or flat price moves.
3. The lower plot shows periods when we could apply these predictions. The blue circles are periods where we predict the price to rise.
The red circles are when we predict the price to fall.
0 200 400 600 8000.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
price
+9.239e1
bid
ask
buy
sell
submit
fill
cancel
0 200 400 600 8000.0
0.2
0.4
0.6
0.8
1.0
predictionprobability
Predict Down
Predict Flat
Predict Up
0 200 400 600 800
tick time
0.01
0.02
0.03
0.04
0.05
0.06
0.07
0.08
0.09
price
+9.239e1
bid
ask
weighted mpp
Predict Up
Predict Down
18
2. | ML Forecasting - Trading with Signals
Impact reduces signal effectiveness
1. To test our assumptions, we superimposed signals over child order submissions and studied the execution/slippage profile.
2. The first thing we noticed is a bias in the signal distribution themselves, due to the fact that any given number of orders would
survive longer if the signal was trending away (not shown).
3. This observation has important implications for simulation: behaviour bias needs to be accounted for by calibration - calibrate fill
probabilities using historical transaction data.
4. The black line is the unconditional move predicted as a function of probability (without the systematic presence of our child orders).
5. The lines above that represent order types (1-5) in terms of urgency; the difference between these conditional observations reflect
the impact of child orders.
6. Clearly it is important to consider the joint effect of candidate child orders in optimal order placement, insofar as how they effect
signals the algorithm uses.
7. The next section discusses how signals, impact on signals (impact), and opportunity costs are combined into an optimisation
framework.
0.2 0.3 0.4 0.5 0.6 0.7 0.8
prediction probability of downward mpp move
-0.10
-0.05
0.00
0.05
0.10
0.15
avgmidpointmove
1
2
3
4
5
19
2. | Order Queue Dynamics
The plots below show the distribution of the bootstrapped mean for passively filled quantity. The upper plots show the filled quantity on
the first price level (so near touch), the lower plots the secondary price level (so one tick away from the near touch). Each distribution
contains 10,000 bootstrap samples.
1. The grey distribution is all samples.
2. The blue and orange distributions (named class A
and B respectively) are the same points classified
ex-post, using the prediction classification used in
the previous slides. Class A is when we predict the
price to remain flat or move away, Class B is when
we predict the price to move in our favour.
3. There is a clear indication that we can predict a
higher rate of passive filling based on the signals.
2.2 2.4 2.6 2.8 3.0 3.2 3.4
bootstrapped mean filled qty PRICE LEVEL 1 (contracts)
0
500
1000
1500
2000
numberofsamples
Mean Passive Filled Qty (Simulations) :: Conditioned on Signals :: Crude Oil
Class A & B
Class A
Class B
2.5 3.0 3.5 4.0
bootstrapped mean filled qty PRICE LEVEL 2 (contracts)
0
100
200
300
400
500
600
700
800
numberofsamples
Class A & B
Class A
Class B
20
2. | Alpha Decay & System Latencies
Each signal is characterized by a CDF that provides the fraction of observations that fall under a given prediction horizon. This can be
used to tune/understand the effectiveness of the system overall as latencies are obtained for each leg of the process, from signal
origination to execution. Furthermore, in real-time - each signal carries this information so that stale conditions are handled properly.
0 500 1000 1500 2000
prediction horizon (ms)
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
fractionofinstances
CDF :: Prediction Horizon Time
21
3. | Reinforcement learning
Deep learning and reinforcement learning used in various fields
22
3. | Reinforcement Learning
So how do we put the pieces together? How do we combine signals, market impact, order placement, scheduling, simulation to create a
coherent strategy that follows an optimal solution hierarchically from scheduling to order placement? The aim is to choose optimal prices,
sizes and durations for all orders based on both the prevailing and projected market conditions. The solution outlined here solves for
these issues jointly as part of a stochastic optimal control problem.
The learned value function, Q(s,a) = T
t s,a is
parametrised by t and the feature vector s,a at
time t.
t+1 = t+ (rt+1+ Qt(st+1,at+1)−Qt(st,at))
(5)
where
t+1 : updated parameter vector
t : parameter vector at time t
: feature vector
s : is the current state
a : the action space
rt+1 : reward for action at in state st
Qt(st,at) : value function
Agent
Environment
Action atNew state st+1 Reward rt+1
23
3. | Reinforcement Learning
Supervised Learning: learning by example
Regression
Chess game
State
Response
For a chess game:
xi = state of the board at time i.
yi = f(xi) optimal move when in state xi.
But how can we obtain samples yi = f(xi) ?
24
3. | Reinforcement Learning
Reinforcement Learning: value function
The value function V is the memory/brain of the agent:
Instead of learning directly the optimal action we learn the value
of a pair (xi,ai).
V maps every state/action to a value: xi,ai → vi ∈ .
The value of (xi,ai) is defined as the sum of expected future
rewards when starting from xi and taking first action ai.
V(xi,ai) = á T
t=i Rt|ai
Every time we visit a state, take on action and receive a reward we
update the function V:
1. Record (xi → ai → ri → xi+1).
2. Update V
The central question: how do we perform the update? 25
3. | Reinforcement Learning
Reinforcement Learning: value function
After having observed (xi,ai,ri,xi+1) we obtain:
V(xi,ai) = á


T
t=i
Rt|ai

 = ri + á


T
t=i+1
Rt

 =: yi
yi defined above is our sample!
But what about the expectation á part in yi?
Start with a random initialisation for the mapping V and improve
gradually.
26
3. | Reinforcement Learning
Definition (State-value function)
The function
V (s) = E {Rt|st = s} = E



∞
k=0
k
rt+k+1 st = s



(6)
is called the state-value function for policy . E =expected value
assuming the agent follows the policy with an arbitrary time step t.
Definition (Optimal state-value function)
The function
V∗
(s) = maxV (s)∀s ∈ S (7)
is called optimal state-value function.
27
3. | Reinforcement Learning
Definition (Action-value function)
The function
Q (s,a) = E {Rt|st = s,at = a} = E



∞
k=0
k
rt+k+1 st = s,at = a



(8)
is called the action-value function for policy .
Definition (Optimal action-value function)
The function
Q∗
(s,a) = maxQ (s,a)∀s ∈ S and a ∈ A(s) (9)
is called optimal action-value function.
28
3. | The Q-Learning Algorithm
Algorithm 1 Q-Learning
1: 1. Initialisation:
Load a simulation environment: price series, fill probability;
Initialise the value function V0 and set the parameters: , ;
2: 2. Optimisation:
3: for episode = 1,2,3... do
4: for t = 1,2,3...T do
5: Observe current state st;
6: Take an action at(Qt,st, );
7: Observe new state st+1;
8: Receive reward rt(st,at,st+1);
9: Update value function using rt and current estimate Qt:
a) compute yt = rt + maxa Qt(st+1,a)
b) update the function Qt with target yt
10: end for
11: end for 29
3. | Reinforcement learning
Problem setup
Objective : VWAP minimisation
Optimising on deterministic part (spread cost) which will mainly affect mean of slippage distribution. Profile
slippage is reduced by ensuring orders are spread enough throughout the episode (e.g. risk of being filled too
quickly entirely passively for small quantities). Rewards are made of 2 parts:
Immediate rewards (for each iteration):
increment of spread cost (on executions we had since last iteration)
Terminal rewards (end of episode):
completion penalty
order duration penalty
market impact penalty
30
3. | Reinforcement learning
State space
States which are common to every placement:
target historical pov rate
%progress (= %timeElapsed - %inventoryTraded)
%timeElapsed
alpha profile
For each placement/action evaluated, we have those specific states:
expected spread cost
fill probability
size placed (as % of episode quantity)
31
3. | Reinforcement learning
Fill probabilities
32
3. | Reinforcement learning
Placement evaluations
33
3. | Reinforcement learning
Neural network architecture
34
3. | Reinforcement Learning
Learning price momentum - arrival objective
q q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q q
q
q q q
q
q
q
q
q
q
q
q
q
q
q
q q q
q q
q
q
q
q
q
q
q
q
q
q q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q q
q
q
q
q
q q
q
q q
q
q
q q
q
q
q q
q
q
q
q q
q
q
q
q
q
q q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q
q
q
q
q q
q
q
q q
q q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q q q
q
q
q q
q
q
q
q
q q
q
q
q
q q
q
q q q
q
q
q q
q
q
q q
q
q
q
q
q
q
q q q q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q q
q
q q
q
q
q q q
q
q q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q
q q q
q
q
q q q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q q
q q q
q
q
q
q
q
q
q
q
q
q
q
q
q q q
q
q
q
q
q
q q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q q q q
q
q
q
q
q
q
q
q q q
q
q
q
q
q
q q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q
q
q
q q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q q q q
q
q q
q
q
q
q
q
q
q
q
q
q q q
q q q q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q q
q
q q
q q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q q
q
q
q
q
q
q
q
q q q q
q
q
q
q
q q
q
q
q
q
q q
q
q
q q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q q
q q
q q
q
q
q q
q
q q
q
q
q q q
q
q
q q
q
q q
q
q q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q q
q
q
q
q
q
q
q
q q
q
q q q
q
q
q
q
q
q q
q
q
q
q
q
q
q q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q q
q q
q
q
q q q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q q q q
q
q
q
q
q q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q q
q
q
qq
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q q q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q q
q
−0.25
0.00
0.25
5 10 15
iteration
pricereturn
25
50
75
100
episode
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q q q
q
q
q
q q q
q
q q
q
q
q
q
q
q q q
q
q
q
q
q
q
q
q q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q q q q
q
q q
q q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q q
q
q
q
q q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q q
q q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q q
q
q
q
q
q
q q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q q q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
−0.25
0.00
0.25
5 10 15
iteration
pricereturn 818000
818025
818050
818075
818100
episode
0
1
2
3
4
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
iteration
inventory
0
1000
2000
3000
4000
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
iteration
count
0
1000
2000
3000
4000
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
iteration
count
0
1
2
3
−0.50 −0.25 0.00 0.25
price return
density
35
3. | Reinforcement Learning
Learning fill probabilities
A surface plot state space explored for either (a) Passive or (b) Aggressive for two variables, time (horizontal axis) and inventory (vertical
axis). Red implies that this choice is preferred, showing a preference to trade passively at the beginning of an order and then aggress near
the end. There is an additional aggressive order at the beginning of this particular scenario, which is a result of the overall low probability
of passively filling the entire order. The dark blue region in the lower left is not explored, which greatly reduces the state space
exploration. The upper right is also only weakly explored, due to the optimal path of filling throughout the episode. The surface plots were
generated from 20,000 simulations.
0 2 4 6 8 10
time
0
1
2
3
4
5
6
inventoryremaining
Exploration :: Action=PassiveSubmit :: Signal=0
0 2 4 6 8 10
time
0
1
2
3
4
5
6
inventoryremaining
Exploration :: Action=AggressiveSubmit :: Signal=0
36
3. | Empirical results
Convergence in training
37
3. | Empirical results
Backtest statistics
38
3. | Reinforcement learning
Evolution of trading
Machine Learning approach:
avoids hand crafting of logic / manual overfitting
allows for strategies to be optimised around characteristics of individual stocks
but this is not just about data and compute : domain expertise critical for applications in finance
39
3. | Infrastructure
Hardware
640 CPU cores
4 x 2880 GPU cores
2.5 TB RAM
Training time
16 hours for 700 tickers
40
3. | Architecture: production
41
Final remarks - Multi-agent systems
Further items to research and consider
In addition to being able to train a single agent operating in a system of noisy and informed agents - we are also studying the interaction
of multi-agents in systems specifically crafted to encourage co-evolution: how do agents co-adapt by leveraging their respective
advantages?
Auctions - single-auction price setting, sequential price setting scenarios.
Reservoir discovery - Micro-strategies can take advantage of temporary conditions by latching-onto markers, for example to capture
liquidity under a price limit, then resume normal operation when the opportunity disappears.
Limit-aware tactics - Parent limit orders are private information; implicitly changing behaviour at or near the limit is a signal: account
for constraints during decision making.
Arbitrageurs - Train agents for specialized interaction where optimal actions co-evolve.
Counterpary Simulations - Stress-test conditions and examine situations where counterparty diversity narrows to a particular set
(trading against faster opponents, for example).
Herding - Study behaviour when correlated motives result in coordinated behaviour, as compared to correlated information.
Theoretical Simulations - Similar to the case in Game Theory where the optimal strategy is conditioned on the player’s skill in
identifying the game she is in fact playing.
42
Disclaimer
This material (Material) is written by the Electronic Client Solutions Quantitative Solutions Team for distribution to you, our J.P. Morgans client, only. It may not be forwarded to any party without
obtaining the express written consent of J.P. Morgan. This Material is not available for distribution to persons regarded as private customers (or equivalent) in their home jurisdiction. It is not a
product of J.P. Morgans research department. Opinions expressed herein are solely those of the individual author and may differ from the views or opinions expressed by other areas of J.P. Morgan,
including the research department and its affiliates. The Material is provided for your information only and does not constitute: (i) research or a product of the J.P. Morgan research department, (ii)
an offer to sell, a solicitation of an offer to buy, or a recommendation for any investment product or strategy, or (iii) any investment, legal or tax advice. You are solely responsible for deciding
whether any investment product or strategy is appropriate for you based upon your investment goals, financial situation and tolerance for risk.
J.P. Morgan disclaims all representations and warranties in the information contained in this Material, whether express or implied, including, without limitation, any warranty of satisfactory quality,
completeness, accuracy, fitness for a particular purpose or non-infringement. The information contained herein is as of the date and time referenced in the Material and J.P. Morgan does not
undertake any obligation to update such information. All content, data, statements and other information are not warranted as to completeness or accuracy and are subject to change without
notice. J.P. Morgan disclaims any responsibility or liability, whether in contract, tort (including, without limitation, negligence), equity or otherwise, for the quality, accuracy or completeness of the
information contained in this Material, and for any reliance on, or uses to which, this Material, is put, and you are solely responsible for any use to which you put such information. Without limiting
any of the foregoing, to the fullest extent permitted by applicable law, in no event shall J.P. Morgan have any liability for any special, punitive, indirect, or consequential damages (including lost profits
or lost opportunity), in connection with the information contained in this Material, even if notified of the possibility of such damages. J.P. Morgan and its affiliates may have positions (long or short),
effect transactions or make markets in securities or financial instruments mentioned herein (or options with respect thereto), or provide advice or loans to or participate in the underwriting or
restructuring of the obligations of, issuers mentioned herein. This Material is proprietary and confidential to J.P. Morgan. Any comments or statements made herein do not necessarily reflect those
of J.P. Morgan, its subsidiaries or its affiliates. Clients should contact their salespersons at, and execute transactions through, a J.P. Morgan entity qualified in their home jurisdiction unless
governing law permits otherwise. Any unauthorized use, dissemination, distribution or copying of this Material, in whole or in part, is strictly prohibited.
Copyright 2016 JPMorgan Chase & Co. All rights reserved. J.P. Morgan is the global brand name for JPMorgan Chase & Co. and its subsidiaries and affiliates worldwide. J.P. Morgan Cazenove is a
marketing name for the U.K. investment banking businesses and EMEA cash equities and equity research businesses of JPMorgan Chase & Co. and its subsidiaries, conducted primarily through J.P.
Morgan Securities plc. Execution services are offered through J.P. Morgan Securities LLC (member of FINRA, NYSE and SIPC), J.P. Morgan Securities plc (a member of the London Stock Exchange and
authorized by the Prudential Regulation Authority and regulated by the Financial Conduct Authority and the Prudential Regulation Authority. Registered in England & Wales No. 2711006. Registered
Office: 25 Bank Street, Canary Wharf, London E14 5JP), J.P. Morgan Securities (Asia Pacific) Limited (CE number AAJ321) (regulated by the Hong Kong Monetary Authority and the Securities and
Futures Commission in Hong Kong), J.P. Morgan Securities Australia Limited (authorised and regulated by the Australian Securities & Investments Commission), J.P. Morgan Securities Japan Co., Ltd.
(authorised and regulated by the Financial Services Agency and the Japan Securities Dealers Association) and other investment banking affiliates and subsidiaries of J.P. Morgan in other
jurisdictions worldwide and registered with local authorities as appropriate. Please consult http://www.jpmorgan.com/pages/jpmorgan/investbk/global for more information. Product names,
company names and logos mentioned herein are trademarks or registered trademarks of their respective owners.
To the extent that you are a U.S. Institutional Investor or a “Major U.S. Institutional Investor” (collectively, “U.S.-Based Clients”) as those terms are defined in Rule 15a-6 (“Rule 15a-6”) of the U.S.
Securities Exchange Act of 1934 (as amended from time to time) you acknowledge that where a non-US affiliate of J.P. Morgan enters into a transaction with you, you acknowledge and understand
that J.P. Morgan performs such brokerage services with the U.S.- Based Clients pursuant to the terms of Rule 15a-6. Any transactions with U.S.-Based Clients are intermediated in accordance with
the terms of Rule 15a-6 by J.P. Morgans U.S registered broker-dealer affiliate, J.P. Morgan Securities LLC.
43

Contenu connexe

Tendances

"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an..."Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
Quantopian
 
Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...
Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...
Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...
QuantInsti
 

Tendances (20)

Algo trading(Minor Project) strategy EMA with Ipython
Algo trading(Minor Project) strategy EMA with IpythonAlgo trading(Minor Project) strategy EMA with Ipython
Algo trading(Minor Project) strategy EMA with Ipython
 
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas..."Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
"Quant Trading for a Living – Lessons from a Life in the Trenches" by Andreas...
 
QuantConnect - Introduction to Pairs Trading
QuantConnect - Introduction to Pairs TradingQuantConnect - Introduction to Pairs Trading
QuantConnect - Introduction to Pairs Trading
 
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
A Guided Tour of Machine Learning for Traders by Tucker Balch at QuantCon 2016
 
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ..."A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
"A Framework-Based Approach to Building Quantitative Trading Systems" by Dr. ...
 
"Trading Strategies That Are Designed Not Fitted" by Robert Carver, Independe...
"Trading Strategies That Are Designed Not Fitted" by Robert Carver, Independe..."Trading Strategies That Are Designed Not Fitted" by Robert Carver, Independe...
"Trading Strategies That Are Designed Not Fitted" by Robert Carver, Independe...
 
Algorithmic & High-Frequency Trading
Algorithmic & High-Frequency TradingAlgorithmic & High-Frequency Trading
Algorithmic & High-Frequency Trading
 
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an..."Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
"Quantitative Trading as a Mathematical Science" by Dr. Haksun Li, Founder an...
 
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
"Fundamental Forecasts: Methods and Timing" by Vinesh Jha, CEO of ExtractAlpha
 
"How to Run a Quantitative Trading Business in China with Python" by Xiaoyou ...
"How to Run a Quantitative Trading Business in China with Python" by Xiaoyou ..."How to Run a Quantitative Trading Business in China with Python" by Xiaoyou ...
"How to Run a Quantitative Trading Business in China with Python" by Xiaoyou ...
 
Should You Build Your Own Backtester? by Michael Halls-Moore at QuantCon 2016
Should You Build Your Own Backtester? by Michael Halls-Moore at QuantCon 2016Should You Build Your Own Backtester? by Michael Halls-Moore at QuantCon 2016
Should You Build Your Own Backtester? by Michael Halls-Moore at QuantCon 2016
 
DIY Quant Strategies on Quantopian
DIY Quant Strategies on QuantopianDIY Quant Strategies on Quantopian
DIY Quant Strategies on Quantopian
 
Modeling Transaction Costs for Algorithmic Strategies
Modeling Transaction Costs for Algorithmic StrategiesModeling Transaction Costs for Algorithmic Strategies
Modeling Transaction Costs for Algorithmic Strategies
 
Statistics - The Missing Link Between Technical Analysis and Algorithmic Trad...
Statistics - The Missing Link Between Technical Analysis and Algorithmic Trad...Statistics - The Missing Link Between Technical Analysis and Algorithmic Trad...
Statistics - The Missing Link Between Technical Analysis and Algorithmic Trad...
 
Paradigms of trading strategies formulation
Paradigms of trading strategies formulationParadigms of trading strategies formulation
Paradigms of trading strategies formulation
 
Order book dynamics in high frequency trading
Order book dynamics in high frequency tradingOrder book dynamics in high frequency trading
Order book dynamics in high frequency trading
 
Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...
Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...
Backtesting And Live Trading With Interactive Brokers Using Python With Dr. H...
 
Decision tree and random forest
Decision tree and random forestDecision tree and random forest
Decision tree and random forest
 
Classification of quantitative trading strategies webinar ppt
Classification of quantitative trading strategies webinar pptClassification of quantitative trading strategies webinar ppt
Classification of quantitative trading strategies webinar ppt
 
Machine Learning for Finance Master Class
Machine Learning for Finance Master Class Machine Learning for Finance Master Class
Machine Learning for Finance Master Class
 

En vedette

The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
Quantopian
 

En vedette (13)

"Artificial Intelligence Powered By Crowdsourcing - The Next Evolution in Qua...
"Artificial Intelligence Powered By Crowdsourcing - The Next Evolution in Qua..."Artificial Intelligence Powered By Crowdsourcing - The Next Evolution in Qua...
"Artificial Intelligence Powered By Crowdsourcing - The Next Evolution in Qua...
 
"Applying Deep Learning Techniques to Financial Time Series" by Scott Treloar...
"Applying Deep Learning Techniques to Financial Time Series" by Scott Treloar..."Applying Deep Learning Techniques to Financial Time Series" by Scott Treloar...
"Applying Deep Learning Techniques to Financial Time Series" by Scott Treloar...
 
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
 
“The Sum of The Parts Must Be Greater Than the Whole; Is There or What Is A C...
“The Sum of The Parts Must Be Greater Than the Whole; Is There or What Is A C...“The Sum of The Parts Must Be Greater Than the Whole; Is There or What Is A C...
“The Sum of The Parts Must Be Greater Than the Whole; Is There or What Is A C...
 
Risk Systems That Read by Nick Wade, Director of APAC marketing, Northfield I...
Risk Systems That Read by Nick Wade, Director of APAC marketing, Northfield I...Risk Systems That Read by Nick Wade, Director of APAC marketing, Northfield I...
Risk Systems That Read by Nick Wade, Director of APAC marketing, Northfield I...
 
Automated Selection and Robustness for Systematic Trading Strategies by Dr. T...
Automated Selection and Robustness for Systematic Trading Strategies by Dr. T...Automated Selection and Robustness for Systematic Trading Strategies by Dr. T...
Automated Selection and Robustness for Systematic Trading Strategies by Dr. T...
 
Intra-Day De Mark Plus Order Flow Indicator by Dr. Christopher Ting, SMU
  Intra-Day De Mark Plus Order Flow Indicator by Dr. Christopher Ting, SMU  Intra-Day De Mark Plus Order Flow Indicator by Dr. Christopher Ting, SMU
Intra-Day De Mark Plus Order Flow Indicator by Dr. Christopher Ting, SMU
 
"The 6 Stages of a Quant Equity Workflow" by Dr. Jessica Stauth, Vice Preside...
"The 6 Stages of a Quant Equity Workflow" by Dr. Jessica Stauth, Vice Preside..."The 6 Stages of a Quant Equity Workflow" by Dr. Jessica Stauth, Vice Preside...
"The 6 Stages of a Quant Equity Workflow" by Dr. Jessica Stauth, Vice Preside...
 
"A Framework for Developing Trading Models Based on Machine Learning" by Kris...
"A Framework for Developing Trading Models Based on Machine Learning" by Kris..."A Framework for Developing Trading Models Based on Machine Learning" by Kris...
"A Framework for Developing Trading Models Based on Machine Learning" by Kris...
 
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
The QuantCon Keynote: "Counter Trend Trading – Threat or Complement to Trend ...
 
"The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor...
"The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor..."The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor...
"The Hunt For Alpha Among Alternative Data Sources" by Dr. Michael Halls-Moor...
 
"Is Momentum Still Relevant for Today’s Markets?" by Anthony Ng, Senior Lecturer
"Is Momentum Still Relevant for Today’s Markets?" by Anthony Ng, Senior Lecturer"Is Momentum Still Relevant for Today’s Markets?" by Anthony Ng, Senior Lecturer
"Is Momentum Still Relevant for Today’s Markets?" by Anthony Ng, Senior Lecturer
 
The Sustainable Active Investing Framework: Simple, but Not Easy by Wesley Gr...
The Sustainable Active Investing Framework: Simple, but Not Easy by Wesley Gr...The Sustainable Active Investing Framework: Simple, but Not Easy by Wesley Gr...
The Sustainable Active Investing Framework: Simple, but Not Easy by Wesley Gr...
 

Similaire à "Active Learning in Trading Algorithms" by David Fellah, Head of the EMEA Linear Quant Research Group at J.P. Morgan

Investigation of Frequent Batch Auctions using Agent Based Model
Investigation of Frequent Batch Auctions using Agent Based ModelInvestigation of Frequent Batch Auctions using Agent Based Model
Investigation of Frequent Batch Auctions using Agent Based Model
Takanobu Mizuta
 
Quant congressusa2011algotradinglast
Quant congressusa2011algotradinglastQuant congressusa2011algotradinglast
Quant congressusa2011algotradinglast
Tomasz Waszczyk
 
2022CIFEr
2022CIFEr2022CIFEr
2022CIFEr
Takanobu Mizuta
 
MRET Case Studies 5.16(2)-4
MRET Case Studies 5.16(2)-4MRET Case Studies 5.16(2)-4
MRET Case Studies 5.16(2)-4
Prashant Bhuyan
 
Market ultimate profitability - ENEC 2010 proceedings
Market ultimate profitability - ENEC 2010 proceedingsMarket ultimate profitability - ENEC 2010 proceedings
Market ultimate profitability - ENEC 2010 proceedings
Alexei Kazakov
 
1100163YifanGuo
1100163YifanGuo1100163YifanGuo
1100163YifanGuo
Yifan Guo
 

Similaire à "Active Learning in Trading Algorithms" by David Fellah, Head of the EMEA Linear Quant Research Group at J.P. Morgan (20)

Investigation of Frequent Batch Auctions using Agent Based Model
Investigation of Frequent Batch Auctions using Agent Based ModelInvestigation of Frequent Batch Auctions using Agent Based Model
Investigation of Frequent Batch Auctions using Agent Based Model
 
How Many Orders does a Spoofer Need? - Investigation by Agent-Based Model -
How Many Orders does a Spoofer Need?  - Investigation by Agent-Based Model -How Many Orders does a Spoofer Need?  - Investigation by Agent-Based Model -
How Many Orders does a Spoofer Need? - Investigation by Agent-Based Model -
 
Summary jpx-wpenno2
Summary jpx-wpenno2Summary jpx-wpenno2
Summary jpx-wpenno2
 
How to survive in a High Frequency World
How to survive in a High Frequency WorldHow to survive in a High Frequency World
How to survive in a High Frequency World
 
gatheral_financialrisks_paris2014.pdf
gatheral_financialrisks_paris2014.pdfgatheral_financialrisks_paris2014.pdf
gatheral_financialrisks_paris2014.pdf
 
Affecting Market Efficiency by Increasing Speed of Order Matching Systems on ...
Affecting Market Efficiency by Increasing Speed of Order Matching Systems on ...Affecting Market Efficiency by Increasing Speed of Order Matching Systems on ...
Affecting Market Efficiency by Increasing Speed of Order Matching Systems on ...
 
Value at risk
Value at riskValue at risk
Value at risk
 
Cifer2017
Cifer2017Cifer2017
Cifer2017
 
Quant congressusa2011algotradinglast
Quant congressusa2011algotradinglastQuant congressusa2011algotradinglast
Quant congressusa2011algotradinglast
 
Algorithmic and high-frequency_trading 2011
Algorithmic and high-frequency_trading 2011Algorithmic and high-frequency_trading 2011
Algorithmic and high-frequency_trading 2011
 
2022CIFEr
2022CIFEr2022CIFEr
2022CIFEr
 
MRET Case Studies 5.16(2)-4
MRET Case Studies 5.16(2)-4MRET Case Studies 5.16(2)-4
MRET Case Studies 5.16(2)-4
 
Iof4 b32ublh72kcx2cg3
Iof4 b32ublh72kcx2cg3Iof4 b32ublh72kcx2cg3
Iof4 b32ublh72kcx2cg3
 
Market ultimate profitability - ENEC 2010 proceedings
Market ultimate profitability - ENEC 2010 proceedingsMarket ultimate profitability - ENEC 2010 proceedings
Market ultimate profitability - ENEC 2010 proceedings
 
Summary jpx wp_en_no9
Summary jpx wp_en_no9Summary jpx wp_en_no9
Summary jpx wp_en_no9
 
Automated Trading
Automated TradingAutomated Trading
Automated Trading
 
1100163YifanGuo
1100163YifanGuo1100163YifanGuo
1100163YifanGuo
 
Wealth Accumulation under Equity Trading; A Rational Approach
Wealth Accumulation under Equity Trading; A Rational ApproachWealth Accumulation under Equity Trading; A Rational Approach
Wealth Accumulation under Equity Trading; A Rational Approach
 
Why do Active Funds that Trade Infrequently Make a Market more Efficient? --...
Why do Active Funds that Trade Infrequently Make a Market more Efficient?  --...Why do Active Funds that Trade Infrequently Make a Market more Efficient?  --...
Why do Active Funds that Trade Infrequently Make a Market more Efficient? --...
 
Limit Order Market Modeling with Double Auction
Limit Order Market Modeling with Double AuctionLimit Order Market Modeling with Double Auction
Limit Order Market Modeling with Double Auction
 

Plus de Quantopian

"From Insufficient Economic data to Economic Big Data – How Trade Data is red...
"From Insufficient Economic data to Economic Big Data – How Trade Data is red..."From Insufficient Economic data to Economic Big Data – How Trade Data is red...
"From Insufficient Economic data to Economic Big Data – How Trade Data is red...
Quantopian
 

Plus de Quantopian (20)

Being open (source) in the traditionally secretive field of quant finance.
Being open (source) in the traditionally secretive field of quant finance.Being open (source) in the traditionally secretive field of quant finance.
Being open (source) in the traditionally secretive field of quant finance.
 
Stauth common pitfalls_stock_market_modeling_pqtc_fall2018
Stauth common pitfalls_stock_market_modeling_pqtc_fall2018Stauth common pitfalls_stock_market_modeling_pqtc_fall2018
Stauth common pitfalls_stock_market_modeling_pqtc_fall2018
 
Tearsheet feedback webinar 10.10.18
Tearsheet feedback webinar 10.10.18Tearsheet feedback webinar 10.10.18
Tearsheet feedback webinar 10.10.18
 
"Three Dimensional Time: Working with Alternative Data" by Kathryn Glowinski,...
"Three Dimensional Time: Working with Alternative Data" by Kathryn Glowinski,..."Three Dimensional Time: Working with Alternative Data" by Kathryn Glowinski,...
"Three Dimensional Time: Working with Alternative Data" by Kathryn Glowinski,...
 
"Alpha from Alternative Data" by Emmett Kilduff, Founder and CEO of Eagle Alpha
"Alpha from Alternative Data" by Emmett Kilduff,  Founder and CEO of Eagle Alpha"Alpha from Alternative Data" by Emmett Kilduff,  Founder and CEO of Eagle Alpha
"Alpha from Alternative Data" by Emmett Kilduff, Founder and CEO of Eagle Alpha
 
"Supply Chain Earnings Diffusion" by Josh Holcroft, Head of Quantitative Rese...
"Supply Chain Earnings Diffusion" by Josh Holcroft, Head of Quantitative Rese..."Supply Chain Earnings Diffusion" by Josh Holcroft, Head of Quantitative Rese...
"Supply Chain Earnings Diffusion" by Josh Holcroft, Head of Quantitative Rese...
 
"Portfolio Optimisation When You Don’t Know the Future (or the Past)" by Rob...
"Portfolio Optimisation When You Don’t Know the Future (or the Past)" by Rob..."Portfolio Optimisation When You Don’t Know the Future (or the Past)" by Rob...
"Portfolio Optimisation When You Don’t Know the Future (or the Past)" by Rob...
 
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
 
“Market Insights Through the Lens of a Risk Model” by Olivier d'Assier, Head ...
“Market Insights Through the Lens of a Risk Model” by Olivier d'Assier, Head ...“Market Insights Through the Lens of a Risk Model” by Olivier d'Assier, Head ...
“Market Insights Through the Lens of a Risk Model” by Olivier d'Assier, Head ...
 
"Maximize Alpha with Systematic Factor Testing" by Cheng Peng, Software Engin...
"Maximize Alpha with Systematic Factor Testing" by Cheng Peng, Software Engin..."Maximize Alpha with Systematic Factor Testing" by Cheng Peng, Software Engin...
"Maximize Alpha with Systematic Factor Testing" by Cheng Peng, Software Engin...
 
"From Alpha Discovery to Portfolio Construction: Pitfalls and Solutions" by D...
"From Alpha Discovery to Portfolio Construction: Pitfalls and Solutions" by D..."From Alpha Discovery to Portfolio Construction: Pitfalls and Solutions" by D...
"From Alpha Discovery to Portfolio Construction: Pitfalls and Solutions" by D...
 
"Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo...
"Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo..."Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo...
"Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo...
 
"Making the Grade: A Look Inside the Algorithm Evaluation Process" by Dr. Jes...
"Making the Grade: A Look Inside the Algorithm Evaluation Process" by Dr. Jes..."Making the Grade: A Look Inside the Algorithm Evaluation Process" by Dr. Jes...
"Making the Grade: A Look Inside the Algorithm Evaluation Process" by Dr. Jes...
 
"Building Diversified Portfolios that Outperform Out-of-Sample" by Dr. Marcos...
"Building Diversified Portfolios that Outperform Out-of-Sample" by Dr. Marcos..."Building Diversified Portfolios that Outperform Out-of-Sample" by Dr. Marcos...
"Building Diversified Portfolios that Outperform Out-of-Sample" by Dr. Marcos...
 
"From Insufficient Economic data to Economic Big Data – How Trade Data is red...
"From Insufficient Economic data to Economic Big Data – How Trade Data is red..."From Insufficient Economic data to Economic Big Data – How Trade Data is red...
"From Insufficient Economic data to Economic Big Data – How Trade Data is red...
 
"Machine Learning Approaches to Regime-aware Portfolio Management" by Michael...
"Machine Learning Approaches to Regime-aware Portfolio Management" by Michael..."Machine Learning Approaches to Regime-aware Portfolio Management" by Michael...
"Machine Learning Approaches to Regime-aware Portfolio Management" by Michael...
 
"Don't Lose Your Shirt Trading Mean-Reversion" by Edith Mandel, Principal at ...
"Don't Lose Your Shirt Trading Mean-Reversion" by Edith Mandel, Principal at ..."Don't Lose Your Shirt Trading Mean-Reversion" by Edith Mandel, Principal at ...
"Don't Lose Your Shirt Trading Mean-Reversion" by Edith Mandel, Principal at ...
 
"Deep Q-Learning for Trading" by Dr. Tucker Balch, Professor of Interactive C...
"Deep Q-Learning for Trading" by Dr. Tucker Balch, Professor of Interactive C..."Deep Q-Learning for Trading" by Dr. Tucker Balch, Professor of Interactive C...
"Deep Q-Learning for Trading" by Dr. Tucker Balch, Professor of Interactive C...
 
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ..."Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
"Quantum Hierarchical Risk Parity - A Quantum-Inspired Approach to Portfolio ...
 
"Snake Oil, Swamp Land, and Factor-Based Investing" by Gary Antonacci, author...
"Snake Oil, Swamp Land, and Factor-Based Investing" by Gary Antonacci, author..."Snake Oil, Swamp Land, and Factor-Based Investing" by Gary Antonacci, author...
"Snake Oil, Swamp Land, and Factor-Based Investing" by Gary Antonacci, author...
 

Dernier

VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
dipikadinghjn ( Why You Choose Us? ) Escorts
 
VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...
VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...
VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...
dipikadinghjn ( Why You Choose Us? ) Escorts
 
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort : 9352852248 Make on-demand Arrangements Near yOU
 

Dernier (20)

Gurley shaw Theory of Monetary Economics.
Gurley shaw Theory of Monetary Economics.Gurley shaw Theory of Monetary Economics.
Gurley shaw Theory of Monetary Economics.
 
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
 
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
 
TEST BANK For Corporate Finance, 13th Edition By Stephen Ross, Randolph Weste...
TEST BANK For Corporate Finance, 13th Edition By Stephen Ross, Randolph Weste...TEST BANK For Corporate Finance, 13th Edition By Stephen Ross, Randolph Weste...
TEST BANK For Corporate Finance, 13th Edition By Stephen Ross, Randolph Weste...
 
VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...
VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...
VIP Independent Call Girls in Mira Bhayandar 🌹 9920725232 ( Call Me ) Mumbai ...
 
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
 
Stock Market Brief Deck (Under Pressure).pdf
Stock Market Brief Deck (Under Pressure).pdfStock Market Brief Deck (Under Pressure).pdf
Stock Market Brief Deck (Under Pressure).pdf
 
Call Girls in New Friends Colony Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escort...
Call Girls in New Friends Colony Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escort...Call Girls in New Friends Colony Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escort...
Call Girls in New Friends Colony Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escort...
 
Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...
Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...
Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...
 
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
 
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
 
Top Rated Pune Call Girls Pashan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Pashan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Pashan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Pashan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
 
Top Rated Pune Call Girls Dighi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Dighi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Dighi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Dighi ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
 
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
 
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
 
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
 
(INDIRA) Call Girl Srinagar Call Now 8617697112 Srinagar Escorts 24x7
(INDIRA) Call Girl Srinagar Call Now 8617697112 Srinagar Escorts 24x7(INDIRA) Call Girl Srinagar Call Now 8617697112 Srinagar Escorts 24x7
(INDIRA) Call Girl Srinagar Call Now 8617697112 Srinagar Escorts 24x7
 
Indore Real Estate Market Trends Report.pdf
Indore Real Estate Market Trends Report.pdfIndore Real Estate Market Trends Report.pdf
Indore Real Estate Market Trends Report.pdf
 
Business Principles, Tools, and Techniques in Participating in Various Types...
Business Principles, Tools, and Techniques  in Participating in Various Types...Business Principles, Tools, and Techniques  in Participating in Various Types...
Business Principles, Tools, and Techniques in Participating in Various Types...
 

"Active Learning in Trading Algorithms" by David Fellah, Head of the EMEA Linear Quant Research Group at J.P. Morgan

  • 1. active learning in trading algorithms J.P. Morgan Linear Quantitative Research | David Fellah November 10, 2016 QuantCon - Singapore, 2016
  • 2. 0. | Introduction Section 1: Market response 1. Review of the algorithmic trading problem: forecasting and feedback mechanisms 2. Impact modeling 2. Empirical examples - real world trading is a complex, dynamical problem Section 2: Machine learning in forecasting 1. Forecasting: Random Forests for high-frequency price direction prediction 2. Empirical results Section 3: Reinforcement Learning 1. Putting it all together: Reinforcement Learning 2. Problem setup 3. Learning Please note: This material is provided for information only and is not intended as a recommendation or an offer or solicitation for the purchase or sale of any security or financial instrument. 1
  • 3. 1. | Microstructure review Market response 1. 60-80% of trading occurs algorithmically - most trading occurs between machines. 2. Auto-correlation is a measure of concordance with previous observations: how likely do we observe another buyer-initiated trade? 3. Factors that contribute to the autocorrelation of signed trade-flow Parent order splitting, market fragmentation. 4. Joint information sets: Left plot - information due to trading activity. Right plot - information due to limit order placement activity. 1 10 100 lag 0.00010 0.00100 0.01000 0.10000 1.00000 acf eMini-S&P Gold FTSE -1500 -1000 -500 0 500 1000 1500 x -0.010 -0.005 0.000 0.005 0.010 f(x) Observations Prediction 95% µ conf int 2
  • 4. 1. | Microstructure review Limit orders cluster at varying price levels - reveals latent supply. Buyer/seller-initiated orders generate trade flow - reveals intent. Price patterns reveal competing agents - typical of algorithms exhibiting passive bias. Cancellation patterns reveal the transient nature of liquidity. Impact is dynamic - a model is needed for training, intra-trade planning, post-trade attribution. 3
  • 5. 1. | Microstructure review A transient model of market impact provides a consistent transform between orders executed at the "micro"-scale and the aggregate, "parent" or "meta" order trading costs. Practically speaking, this enables the formulation of a class or family of models that operate at the threshold between signals and scheduling, depending on the amount of information available at the point of decision. Micro-formulation is suitable for trading decisions at the micro-scale (order placement). f models the immediate market response, following a transaction. G models the subsequent decay or memory effect. pt = p0 × (1 + t 0 impulse f(.) decay G(t − s)− + (.)ds) (1) Description f(.) G(.) (.) MI.1 (t − s) 0.5spread MI.2 (t − s)1− spread HFMI s (t − s)1− spreads 1 2 3 4 5 T Time 0 5 10 15 20 PriceImpact 0 100 200 300 400 500 600 700 800 Market Trade Count 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 PriceImpact(ticks) 5% 10% 15% 20% 25% 30% 35% 4
  • 6. 1. | Microstructure review Low-frequency market impact A transient model of market impact provides a consistent transform between orders executed at the "micro"-scale and the aggregate, "parent" or "meta" order trading costs. Practically speaking, this enables the formulation of a class or family of models that operate at the threshold between signals and scheduling, depending on the amount of information available at the point of decision. Macro-formulation is suitable for scheduling/allocation decisions. pt = p0×(1+ t 0 impulse s decay (t − s)− ds)+ t+ × (2) where : Scale parameter : Volatility parameter : Trade Rate parameter : Impact Decay parameter : Spread parameter t : Noise term, t H0∼ IID(0, ) Region NA 488 0.89 .663 1-0.325 0.118 APAC 389 0.857 0.573 1-0.393 0.102 EMEA 349 0.734 0.65 1-0.39 0.066 06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00 Time 150 100 50 0 50 100 Returns(centredonarrivaltime) Returns/Impact Profile Example 0 2 4 6 8 10 bps(impact) Est Impact 06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00 Time 5000 4000 3000 2000 1000 0 1000 2000 Filled Execution Profile Example Lit Dark 5
  • 7. 1. | Microstructure review Impact accrual over multiple submissions 1. Consider a single parent order, with several lit and dark executions occurring sequentially. 2. Impact decays as a power-law over volume time. 3. The decay of impact is accounted in optimal scheduling. 06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00 Time 15000 10000 5000 0 5000 10000 Filled Execution Profile 2014.10.17 Lit Dark 06:56:40 08:20:00 09:43:20 11:06:40 12:30:00 13:53:20 15:16:40 16:40:00 Time 4.90 4.95 5.00 5.05 5.10 5.15 5.20 Price Returns/Impact Profile 2014.10.17 2 0 2 4 6 8 10 12 14 bps(impact) Buy Est Impact 6
  • 8. 1. | Microstructure review Basket Impact Profiles. Slippage is struck from arrival. The blue series represents the buy-side and sells are red; each group is a dollar-weighted basket. Prior to order arrival sells exceed buys versus arrival, and the area is shaded green. As orders arrival and trading begins, impact moves sells lower and buys higher. The impact estimate is on the second (right) y-axis. For example, buys move by +15bps and the estimated impact is roughly 10. The darker red/blue lines are the accumulated impact curves and subsequent decay as orders complete. Completion time would roughly correspond to the peak of impact. After this point, in this sample - buys revert and sells continue on their path downwards. (b) represents the total dollars bought and sold in lit and dark venues. Generally they are correlated but it is a matter of further research to measure their indiviudal and joint contributions to cost. 11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20 Time 7 6 5 4 3 2 1 0 1 2 bps(returns) Returns/Impact Profile 0.2 0.4 0.6 0.8 1.0 bps(impact) Buy Est Impact Sell Est Impact (a) 11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20 Time 15 10 5 0 5 10 bps(returns) Returns/Impact Profile 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 bps(impact) Buy Est Impact Sell Est Impact (b) 11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20 Time 10 8 6 4 2 0 2 4 6 bps(returns) Returns/Impact Profile 0.0 0.5 1.0 1.5 2.0 2.5 bps(impact) Buy Est Impact Sell Est Impact (c) 11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20 Time 0 500000 1000000 1500000 2000000 2500000 3000000 3500000 4000000 $filled Execution Profile 11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20 Time 0 500000 1000000 1500000 2000000 2500000 $filled Execution Profile 11:40:00 12:13:20 12:46:40 13:20:00 13:53:20 14:26:40 15:00:00 15:33:20 Time 0 500000 1000000 1500000 2000000 2500000 $filled Execution Profile 7
  • 9. 1. | Microstructure review 1. Model is used for simulating market response due to both passive and aggressive orders. 2. Midpoint price reconstruction through order/trade flows provide indicative fair value. 3. Changes in book prices reflect tick/spread constraints. 6 4 2 0 2 4 6 φt 0.010 0.005 0.000 0.005 rt pearsonr = 0.68; p = 5.3e-08 1500 1000 500 0 500 1000 1500 ψt 0.005 0.000 0.005 rt pearsonr = 0.27; p = 0.053 8
  • 10. 1. | The Trading Problem VWAP slippage minimisation VWAPslippage = m j=1 j (Mej − Pej ) + m j=1 ( j ( lj k=1 Pm j k j k lj k=1 j k − Mej ) + m j=1 ( lj k=1 j k − j ) lj k=1 Pm j k j k lj k=1 j k )) (3) 9
  • 11. 1. | The Trading Problem VWAP slippage minimisation different trading style required depending on rate of trading targeted 10
  • 12. 1. | The Trading Problem Optimal execution is path-dependent 1. In this simulation, the trading algorithm is measuring the impact of its activities jointly with market activity, considering both passive and aggressive placements. 2. Top series indicates price time series and market trades; middle chart charts net traded volume (due to executions) and net order flow (due to quote changes). 3. Impact is a path-dependent process - it depends on individual transactions considering market state at the time of execution, and how latent terms propagate over volume-time. 4. The optimisation problem is to minimise the difference between the series in the bottom chart. 5. In this context, rewards that can be attributed to individual transactions are only known at the end of the order - i.e. rewards are delayed. 0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 9500.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 price +6.768e3 bid price ask price weighted mid point market trades 0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950 -1.0 -0.5 0.0 0.5 1.0 Signal(normalized) net traded volume net order flow 0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950 time (quotes) 0 1 2 3 4 5 6 estimatedpriceimpact estimated price estimated price (+2.5%) 11
  • 13. 2. | Machine learning in forecasting Section 2: Machine learning in forecasting 1. Review: Understanding model performance through bias-variance decomposition 2. Forecasting: Random Forests for high-frequency price direction prediction 3. Performance measures of classifiers 4. Trading with signals 12
  • 14. 2. | ML Forecasting - review Bias-variance decomposition and overfitting Bias reflects the error between the forecast, ˆf(x) and observed, target variable f(x) - how far is the prediction from reality? Variance is the variability of the model for a given data point - how does the prediction vary over examples? Noise is the irriducible error. The objective of most machine learning algorithms is to minimise loss with respect to some penalty function subject to a balance between bias and variance. Theorem. For the squared error loss, the bias-variance decomposition of the expected generalization error at X = x is Err(x) = Bias2 + Variance + Noise (4) where Bias2 : á[ ˆf(x)] − f(x) 2 Variance : á ˆf(x) − á[ ˆf(x)] 2 Noise : 2 e 13
  • 15. 2. | ML Forecasting - review Random Forests Error: (Residual error: Lowest achievable error, independent of L.) Bias: Decision trees usually have low bias. Variance:They often suffer from high variance - i.e. don’t generalise well due to overfitting. 𝒙 𝑝 𝜑1 (𝑌 = 𝑐|𝑋 = 𝒙) 𝜑1 𝜑 𝑀 … 𝑝 𝜑 𝑚 (𝑌 = 𝑐|𝑋 = 𝒙) ∑ 𝑝 𝜓(𝑌 = 𝑐|𝑋 = 𝒙) 14
  • 16. 2. | ML Forecasting - Random Forest Classifiers Directional classifiers are designed to predict large positive price moves, large negative price moves and flat price moves. The correctly classified instances for large positive price moves are shown in blue. The instances that were incorrectly classified as large positive price moves are shown in red (so we predicted the price would move up, but it did not). The green points are the instances that we predicted as being either flat or negative price categories. 1. The x-axis is the same signal as shown in the regression plots, being limit order flow over the past 100 ticks. 2. The presence of blue instances with negative order flow indicates that other information can be used to predict price moves. The prediction probability is seen to increase with higher values of order flow. 3. The larger, darker points indicate a higher prediction probability. We are more certain of the result with a higher the prediction probability, which is additional, useful information. -3 -2 -1 0 1 2 3 order flow -1.0 -0.5 0.0 0.5 1.0 priceresponse(ticks) Classification Random Forest :: Class 1 :: Precision=60%, Recall=64% Ensemble classifiers combine multiple signals, in this case around 15, that are weighted to give the best prediction power. 15
  • 17. 2. | ML Forecasting - Measuring Predictive Power Ensemble learning techniques are used to combine multiple, highly correlated signals into a composite vote that performs better than the individual voters. Branches on a tree can capture performant signal compositions in different regimes, such as momentum, low-spread, etc. Every transaction across the entire market is fed through a network of signals which are multi-tick horizon and also are accompanied with probabilities. During training, feature imbalances were taken into account through re-sampling of the data. Feature importance map for the random decision tree, by signal name. sellVolumeAtAskPriceOrLower10 BIDSIZE sellCountAtAskPriceOrLower10 stVolImb10 sellTradedVol10 prevAvgBidPrice10 sellCountAtAskPriceOrLower20 ordFlowSum10 ordFlowSum20 ordFlowAvg20 stVolImb40 stVolImbNorm10 spread10 prevAvgAskPrice40 ordFlowAvg10 ordAskFlowSum10 prevAvgWMPPPrice40 prevAvgBidPrice20 stVolImb20 sellTradedVol20 stVolImbNorm20 ordAskFlowSum20 prevAvgWMPPPrice10 prevWMPPPrice10 prevBidPrice10 ordBidFlowSum10 ordFlowAvg40 spread20 buyCountAtAskPriceOrHigher10 minDBidPrice10 buyVolumeAtAskPriceOrHigher10 prevVWAPWMPPPrice40 obImb40 ordBidFlowSum20 buyTradedVol10 buyCountAtAskPriceOrHigher20 sellVolumeAtAskPriceOrLower20 prevAvgAskPrice20 buyTradedVol20 prevVWAPWMPPPrice20 spread40 buyVolumeAtAskPriceOrHigher20 minDBidPrice20 ordAskFlowSum40 prevWMPPPrice20 prevAvgBidPrice40 prevAvgWMPPPrice20 maxDAskPrice40 ordBidFlowSum40 stVolImbNorm40 prevBidPrice40 prevAskPrice40 obImb20 trdVol20 obImbNorm20 prevVWAPWMPPPrice10 trdVol10 prevBidPrice20 prevWMPPPrice40 ordFlowSum40 buyTradedVol40 prevAskPrice20 ASKSIZE prevAvgAskPrice10 sellVolumeAtAskPriceOrLower40 obImbNorm40 obImb10 minDAskPrice40 sellTradedVol40 sellCountAtAskPriceOrLower40 trdVol40 prevAskPrice10 buyVolumeAtAskPriceOrHigher40 buyCountAtAskPriceOrHigher40 maxDBidPrice10 maxDBidPrice20 maxDAskPrice20 minDBidPrice40 maxDBidPrice40 obImbNorm10 minDAskPrice20 maxDAskPrice10 minDAskPrice10 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Feature importances 16
  • 18. 2. | Measuring Predictive Power Prediction Accuracy Horizon (ticks) BAYGn.DE CCH.L DAIGn.DE ISP.MI SAPG.DE SASY.PA TOTF.PA VOD.L 10 0.797 0.777 0.662 0.877 0.712 0.773 0.748 0.791 20 0.749 0.731 0.638 0.832 0.678 0.728 0.708 0.739 50 0.691 0.679 0.620 0.763 0.642 0.671 0.656 0.669 100 0.662 0.649 0.612 0.714 0.623 0.645 0.639 0.648 200 0.644 0.625 0.603 0.675 0.612 0.631 0.625 0.631 800 1000 1200 1400 1600 1800 2000 time (ticks) 241.6 241.8 242.0 242.2 242.4 242.6 242.8 243.0 price VOD.L :: Bid and Ask Short-term Price Prediction 07:09:46 to 07:15:44 bid price ask price bid price predict down ask price predict up 0 5 10 15 20 25 30 35 40 45 time (ticks) 0.00 0.02 0.04 0.06 0.08 price +9.828e1 SASY.PA :: Short-term Bid Price Prediction 07:00:28 to 07:00:33 bid price ask price Bid Price Prediction Market Sell Market Buy 17
  • 19. 2. | ML Forecasting - Trading with Signals The simulator is then used to test if signal classification is useful over the time frames of the limit order submissions. 1. The upper plot shows an example of Crude Oil placement and filling using parameters similar to Production. Note the micro-structure changes over the timescales relevant to limit order placement. 2. The middle plot shows the prediction probabilities into up, down or flat price moves. 3. The lower plot shows periods when we could apply these predictions. The blue circles are periods where we predict the price to rise. The red circles are when we predict the price to fall. 0 200 400 600 8000.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 price +9.239e1 bid ask buy sell submit fill cancel 0 200 400 600 8000.0 0.2 0.4 0.6 0.8 1.0 predictionprobability Predict Down Predict Flat Predict Up 0 200 400 600 800 tick time 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 price +9.239e1 bid ask weighted mpp Predict Up Predict Down 18
  • 20. 2. | ML Forecasting - Trading with Signals Impact reduces signal effectiveness 1. To test our assumptions, we superimposed signals over child order submissions and studied the execution/slippage profile. 2. The first thing we noticed is a bias in the signal distribution themselves, due to the fact that any given number of orders would survive longer if the signal was trending away (not shown). 3. This observation has important implications for simulation: behaviour bias needs to be accounted for by calibration - calibrate fill probabilities using historical transaction data. 4. The black line is the unconditional move predicted as a function of probability (without the systematic presence of our child orders). 5. The lines above that represent order types (1-5) in terms of urgency; the difference between these conditional observations reflect the impact of child orders. 6. Clearly it is important to consider the joint effect of candidate child orders in optimal order placement, insofar as how they effect signals the algorithm uses. 7. The next section discusses how signals, impact on signals (impact), and opportunity costs are combined into an optimisation framework. 0.2 0.3 0.4 0.5 0.6 0.7 0.8 prediction probability of downward mpp move -0.10 -0.05 0.00 0.05 0.10 0.15 avgmidpointmove 1 2 3 4 5 19
  • 21. 2. | Order Queue Dynamics The plots below show the distribution of the bootstrapped mean for passively filled quantity. The upper plots show the filled quantity on the first price level (so near touch), the lower plots the secondary price level (so one tick away from the near touch). Each distribution contains 10,000 bootstrap samples. 1. The grey distribution is all samples. 2. The blue and orange distributions (named class A and B respectively) are the same points classified ex-post, using the prediction classification used in the previous slides. Class A is when we predict the price to remain flat or move away, Class B is when we predict the price to move in our favour. 3. There is a clear indication that we can predict a higher rate of passive filling based on the signals. 2.2 2.4 2.6 2.8 3.0 3.2 3.4 bootstrapped mean filled qty PRICE LEVEL 1 (contracts) 0 500 1000 1500 2000 numberofsamples Mean Passive Filled Qty (Simulations) :: Conditioned on Signals :: Crude Oil Class A & B Class A Class B 2.5 3.0 3.5 4.0 bootstrapped mean filled qty PRICE LEVEL 2 (contracts) 0 100 200 300 400 500 600 700 800 numberofsamples Class A & B Class A Class B 20
  • 22. 2. | Alpha Decay & System Latencies Each signal is characterized by a CDF that provides the fraction of observations that fall under a given prediction horizon. This can be used to tune/understand the effectiveness of the system overall as latencies are obtained for each leg of the process, from signal origination to execution. Furthermore, in real-time - each signal carries this information so that stale conditions are handled properly. 0 500 1000 1500 2000 prediction horizon (ms) 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 fractionofinstances CDF :: Prediction Horizon Time 21
  • 23. 3. | Reinforcement learning Deep learning and reinforcement learning used in various fields 22
  • 24. 3. | Reinforcement Learning So how do we put the pieces together? How do we combine signals, market impact, order placement, scheduling, simulation to create a coherent strategy that follows an optimal solution hierarchically from scheduling to order placement? The aim is to choose optimal prices, sizes and durations for all orders based on both the prevailing and projected market conditions. The solution outlined here solves for these issues jointly as part of a stochastic optimal control problem. The learned value function, Q(s,a) = T t s,a is parametrised by t and the feature vector s,a at time t. t+1 = t+ (rt+1+ Qt(st+1,at+1)−Qt(st,at)) (5) where t+1 : updated parameter vector t : parameter vector at time t : feature vector s : is the current state a : the action space rt+1 : reward for action at in state st Qt(st,at) : value function Agent Environment Action atNew state st+1 Reward rt+1 23
  • 25. 3. | Reinforcement Learning Supervised Learning: learning by example Regression Chess game State Response For a chess game: xi = state of the board at time i. yi = f(xi) optimal move when in state xi. But how can we obtain samples yi = f(xi) ? 24
  • 26. 3. | Reinforcement Learning Reinforcement Learning: value function The value function V is the memory/brain of the agent: Instead of learning directly the optimal action we learn the value of a pair (xi,ai). V maps every state/action to a value: xi,ai → vi ∈ . The value of (xi,ai) is defined as the sum of expected future rewards when starting from xi and taking first action ai. V(xi,ai) = á T t=i Rt|ai Every time we visit a state, take on action and receive a reward we update the function V: 1. Record (xi → ai → ri → xi+1). 2. Update V The central question: how do we perform the update? 25
  • 27. 3. | Reinforcement Learning Reinforcement Learning: value function After having observed (xi,ai,ri,xi+1) we obtain: V(xi,ai) = á   T t=i Rt|ai   = ri + á   T t=i+1 Rt   =: yi yi defined above is our sample! But what about the expectation á part in yi? Start with a random initialisation for the mapping V and improve gradually. 26
  • 28. 3. | Reinforcement Learning Definition (State-value function) The function V (s) = E {Rt|st = s} = E    ∞ k=0 k rt+k+1 st = s    (6) is called the state-value function for policy . E =expected value assuming the agent follows the policy with an arbitrary time step t. Definition (Optimal state-value function) The function V∗ (s) = maxV (s)∀s ∈ S (7) is called optimal state-value function. 27
  • 29. 3. | Reinforcement Learning Definition (Action-value function) The function Q (s,a) = E {Rt|st = s,at = a} = E    ∞ k=0 k rt+k+1 st = s,at = a    (8) is called the action-value function for policy . Definition (Optimal action-value function) The function Q∗ (s,a) = maxQ (s,a)∀s ∈ S and a ∈ A(s) (9) is called optimal action-value function. 28
  • 30. 3. | The Q-Learning Algorithm Algorithm 1 Q-Learning 1: 1. Initialisation: Load a simulation environment: price series, fill probability; Initialise the value function V0 and set the parameters: , ; 2: 2. Optimisation: 3: for episode = 1,2,3... do 4: for t = 1,2,3...T do 5: Observe current state st; 6: Take an action at(Qt,st, ); 7: Observe new state st+1; 8: Receive reward rt(st,at,st+1); 9: Update value function using rt and current estimate Qt: a) compute yt = rt + maxa Qt(st+1,a) b) update the function Qt with target yt 10: end for 11: end for 29
  • 31. 3. | Reinforcement learning Problem setup Objective : VWAP minimisation Optimising on deterministic part (spread cost) which will mainly affect mean of slippage distribution. Profile slippage is reduced by ensuring orders are spread enough throughout the episode (e.g. risk of being filled too quickly entirely passively for small quantities). Rewards are made of 2 parts: Immediate rewards (for each iteration): increment of spread cost (on executions we had since last iteration) Terminal rewards (end of episode): completion penalty order duration penalty market impact penalty 30
  • 32. 3. | Reinforcement learning State space States which are common to every placement: target historical pov rate %progress (= %timeElapsed - %inventoryTraded) %timeElapsed alpha profile For each placement/action evaluated, we have those specific states: expected spread cost fill probability size placed (as % of episode quantity) 31
  • 33. 3. | Reinforcement learning Fill probabilities 32
  • 34. 3. | Reinforcement learning Placement evaluations 33
  • 35. 3. | Reinforcement learning Neural network architecture 34
  • 36. 3. | Reinforcement Learning Learning price momentum - arrival objective q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q −0.25 0.00 0.25 5 10 15 iteration pricereturn 25 50 75 100 episode q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q −0.25 0.00 0.25 5 10 15 iteration pricereturn 818000 818025 818050 818075 818100 episode 0 1 2 3 4 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 iteration inventory 0 1000 2000 3000 4000 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 iteration count 0 1000 2000 3000 4000 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 iteration count 0 1 2 3 −0.50 −0.25 0.00 0.25 price return density 35
  • 37. 3. | Reinforcement Learning Learning fill probabilities A surface plot state space explored for either (a) Passive or (b) Aggressive for two variables, time (horizontal axis) and inventory (vertical axis). Red implies that this choice is preferred, showing a preference to trade passively at the beginning of an order and then aggress near the end. There is an additional aggressive order at the beginning of this particular scenario, which is a result of the overall low probability of passively filling the entire order. The dark blue region in the lower left is not explored, which greatly reduces the state space exploration. The upper right is also only weakly explored, due to the optimal path of filling throughout the episode. The surface plots were generated from 20,000 simulations. 0 2 4 6 8 10 time 0 1 2 3 4 5 6 inventoryremaining Exploration :: Action=PassiveSubmit :: Signal=0 0 2 4 6 8 10 time 0 1 2 3 4 5 6 inventoryremaining Exploration :: Action=AggressiveSubmit :: Signal=0 36
  • 38. 3. | Empirical results Convergence in training 37
  • 39. 3. | Empirical results Backtest statistics 38
  • 40. 3. | Reinforcement learning Evolution of trading Machine Learning approach: avoids hand crafting of logic / manual overfitting allows for strategies to be optimised around characteristics of individual stocks but this is not just about data and compute : domain expertise critical for applications in finance 39
  • 41. 3. | Infrastructure Hardware 640 CPU cores 4 x 2880 GPU cores 2.5 TB RAM Training time 16 hours for 700 tickers 40
  • 42. 3. | Architecture: production 41
  • 43. Final remarks - Multi-agent systems Further items to research and consider In addition to being able to train a single agent operating in a system of noisy and informed agents - we are also studying the interaction of multi-agents in systems specifically crafted to encourage co-evolution: how do agents co-adapt by leveraging their respective advantages? Auctions - single-auction price setting, sequential price setting scenarios. Reservoir discovery - Micro-strategies can take advantage of temporary conditions by latching-onto markers, for example to capture liquidity under a price limit, then resume normal operation when the opportunity disappears. Limit-aware tactics - Parent limit orders are private information; implicitly changing behaviour at or near the limit is a signal: account for constraints during decision making. Arbitrageurs - Train agents for specialized interaction where optimal actions co-evolve. Counterpary Simulations - Stress-test conditions and examine situations where counterparty diversity narrows to a particular set (trading against faster opponents, for example). Herding - Study behaviour when correlated motives result in coordinated behaviour, as compared to correlated information. Theoretical Simulations - Similar to the case in Game Theory where the optimal strategy is conditioned on the player’s skill in identifying the game she is in fact playing. 42
  • 44. Disclaimer This material (Material) is written by the Electronic Client Solutions Quantitative Solutions Team for distribution to you, our J.P. Morgans client, only. It may not be forwarded to any party without obtaining the express written consent of J.P. Morgan. This Material is not available for distribution to persons regarded as private customers (or equivalent) in their home jurisdiction. It is not a product of J.P. Morgans research department. Opinions expressed herein are solely those of the individual author and may differ from the views or opinions expressed by other areas of J.P. Morgan, including the research department and its affiliates. The Material is provided for your information only and does not constitute: (i) research or a product of the J.P. Morgan research department, (ii) an offer to sell, a solicitation of an offer to buy, or a recommendation for any investment product or strategy, or (iii) any investment, legal or tax advice. You are solely responsible for deciding whether any investment product or strategy is appropriate for you based upon your investment goals, financial situation and tolerance for risk. J.P. Morgan disclaims all representations and warranties in the information contained in this Material, whether express or implied, including, without limitation, any warranty of satisfactory quality, completeness, accuracy, fitness for a particular purpose or non-infringement. The information contained herein is as of the date and time referenced in the Material and J.P. Morgan does not undertake any obligation to update such information. All content, data, statements and other information are not warranted as to completeness or accuracy and are subject to change without notice. J.P. Morgan disclaims any responsibility or liability, whether in contract, tort (including, without limitation, negligence), equity or otherwise, for the quality, accuracy or completeness of the information contained in this Material, and for any reliance on, or uses to which, this Material, is put, and you are solely responsible for any use to which you put such information. Without limiting any of the foregoing, to the fullest extent permitted by applicable law, in no event shall J.P. Morgan have any liability for any special, punitive, indirect, or consequential damages (including lost profits or lost opportunity), in connection with the information contained in this Material, even if notified of the possibility of such damages. J.P. Morgan and its affiliates may have positions (long or short), effect transactions or make markets in securities or financial instruments mentioned herein (or options with respect thereto), or provide advice or loans to or participate in the underwriting or restructuring of the obligations of, issuers mentioned herein. This Material is proprietary and confidential to J.P. Morgan. Any comments or statements made herein do not necessarily reflect those of J.P. Morgan, its subsidiaries or its affiliates. Clients should contact their salespersons at, and execute transactions through, a J.P. Morgan entity qualified in their home jurisdiction unless governing law permits otherwise. Any unauthorized use, dissemination, distribution or copying of this Material, in whole or in part, is strictly prohibited. Copyright 2016 JPMorgan Chase & Co. All rights reserved. J.P. Morgan is the global brand name for JPMorgan Chase & Co. and its subsidiaries and affiliates worldwide. J.P. Morgan Cazenove is a marketing name for the U.K. investment banking businesses and EMEA cash equities and equity research businesses of JPMorgan Chase & Co. and its subsidiaries, conducted primarily through J.P. Morgan Securities plc. Execution services are offered through J.P. Morgan Securities LLC (member of FINRA, NYSE and SIPC), J.P. Morgan Securities plc (a member of the London Stock Exchange and authorized by the Prudential Regulation Authority and regulated by the Financial Conduct Authority and the Prudential Regulation Authority. Registered in England & Wales No. 2711006. Registered Office: 25 Bank Street, Canary Wharf, London E14 5JP), J.P. Morgan Securities (Asia Pacific) Limited (CE number AAJ321) (regulated by the Hong Kong Monetary Authority and the Securities and Futures Commission in Hong Kong), J.P. Morgan Securities Australia Limited (authorised and regulated by the Australian Securities & Investments Commission), J.P. Morgan Securities Japan Co., Ltd. (authorised and regulated by the Financial Services Agency and the Japan Securities Dealers Association) and other investment banking affiliates and subsidiaries of J.P. Morgan in other jurisdictions worldwide and registered with local authorities as appropriate. Please consult http://www.jpmorgan.com/pages/jpmorgan/investbk/global for more information. Product names, company names and logos mentioned herein are trademarks or registered trademarks of their respective owners. To the extent that you are a U.S. Institutional Investor or a “Major U.S. Institutional Investor” (collectively, “U.S.-Based Clients”) as those terms are defined in Rule 15a-6 (“Rule 15a-6”) of the U.S. Securities Exchange Act of 1934 (as amended from time to time) you acknowledge that where a non-US affiliate of J.P. Morgan enters into a transaction with you, you acknowledge and understand that J.P. Morgan performs such brokerage services with the U.S.- Based Clients pursuant to the terms of Rule 15a-6. Any transactions with U.S.-Based Clients are intermediated in accordance with the terms of Rule 15a-6 by J.P. Morgans U.S registered broker-dealer affiliate, J.P. Morgan Securities LLC. 43