SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
Machine Learning in Actuarial Science & Insurance
Arthur Charpentier
Machine learning for economists and applied social scientists, 2020
@freakonometrics freakonometrics freakonometrics.hypotheses.org 1 / 22
Arthur Charpentier
Université du Québec à Montréal
@freakonometrics
freakonometrics
freakonometrics.hypotheses.org
@freakonometrics freakonometrics freakonometrics.hypotheses.org 2 / 22
Insurance & Actuarial Science
“Insurance is the contribution of the many to the misfortune of the few ”
POLICYHOLDER
INSURANCE
COMPANY
contribution (premium)
(possible) losses
What would be a “fair contribution ” ? see O’Neill (1997)
pure actuarial fairness contributions for individual policyholders should perfectly
reflect their predicted risk levels → predictive modeling
choice-sensitive fairness contributions should take into account only risks that
result from choices - luck-egalitarianism (Cohen (1989) or Arneson (2011))
@freakonometrics freakonometrics freakonometrics.hypotheses.org 3 / 22
Agenda
General Insurance & Predictive Modeling
From Econometric Techniques to Machine Learning
Goodness of Fit & Uncertainty
Model Interpretation
Price Discrimination & Fairness
To Go Further
@freakonometrics freakonometrics freakonometrics.hypotheses.org 4 / 22
General Insurance & Predictive Modeling (1)
fraud detection & network data
see e.g. Óskarsdóttir et al. (2019)
parametric insurance & satellite pictures
see e.g. de Leeuw et al. (2014)
claims reserving
see e.g. Wüthrich (2018)
automatic reading (medical reports)
See also Denuit et al. (2019a, 2020, 2019b)
@freakonometrics freakonometrics freakonometrics.hypotheses.org 5 / 22
General Insurance & Predictive Modeling (2)
premium computation, π = EP(S)
POLICYHOLDER
INSURANCE
COMPANY
premium π = EP(S)
(possible) losses S = Y1 + · · · + YN
or, given some features X = {X1, · · · , Xp}, premium π(x) = EPX
(S) = E[S|X]
Under standard assumptions,
π(x) = E[S|X] = E[N|X]
frequency
· E[Y |X]
average cost
= E[1S>0|X]
occurence
· E[S|X, S > 0]
individual cost
@freakonometrics freakonometrics freakonometrics.hypotheses.org 6 / 22
From Econometric Techniques to Machine Learning (1)
Merging claims & underwriters databases (per policy), e.g. (ni , ei , xi ), i = 1, · · · , np
1 id exposure zone pwr agecar agedrv model gas dsty region nbr occ
2 1 3227 0.87 C 7 0 56 12 D 93 13 0 0
3 2 4115 0.72 D 5 0 45 22 R 54 13 0 0
4 3 5121 0.05 C 6 3 37 17 D 11 13 0 0
5 4 5142 0.90 C 10 10 42 7 D 93 13 0 0
6 5 6255 0.12 C 7 0 59 11 R 73 13 0 0
7 6 8486 0.83 C 5 0 75 7 R 42 13 2 1
Standard model, Poisson regression (possibly zero-inflated, possibly over-dispesersed,
etc), related to the Poisson process
Ni ∼ P(ei · exp[θi ]), where θi = α0 + α1x1,i + α2x2,i + · · · + αpxp,i = xi α
or possibly GAMs (for nonlinearities, or spatial component)
θi = α0 + s1(x1,i ) + α2x2,i + · · · + αpxp,i , s1(x) =
j
γj ϕj (x)
@freakonometrics freakonometrics freakonometrics.hypotheses.org 7 / 22
From Econometric Techniques to Machine Learning (2)
or (yi xi ), i = 1, · · · , nc
1 id no cover cost zone pwr agecar agedrv model gas dsty
2 1 1870 17219 1TP 1692.29 C 5 0 52 12 R 93
3 2 1963 16336 1TP 422.05 E 9 0 78 12 R 27
4 3 4263 17089 2MT 549.21 C 10 7 27 17 D 19
5 4 5181 17801 1TP 191.15 D 5 2 26 3 D 91
6 5 6375 17485 1TP 2031.77 B 7 4 46 6 R 48
Standard model, Gamma regression (with a log link function)
Yi ∼ G(exp[ϑi ], b), where ϑi = β0 + β1x1,i + β2x2,i + · · · + βpxp,i = xi β
possible excluding large claims (and pooling on the entiere population)
E[Y |X] = P[Y ≤ s|X]
ps
· E[Y |X, Y ≤ s]
µs (x)
+ P[Y > s|X]
1−ps
· E[Y |X, Y > s]
y
or some Tweedie regression on S (but less flexible), Tweedie (1984).
@freakonometrics freakonometrics freakonometrics.hypotheses.org 8 / 22
From Econometric Techniques to Machine Learning (3)
Probabilistic interpretations : estimation using maximum likelihood - Generalized
(Mixed) Linear Models (see McCullagh and Nelder (1989) or Jørgensen (1997))
Individual model: S = 1S>0 · S
Collective model: S =
N
i=1
Yi , N ∼ P(λ), Yi ∼ G(α, β) in the exponential family,
with variance function V (µ) = µk, where k ∈ [1, 2]
log L =
n
i=1
yi
µ1−k
i
1 − k
−
µ2−k
i
2 − k
− (yi ,µi )
, where µi = exp[xi β]
Machine learning : interpret − log L as a loss
One can also include some penalty if the number of possible covariates is too large...
@freakonometrics freakonometrics freakonometrics.hypotheses.org 9 / 22
Goodness of Fit & Uncertainty (1)
Consider a simple model: number of claims N ∈ {0, 1} and fixed cost, say $1, 000.
Simple classification problem.
Assume that N ∼ B(θ) and we have some covariate x.
Two important features:
the dispersion of the heterogeneity, i.e. the variance of θ
e.g. assume that θ has a Beta distribution on [0, 1]
the dependence between heterogeneity θ and covariate x
← low correlation high correlation →
@freakonometrics freakonometrics freakonometrics.hypotheses.org 10 / 22
Goodness of Fit & Uncertainty (1)
Consider a simple model: number of claims N ∈ {0, 1} and fixed cost, say $1, 000.
Simple classification problem.
Assume that N ∼ B(θ) and we have some covariate x.
Two important features:
the dispersion of the heterogeneity, i.e. the variance of θ
e.g. assume that θ has a Beta distribution on [0, 1]
the dependence between heterogeneity θ and covariate x
← low correlation high correlation →
@freakonometrics freakonometrics freakonometrics.hypotheses.org 11 / 22
Goodness of Fit & Uncertainty (2)
the dispersion of the heterogeneity, d = 10% or 40%
the dependence between heterogeneity θ and covariate x
(correlation of the underlying copula function)
very difficult to reach high AUC (area under the ROC curve)
More complicated for insurance premiums...
Frees et al. (2014) defined a ROC-type curve, inspired by Lorenz
curve: given observed losses si and premiums π(xi ), policyhold-
ers ordered by premiums, π(x1) ≥ π(x2) ≥ · · · ≥ π(xn), plot
{Fi , Li } with Fi =
i
n
proportion
of insured
and Li =
i
j=1 sj
n
j=1 sj
proportion
of losses
@freakonometrics freakonometrics freakonometrics.hypotheses.org 12 / 22
Goodness of Fit & Uncertainty (3)
Parallel with classical OLS models, S ∼ N(θ, σ2), with covariates X = {X1, · · · , Xp}
Var S = E Var[S|X] + Var E[S|X]
premium
E Var[S|X] = E Var[S|Θ]
perfect pricing = σ2
+ E Var E[S|Θ] X
misfit
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
qq
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
qq
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
qqq
q
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
0 1 2 3 4 5
05101520
Risk Factor
Risk
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
qq
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
qq
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
qqq
q
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
0 1 2 3 4 5
−10−50510
Risk Factor
Profit&Loss
q
q
Profit&Loss
−10−50510
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
qq
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
qq
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
qqq
q
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
0 1 2 3 4 5
05101520
Risk Factor
Risk
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
qq
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
qq
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
qqq
q
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
0 1 2 3 4 5
−10−50510
Risk Factor
Profit&Loss
q
q
Profit&Loss
−10−50510
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
qq
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
qq
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
qqq
q
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
qq
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
0 1 2 3 4 5
05101520
Risk Factor
Risk
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
qq
qq
qq
q
q
qq
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
qq
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
qq
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
qqq
q
q
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
qq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
qqq
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
0 1 2 3 4 5
−10−50510
Risk Factor
Profit&Loss
q
q
Profit&Loss
−10−50510
(1) (θi , si ) and E(Si |Xi ) (2) (θi , si − E(Si |Xi )) (3) distribution of S − E(S|X))
@freakonometrics freakonometrics freakonometrics.hypotheses.org 13 / 22
Model Interpretation & Explainability (1)
Most machine linearing algorithms are black boxes, Pasquale (2015)
“providing transparency and explanations of algorithmic outputs might serve a number
of goals, including scrutability, trust, effectiveness, persuasiveness, efficiency, and
satisfaction ” Diakopoulos (2016)
Ceteris paribus can be translated into “all other things being equal" or “holding
other factors constant"
Mutatis mutandis approximately translates as “allowing other things to change
accordingly" or “the necessary changes having been made"
Guidotti et al. (2018) for a survey on methods for explaining back boxes
explaining the model or explaining the outcome
providing a transparent design (locally or globally)
@freakonometrics freakonometrics freakonometrics.hypotheses.org 14 / 22
Model Interpretation & Explainability (2)
Various tools :
Local Interpretable Model-Agnostic Explanations (LIME) for general models,
and for regression types, partial dependence plots (PDP), Individual Conditional
Expectation (ICE), or Accumulated Local Effects (ALE)
pdp(x1) = E π(x1, X2) and pdp(x1) =
1
n
n
i=1
π(x1, xi,2)
Or define ale(x1) =
x1
−∞
E
∂π(z1, X2)
∂x1
dz1, as in Apley and Zhu (2016).
For more details, see https://christophm.github.io/interpretable-ml-book/,
or Lakkaraju et al. (2019), Samek et al. (2017), Ribeiro et al. (2016), Lipton (2016),
Lundberg and Lee (2017), Gilpin et al. (2018), etc.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 15 / 22
Insurance Premium Fairness
Various concepts and definitions,
pure actuarial fairness: insurance costs for individuals should directly reflect their
level of risk
choice-sensitive equity: insurance costs for individuals should reflect only the risks
that result from individual choices (see luck-egalitarian)
equity as social justice: insurance of goods that are basic requirements of social
justice should be provided irrespective of the risks and choices of individuals
Economic and philosophical issues.... but also statistical ones.
Formally, x = (x1, x2) where only variables x2 can be used.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 16 / 22
Algorithmic Fairness (for classifiers)
Feldman et al. (2015), Bonchi et al. (2017) or Corbett-Davies and Goel (2018)
Let s denote the (underlying) score of a classifier s(x) = P[Y = 1|X = x], and p the
classifier p(x) = 1[t,∞)(s(x),
anti-classification: some attributes x1 cannot be used
p(x1, x2) = p(x1, x2) whenever x1 = x1
classification parity: standard metric do not depend on x1,
e.g. for a classifier, the false positive rate should not depend on x1
P[p(X1, X2) = 1|Y = 0, X1 = x1] = P[p(X1, X2) = 1|Y = 0], ∀x1.
calibration equity:
P[Y = 1|S(X1, X2) = s, X1 = x1] = P[Y = 1|S(X1, X2) = s]
.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 17 / 22
Wrap-Up & Open Challenges
See Charpentier and Denuit (2020) for a recent state-of-the-art
predictive models (econometrics or ML) are everywhere in insurance
impossible to use black boxes for ratemaking
more and more regulation towards transparency, fairness, equity
so far, only actuarial modeling, nothing about economics of insurance
(moral hazard, competition, price elasticity, etc)
how to include ex-post observations, e.g. telematics (endogeneity issues)?
how to include competition? (learning games)
to go further freakonometrics.hypotheses.org or charpentier.arthur@uqam.ca
@freakonometrics freakonometrics freakonometrics.hypotheses.org 18 / 22
References I
Apley, D. W. and Zhu, J. (2016). Visualizing the effects of predictor variables in black box supervised
learning models. arXiv:1612.08468.
Arneson, R. J. (2011). Luck egalitarianism–a primer. In Knight, C. and Stemplowska, Z., editors,
Responsibility and Distributive Justice, pages 24–50. Oxford University Press.
Bonchi, F., Hajian, S., Mishra, B., and Ramazzotti, D. (2017). Exposing the probabilistic causal
structure of discrimination. International Journal of Data Science and Analytics, 3(1):1–21.
Charpentier, A. and Denuit, M. (2020). On limits for machine learning algorithms in insurance. In
Insurance data analytics : some case studies of advanced algorithms and applications, pages
210–235. Economica.
Charpentier, A., Élie, R., and Remlinger, C. (2020). Reinforcement learning in economics and finance.
arXiv, 2003.10014.
Charpentier, A., Flachaire, E., and Ly, A. (2018). Econometrics and machine learning. Economics and
Statistics, 505-506:147–169.
Cohen, G. A. (1989). On the currency of egalitarian justice. Ethics, 99(4):906–944.
Corbett-Davies, S. and Goel, S. (2018). The measure and mismeasure of fairness: A critical review of
fair machine learning. arXiv, 1808.00023.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 19 / 22
References II
de Leeuw, J., Vrieling, A., Shee, A., Atzberger, C., Hadgu, K., Biradar, C., Keah, H., and Turvey, C.
(2014). The potential and uptake of remote sensing in insurance: A review. Remote Sensing,
6(11):10888–10912.
Denuit, M., Hainault, D., and Trufin, J. (2019a). Effective Statistical Learning Methods for Actuaries I
(GLMs and Extensions). Springer Verlag.
Denuit, M., Hainault, D., and Trufin, J. (2019b). Effective Statistical Learning Methods for Actuaries
III (Neural Networks and Extensions). Springer Verlag.
Denuit, M., Hainault, D., and Trufin, J. (2020). Effective Statistical Learning Methods for Actuaries II
(Tree-based methods and Extensions). Springer Verlag.
Diakopoulos, N. (2016). Accountability in algorithmic decision making. Commununications of the
ACM, 59(2):56–62.
Domingos, P. (2012). A few useful things to know about machine learning. Communications of the
ACM, 55(10):78–87.
Feldman, M., Friedler, S. A., Moeller, J., Scheidegger, C., and Venkatasubramanian, S. (2015).
Certifying and removing disparate impact. In Proceedings of the 21th ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, KDD ’15, page 259–268, New York, NY,
USA. Association for Computing Machinery.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 20 / 22
References III
Frees, E. W. J., Meyers, G., and Cummings, A. D. (2014). Insurance ratemaking and a gini index. The
Journal of Risk and Insurance, 81(2):335–366.
Gilpin, L. H., Bau, D., Yuan, B. Z., Bajwa, A., Specter, M., and Kagal, L. (2018). Explaining
explanations: An overview of interpretability of machine learning. arXiv, 1806.00069.
Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., and Pedreschi, D. (2018). A survey
of methods for explaining black box models. ACM Comput. Surv., 51(5).
Jørgensen, B. (1997). The Theory of Dispersion Models. Chapman & Hall.
Lakkaraju, H., Kamar, E., Caruana, R., and Leskovec, J. (2019). Faithful and customizable
explanations of black box models. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics,
and Society, AIES ’19, page 131–138, New York, NY, USA. Association for Computing Machinery.
Lipton, Z. C. (2016). The mythos of model interpretability. arXiv, 1606.03490.
Lundberg, S. M. and Lee, S.-I. (2017). A unified approach to interpreting model predictions. In Guyon,
I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., editors,
Advances in Neural Information Processing Systems 30, pages 4765–4774. Curran Associates, Inc.
McCullagh, P. and Nelder, J. (1989). Generalized linear models. Chapman & Hall.
O’Neill, O. (1997). Genetic information and insurance: Some ethical issues. Philosophical
Transactions: Biological Sciences, 352(1357):1087–1093.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 21 / 22
References IV
Óskarsdóttir, M., Ahmed, W., Antonio, K., Bart Baesens, Rémi Dendievel, T. D., and Reynkens, T.
(2019). Social network analytics for supervised fraud detection in insurance. KUL Working Paper.
Pasquale, F. (2015). The black box society: the secret algorithms that control money and information.
Harvard University Press.
Ribeiro, M. T., Singh, S., and Guestrin, C. (2016). "why should i trust you?": Explaining the
predictions of any classifier. arXiv, 1602.04938.
Samek, W., Montavon, G., Vedaldi, A., Hansen, L. K., and Müller, K.-R. (2017). Explainable AI:
Interpreting, Explaining and Visualizing Deep Learning. Springer Verlag.
Schauer, F. (2006). Profiles, Probabilities and Stereotypes. Harvard University Press.
Tweedie, M. C. K. (1984). An index which distinguishes between some important exponential families.
Statistics: applications and new directions (Calcutta, 1981), pages 579–604.
Wüthrich, M. V. (2018). Machine learning in individual claims reserving. Scandinavian Actuarial
Journal, 2018(6):465–480.
@freakonometrics freakonometrics freakonometrics.hypotheses.org 22 / 22

Contenu connexe

Tendances (20)

Side 2019 #6
Side 2019 #6Side 2019 #6
Side 2019 #6
 
Slides econometrics-2018-graduate-2
Slides econometrics-2018-graduate-2Slides econometrics-2018-graduate-2
Slides econometrics-2018-graduate-2
 
Side 2019 #5
Side 2019 #5Side 2019 #5
Side 2019 #5
 
Slides econometrics-2018-graduate-3
Slides econometrics-2018-graduate-3Slides econometrics-2018-graduate-3
Slides econometrics-2018-graduate-3
 
Hands-On Algorithms for Predictive Modeling
Hands-On Algorithms for Predictive ModelingHands-On Algorithms for Predictive Modeling
Hands-On Algorithms for Predictive Modeling
 
Lausanne 2019 #1
Lausanne 2019 #1Lausanne 2019 #1
Lausanne 2019 #1
 
Side 2019, part 1
Side 2019, part 1Side 2019, part 1
Side 2019, part 1
 
Lausanne 2019 #4
Lausanne 2019 #4Lausanne 2019 #4
Lausanne 2019 #4
 
Side 2019 #3
Side 2019 #3Side 2019 #3
Side 2019 #3
 
Predictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big dataPredictive Modeling in Insurance in the context of (possibly) big data
Predictive Modeling in Insurance in the context of (possibly) big data
 
Side 2019 #10
Side 2019 #10Side 2019 #10
Side 2019 #10
 
Actuarial Pricing Game
Actuarial Pricing GameActuarial Pricing Game
Actuarial Pricing Game
 
transformations and nonparametric inference
transformations and nonparametric inferencetransformations and nonparametric inference
transformations and nonparametric inference
 
Varese italie seminar
Varese italie seminarVarese italie seminar
Varese italie seminar
 
Side 2019 #8
Side 2019 #8Side 2019 #8
Side 2019 #8
 
Varese italie #2
Varese italie #2Varese italie #2
Varese italie #2
 
Slides econometrics-2018-graduate-1
Slides econometrics-2018-graduate-1Slides econometrics-2018-graduate-1
Slides econometrics-2018-graduate-1
 
Econ. Seminar Uqam
Econ. Seminar UqamEcon. Seminar Uqam
Econ. Seminar Uqam
 
Graduate Econometrics Course, part 4, 2017
Graduate Econometrics Course, part 4, 2017Graduate Econometrics Course, part 4, 2017
Graduate Econometrics Course, part 4, 2017
 
Slides ub-7
Slides ub-7Slides ub-7
Slides ub-7
 

Similaire à Machine Learning in Actuarial Science & Insurance

Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Valentin De Bortoli
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsUniversity of Glasgow
 
Monte Carlo Methods 2017 July Talk in Montreal
Monte Carlo Methods 2017 July Talk in MontrealMonte Carlo Methods 2017 July Talk in Montreal
Monte Carlo Methods 2017 July Talk in MontrealFred J. Hickernell
 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...Chiheb Ben Hammouda
 
Multiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsMultiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsChristian Robert
 
quebec.pdf
quebec.pdfquebec.pdf
quebec.pdfykyog
 
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Umberto Picchini
 
article_mdimagh_haddar_2012
article_mdimagh_haddar_2012article_mdimagh_haddar_2012
article_mdimagh_haddar_2012Mdimagh Ridha
 
Inference for stochastic differential equations via approximate Bayesian comp...
Inference for stochastic differential equations via approximate Bayesian comp...Inference for stochastic differential equations via approximate Bayesian comp...
Inference for stochastic differential equations via approximate Bayesian comp...Umberto Picchini
 
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...SYRTO Project
 

Similaire à Machine Learning in Actuarial Science & Insurance (20)

MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
 
Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...
 
Projection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamicsProjection methods for stochastic structural dynamics
Projection methods for stochastic structural dynamics
 
Slides ensae-2016-9
Slides ensae-2016-9Slides ensae-2016-9
Slides ensae-2016-9
 
Monte Carlo Methods 2017 July Talk in Montreal
Monte Carlo Methods 2017 July Talk in MontrealMonte Carlo Methods 2017 July Talk in Montreal
Monte Carlo Methods 2017 July Talk in Montreal
 
Chapter5.pdf.pdf
Chapter5.pdf.pdfChapter5.pdf.pdf
Chapter5.pdf.pdf
 
Slides ensae 9
Slides ensae 9Slides ensae 9
Slides ensae 9
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
 
Multiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximationsMultiple estimators for Monte Carlo approximations
Multiple estimators for Monte Carlo approximations
 
KAUST_talk_short.pdf
KAUST_talk_short.pdfKAUST_talk_short.pdf
KAUST_talk_short.pdf
 
Climate Extremes Workshop - Extreme Value Theory and the Re-assessment in th...
Climate Extremes Workshop -  Extreme Value Theory and the Re-assessment in th...Climate Extremes Workshop -  Extreme Value Theory and the Re-assessment in th...
Climate Extremes Workshop - Extreme Value Theory and the Re-assessment in th...
 
quebec.pdf
quebec.pdfquebec.pdf
quebec.pdf
 
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
 
CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...
CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...
CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...
 
article_mdimagh_haddar_2012
article_mdimagh_haddar_2012article_mdimagh_haddar_2012
article_mdimagh_haddar_2012
 
Inference for stochastic differential equations via approximate Bayesian comp...
Inference for stochastic differential equations via approximate Bayesian comp...Inference for stochastic differential equations via approximate Bayesian comp...
Inference for stochastic differential equations via approximate Bayesian comp...
 
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
 
PhD defense talk slides
PhD  defense talk slidesPhD  defense talk slides
PhD defense talk slides
 
main
mainmain
main
 

Plus de Arthur Charpentier

Plus de Arthur Charpentier (12)

Family History and Life Insurance
Family History and Life InsuranceFamily History and Life Insurance
Family History and Life Insurance
 
ACT6100 introduction
ACT6100 introductionACT6100 introduction
ACT6100 introduction
 
Family History and Life Insurance (UConn actuarial seminar)
Family History and Life Insurance (UConn actuarial seminar)Family History and Life Insurance (UConn actuarial seminar)
Family History and Life Insurance (UConn actuarial seminar)
 
Control epidemics
Control epidemics Control epidemics
Control epidemics
 
STT5100 Automne 2020, introduction
STT5100 Automne 2020, introductionSTT5100 Automne 2020, introduction
STT5100 Automne 2020, introduction
 
Family History and Life Insurance
Family History and Life InsuranceFamily History and Life Insurance
Family History and Life Insurance
 
Optimal Control and COVID-19
Optimal Control and COVID-19Optimal Control and COVID-19
Optimal Control and COVID-19
 
Slides OICA 2020
Slides OICA 2020Slides OICA 2020
Slides OICA 2020
 
Lausanne 2019 #3
Lausanne 2019 #3Lausanne 2019 #3
Lausanne 2019 #3
 
Side 2019 #11
Side 2019 #11Side 2019 #11
Side 2019 #11
 
Pareto Models, Slides EQUINEQ
Pareto Models, Slides EQUINEQPareto Models, Slides EQUINEQ
Pareto Models, Slides EQUINEQ
 
Mutualisation et Segmentation
Mutualisation et SegmentationMutualisation et Segmentation
Mutualisation et Segmentation
 

Dernier

Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...priyasharma62062
 
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...dipikadinghjn ( Why You Choose Us? ) Escorts
 
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...priyasharma62062
 
VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...
VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...
VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...dipikadinghjn ( Why You Choose Us? ) Escorts
 
7 tips trading Deriv Accumulator Options
7 tips trading Deriv Accumulator Options7 tips trading Deriv Accumulator Options
7 tips trading Deriv Accumulator OptionsVince Stanzione
 
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...dipikadinghjn ( Why You Choose Us? ) Escorts
 
Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...
Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...
Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...priyasharma62062
 
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...priyasharma62062
 
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Availabledollysharma2066
 
Toronto dominion bank investor presentation.pdf
Toronto dominion bank investor presentation.pdfToronto dominion bank investor presentation.pdf
Toronto dominion bank investor presentation.pdfJinJiang6
 
Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...
Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...
Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...priyasharma62062
 
Pension dashboards forum 1 May 2024 (1).pdf
Pension dashboards forum 1 May 2024 (1).pdfPension dashboards forum 1 May 2024 (1).pdf
Pension dashboards forum 1 May 2024 (1).pdfHenry Tapper
 
VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...dipikadinghjn ( Why You Choose Us? ) Escorts
 
Webinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech BelgiumWebinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech BelgiumFinTech Belgium
 
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...priyasharma62062
 
VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...
VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...
VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...roshnidevijkn ( Why You Choose Us? ) Escorts
 
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...priyasharma62062
 

Dernier (20)

Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
 
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
 
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
 
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
 
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
 
VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...
VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...
VIP Independent Call Girls in Andheri 🌹 9920725232 ( Call Me ) Mumbai Escorts...
 
7 tips trading Deriv Accumulator Options
7 tips trading Deriv Accumulator Options7 tips trading Deriv Accumulator Options
7 tips trading Deriv Accumulator Options
 
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
 
Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...
Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...
Vasai-Virar High Profile Model Call Girls📞9833754194-Nalasopara Satisfy Call ...
 
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
Mira Road Awesome 100% Independent Call Girls NUmber-9833754194-Dahisar Inter...
 
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
 
Toronto dominion bank investor presentation.pdf
Toronto dominion bank investor presentation.pdfToronto dominion bank investor presentation.pdf
Toronto dominion bank investor presentation.pdf
 
Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...
Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...
Airport Road Best Experience Call Girls Number-📞📞9833754194 Santacruz MOst Es...
 
Pension dashboards forum 1 May 2024 (1).pdf
Pension dashboards forum 1 May 2024 (1).pdfPension dashboards forum 1 May 2024 (1).pdf
Pension dashboards forum 1 May 2024 (1).pdf
 
VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Mumbai 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
 
Webinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech BelgiumWebinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech Belgium
 
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
 
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
 
VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...
VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...
VIP Kalyan Call Girls 🌐 9920725232 🌐 Make Your Dreams Come True With Mumbai E...
 
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
 

Machine Learning in Actuarial Science & Insurance

  • 1. Machine Learning in Actuarial Science & Insurance Arthur Charpentier Machine learning for economists and applied social scientists, 2020 @freakonometrics freakonometrics freakonometrics.hypotheses.org 1 / 22
  • 2. Arthur Charpentier Université du Québec à Montréal @freakonometrics freakonometrics freakonometrics.hypotheses.org @freakonometrics freakonometrics freakonometrics.hypotheses.org 2 / 22
  • 3. Insurance & Actuarial Science “Insurance is the contribution of the many to the misfortune of the few ” POLICYHOLDER INSURANCE COMPANY contribution (premium) (possible) losses What would be a “fair contribution ” ? see O’Neill (1997) pure actuarial fairness contributions for individual policyholders should perfectly reflect their predicted risk levels → predictive modeling choice-sensitive fairness contributions should take into account only risks that result from choices - luck-egalitarianism (Cohen (1989) or Arneson (2011)) @freakonometrics freakonometrics freakonometrics.hypotheses.org 3 / 22
  • 4. Agenda General Insurance & Predictive Modeling From Econometric Techniques to Machine Learning Goodness of Fit & Uncertainty Model Interpretation Price Discrimination & Fairness To Go Further @freakonometrics freakonometrics freakonometrics.hypotheses.org 4 / 22
  • 5. General Insurance & Predictive Modeling (1) fraud detection & network data see e.g. Óskarsdóttir et al. (2019) parametric insurance & satellite pictures see e.g. de Leeuw et al. (2014) claims reserving see e.g. Wüthrich (2018) automatic reading (medical reports) See also Denuit et al. (2019a, 2020, 2019b) @freakonometrics freakonometrics freakonometrics.hypotheses.org 5 / 22
  • 6. General Insurance & Predictive Modeling (2) premium computation, π = EP(S) POLICYHOLDER INSURANCE COMPANY premium π = EP(S) (possible) losses S = Y1 + · · · + YN or, given some features X = {X1, · · · , Xp}, premium π(x) = EPX (S) = E[S|X] Under standard assumptions, π(x) = E[S|X] = E[N|X] frequency · E[Y |X] average cost = E[1S>0|X] occurence · E[S|X, S > 0] individual cost @freakonometrics freakonometrics freakonometrics.hypotheses.org 6 / 22
  • 7. From Econometric Techniques to Machine Learning (1) Merging claims & underwriters databases (per policy), e.g. (ni , ei , xi ), i = 1, · · · , np 1 id exposure zone pwr agecar agedrv model gas dsty region nbr occ 2 1 3227 0.87 C 7 0 56 12 D 93 13 0 0 3 2 4115 0.72 D 5 0 45 22 R 54 13 0 0 4 3 5121 0.05 C 6 3 37 17 D 11 13 0 0 5 4 5142 0.90 C 10 10 42 7 D 93 13 0 0 6 5 6255 0.12 C 7 0 59 11 R 73 13 0 0 7 6 8486 0.83 C 5 0 75 7 R 42 13 2 1 Standard model, Poisson regression (possibly zero-inflated, possibly over-dispesersed, etc), related to the Poisson process Ni ∼ P(ei · exp[θi ]), where θi = α0 + α1x1,i + α2x2,i + · · · + αpxp,i = xi α or possibly GAMs (for nonlinearities, or spatial component) θi = α0 + s1(x1,i ) + α2x2,i + · · · + αpxp,i , s1(x) = j γj ϕj (x) @freakonometrics freakonometrics freakonometrics.hypotheses.org 7 / 22
  • 8. From Econometric Techniques to Machine Learning (2) or (yi xi ), i = 1, · · · , nc 1 id no cover cost zone pwr agecar agedrv model gas dsty 2 1 1870 17219 1TP 1692.29 C 5 0 52 12 R 93 3 2 1963 16336 1TP 422.05 E 9 0 78 12 R 27 4 3 4263 17089 2MT 549.21 C 10 7 27 17 D 19 5 4 5181 17801 1TP 191.15 D 5 2 26 3 D 91 6 5 6375 17485 1TP 2031.77 B 7 4 46 6 R 48 Standard model, Gamma regression (with a log link function) Yi ∼ G(exp[ϑi ], b), where ϑi = β0 + β1x1,i + β2x2,i + · · · + βpxp,i = xi β possible excluding large claims (and pooling on the entiere population) E[Y |X] = P[Y ≤ s|X] ps · E[Y |X, Y ≤ s] µs (x) + P[Y > s|X] 1−ps · E[Y |X, Y > s] y or some Tweedie regression on S (but less flexible), Tweedie (1984). @freakonometrics freakonometrics freakonometrics.hypotheses.org 8 / 22
  • 9. From Econometric Techniques to Machine Learning (3) Probabilistic interpretations : estimation using maximum likelihood - Generalized (Mixed) Linear Models (see McCullagh and Nelder (1989) or Jørgensen (1997)) Individual model: S = 1S>0 · S Collective model: S = N i=1 Yi , N ∼ P(λ), Yi ∼ G(α, β) in the exponential family, with variance function V (µ) = µk, where k ∈ [1, 2] log L = n i=1 yi µ1−k i 1 − k − µ2−k i 2 − k − (yi ,µi ) , where µi = exp[xi β] Machine learning : interpret − log L as a loss One can also include some penalty if the number of possible covariates is too large... @freakonometrics freakonometrics freakonometrics.hypotheses.org 9 / 22
  • 10. Goodness of Fit & Uncertainty (1) Consider a simple model: number of claims N ∈ {0, 1} and fixed cost, say $1, 000. Simple classification problem. Assume that N ∼ B(θ) and we have some covariate x. Two important features: the dispersion of the heterogeneity, i.e. the variance of θ e.g. assume that θ has a Beta distribution on [0, 1] the dependence between heterogeneity θ and covariate x ← low correlation high correlation → @freakonometrics freakonometrics freakonometrics.hypotheses.org 10 / 22
  • 11. Goodness of Fit & Uncertainty (1) Consider a simple model: number of claims N ∈ {0, 1} and fixed cost, say $1, 000. Simple classification problem. Assume that N ∼ B(θ) and we have some covariate x. Two important features: the dispersion of the heterogeneity, i.e. the variance of θ e.g. assume that θ has a Beta distribution on [0, 1] the dependence between heterogeneity θ and covariate x ← low correlation high correlation → @freakonometrics freakonometrics freakonometrics.hypotheses.org 11 / 22
  • 12. Goodness of Fit & Uncertainty (2) the dispersion of the heterogeneity, d = 10% or 40% the dependence between heterogeneity θ and covariate x (correlation of the underlying copula function) very difficult to reach high AUC (area under the ROC curve) More complicated for insurance premiums... Frees et al. (2014) defined a ROC-type curve, inspired by Lorenz curve: given observed losses si and premiums π(xi ), policyhold- ers ordered by premiums, π(x1) ≥ π(x2) ≥ · · · ≥ π(xn), plot {Fi , Li } with Fi = i n proportion of insured and Li = i j=1 sj n j=1 sj proportion of losses @freakonometrics freakonometrics freakonometrics.hypotheses.org 12 / 22
  • 13. Goodness of Fit & Uncertainty (3) Parallel with classical OLS models, S ∼ N(θ, σ2), with covariates X = {X1, · · · , Xp} Var S = E Var[S|X] + Var E[S|X] premium E Var[S|X] = E Var[S|Θ] perfect pricing = σ2 + E Var E[S|Θ] X misfit q q q q q q q q qq q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqqq q q q q q q q q q q q q q q qq q q q q qq qq q q qq q q q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q qq q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q qq q q q q q q q q qq qq q q qq q q q q q q q q q q q qq q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q qq q q q q q q q q q q qqq q q q q q q q qqq q q q q q q q qq q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q qq q qqq q q q q q q q qq q q qq q q q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q qqq qq q q q q q q q q q q q q q q q q q q q q qq q q q q q q qq q q q q q q qq q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q q q q q q qqq q q q q qq q q q q q q qq q q qq q q q q q q q q qq q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq 0 1 2 3 4 5 05101520 Risk Factor Risk q q q q q q q q qq q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqqq q q q q q q q q q q q q q q qq q q q q qq qq q q qq q q q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q qq q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q qq q q q q q q q q qq qq q q qq q q q q q q q q q q q qq q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q qq q q q q q q q q q q qqq q q q q q q q qqq q q q q q q q qq q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q qq q qqq q q q q q q q qq q q qq q q q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q qqq qq q q q q q q q q q q q q q q q q q q q q qq q q q q q q qq q q q q q q qq q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q q q q q q qqq q q q q qq q q q q q q qq q q qq q q q q q q q q qq q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq 0 1 2 3 4 5 −10−50510 Risk Factor Profit&Loss q q Profit&Loss −10−50510 q q q q q q q q qq q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqqq q q q q q q q q q q q q q q qq q q q q qq qq q q qq q q q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q qq q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q qq q q q q q q q q qq qq q q qq q q q q q q q q q q q qq q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q qq q q q q q q q q q q qqq q q q q q q q qqq q q q q q q q qq q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q qq q qqq q q q q q q q qq q q qq q q q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q qqq qq q q q q q q q q q q q q q q q q q q q q qq q q q q q q qq q q q q q q qq q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q q q q q q qqq q q q q qq q q q q q q qq q q qq q q q q q q q q qq q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq 0 1 2 3 4 5 05101520 Risk Factor Risk q q q q q q q q qq q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqqq q q q q q q q q q q q q q q qq q q q q qq qq q q qq q q q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q qq q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q qq q q q q q q q q qq qq q q qq q q q q q q q q q q q qq q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q qq q q q q q q q q q q qqq q q q q q q q qqq q q q q q q q qq q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q qq q qqq q q q q q q q qq q q qq q q q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q qqq qq q q q q q q q q q q q q q q q q q q q q qq q q q q q q qq q q q q q q qq q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q q q q q q qqq q q q q qq q q q q q q qq q q qq q q q q q q q q qq q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq 0 1 2 3 4 5 −10−50510 Risk Factor Profit&Loss q q Profit&Loss −10−50510 q q q q q q q q qq q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqqq q q q q q q q q q q q q q q qq q q q q qq qq q q qq q q q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q qq q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q qq q q q q q q q q qq qq q q qq q q q q q q q q q q q qq q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q qq q q q q q q q q q q qqq q q q q q q q qqq q q q q q q q qq q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q qq q qqq q q q q q q q qq q q qq q q q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q qqq qq q q q q q q q q q q q q q q q q q q q q qq q q q q q q qq q q q q q q qq q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q q q q q q qqq q q q q qq q q q q q q qq q q qq q q q q q q q q qq q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq 0 1 2 3 4 5 05101520 Risk Factor Risk q q q q q q q q qq q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqqq q q q q q q q q q q q q q q qq q q qq qq qq q q qq q q q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q qq q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q qq q q q q q q q q qq qq q q qq q q q q q q q q q q q qq q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q qq q q q q q q q q q q qqq q q q q q q q q qq q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q qqq q q q q q q q qq q q qq q q q q q q q q q q qq qq q q q q q q q q q q q q q q q q q q q qqq qq q q q q q q q q q q q q q q q q q q q q qq q q q q q q qqq q q q q q qq q q q q q q q q qq q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q q q q q q q qq q q q q qq q q q q q q q q q q qq q q q q q q q q qq q q q q q qq q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q qq 0 1 2 3 4 5 −10−50510 Risk Factor Profit&Loss q q Profit&Loss −10−50510 (1) (θi , si ) and E(Si |Xi ) (2) (θi , si − E(Si |Xi )) (3) distribution of S − E(S|X)) @freakonometrics freakonometrics freakonometrics.hypotheses.org 13 / 22
  • 14. Model Interpretation & Explainability (1) Most machine linearing algorithms are black boxes, Pasquale (2015) “providing transparency and explanations of algorithmic outputs might serve a number of goals, including scrutability, trust, effectiveness, persuasiveness, efficiency, and satisfaction ” Diakopoulos (2016) Ceteris paribus can be translated into “all other things being equal" or “holding other factors constant" Mutatis mutandis approximately translates as “allowing other things to change accordingly" or “the necessary changes having been made" Guidotti et al. (2018) for a survey on methods for explaining back boxes explaining the model or explaining the outcome providing a transparent design (locally or globally) @freakonometrics freakonometrics freakonometrics.hypotheses.org 14 / 22
  • 15. Model Interpretation & Explainability (2) Various tools : Local Interpretable Model-Agnostic Explanations (LIME) for general models, and for regression types, partial dependence plots (PDP), Individual Conditional Expectation (ICE), or Accumulated Local Effects (ALE) pdp(x1) = E π(x1, X2) and pdp(x1) = 1 n n i=1 π(x1, xi,2) Or define ale(x1) = x1 −∞ E ∂π(z1, X2) ∂x1 dz1, as in Apley and Zhu (2016). For more details, see https://christophm.github.io/interpretable-ml-book/, or Lakkaraju et al. (2019), Samek et al. (2017), Ribeiro et al. (2016), Lipton (2016), Lundberg and Lee (2017), Gilpin et al. (2018), etc. @freakonometrics freakonometrics freakonometrics.hypotheses.org 15 / 22
  • 16. Insurance Premium Fairness Various concepts and definitions, pure actuarial fairness: insurance costs for individuals should directly reflect their level of risk choice-sensitive equity: insurance costs for individuals should reflect only the risks that result from individual choices (see luck-egalitarian) equity as social justice: insurance of goods that are basic requirements of social justice should be provided irrespective of the risks and choices of individuals Economic and philosophical issues.... but also statistical ones. Formally, x = (x1, x2) where only variables x2 can be used. @freakonometrics freakonometrics freakonometrics.hypotheses.org 16 / 22
  • 17. Algorithmic Fairness (for classifiers) Feldman et al. (2015), Bonchi et al. (2017) or Corbett-Davies and Goel (2018) Let s denote the (underlying) score of a classifier s(x) = P[Y = 1|X = x], and p the classifier p(x) = 1[t,∞)(s(x), anti-classification: some attributes x1 cannot be used p(x1, x2) = p(x1, x2) whenever x1 = x1 classification parity: standard metric do not depend on x1, e.g. for a classifier, the false positive rate should not depend on x1 P[p(X1, X2) = 1|Y = 0, X1 = x1] = P[p(X1, X2) = 1|Y = 0], ∀x1. calibration equity: P[Y = 1|S(X1, X2) = s, X1 = x1] = P[Y = 1|S(X1, X2) = s] . @freakonometrics freakonometrics freakonometrics.hypotheses.org 17 / 22
  • 18. Wrap-Up & Open Challenges See Charpentier and Denuit (2020) for a recent state-of-the-art predictive models (econometrics or ML) are everywhere in insurance impossible to use black boxes for ratemaking more and more regulation towards transparency, fairness, equity so far, only actuarial modeling, nothing about economics of insurance (moral hazard, competition, price elasticity, etc) how to include ex-post observations, e.g. telematics (endogeneity issues)? how to include competition? (learning games) to go further freakonometrics.hypotheses.org or charpentier.arthur@uqam.ca @freakonometrics freakonometrics freakonometrics.hypotheses.org 18 / 22
  • 19. References I Apley, D. W. and Zhu, J. (2016). Visualizing the effects of predictor variables in black box supervised learning models. arXiv:1612.08468. Arneson, R. J. (2011). Luck egalitarianism–a primer. In Knight, C. and Stemplowska, Z., editors, Responsibility and Distributive Justice, pages 24–50. Oxford University Press. Bonchi, F., Hajian, S., Mishra, B., and Ramazzotti, D. (2017). Exposing the probabilistic causal structure of discrimination. International Journal of Data Science and Analytics, 3(1):1–21. Charpentier, A. and Denuit, M. (2020). On limits for machine learning algorithms in insurance. In Insurance data analytics : some case studies of advanced algorithms and applications, pages 210–235. Economica. Charpentier, A., Élie, R., and Remlinger, C. (2020). Reinforcement learning in economics and finance. arXiv, 2003.10014. Charpentier, A., Flachaire, E., and Ly, A. (2018). Econometrics and machine learning. Economics and Statistics, 505-506:147–169. Cohen, G. A. (1989). On the currency of egalitarian justice. Ethics, 99(4):906–944. Corbett-Davies, S. and Goel, S. (2018). The measure and mismeasure of fairness: A critical review of fair machine learning. arXiv, 1808.00023. @freakonometrics freakonometrics freakonometrics.hypotheses.org 19 / 22
  • 20. References II de Leeuw, J., Vrieling, A., Shee, A., Atzberger, C., Hadgu, K., Biradar, C., Keah, H., and Turvey, C. (2014). The potential and uptake of remote sensing in insurance: A review. Remote Sensing, 6(11):10888–10912. Denuit, M., Hainault, D., and Trufin, J. (2019a). Effective Statistical Learning Methods for Actuaries I (GLMs and Extensions). Springer Verlag. Denuit, M., Hainault, D., and Trufin, J. (2019b). Effective Statistical Learning Methods for Actuaries III (Neural Networks and Extensions). Springer Verlag. Denuit, M., Hainault, D., and Trufin, J. (2020). Effective Statistical Learning Methods for Actuaries II (Tree-based methods and Extensions). Springer Verlag. Diakopoulos, N. (2016). Accountability in algorithmic decision making. Commununications of the ACM, 59(2):56–62. Domingos, P. (2012). A few useful things to know about machine learning. Communications of the ACM, 55(10):78–87. Feldman, M., Friedler, S. A., Moeller, J., Scheidegger, C., and Venkatasubramanian, S. (2015). Certifying and removing disparate impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’15, page 259–268, New York, NY, USA. Association for Computing Machinery. @freakonometrics freakonometrics freakonometrics.hypotheses.org 20 / 22
  • 21. References III Frees, E. W. J., Meyers, G., and Cummings, A. D. (2014). Insurance ratemaking and a gini index. The Journal of Risk and Insurance, 81(2):335–366. Gilpin, L. H., Bau, D., Yuan, B. Z., Bajwa, A., Specter, M., and Kagal, L. (2018). Explaining explanations: An overview of interpretability of machine learning. arXiv, 1806.00069. Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., and Pedreschi, D. (2018). A survey of methods for explaining black box models. ACM Comput. Surv., 51(5). Jørgensen, B. (1997). The Theory of Dispersion Models. Chapman & Hall. Lakkaraju, H., Kamar, E., Caruana, R., and Leskovec, J. (2019). Faithful and customizable explanations of black box models. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’19, page 131–138, New York, NY, USA. Association for Computing Machinery. Lipton, Z. C. (2016). The mythos of model interpretability. arXiv, 1606.03490. Lundberg, S. M. and Lee, S.-I. (2017). A unified approach to interpreting model predictions. In Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., editors, Advances in Neural Information Processing Systems 30, pages 4765–4774. Curran Associates, Inc. McCullagh, P. and Nelder, J. (1989). Generalized linear models. Chapman & Hall. O’Neill, O. (1997). Genetic information and insurance: Some ethical issues. Philosophical Transactions: Biological Sciences, 352(1357):1087–1093. @freakonometrics freakonometrics freakonometrics.hypotheses.org 21 / 22
  • 22. References IV Óskarsdóttir, M., Ahmed, W., Antonio, K., Bart Baesens, Rémi Dendievel, T. D., and Reynkens, T. (2019). Social network analytics for supervised fraud detection in insurance. KUL Working Paper. Pasquale, F. (2015). The black box society: the secret algorithms that control money and information. Harvard University Press. Ribeiro, M. T., Singh, S., and Guestrin, C. (2016). "why should i trust you?": Explaining the predictions of any classifier. arXiv, 1602.04938. Samek, W., Montavon, G., Vedaldi, A., Hansen, L. K., and Müller, K.-R. (2017). Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Springer Verlag. Schauer, F. (2006). Profiles, Probabilities and Stereotypes. Harvard University Press. Tweedie, M. C. K. (1984). An index which distinguishes between some important exponential families. Statistics: applications and new directions (Calcutta, 1981), pages 579–604. Wüthrich, M. V. (2018). Machine learning in individual claims reserving. Scandinavian Actuarial Journal, 2018(6):465–480. @freakonometrics freakonometrics freakonometrics.hypotheses.org 22 / 22