SlideShare a Scribd company logo
1 of 35
Download to read offline
MACHINE LEARNING FOR
EPIDEMIOLOGICAL MODELS
E. Meloni, A. Zugarini, A. Panizza, A. Betti, M. Gori
September 24, 2020
SAILab, University of Siena
TABLE OF CONTENTS
1. SIR Models
2. The data
3. The learning problem
4. Regularization
5. Fitting the data
1
SIR Models
WHY LEARN EPIDEMIC MODELS?
During 2020, the spread of COVID-19 infection affected the
whole world.
Some countries resorted to non-pharmaceutical interventions of
various degrees to control the spread of the virus, such as
lockdowns or imposing the use of masks.
Many countries collected vast amounts of detailed data about the
evolution of the epidemic.
Understanding the evolution of the epidemic from the data can
help us understand the efficacy of this interventions and predict
the spread of the illness if no action is taken.
2
WHY LEARN EPIDEMIC MODELS?
Deterministic epidemic models, such as SIR, usually model the
evolution through some constant parameters, such as infection
rate.
Constant parameters are ill-suited to model a dynamic
phenomenon where the spread is counter-acted by the
intervention of national authorities.
Time-varying parameters are better suited, but the parameter
space becomes quickly too big to be manually fine-tuned where
analytical solutions do not exist.
A Machine Learning approach can be used to explore the
parameter space and find interesting and useful models.
3
THE MODEL
A classical deterministic approach to model an epidemic is the
susceptible, infectious, removed (SIR) model.
A population of N individuals is divided into
• x(t) = (# of susceptible at time t)
• y(t) = (# of infectious at time t)
• z(t) = (# of removed at time t)
We assume that the population remains constant, therefore:
x(t) + y(t) + z(t) = N.
4
THE MODEL
The dynamics of the epidemics then can be described by the
following system of ODEs1:



x (t) = −βx(t)y(t)
y (t) = βx(t)y(t) − γy(t)
z (t) = γy(t)
• β > 0 → infection rate
• γ > 0 → removal rate
• R0 := β/γ → reproduction number
1
Norman TJ Bailey et al. The mathematical theory of infectious diseases and
its applications. Charles Griffin & Company Ltd, 1975.
5
SIDARTHE
S I
D
R
H
E
T
A
α, β, γ, δ
ε
η
ξ
ζ
µ
τ
ρ
λ
κ
ν
σθ
A more sophisticated epidemiological model is SIDARTHE2,
where we add more compartments to better describe the stages
of infection. (ODEs can be found in the Appendix)
2
Giulia Giordano et al. “Modelling the COVID-19 epidemic and
implementation of population-wide interventions in Italy”. In: Nature Medicine
(2020).
6
SIDARTHE
Compartments:
• S: Susceptible
• I: Asymptomatic undetected infected individuals
• D: Aymptomatic detected infected individuals
• A: Symptomatic undetected infected individuals
• R: Symptomatic detected infected individuals
• T: Acutely symptomatic infected individuals
• H: Healed individuals
• E: Deceased individuals
An important difference is that we differentiate between detected
and undetected individuals.
7
DYNAMIC PARAMETERS
The epidemiological parameters are usually considered constant in
time.
This prevents the model from describing changes due to
non-pharmaceutical interventions (lockdown, etc...).
We want to extend the model by allowing daily changes to the
parameters values.
β → β(t) τ → τ(t)
In this way we can monitor how parameters (i.e. infection rate)
change during the epidemic evolution.
8
The data
OFFICIAL DATA
We pre-processed the data3 to obtain 5 time-series which map to
5 SIDARTHE compartments:
• D: detected asymptomatic individuals
• R: detected symptomatic individuals
• T: acutely symptomatic individuals
• Hd : portion of healed individuals that were previously detected
• E: deceased individuals
We do not have access to data about the undetected infected
individuals (of course).
3
You can find the data following this link:
https://github.com/pcm-dpc/COVID-19
9
TRAIN/VALIDATION/TEST SPLIT
All time-series were split into three sets:
• Train Set (size T): 0 ≤ t ≤ T
• Validation Set (size V): T ≤ t ≤ T + V
• Test Set (size D): T + V ≤ t ≤ T + V + D
As usual, the training set will be used to fit the model, the
validation set will be used to evaluate over-fitting and the test set
will be used for the final evaluation of the model.
10
The learning problem
GENERIC STATEMENT
We call u(t) the concatenation of all parameters at time t.
u(t) = (α(t), β(t)...)
Problem
Learning u(t) from supervisions on the number of people in the
detected compartments, i.e. D, R, T, Hd and E.
11
PRECISE FORMULATION
Given the supervisions D, R, T, Hd and E
Given ˆDu, ˆRu, ˆTu, ˆHu
d and ˆEu the solutions of SIDARTHE
equations given by parameters u(t).
We want to find the set of parameters u(t) which minimizes the
MSE of the prediction against the targets on samples for
t = 0 → T.
12
GRADIENT DESCENT THROUGH ODES
The learning is done through Gradient Descent.
The SIDARTHE equations are computed through numeric
integration, using Heun method and a fixed time step.
To obtain the gradient of parameters through the integration, we
have implemented the Heun method in PyTorch to leverage its
powerful autograd framework.
Sidenote: there are strong similarities with Neural ODEs.
13
Regularization
SMOOTHNESS
In case of overfitting, the learned parameters could result in a
discontinuous and wrinkled function of time.
To avoid this problem and reduce overfitting, we augment the loss
function with a regularization term R(u).
The penalty R(u) is higher when the parameters are discontinuous
and enforces a smooth function.
14
MOMENTUM
Since the solution is computed through integration, the value of
parameters at initial time steps have a bigger impact on the
solution w.r.t. values at final time steps.
This can be seen in the gradient of each parameter: for t → T,
gradients approach 0.
Hence, we augmented the Gradient Descent update rule with a
momentum term, which manages to distribute the gradient
throughout the u function.
Important note: the commonly known momentum carries the
gradient over subsequent epochs; this momentum term carries the
gradient over subsequent time steps of the parameters u(t)
15
TARGET NORMALIZATION
D, R, T, Hd and E have very different scales of values.
E.g. max(Hd ) ≈ 200 000 and max(T) ≈ 4 000
This could cause the learning to prefer a target at the expense of
another one.
We weight the loss components for each target based on its scale
relative to the other targets.
16
Fitting the data
INFECTED INDIVIDUALS
0DU

$SU 0D -XQ -XO $XJ 6HS
GDWH





'WUDLQYDOLGDWLRQWHVW
3UHGLFWLRQWUDLQ
7DUJHWWUDLQ
3UHGLFWLRQYDO
7DUJHWYDO
3UHGLFWLRQWHVW
7DUJHWWHVW
0DU

$SU 0D -XQ -XO $XJ 6HS
GDWH







5WUDLQYDOLGDWLRQWHVW
3UHGLFWLRQWUDLQ
7DUJHWWUDLQ
3UHGLFWLRQYDO
7DUJHWYDO
3UHGLFWLRQWHVW
7DUJHWWHVW
17

More Related Content

What's hot

Chapter 09
Chapter 09Chapter 09
Chapter 09bmcfad01
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervalsLayal Fahad
 
Business Statistics Chapter 9
Business Statistics Chapter 9Business Statistics Chapter 9
Business Statistics Chapter 9Lux PP
 
Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)jillmitchell8778
 
09 ch ken black solution
09 ch ken black solution09 ch ken black solution
09 ch ken black solutionKrunal Shah
 
Testing a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or VarianceTesting a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or VarianceLong Beach City College
 
Assessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's GuideAssessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's GuideMegan Verbakel
 
Statistical Estimation
Statistical Estimation Statistical Estimation
Statistical Estimation Remyagharishs
 
Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...
Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...
Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...Daniel Katz
 
01 ch ken black solution
01 ch ken black solution01 ch ken black solution
01 ch ken black solutionKrunal Shah
 
Basic Statistics Presentation
Basic Statistics PresentationBasic Statistics Presentation
Basic Statistics PresentationIUBAT
 

What's hot (20)

Chapter 09
Chapter 09Chapter 09
Chapter 09
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervals
 
Business Statistics Chapter 9
Business Statistics Chapter 9Business Statistics Chapter 9
Business Statistics Chapter 9
 
Msb12e ppt ch06
Msb12e ppt ch06Msb12e ppt ch06
Msb12e ppt ch06
 
Chapter08
Chapter08Chapter08
Chapter08
 
Testing a Claim About a Proportion
Testing a Claim About a ProportionTesting a Claim About a Proportion
Testing a Claim About a Proportion
 
Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)
 
09 ch ken black solution
09 ch ken black solution09 ch ken black solution
09 ch ken black solution
 
Testing a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or VarianceTesting a Claim About a Standard Deviation or Variance
Testing a Claim About a Standard Deviation or Variance
 
Inferences about Two Proportions
 Inferences about Two Proportions Inferences about Two Proportions
Inferences about Two Proportions
 
Chapter9
Chapter9Chapter9
Chapter9
 
Assessing Normality
Assessing NormalityAssessing Normality
Assessing Normality
 
Assessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's GuideAssessing Model Performance - Beginner's Guide
Assessing Model Performance - Beginner's Guide
 
Chapter 8
Chapter 8Chapter 8
Chapter 8
 
Statistical Estimation
Statistical Estimation Statistical Estimation
Statistical Estimation
 
Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...
Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...
Quantitative Methods for Lawyers - Class #10 - Binomial Distributions, Normal...
 
01 ch ken black solution
01 ch ken black solution01 ch ken black solution
01 ch ken black solution
 
Basic Statistics Presentation
Basic Statistics PresentationBasic Statistics Presentation
Basic Statistics Presentation
 
Significance Tests
Significance TestsSignificance Tests
Significance Tests
 
Estimating a Population Mean
Estimating a Population MeanEstimating a Population Mean
Estimating a Population Mean
 

Similar to Machine Learning for Epidemiological Models (Enrico Meloni)

This project will investigate the SIR model and use numeric methods t.pdf
 This project will investigate the SIR model and use numeric methods t.pdf This project will investigate the SIR model and use numeric methods t.pdf
This project will investigate the SIR model and use numeric methods t.pdfjkcs20004
 
COVID-19 (Coronavirus Disease) Outbreak Prediction Using a Susceptible-Expos...
COVID-19 (Coronavirus Disease) Outbreak  Prediction Using a Susceptible-Expos...COVID-19 (Coronavirus Disease) Outbreak  Prediction Using a Susceptible-Expos...
COVID-19 (Coronavirus Disease) Outbreak Prediction Using a Susceptible-Expos...Dr. Amir Mosavi, PhD., P.Eng.
 
MATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAY
MATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAYMATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAY
MATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAYcscpconf
 
V. pacáková, d. brebera
V. pacáková, d. breberaV. pacáková, d. brebera
V. pacáková, d. breberalogyalaa
 
#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkj
#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkj#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkj
#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkjBereketRegassa1
 
#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkj
#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkj#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkj
#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkjBereketRegassa1
 
Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...
Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...
Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...Dr. Amir Mosavi, PhD., P.Eng.
 
IRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning AlgorithmsIRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning AlgorithmsIRJET Journal
 
Areas In Statistics
Areas In StatisticsAreas In Statistics
Areas In Statisticsguestc94d8c
 
1608 probability and statistics in engineering
1608 probability and statistics in engineering1608 probability and statistics in engineering
1608 probability and statistics in engineeringDr Fereidoun Dejahang
 
A SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASES
A SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASESA SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASES
A SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASESSOUMYADAS835019
 
Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)HennaAnsari
 
3 descritive statistics measure of central tendency variatio
3 descritive statistics measure of   central   tendency variatio3 descritive statistics measure of   central   tendency variatio
3 descritive statistics measure of central tendency variatioLama K Banna
 
Basic survival analysis
Basic survival analysisBasic survival analysis
Basic survival analysisMike LaValley
 
Forecasting COVID-19 using Polynomial Regression and Support Vector Machine
Forecasting COVID-19 using Polynomial Regression and Support Vector MachineForecasting COVID-19 using Polynomial Regression and Support Vector Machine
Forecasting COVID-19 using Polynomial Regression and Support Vector MachineIRJET Journal
 

Similar to Machine Learning for Epidemiological Models (Enrico Meloni) (20)

This project will investigate the SIR model and use numeric methods t.pdf
 This project will investigate the SIR model and use numeric methods t.pdf This project will investigate the SIR model and use numeric methods t.pdf
This project will investigate the SIR model and use numeric methods t.pdf
 
AJMS_395_22.pdf
AJMS_395_22.pdfAJMS_395_22.pdf
AJMS_395_22.pdf
 
COVID-19 (Coronavirus Disease) Outbreak Prediction Using a Susceptible-Expos...
COVID-19 (Coronavirus Disease) Outbreak  Prediction Using a Susceptible-Expos...COVID-19 (Coronavirus Disease) Outbreak  Prediction Using a Susceptible-Expos...
COVID-19 (Coronavirus Disease) Outbreak Prediction Using a Susceptible-Expos...
 
MATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAY
MATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAYMATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAY
MATHEMATICAL MODELLING OF EPIDEMIOLOGY IN PRESENCE OF VACCINATION AND DELAY
 
Data science
Data scienceData science
Data science
 
V. pacáková, d. brebera
V. pacáková, d. breberaV. pacáková, d. brebera
V. pacáková, d. brebera
 
#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkj
#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkj#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkj
#2ECcourse.ppthjjkhkjhkjhjkjhkhhkhkhkhkj
 
#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkj
#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkj#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkj
#2ECcourse.pptgjdgjgfjgfjhkhkjkkjhkhkhkjhjkj
 
Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...
Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...
Predicting Trends of Coronavirus Disease (COVID-19) Using SIRD and Gaussian-S...
 
Ijcatr04041015
Ijcatr04041015Ijcatr04041015
Ijcatr04041015
 
IRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning AlgorithmsIRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
IRJET- Dengue Possibility Forecasting Model using Machine Learning Algorithms
 
Areas In Statistics
Areas In StatisticsAreas In Statistics
Areas In Statistics
 
B02308013
B02308013B02308013
B02308013
 
B02308013
B02308013B02308013
B02308013
 
1608 probability and statistics in engineering
1608 probability and statistics in engineering1608 probability and statistics in engineering
1608 probability and statistics in engineering
 
A SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASES
A SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASESA SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASES
A SEIR MODEL FOR CONTROL OF INFECTIOUS DISEASES
 
Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)
 
3 descritive statistics measure of central tendency variatio
3 descritive statistics measure of   central   tendency variatio3 descritive statistics measure of   central   tendency variatio
3 descritive statistics measure of central tendency variatio
 
Basic survival analysis
Basic survival analysisBasic survival analysis
Basic survival analysis
 
Forecasting COVID-19 using Polynomial Regression and Support Vector Machine
Forecasting COVID-19 using Polynomial Regression and Support Vector MachineForecasting COVID-19 using Polynomial Regression and Support Vector Machine
Forecasting COVID-19 using Polynomial Regression and Support Vector Machine
 

More from MeetupDataScienceRoma

Serve Davvero il Machine Learning nelle PMI? | Niccolò Annino
Serve Davvero il Machine Learning nelle PMI? | Niccolò AnninoServe Davvero il Machine Learning nelle PMI? | Niccolò Annino
Serve Davvero il Machine Learning nelle PMI? | Niccolò AnninoMeetupDataScienceRoma
 
Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...
Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...
Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...MeetupDataScienceRoma
 
Claudio Gallicchio - Deep Reservoir Computing for Structured Data
Claudio Gallicchio - Deep Reservoir Computing for Structured DataClaudio Gallicchio - Deep Reservoir Computing for Structured Data
Claudio Gallicchio - Deep Reservoir Computing for Structured DataMeetupDataScienceRoma
 
Docker for Deep Learning (Andrea Panizza)
Docker for Deep Learning (Andrea Panizza)Docker for Deep Learning (Andrea Panizza)
Docker for Deep Learning (Andrea Panizza)MeetupDataScienceRoma
 
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)MeetupDataScienceRoma
 
Web Meetup #2: Modelli matematici per l'epidemiologia
Web Meetup #2: Modelli matematici per l'epidemiologiaWeb Meetup #2: Modelli matematici per l'epidemiologia
Web Meetup #2: Modelli matematici per l'epidemiologiaMeetupDataScienceRoma
 
Deep red - The environmental impact of deep learning (Paolo Caressa)
Deep red - The environmental impact of deep learning (Paolo Caressa)Deep red - The environmental impact of deep learning (Paolo Caressa)
Deep red - The environmental impact of deep learning (Paolo Caressa)MeetupDataScienceRoma
 
Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...
Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...
Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...MeetupDataScienceRoma
 
Multimodal AI Approach to Provide Assistive Services (Francesco Puja)
Multimodal AI Approach to Provide Assistive Services (Francesco Puja)Multimodal AI Approach to Provide Assistive Services (Francesco Puja)
Multimodal AI Approach to Provide Assistive Services (Francesco Puja)MeetupDataScienceRoma
 
Introduzione - Meetup MLOps & Assistive AI
Introduzione - Meetup MLOps & Assistive AIIntroduzione - Meetup MLOps & Assistive AI
Introduzione - Meetup MLOps & Assistive AIMeetupDataScienceRoma
 
Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)
Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)
Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)MeetupDataScienceRoma
 
Mario Incarnati - The power of data visualization
Mario Incarnati - The power of data visualizationMario Incarnati - The power of data visualization
Mario Incarnati - The power of data visualizationMeetupDataScienceRoma
 
OLIVAW: reaching superhuman strength at Othello
OLIVAW: reaching superhuman strength at OthelloOLIVAW: reaching superhuman strength at Othello
OLIVAW: reaching superhuman strength at OthelloMeetupDataScienceRoma
 
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud Platform[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud PlatformMeetupDataScienceRoma
 
Bring your neural networks to the browser with TF.js - Simone Scardapane
Bring your neural networks to the browser with TF.js - Simone ScardapaneBring your neural networks to the browser with TF.js - Simone Scardapane
Bring your neural networks to the browser with TF.js - Simone ScardapaneMeetupDataScienceRoma
 
Meetup Gennaio 2019 - Slide introduttiva
Meetup Gennaio 2019 - Slide introduttivaMeetup Gennaio 2019 - Slide introduttiva
Meetup Gennaio 2019 - Slide introduttivaMeetupDataScienceRoma
 
Bruno Coletta - Data-Driven Creativity in Marketing and Advertising
Bruno Coletta - Data-Driven Creativity in Marketing and AdvertisingBruno Coletta - Data-Driven Creativity in Marketing and Advertising
Bruno Coletta - Data-Driven Creativity in Marketing and AdvertisingMeetupDataScienceRoma
 

More from MeetupDataScienceRoma (20)

Serve Davvero il Machine Learning nelle PMI? | Niccolò Annino
Serve Davvero il Machine Learning nelle PMI? | Niccolò AnninoServe Davvero il Machine Learning nelle PMI? | Niccolò Annino
Serve Davvero il Machine Learning nelle PMI? | Niccolò Annino
 
Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...
Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...
Meta-learning through the lenses of Statistical Learning Theory (Carlo Cilibe...
 
Claudio Gallicchio - Deep Reservoir Computing for Structured Data
Claudio Gallicchio - Deep Reservoir Computing for Structured DataClaudio Gallicchio - Deep Reservoir Computing for Structured Data
Claudio Gallicchio - Deep Reservoir Computing for Structured Data
 
Docker for Deep Learning (Andrea Panizza)
Docker for Deep Learning (Andrea Panizza)Docker for Deep Learning (Andrea Panizza)
Docker for Deep Learning (Andrea Panizza)
 
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
 
Web Meetup #2: Modelli matematici per l'epidemiologia
Web Meetup #2: Modelli matematici per l'epidemiologiaWeb Meetup #2: Modelli matematici per l'epidemiologia
Web Meetup #2: Modelli matematici per l'epidemiologia
 
Deep red - The environmental impact of deep learning (Paolo Caressa)
Deep red - The environmental impact of deep learning (Paolo Caressa)Deep red - The environmental impact of deep learning (Paolo Caressa)
Deep red - The environmental impact of deep learning (Paolo Caressa)
 
[Sponsored] C3.ai description
[Sponsored] C3.ai description[Sponsored] C3.ai description
[Sponsored] C3.ai description
 
Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...
Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...
Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...
 
Multimodal AI Approach to Provide Assistive Services (Francesco Puja)
Multimodal AI Approach to Provide Assistive Services (Francesco Puja)Multimodal AI Approach to Provide Assistive Services (Francesco Puja)
Multimodal AI Approach to Provide Assistive Services (Francesco Puja)
 
Introduzione - Meetup MLOps & Assistive AI
Introduzione - Meetup MLOps & Assistive AIIntroduzione - Meetup MLOps & Assistive AI
Introduzione - Meetup MLOps & Assistive AI
 
Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)
Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)
Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)
 
Mario Incarnati - The power of data visualization
Mario Incarnati - The power of data visualizationMario Incarnati - The power of data visualization
Mario Incarnati - The power of data visualization
 
Machine Learning in the AWS Cloud
Machine Learning in the AWS CloudMachine Learning in the AWS Cloud
Machine Learning in the AWS Cloud
 
OLIVAW: reaching superhuman strength at Othello
OLIVAW: reaching superhuman strength at OthelloOLIVAW: reaching superhuman strength at Othello
OLIVAW: reaching superhuman strength at Othello
 
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud Platform[Giovanni Galloro] How to use machine learning on Google Cloud Platform
[Giovanni Galloro] How to use machine learning on Google Cloud Platform
 
Bring your neural networks to the browser with TF.js - Simone Scardapane
Bring your neural networks to the browser with TF.js - Simone ScardapaneBring your neural networks to the browser with TF.js - Simone Scardapane
Bring your neural networks to the browser with TF.js - Simone Scardapane
 
Meetup Gennaio 2019 - Slide introduttiva
Meetup Gennaio 2019 - Slide introduttivaMeetup Gennaio 2019 - Slide introduttiva
Meetup Gennaio 2019 - Slide introduttiva
 
Elena Gagliardoni - Neural Chatbot
Elena Gagliardoni - Neural ChatbotElena Gagliardoni - Neural Chatbot
Elena Gagliardoni - Neural Chatbot
 
Bruno Coletta - Data-Driven Creativity in Marketing and Advertising
Bruno Coletta - Data-Driven Creativity in Marketing and AdvertisingBruno Coletta - Data-Driven Creativity in Marketing and Advertising
Bruno Coletta - Data-Driven Creativity in Marketing and Advertising
 

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 

Machine Learning for Epidemiological Models (Enrico Meloni)

  • 1. MACHINE LEARNING FOR EPIDEMIOLOGICAL MODELS E. Meloni, A. Zugarini, A. Panizza, A. Betti, M. Gori September 24, 2020 SAILab, University of Siena
  • 2. TABLE OF CONTENTS 1. SIR Models 2. The data 3. The learning problem 4. Regularization 5. Fitting the data 1
  • 4. WHY LEARN EPIDEMIC MODELS? During 2020, the spread of COVID-19 infection affected the whole world. Some countries resorted to non-pharmaceutical interventions of various degrees to control the spread of the virus, such as lockdowns or imposing the use of masks. Many countries collected vast amounts of detailed data about the evolution of the epidemic. Understanding the evolution of the epidemic from the data can help us understand the efficacy of this interventions and predict the spread of the illness if no action is taken. 2
  • 5. WHY LEARN EPIDEMIC MODELS? Deterministic epidemic models, such as SIR, usually model the evolution through some constant parameters, such as infection rate. Constant parameters are ill-suited to model a dynamic phenomenon where the spread is counter-acted by the intervention of national authorities. Time-varying parameters are better suited, but the parameter space becomes quickly too big to be manually fine-tuned where analytical solutions do not exist. A Machine Learning approach can be used to explore the parameter space and find interesting and useful models. 3
  • 6. THE MODEL A classical deterministic approach to model an epidemic is the susceptible, infectious, removed (SIR) model. A population of N individuals is divided into • x(t) = (# of susceptible at time t) • y(t) = (# of infectious at time t) • z(t) = (# of removed at time t) We assume that the population remains constant, therefore: x(t) + y(t) + z(t) = N. 4
  • 7. THE MODEL The dynamics of the epidemics then can be described by the following system of ODEs1:    x (t) = −βx(t)y(t) y (t) = βx(t)y(t) − γy(t) z (t) = γy(t) • β > 0 → infection rate • γ > 0 → removal rate • R0 := β/γ → reproduction number 1 Norman TJ Bailey et al. The mathematical theory of infectious diseases and its applications. Charles Griffin & Company Ltd, 1975. 5
  • 8. SIDARTHE S I D R H E T A α, β, γ, δ ε η ξ ζ µ τ ρ λ κ ν σθ A more sophisticated epidemiological model is SIDARTHE2, where we add more compartments to better describe the stages of infection. (ODEs can be found in the Appendix) 2 Giulia Giordano et al. “Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy”. In: Nature Medicine (2020). 6
  • 9. SIDARTHE Compartments: • S: Susceptible • I: Asymptomatic undetected infected individuals • D: Aymptomatic detected infected individuals • A: Symptomatic undetected infected individuals • R: Symptomatic detected infected individuals • T: Acutely symptomatic infected individuals • H: Healed individuals • E: Deceased individuals An important difference is that we differentiate between detected and undetected individuals. 7
  • 10. DYNAMIC PARAMETERS The epidemiological parameters are usually considered constant in time. This prevents the model from describing changes due to non-pharmaceutical interventions (lockdown, etc...). We want to extend the model by allowing daily changes to the parameters values. β → β(t) τ → τ(t) In this way we can monitor how parameters (i.e. infection rate) change during the epidemic evolution. 8
  • 12. OFFICIAL DATA We pre-processed the data3 to obtain 5 time-series which map to 5 SIDARTHE compartments: • D: detected asymptomatic individuals • R: detected symptomatic individuals • T: acutely symptomatic individuals • Hd : portion of healed individuals that were previously detected • E: deceased individuals We do not have access to data about the undetected infected individuals (of course). 3 You can find the data following this link: https://github.com/pcm-dpc/COVID-19 9
  • 13. TRAIN/VALIDATION/TEST SPLIT All time-series were split into three sets: • Train Set (size T): 0 ≤ t ≤ T • Validation Set (size V): T ≤ t ≤ T + V • Test Set (size D): T + V ≤ t ≤ T + V + D As usual, the training set will be used to fit the model, the validation set will be used to evaluate over-fitting and the test set will be used for the final evaluation of the model. 10
  • 15. GENERIC STATEMENT We call u(t) the concatenation of all parameters at time t. u(t) = (α(t), β(t)...) Problem Learning u(t) from supervisions on the number of people in the detected compartments, i.e. D, R, T, Hd and E. 11
  • 16. PRECISE FORMULATION Given the supervisions D, R, T, Hd and E Given ˆDu, ˆRu, ˆTu, ˆHu d and ˆEu the solutions of SIDARTHE equations given by parameters u(t). We want to find the set of parameters u(t) which minimizes the MSE of the prediction against the targets on samples for t = 0 → T. 12
  • 17. GRADIENT DESCENT THROUGH ODES The learning is done through Gradient Descent. The SIDARTHE equations are computed through numeric integration, using Heun method and a fixed time step. To obtain the gradient of parameters through the integration, we have implemented the Heun method in PyTorch to leverage its powerful autograd framework. Sidenote: there are strong similarities with Neural ODEs. 13
  • 19. SMOOTHNESS In case of overfitting, the learned parameters could result in a discontinuous and wrinkled function of time. To avoid this problem and reduce overfitting, we augment the loss function with a regularization term R(u). The penalty R(u) is higher when the parameters are discontinuous and enforces a smooth function. 14
  • 20. MOMENTUM Since the solution is computed through integration, the value of parameters at initial time steps have a bigger impact on the solution w.r.t. values at final time steps. This can be seen in the gradient of each parameter: for t → T, gradients approach 0. Hence, we augmented the Gradient Descent update rule with a momentum term, which manages to distribute the gradient throughout the u function. Important note: the commonly known momentum carries the gradient over subsequent epochs; this momentum term carries the gradient over subsequent time steps of the parameters u(t) 15
  • 21. TARGET NORMALIZATION D, R, T, Hd and E have very different scales of values. E.g. max(Hd ) ≈ 200 000 and max(T) ≈ 4 000 This could cause the learning to prefer a target at the expense of another one. We weight the loss components for each target based on its scale relative to the other targets. 16
  • 23. INFECTED INDIVIDUALS 0DU $SU 0D -XQ -XO $XJ 6HS GDWH 'WUDLQYDOLGDWLRQWHVW 3UHGLFWLRQWUDLQ
  • 29. 0DU $SU 0D -XQ -XO $XJ 6HS GDWH 5WUDLQYDOLGDWLRQWHVW 3UHGLFWLRQWUDLQ
  • 35. 17
  • 36. ACUTELY SYMPTOMATIC DECEASED 0DU $SU 0D -XQ -XO $XJ 6HS GDWH 7WUDLQYDOLGDWLRQWHVW 3UHGLFWLRQWUDLQ
  • 42. 0DU $SU 0D -XQ -XO $XJ 6HS GDWH (WUDLQYDOLGDWLRQWHVW 3UHGLFWLRQWUDLQ
  • 48. 18
  • 49. DETECTED HEALED INDIVIDUALS 0DU $SU 0D -XQ -XO $XJ 6HS GDWH +B'(7(7('WUDLQYDOLGDWLRQWHVW 3UHGLFWLRQWUDLQ
  • 55. 19
  • 56. R0 0DU $SU 0D -XQ -XO $XJ 6HS GDWH 5WUDLQYDOLGDWLRQWHVW 3UHGLFWLRQWUDLQ
  • 59. 20
  • 60. PARAMETERS 0DU $SU 0D -XQ -XO $XJ 6HS GDWH ,QIHFWLRQ5DWHV 0DU $SU 0D -XQ -XO $XJ 6HS GDWH 'HWHFWLRQ5DWHV 0DU $SU 0D -XQ -XO $XJ 6HS GDWH 'HDWK5DWHV 0DU $SU 0D -XQ -XO $XJ 6HS GDWH 6PSWRPV'HYHORSPHQW5DWHV 21
  • 61. Thank you for listening! 21
  • 62. References Bailey, Norman TJ et al. The mathematical theory of infectious diseases and its applications. Charles Griffin Company Ltd, 1975. Giordano, Giulia et al. “Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy”. In: Nature Medicine (2020). 22
  • 64. SIDARTHE EQUATIONS    ˙S(t) = −S(t) αI(t) + βD(t) + γA(t) + δR(t) ; ˙I(t) = S(t) αI(t) + βD(t) + γA(t) + δR(t) −(ε + ζ + λ)I(t); ˙D(t) = εI(t) − (η + ρ)D(t); ˙A(t) = ζI(t) − (θ + µ + κ + φ)A(t); ˙R(t) = ηD(t) + θA(t) − (ν + ξ + χ)R(t); ˙T(t) = µA(t) + νR(t) − (σ + τ)T(t); ˙H(t) = λI(t) + ρD(t) + κA(t) + ξR(t) + σT(t); ˙E(t) = φA(t) + χR(t) + τT(t)
  • 65. LOSS FUNCTION The loss function F(u) has 5 components, one for each target. Each target is weighted by the normalizing weight Wx , which takes into account the difference in scale relatively to other targets. Reminder: ˆX is the target, X u is the solution computed by the model using parameters u(t) F(u) = 1 T T t=0 WD 2 D(t) − ˆDu (t) 2 + WR 2 R(t) − ˆRu (t) 2 + WT 2 T(t)− ˆTu (t) 2 + WH 2 Hd (t)− ˆHu d (t) 2 + WE 2 E(t)− ˆEu (t) 2
  • 66. SMOOTHNESS The smoothness regularization term depends on the derivative of the parameters. It is also weighted with the hyper-parameter WR to better tune the regularizing effect of the term. R(u) = WR T T t=0 ˙u(t)2 2
  • 67. MOMENTUM The momentum term is added to the well-known update rule of Gradient Descent. uk+1 t =    uk t − tf (uk) if t = 0 uk t − tf (uk) + µt(uk+1 t−1 − uk t−1) if t 0 Where k is the epoch number, tf (uk) is the gradient of parameter u at time t and epoch k, µ(t) = sigmoid(mt) and m is a hyper-parameter.
  • 68. LINKS SAILab: https://sailab.diism.unisi.it/ Code: https://github.com/sailab-code/learning-sidarthe Data: https://github.com/pcm-dpc/COVID-19 Speaker webpage: https://enricomeloni.github.io