SlideShare une entreprise Scribd logo
1  sur  70
Télécharger pour lire hors ligne
BN learning Dynamic BN learning Relational BN learning Conclusion
Advances in Learning with Bayesian
Networks
Philippe Leray
philippe.leray@univ-nantes.fr
DUKe (Data User Knowledge) research group, LINA UMR 6241, Nantes, France
Nantes Machine Learning Meetup, July 6, 2015, Nantes, France
Philippe Leray Advances in Learning with Bayesian Networks 1/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
Bayesian networks (BNs) are a powerful tool for graphical
representation of the underlying knowledge in the data and
reasoning with incomplete or imprecise observations.
BNs have been extended (or generalized) in several ways, as
for instance, causal BNs, dynamic BNs, relational BNs, ...
Grade
Letter
SAT
IntelligenceDifficulty
d1
d0
0.6 0.4
i1
i0
0.7 0.3
i0
i1
s1s0
0.95
0.2
0.05
0.8
g1
g2
g2
l1
l 0
0.1
0.4
0.99
0.9
0.6
0.01
i0,d0
i0
,d1
i0
,d0
i0
,d1
g2 g3g1
0.3
0.05
0.9
0.5
0.4
0.25
0.08
0.3
0.3
0.7
0.02
0.2
Philippe Leray Advances in Learning with Bayesian Networks 2/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
Bayesian networks (BNs) are a powerful tool for graphical
representation of the underlying knowledge in the data and
reasoning with incomplete or imprecise observations.
BNs have been extended (or generalized) in several ways, as
for instance, causal BNs, dynamic BNs, relational BNs, ...
Grade
Letter
SAT
IntelligenceDifficulty
d1
d0
0.6 0.4
i1
i0
0.7 0.3
i0
i1
s1s0
0.95
0.2
0.05
0.8
g1
g2
g2
l1
l 0
0.1
0.4
0.99
0.9
0.6
0.01
i0,d0
i0
,d1
i0
,d0
i0
,d1
g2 g3g1
0.3
0.05
0.9
0.5
0.4
0.25
0.08
0.3
0.3
0.7
0.02
0.2
Philippe Leray Advances in Learning with Bayesian Networks 2/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
Philippe Leray Advances in Learning with Bayesian Networks 3/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete /incomplete [Fran¸cois et al. 06]
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete /incomplete [Fran¸cois et al. 06]
high n,
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete /incomplete [Fran¸cois et al. 06]
high n, n >> p [Ammar & Leray, 11]
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete /incomplete [Fran¸cois et al. 06]
high n, n >> p [Ammar & Leray, 11]
stream [Yasin and Leray, 13]
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete /incomplete [Fran¸cois et al. 06]
high n, n >> p [Ammar & Leray, 11]
stream [Yasin and Leray, 13]
+ prior knowledge / ontology [Ben Messaoud et al., 13]
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete /incomplete [Fran¸cois et al. 06]
high n, n >> p [Ammar & Leray, 11]
stream [Yasin and Leray, 13]
+ prior knowledge / ontology [Ben Messaoud et al., 13]
structured data [Ben Ishak, Coutant, Chulyadyo et al.]
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
We would like to learn a BN from data... but which kind of data ?
complete /incomplete [Fran¸cois et al. 06]
high n, n >> p [Ammar & Leray, 11]
stream [Yasin and Leray, 13]
+ prior knowledge / ontology [Ben Messaoud et al., 13]
structured data [Ben Ishak, Coutant, Chulyadyo et al.]
not so structured data [Elabri et al.]
Philippe Leray Advances in Learning with Bayesian Networks 4/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
Even the learning task can differ : generative
modeling P(X, Y )
no target variable
more general model
better behavior with
incomplete data
Objectives of this talk
how to learn BNs in such various contexts ?
state of the art : founding algorithms and recent ones
pointing out our contributions in this field
Philippe Leray Advances in Learning with Bayesian Networks 5/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
Even the learning task can differ : generative vs. discriminative
modeling P(X, Y )
no target variable
more general model
better behavior with
incomplete data
modeling P(Y |X)
one target variable Y
dedicated model
Objectives of this talk
how to learn BNs in such various contexts ?
state of the art : founding algorithms and recent ones
pointing out our contributions in this field
Philippe Leray Advances in Learning with Bayesian Networks 5/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Motivations
Even the learning task can differ : generative vs. discriminative
modeling P(X, Y )
no target variable
more general model
better behavior with
incomplete data
modeling P(Y |X)
one target variable Y
dedicated model
Objectives of this talk
how to learn BNs in such various contexts ?
state of the art : founding algorithms and recent ones
pointing out our contributions in this field
Philippe Leray Advances in Learning with Bayesian Networks 5/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Outline ...
1 BN learning
Definition
Parameter learning
Structure learning
2 Dynamic BN learning
Definition
Learning
3 Relational BN learning
Definition
Learning
Graph DB ?
4 Conclusion
Last words
References
Philippe Leray Advances in Learning with Bayesian Networks 6/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Bayesian network [Pearl, 1985]
Definition
G qualitative description of
conditional dependences /
independences between
variables
directed acyclic graph (DAG)
Θ quantitative description of
these dependences
conditional probability
distributions (CPDs)
Grade
Letter
SAT
IntelligenceDifficulty
d1
d0
0.6 0.4
i1
i0
0.7 0.3
i0
i1
s1s0
0.95
0.2
0.05
0.8
g1
g2
g2
l1
l 0
0.1
0.4
0.99
0.9
0.6
0.01
i0
,d0
i0,d1
i0
,d0
i0
,d1
g2 g3g1
0.3
0.05
0.9
0.5
0.4
0.25
0.08
0.3
0.3
0.7
0.02
0.2
Main property
the global model is decomposed into a set of local conditional
models
Philippe Leray Advances in Learning with Bayesian Networks 7/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Bayesian network [Pearl, 1985]
Definition
G qualitative description of
conditional dependences /
independences between
variables
directed acyclic graph (DAG)
Θ quantitative description of
these dependences
conditional probability
distributions (CPDs)
Grade
Letter
SAT
IntelligenceDifficulty
d1
d0
0.6 0.4
i1
i0
0.7 0.3
i0
i1
s1s0
0.95
0.2
0.05
0.8
g1
g2
g2
l1
l 0
0.1
0.4
0.99
0.9
0.6
0.01
i0
,d0
i0,d1
i0
,d0
i0
,d1
g2 g3g1
0.3
0.05
0.9
0.5
0.4
0.25
0.08
0.3
0.3
0.7
0.02
0.2
Main property
the global model is decomposed into a set of local conditional
models
Philippe Leray Advances in Learning with Bayesian Networks 7/32
BN learning Dynamic BN learning Relational BN learning Conclusion
One model... but two learning tasks
BN = graph G and set of CPDs Θ
parameter learning / G given
structure learning
Grade
Letter
SAT
IntelligenceDifficulty
d1
d0
0.6 0.4
i1
i0
0.7 0.3
i0
i1
s1s0
0.95
0.2
0.05
0.8
g1
g2
g2
l1
l 0
0.1
0.4
0.99
0.9
0.6
0.01
i0
,d0
i0,d1
i0
,d0
i0
,d1
g2 g3g1
0.3
0.05
0.9
0.5
0.4
0.25
0.08
0.3
0.3
0.7
0.02
0.2
Philippe Leray Advances in Learning with Bayesian Networks 8/32
BN learning Dynamic BN learning Relational BN learning Conclusion
One model... but two learning tasks
BN = graph G and set of CPDs Θ
parameter learning / G given
structure learning
Grade
Letter
SAT
IntelligenceDifficulty
d1
d0 i1
i0
i0
i1
s1s0
g1
g2
g2
l1
l 0
i0
,d0
i0,d1
i0
,d0
i0
,d1
g2 g3g1
? ? ? ?
? ?
? ?
? ?
? ?
? ?
? ?
? ?
? ?
?
?
?
? ??
Philippe Leray Advances in Learning with Bayesian Networks 8/32
BN learning Dynamic BN learning Relational BN learning Conclusion
One model... but two learning tasks
BN = graph G and set of CPDs Θ
parameter learning / G given
structure learning
Grade
Letter
SAT
IntelligenceDifficulty
d
?
?
?
?
?
Philippe Leray Advances in Learning with Bayesian Networks 8/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Parameter learning (generative)
Complete data D
max. of likelihood (ML) : ˆθMV = argmax P(D|θ)
closed-form solution :
ˆP(Xi = xk|Pa(Xi ) = xj ) = ˆθMV
i,j,k =
Ni,j,k
k Ni,j,k
Ni,j,k = nb of occurrences of {Xi = xk and Pa(Xi ) = xj }
Other approaches P(θ) ∼ Dirichlet(α)
max. a posteriori (MAP) : ˆθMAP = argmax P(θ|D)
expectation a posteriori (EAP) : ˆθEAP = E(P(θ|D))
ˆθMAP
i,j,k =
Ni,j,k +αi,j,k −1
k (Ni,j,k +αi,j,k −1)
ˆθEAP
i,j,k =
Ni,j,k +αi,j,k
k (Ni,j,k +αi,j,k )
Philippe Leray Advances in Learning with Bayesian Networks 9/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Parameter learning (generative)
Complete data D
max. of likelihood (ML) : ˆθMV = argmax P(D|θ)
closed-form solution :
ˆP(Xi = xk|Pa(Xi ) = xj ) = ˆθMV
i,j,k =
Ni,j,k
k Ni,j,k
Ni,j,k = nb of occurrences of {Xi = xk and Pa(Xi ) = xj }
Other approaches P(θ) ∼ Dirichlet(α)
max. a posteriori (MAP) : ˆθMAP = argmax P(θ|D)
expectation a posteriori (EAP) : ˆθEAP = E(P(θ|D))
ˆθMAP
i,j,k =
Ni,j,k +αi,j,k −1
k (Ni,j,k +αi,j,k −1)
ˆθEAP
i,j,k =
Ni,j,k +αi,j,k
k (Ni,j,k +αi,j,k )
Philippe Leray Advances in Learning with Bayesian Networks 9/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Parameter learning (generative)
Incomplete data
no closed-form solution
EM (iterative) algorithm [Dempster, 77],
convergence to a local optimum
Incremental data
advantages of sufficient statistics
θi,j,k =
Nold θold
i,j,k + Ni,j,k
Nold + N
this Bayesian updating can include a forgetting factor
Philippe Leray Advances in Learning with Bayesian Networks 10/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Parameter learning (generative)
Incomplete data
no closed-form solution
EM (iterative) algorithm [Dempster, 77],
convergence to a local optimum
Incremental data
advantages of sufficient statistics
θi,j,k =
Nold θold
i,j,k + Ni,j,k
Nold + N
this Bayesian updating can include a forgetting factor
Philippe Leray Advances in Learning with Bayesian Networks 10/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Parameter learning (discriminative)
Complete data
no closed-form
iterative algorithms such as gradient descent
Incomplete data
no closed-form
iterative algorithms + EM :-(
Philippe Leray Advances in Learning with Bayesian Networks 11/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Parameter learning (discriminative)
Complete data
no closed-form
iterative algorithms such as gradient descent
Incomplete data
no closed-form
iterative algorithms + EM :-(
Philippe Leray Advances in Learning with Bayesian Networks 11/32
BN learning Dynamic BN learning Relational BN learning Conclusion
BN structure learning is a complex task
Size of the ”solution” space
the number of possible DAGs with n variables is
super-exponential w.r.t n [Robinson, 77]
NS(5) = 29281 NS(10) = 4.2 × 1018
an exhaustive search is impossible for realistic n !
One thousand millenniums = 3.2 × 1013
seconds
Identifiability
data can only help finding (conditional) dependences /
independences
Markov Equivalence : several graphs describe the same
dependence statements
causal Sufficiency : do we know all the explaining variables ?
Philippe Leray Advances in Learning with Bayesian Networks 12/32
BN learning Dynamic BN learning Relational BN learning Conclusion
BN structure learning is a complex task
Size of the ”solution” space
the number of possible DAGs with n variables is
super-exponential w.r.t n [Robinson, 77]
NS(5) = 29281 NS(10) = 4.2 × 1018
an exhaustive search is impossible for realistic n !
One thousand millenniums = 3.2 × 1013
seconds
Identifiability
data can only help finding (conditional) dependences /
independences
Markov Equivalence : several graphs describe the same
dependence statements
causal Sufficiency : do we know all the explaining variables ?
Philippe Leray Advances in Learning with Bayesian Networks 12/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning (generative / complete)
Constraint-based methods
BN = independence model
⇒ find CI in data in order to build the DAG
ex : IC [Pearl & Verma, 91], PC [Spirtes et al., 93]
problem : reliability of CI statistical tests (ok for n < 100)
Score-based methods
BN = probabilistic model that must fit data as well as possible
problem : size of search space (ok for n < 1000)
Hybrid/ local search methods
local search / neighbor identification (statistical tests)
global (score) optimization
usually for scalability reasons (ok for high n)
Philippe Leray Advances in Learning with Bayesian Networks 13/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning (generative / complete)
Constraint-based methods
BN = independence model
problem : reliability of CI statistical tests (ok for n < 100)
Score-based methods
BN = probabilistic model that must fit data as well as possible
⇒ search the DAG space in order to maximize a scoring
function
ex : Maximum Weighted Spanning Tree [Chow & Liu, 68],
Greedy Search [Chickering, 95], evolutionary approaches
[Larranaga et al., 96] [Wang & Yang, 10]
problem : size of search space (ok for n < 1000)
Hybrid/ local search methods
local search / neighbor identification (statistical tests)Philippe Leray Advances in Learning with Bayesian Networks 13/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning (generative / complete)
Constraint-based methods
BN = independence model
problem : reliability of CI statistical tests (ok for n < 100)
Score-based methods
BN = probabilistic model that must fit data as well as possible
problem : size of search space (ok for n < 1000)
Hybrid/ local search methods
local search / neighbor identification (statistical tests)
global (score) optimization
usually for scalability reasons (ok for high n)
ex : MMHC algorithm [Tsamardinos et al., 06]
Philippe Leray Advances in Learning with Bayesian Networks 13/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning (discriminative)
Specific structures
naive Bayes, augmented naive Bayes
multi-nets
...
X1
X2
X3
C
X4
X5
X1
X2
X3
C
X4
X5
Structure learning
usually, the structure is learned in a generative way
the parameters are then tuned in a discriminative way
Philippe Leray Advances in Learning with Bayesian Networks 14/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning (discriminative)
Specific structures
naive Bayes, augmented naive Bayes
multi-nets
...
X1
X2
X3
C
X4
X5
X1
X2
X3
C
X4
X5
Structure learning
usually, the structure is learned in a generative way
the parameters are then tuned in a discriminative way
Philippe Leray Advances in Learning with Bayesian Networks 14/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning
Incomplete data
hybridization of previous
structure learning methods
and EM
ex : Structural EM
[Friedman, 97]
Greedy Search + EM
problem : convergence
Grade
Letter
SAT
IntelligenceDifficulty
d
?
?
?
?
?
Philippe Leray Advances in Learning with Bayesian Networks 15/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning
n >> p
robustness and complexity
issues
application of Perturb &
Combine principle
ex : mixture of randomly
perturbed trees
[Ammar & Leray, 11]
Philippe Leray Advances in Learning with Bayesian Networks 16/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning
Incremental learning and data
streams
Bayesian updating is easy for
parameters
Bayesian updating is complex
for structure learning
and other constraints related
to data streams (limited
storage, ...)
ex : incremental MMHC
[Yasin and Leray, 13]
Philippe Leray Advances in Learning with Bayesian Networks 17/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Structure learning
Integration of prior knowledge
in order to reduce search
space : white list, black list,
node ordering [Campos &
Castellano, 07]
interaction with ontologies
[Ben Messaoud et al., 13]
Philippe Leray Advances in Learning with Bayesian Networks 18/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Outline ...
1 BN learning
Definition
Parameter learning
Structure learning
2 Dynamic BN learning
Definition
Learning
3 Relational BN learning
Definition
Learning
Graph DB ?
4 Conclusion
Last words
References
Philippe Leray Advances in Learning with Bayesian Networks 19/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Dynamic Bayesian networks (DBNs)
k slices temporal BN (k-TBN)
[Murphy, 02]
k − 1 Markov order
prior graph G0 + transition
graph G
for example : 2-TBNs model
[Dean & Kanazawa, 89]
Simplified k-TBN
k-TBN with only temporal
edges [Dojer, 06][Vinh et al, 12]
Philippe Leray Advances in Learning with Bayesian Networks 20/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Dynamic Bayesian networks (DBNs)
k slices temporal BN (k-TBN)
[Murphy, 02]
k − 1 Markov order
prior graph G0 + transition
graph G
for example : 2-TBNs model
[Dean & Kanazawa, 89]
Simplified k-TBN
k-TBN with only temporal
edges [Dojer, 06][Vinh et al, 12]
Philippe Leray Advances in Learning with Bayesian Networks 20/32
BN learning Dynamic BN learning Relational BN learning Conclusion
DBN structure learning (generative)
Score-based methods
dynamic Greedy Search [Friedman et al., 98], genetic
algorithm [Gao et al., 07], dynamic Simulated Annealing
[Hartemink, 05], ...
for k-TBN (G0 and G learning)
but not scalable (high n)
Hybrid methods
[Dojer, 06] [Vinh et al., 12] for simplified k-TBN, but often
limited to k = 2 for scalability
dynamic MMHC for ”unsimplified” 2-TBNs with high n
[Trabelsi et al., 13]
Philippe Leray Advances in Learning with Bayesian Networks 21/32
BN learning Dynamic BN learning Relational BN learning Conclusion
DBN structure learning (generative)
Score-based methods
dynamic Greedy Search [Friedman et al., 98], genetic
algorithm [Gao et al., 07], dynamic Simulated Annealing
[Hartemink, 05], ...
for k-TBN (G0 and G learning)
but not scalable (high n)
Hybrid methods
[Dojer, 06] [Vinh et al., 12] for simplified k-TBN, but often
limited to k = 2 for scalability
dynamic MMHC for ”unsimplified” 2-TBNs with high n
[Trabelsi et al., 13]
Philippe Leray Advances in Learning with Bayesian Networks 21/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Outline ...
1 BN learning
Definition
Parameter learning
Structure learning
2 Dynamic BN learning
Definition
Learning
3 Relational BN learning
Definition
Learning
Graph DB ?
4 Conclusion
Last words
References
Philippe Leray Advances in Learning with Bayesian Networks 22/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Relational schema
Movie
User
Vote
Movie
User
Rating
Gender
Age
Occupation
RealiseDate
Genre
A relational schema R
classes + relational variables
reference slots (e.g.,
Vote.Movie, Vote.User)
slot chain = a sequence of
reference slots
allow to walk in the relational
schema to create new variables
ex: Vote.User.User−1
.Movie: all
the movies voted by a particular
user
Philippe Leray Advances in Learning with Bayesian Networks 23/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Relational schema
Movie
User
Vote
Movie
User
Rating
Gender
Age
Occupation
RealiseDate
Genre
A relational schema R
classes + relational variables
reference slots (e.g.,
Vote.Movie, Vote.User)
slot chain = a sequence of
reference slots
allow to walk in the relational
schema to create new variables
ex: Vote.User.User−1
.Movie: all
the movies voted by a particular
user
Philippe Leray Advances in Learning with Bayesian Networks 23/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Relational schema
Movie
User
Vote
Movie
User
Rating
Gender
Age
Occupation
RealiseDate
Genre
A relational schema R
classes + relational variables
reference slots (e.g.,
Vote.Movie, Vote.User)
slot chain = a sequence of
reference slots
allow to walk in the relational
schema to create new variables
ex: Vote.User.User−1
.Movie: all
the movies voted by a particular
user
Philippe Leray Advances in Learning with Bayesian Networks 23/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Relational schema
Movie
User
Vote
Movie
User
Rating
Gender
Age
Occupation
RealiseDate
Genre
A relational schema R
classes + relational variables
reference slots (e.g.,
Vote.Movie, Vote.User)
slot chain = a sequence of
reference slots
allow to walk in the relational
schema to create new variables
ex: Vote.User.User−1
.Movie: all
the movies voted by a particular
user
Philippe Leray Advances in Learning with Bayesian Networks 23/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Probabilistic Relational Models
[Koller & Pfeffer, 98]
Definition
A PRM Π associated to R:
a qualitative dependency
structure S (with possible
long slot chains and
aggregation functions)
a set of parameters θS
Vote
Rating
Movie User
RealiseDate
Genre
AgeGender
Occupation
0.60.4
FM
User.Gender
0.40.6Comedy, F
0.50.5Comedy, M
0.10.9Horror, F
0.80.2Horror, M
0.70.3Drama, F
0.50.5Drama, M
HighLow
Votes.RatingMovie.Genre
User.Gender
Philippe Leray Advances in Learning with Bayesian Networks 24/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Probabilistic Relational Models
Definition
Vote
Rating
Movie User
RealiseDate
Genre
AgeGender
Occupation
0.60.4
FM
User.Gender
0.40.6Comedy, F
0.50.5Comedy, M
0.10.9Horror, F
0.80.2Horror, M
0.70.3Drama, F
0.50.5Drama, M
HighLow
Votes.RatingMovie.Genre
User.Gender
Aggregators
Vote.User.User−1.Movie.genre → Vote.rating
movie rating from one user can be dependent with the genre
of all the movies voted by this user
how to describe the dependency with an unknown number of
parents ?
solution : using an aggregated value, e.g. γ = MODE
Philippe Leray Advances in Learning with Bayesian Networks 25/32
BN learning Dynamic BN learning Relational BN learning Conclusion
DAPER
Another probabilistic relational model
[Heckerman & Meek, 04]
Definition
Probabilistic model associated to
an Entity-Relationship model
Philippe Leray Advances in Learning with Bayesian Networks 26/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Learning from a relational datatase
GBN
PRM/DAPER
learning = finding the
probabilistic
dependencies and the
probability tables
from an instantiated
database
the relational
schema/ER model is
given
Age
Rating
Age
Gender
Occupation
Age
Gender
Occupation
Gender
Occupation
Genre
RealiseDate
Genre
Genre
Genre
Genre
U1
U2
U3
M1
M2
M3
M4
M5
#U1, #M1
Rating
#U1, #M2
Rating
#U2, #M1
Rating
#U2, #M3
Rating
#U2, #M4
Rating
#U3, #M1
Rating
#U3, #M2
Rating
#U3, #M3
Rating
#U3, #M5
RealiseDate
RealiseDate
RealiseDate
RealiseDate
Philippe Leray Advances in Learning with Bayesian Networks 27/32
BN learning Dynamic BN learning Relational BN learning Conclusion
PRM/DAPER structure learning
Relational variables
finding new variables by exploring the relational schema
ex: student.reg.grade, registration.course.reg.grade,
registration.student reg.course.reg.grade, ...
⇒ adding another dimension in the search space
⇒ limitation to a given maximal slot chain length
Constraint-based methods
relational PC [Maier et al., 10] relational CD [Maier et al., 13]
don’t deal with aggregation functions
Score-based methods
Greedy search [Getoor et al., 07]
Philippe Leray Advances in Learning with Bayesian Networks 28/32
BN learning Dynamic BN learning Relational BN learning Conclusion
PRM/DAPER structure learning
Relational variables
⇒ adding another dimension in the search space
⇒ limitation to a given maximal slot chain length
Constraint-based methods
relational PC [Maier et al., 10] relational CD [Maier et al., 13]
don’t deal with aggregation functions
Score-based methods
Greedy search [Getoor et al., 07]
Hybrid methods
relational MMHC [Ben Ishak et al., 15]
Philippe Leray Advances in Learning with Bayesian Networks 28/32
BN learning Dynamic BN learning Relational BN learning Conclusion
PRM/DAPER structure learning
Relational variables
⇒ adding another dimension in the search space
⇒ limitation to a given maximal slot chain length
Constraint-based methods
relational PC [Maier et al., 10] relational CD [Maier et al., 13]
don’t deal with aggregation functions
Score-based methods
Greedy search [Getoor et al., 07]
Hybrid methods
relational MMHC [Ben Ishak et al., 15]
Philippe Leray Advances in Learning with Bayesian Networks 28/32
BN learning Dynamic BN learning Relational BN learning Conclusion
PRM/DAPER structure learning
Relational variables
⇒ adding another dimension in the search space
⇒ limitation to a given maximal slot chain length
Constraint-based methods
relational PC [Maier et al., 10] relational CD [Maier et al., 13]
don’t deal with aggregation functions
Score-based methods
Greedy search [Getoor et al., 07]
Hybrid methods
relational MMHC [Ben Ishak et al., 15]
Philippe Leray Advances in Learning with Bayesian Networks 28/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Graph database
Definition
Data is organized as a
graph, with ”labelled”
nodes and
relationships
Attributes can be
associated to both.
Seems nice for ER
model but ...
Schema-free
Elabri et al., in progress
Philippe Leray Advances in Learning with Bayesian Networks 29/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Graph database
Definition
Data is organized as a
graph, with ”labelled”
nodes and
relationships
Attributes can be
associated to both.
Seems nice for ER
model but ...
Schema-free
Elabri et al., in progress
Philippe Leray Advances in Learning with Bayesian Networks 29/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Graph database
Definition
Data is organized as a
graph, with ”labelled”
nodes and
relationships
Attributes can be
associated to both.
Seems nice for ER
model but ...
Schema-free
Elabri et al., in progress
Philippe Leray Advances in Learning with Bayesian Networks 29/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Graph database
Definition
Schema-free
Only data, no
”relational schema”
No warranty that the
data has been
”stored” by following
some meta/ER
model.
Elabri et al., in progress
Philippe Leray Advances in Learning with Bayesian Networks 29/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Graph database
Definition
Schema-free
Only data, no
”relational schema”
No warranty that the
data has been
”stored” by following
some meta/ER
model.
Elabri et al., in progress
Philippe Leray Advances in Learning with Bayesian Networks 29/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Graph database
Definition
Schema-free
Elabri et al., in progress
Learning a
probabilistic relational
model from a graph
DB
Extension to Markov
Logic Networks
Philippe Leray Advances in Learning with Bayesian Networks 29/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Outline ...
1 BN learning
Definition
Parameter learning
Structure learning
2 Dynamic BN learning
Definition
Learning
3 Relational BN learning
Definition
Learning
Graph DB ?
4 Conclusion
Last words
References
Philippe Leray Advances in Learning with Bayesian Networks 30/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Conclusion
Visible face of this talk
BNs = powerful tool for knowledge representation and
reasoning
⇒ interest in using structure learning algorithms for knowledge
discovery
BN structure learning is NP-hard, even for ”usual” BN/data
but we want to learn more and more complex models with
more and more complex data
⇒ many works in progress in order to develop such learning
algorithms
Hidden face of this talk
Philippe Leray Advances in Learning with Bayesian Networks 31/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Conclusion
Visible face of this talk
BNs = powerful tool for knowledge representation and
reasoning
⇒ interest in using structure learning algorithms for knowledge
discovery
BN structure learning is NP-hard, even for ”usual” BN/data
but we want to learn more and more complex models with
more and more complex data
⇒ many works in progress in order to develop such learning
algorithms
Hidden face of this talk
Philippe Leray Advances in Learning with Bayesian Networks 31/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Conclusion
Visible face of this talk
Hidden face of this talk
BN learning and causality : causal discovery
BN learning tools : no unified programming tools, often
limited to simple BN models / simple data
⇒ coming soon : PILGRIM our GPL platform in C++, dealing
with BN, DBN, RBN, incremental data, ...
BN versus other probabilistic graphical models : Qualitative
probabilistic models, Markov random fields, Conditional
random fields, Deep belief networks, ...
Philippe Leray Advances in Learning with Bayesian Networks 31/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Conclusion
Visible face of this talk
Hidden face of this talk
BN learning and causality : causal discovery
BN learning tools : no unified programming tools, often
limited to simple BN models / simple data
⇒ coming soon : PILGRIM our GPL platform in C++, dealing
with BN, DBN, RBN, incremental data, ...
BN versus other probabilistic graphical models : Qualitative
probabilistic models, Markov random fields, Conditional
random fields, Deep belief networks, ...
Philippe Leray Advances in Learning with Bayesian Networks 31/32
BN learning Dynamic BN learning Relational BN learning Conclusion
Conclusion
Visible face of this talk
Hidden face of this talk
BN learning and causality : causal discovery
BN learning tools : no unified programming tools, often
limited to simple BN models / simple data
⇒ coming soon : PILGRIM our GPL platform in C++, dealing
with BN, DBN, RBN, incremental data, ...
BN versus other probabilistic graphical models : Qualitative
probabilistic models, Markov random fields, Conditional
random fields, Deep belief networks, ...
Philippe Leray Advances in Learning with Bayesian Networks 31/32
BN learning Dynamic BN learning Relational BN learning Conclusion
References
One starting point
[Koller & Friedman, 09]
Probabilistic Graphical Models:
Principles and Techniques. MIT
Press.
Our publications
http://tinyurl.com/PhLeray
Thank you for your
attention
Philippe Leray Advances in Learning with Bayesian Networks 32/32
BN learning Dynamic BN learning Relational BN learning Conclusion
References
One starting point
[Koller & Friedman, 09]
Probabilistic Graphical Models:
Principles and Techniques. MIT
Press.
Our publications
http://tinyurl.com/PhLeray
Thank you for your
attention
Philippe Leray Advances in Learning with Bayesian Networks 32/32

Contenu connexe

Tendances

French machine reading for question answering
French machine reading for question answeringFrench machine reading for question answering
French machine reading for question answeringAli Kabbadj
 
Introduction to Interpretable Machine Learning
Introduction to Interpretable Machine LearningIntroduction to Interpretable Machine Learning
Introduction to Interpretable Machine LearningNguyen Giang
 
Nlp research presentation
Nlp research presentationNlp research presentation
Nlp research presentationSurya Sg
 
Nautral Langauge Processing - Basics / Non Technical
Nautral Langauge Processing - Basics / Non Technical Nautral Langauge Processing - Basics / Non Technical
Nautral Langauge Processing - Basics / Non Technical Dhruv Gohil
 
Meta-argumentation Frameworks For Modelling Dialogues with Information from S...
Meta-argumentation Frameworks For Modelling Dialogues with Information from S...Meta-argumentation Frameworks For Modelling Dialogues with Information from S...
Meta-argumentation Frameworks For Modelling Dialogues with Information from S...Gideonbms
 
Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyenNguyen Giang
 
NLP_Project_Paper_up276_vec241
NLP_Project_Paper_up276_vec241NLP_Project_Paper_up276_vec241
NLP_Project_Paper_up276_vec241Urjit Patel
 
Machine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.pptMachine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.pptbutest
 
mlas06_nigam_tie_01.ppt
mlas06_nigam_tie_01.pptmlas06_nigam_tie_01.ppt
mlas06_nigam_tie_01.pptbutest
 
Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1Traian Rebedea
 
Audit report[rollno 49]
Audit report[rollno 49]Audit report[rollno 49]
Audit report[rollno 49]RAHULROHAM2
 
An Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextAn Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextFulvio Rotella
 
Model Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningModel Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningPramit Choudhary
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksLeonardo Di Donato
 
C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)SHC
 
Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...
Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...
Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...Daniele Di Mitri
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis csandit
 

Tendances (20)

French machine reading for question answering
French machine reading for question answeringFrench machine reading for question answering
French machine reading for question answering
 
Introduction to Interpretable Machine Learning
Introduction to Interpretable Machine LearningIntroduction to Interpretable Machine Learning
Introduction to Interpretable Machine Learning
 
Plug play language_models
Plug play language_modelsPlug play language_models
Plug play language_models
 
Nlp research presentation
Nlp research presentationNlp research presentation
Nlp research presentation
 
Nautral Langauge Processing - Basics / Non Technical
Nautral Langauge Processing - Basics / Non Technical Nautral Langauge Processing - Basics / Non Technical
Nautral Langauge Processing - Basics / Non Technical
 
Meta-argumentation Frameworks For Modelling Dialogues with Information from S...
Meta-argumentation Frameworks For Modelling Dialogues with Information from S...Meta-argumentation Frameworks For Modelling Dialogues with Information from S...
Meta-argumentation Frameworks For Modelling Dialogues with Information from S...
 
Show observe and tell giang nguyen
Show observe and tell   giang nguyenShow observe and tell   giang nguyen
Show observe and tell giang nguyen
 
NLP_Project_Paper_up276_vec241
NLP_Project_Paper_up276_vec241NLP_Project_Paper_up276_vec241
NLP_Project_Paper_up276_vec241
 
Machine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.pptMachine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.ppt
 
mlas06_nigam_tie_01.ppt
mlas06_nigam_tie_01.pptmlas06_nigam_tie_01.ppt
mlas06_nigam_tie_01.ppt
 
Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1
 
Audit report[rollno 49]
Audit report[rollno 49]Audit report[rollno 49]
Audit report[rollno 49]
 
Blenderbot
BlenderbotBlenderbot
Blenderbot
 
An Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextAn Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from Text
 
Model Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningModel Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep Learning
 
What is word2vec?
What is word2vec?What is word2vec?
What is word2vec?
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
 
C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)
 
Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...
Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...
Word2Vec: Learning of word representations in a vector space - Di Mitri & Her...
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis
 

Similaire à Advances in Learning with Bayesian Networks - july 2015

Lecture 7: Learning from Massive Datasets
Lecture 7: Learning from Massive DatasetsLecture 7: Learning from Massive Datasets
Lecture 7: Learning from Massive DatasetsMarina Santini
 
Do Neural Models Learn Transitivity of Veridical Inference?
Do Neural Models Learn Transitivity of Veridical Inference?Do Neural Models Learn Transitivity of Veridical Inference?
Do Neural Models Learn Transitivity of Veridical Inference?Hitomi Yanaka
 
Gentle Introduction: Bayesian Modelling and Probabilistic Programming in R
Gentle Introduction: Bayesian Modelling and Probabilistic Programming in RGentle Introduction: Bayesian Modelling and Probabilistic Programming in R
Gentle Introduction: Bayesian Modelling and Probabilistic Programming in RMarco Wirthlin
 
Recent developments in CS education research Jul 18
Recent developments in CS education research Jul 18Recent developments in CS education research Jul 18
Recent developments in CS education research Jul 18Sue Sentance
 
IntroML_3_DataVisualisation
IntroML_3_DataVisualisationIntroML_3_DataVisualisation
IntroML_3_DataVisualisationElio Laureano
 
1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdf1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdfssusere320ca
 
Research Directions - Full Stack Deep Learning
Research Directions - Full Stack Deep LearningResearch Directions - Full Stack Deep Learning
Research Directions - Full Stack Deep LearningSergey Karayev
 
Bayesian Networks to Predict Reputation in Virtual Learning Communities
Bayesian Networks to Predict Reputation in Virtual Learning CommunitiesBayesian Networks to Predict Reputation in Virtual Learning Communities
Bayesian Networks to Predict Reputation in Virtual Learning CommunitiesUniversidad Nacional de Loja
 
Continual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent MachinesContinual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent MachinesVincenzo Lomonaco
 
Deploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National ScaleDeploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National Scalemichaeldwebb
 
Deploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National ScaleDeploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National Scalemwebbjisc
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondIsabelle Augenstein
 
Dialogue system②
Dialogue system②Dialogue system②
Dialogue system②Kent T
 
An Overview Of Mmpi
An Overview Of MmpiAn Overview Of Mmpi
An Overview Of Mmpivturpin
 
Show me the data! Actionable insight from open courses
Show me the data! Actionable insight from open coursesShow me the data! Actionable insight from open courses
Show me the data! Actionable insight from open coursesMartin Hawksey
 
Linked Data Generation for Adaptive Learning Analytics Systems
Linked Data Generation for Adaptive Learning Analytics SystemsLinked Data Generation for Adaptive Learning Analytics Systems
Linked Data Generation for Adaptive Learning Analytics SystemsSven Lieber
 
Distributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and PandasDistributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and PandasDatabricks
 

Similaire à Advances in Learning with Bayesian Networks - july 2015 (20)

Lecture 7: Learning from Massive Datasets
Lecture 7: Learning from Massive DatasetsLecture 7: Learning from Massive Datasets
Lecture 7: Learning from Massive Datasets
 
Do Neural Models Learn Transitivity of Veridical Inference?
Do Neural Models Learn Transitivity of Veridical Inference?Do Neural Models Learn Transitivity of Veridical Inference?
Do Neural Models Learn Transitivity of Veridical Inference?
 
Gentle Introduction: Bayesian Modelling and Probabilistic Programming in R
Gentle Introduction: Bayesian Modelling and Probabilistic Programming in RGentle Introduction: Bayesian Modelling and Probabilistic Programming in R
Gentle Introduction: Bayesian Modelling and Probabilistic Programming in R
 
MILA DL & RL summer school highlights
MILA DL & RL summer school highlights MILA DL & RL summer school highlights
MILA DL & RL summer school highlights
 
Recent developments in CS education research Jul 18
Recent developments in CS education research Jul 18Recent developments in CS education research Jul 18
Recent developments in CS education research Jul 18
 
IntroML_3_DataVisualisation
IntroML_3_DataVisualisationIntroML_3_DataVisualisation
IntroML_3_DataVisualisation
 
1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdf1066_multitask_prompted_training_en.pdf
1066_multitask_prompted_training_en.pdf
 
Lecture20 xing
Lecture20 xingLecture20 xing
Lecture20 xing
 
Research Directions - Full Stack Deep Learning
Research Directions - Full Stack Deep LearningResearch Directions - Full Stack Deep Learning
Research Directions - Full Stack Deep Learning
 
Bayesian Networks to Predict Reputation in Virtual Learning Communities
Bayesian Networks to Predict Reputation in Virtual Learning CommunitiesBayesian Networks to Predict Reputation in Virtual Learning Communities
Bayesian Networks to Predict Reputation in Virtual Learning Communities
 
Continual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent MachinesContinual Learning: Another Step Towards Truly Intelligent Machines
Continual Learning: Another Step Towards Truly Intelligent Machines
 
Deploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National ScaleDeploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National Scale
 
Deploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National ScaleDeploying Open Learning Analytics at a National Scale
Deploying Open Learning Analytics at a National Scale
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Dialogue system②
Dialogue system②Dialogue system②
Dialogue system②
 
lec01.pptx
lec01.pptxlec01.pptx
lec01.pptx
 
An Overview Of Mmpi
An Overview Of MmpiAn Overview Of Mmpi
An Overview Of Mmpi
 
Show me the data! Actionable insight from open courses
Show me the data! Actionable insight from open coursesShow me the data! Actionable insight from open courses
Show me the data! Actionable insight from open courses
 
Linked Data Generation for Adaptive Learning Analytics Systems
Linked Data Generation for Adaptive Learning Analytics SystemsLinked Data Generation for Adaptive Learning Analytics Systems
Linked Data Generation for Adaptive Learning Analytics Systems
 
Distributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and PandasDistributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
Distributed Models Over Distributed Data with MLflow, Pyspark, and Pandas
 

Plus de University of Nantes

Une introduction aux Réseaux bayésiens
Une introduction aux Réseaux bayésiensUne introduction aux Réseaux bayésiens
Une introduction aux Réseaux bayésiensUniversity of Nantes
 
Les Challenges de l'Intelligence Artificielle
Les Challenges de l'Intelligence ArtificielleLes Challenges de l'Intelligence Artificielle
Les Challenges de l'Intelligence ArtificielleUniversity of Nantes
 
Learning possibilistic networks from data: a survey
Learning possibilistic networks from data: a surveyLearning possibilistic networks from data: a survey
Learning possibilistic networks from data: a surveyUniversity of Nantes
 
Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...
Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...
Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...University of Nantes
 
Introduction aux modèles graphiques probabilistes
Introduction aux modèles graphiques probabilistesIntroduction aux modèles graphiques probabilistes
Introduction aux modèles graphiques probabilistesUniversity of Nantes
 
Des modèles graphiques probabilistes aux modeles graphiques de durée
Des modèles graphiques probabilistes aux modeles graphiques de duréeDes modèles graphiques probabilistes aux modeles graphiques de durée
Des modèles graphiques probabilistes aux modeles graphiques de duréeUniversity of Nantes
 

Plus de University of Nantes (8)

SlidesTRAIL.pdf
SlidesTRAIL.pdfSlidesTRAIL.pdf
SlidesTRAIL.pdf
 
SlidesJS-PhLeray.pdf
SlidesJS-PhLeray.pdfSlidesJS-PhLeray.pdf
SlidesJS-PhLeray.pdf
 
Une introduction aux Réseaux bayésiens
Une introduction aux Réseaux bayésiensUne introduction aux Réseaux bayésiens
Une introduction aux Réseaux bayésiens
 
Les Challenges de l'Intelligence Artificielle
Les Challenges de l'Intelligence ArtificielleLes Challenges de l'Intelligence Artificielle
Les Challenges de l'Intelligence Artificielle
 
Learning possibilistic networks from data: a survey
Learning possibilistic networks from data: a surveyLearning possibilistic networks from data: a survey
Learning possibilistic networks from data: a survey
 
Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...
Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...
Evaluation des algorithmes d’apprentissage de structure pour les réseaux Bayé...
 
Introduction aux modèles graphiques probabilistes
Introduction aux modèles graphiques probabilistesIntroduction aux modèles graphiques probabilistes
Introduction aux modèles graphiques probabilistes
 
Des modèles graphiques probabilistes aux modeles graphiques de durée
Des modèles graphiques probabilistes aux modeles graphiques de duréeDes modèles graphiques probabilistes aux modeles graphiques de durée
Des modèles graphiques probabilistes aux modeles graphiques de durée
 

Dernier

GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squaresusmanzain586
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxuniversity
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXDole Philippines School
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicAditi Jain
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 

Dernier (20)

GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
trihybrid cross , test cross chi squares
trihybrid cross , test cross chi squarestrihybrid cross , test cross chi squares
trihybrid cross , test cross chi squares
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Servosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by PetrovicServosystem Theory / Cybernetic Theory by Petrovic
Servosystem Theory / Cybernetic Theory by Petrovic
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 

Advances in Learning with Bayesian Networks - july 2015

  • 1. BN learning Dynamic BN learning Relational BN learning Conclusion Advances in Learning with Bayesian Networks Philippe Leray philippe.leray@univ-nantes.fr DUKe (Data User Knowledge) research group, LINA UMR 6241, Nantes, France Nantes Machine Learning Meetup, July 6, 2015, Nantes, France Philippe Leray Advances in Learning with Bayesian Networks 1/32
  • 2. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations Bayesian networks (BNs) are a powerful tool for graphical representation of the underlying knowledge in the data and reasoning with incomplete or imprecise observations. BNs have been extended (or generalized) in several ways, as for instance, causal BNs, dynamic BNs, relational BNs, ... Grade Letter SAT IntelligenceDifficulty d1 d0 0.6 0.4 i1 i0 0.7 0.3 i0 i1 s1s0 0.95 0.2 0.05 0.8 g1 g2 g2 l1 l 0 0.1 0.4 0.99 0.9 0.6 0.01 i0,d0 i0 ,d1 i0 ,d0 i0 ,d1 g2 g3g1 0.3 0.05 0.9 0.5 0.4 0.25 0.08 0.3 0.3 0.7 0.02 0.2 Philippe Leray Advances in Learning with Bayesian Networks 2/32
  • 3. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations Bayesian networks (BNs) are a powerful tool for graphical representation of the underlying knowledge in the data and reasoning with incomplete or imprecise observations. BNs have been extended (or generalized) in several ways, as for instance, causal BNs, dynamic BNs, relational BNs, ... Grade Letter SAT IntelligenceDifficulty d1 d0 0.6 0.4 i1 i0 0.7 0.3 i0 i1 s1s0 0.95 0.2 0.05 0.8 g1 g2 g2 l1 l 0 0.1 0.4 0.99 0.9 0.6 0.01 i0,d0 i0 ,d1 i0 ,d0 i0 ,d1 g2 g3g1 0.3 0.05 0.9 0.5 0.4 0.25 0.08 0.3 0.3 0.7 0.02 0.2 Philippe Leray Advances in Learning with Bayesian Networks 2/32
  • 4. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations Philippe Leray Advances in Learning with Bayesian Networks 3/32
  • 5. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 6. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete /incomplete [Fran¸cois et al. 06] Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 7. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete /incomplete [Fran¸cois et al. 06] high n, Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 8. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete /incomplete [Fran¸cois et al. 06] high n, n >> p [Ammar & Leray, 11] Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 9. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete /incomplete [Fran¸cois et al. 06] high n, n >> p [Ammar & Leray, 11] stream [Yasin and Leray, 13] Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 10. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete /incomplete [Fran¸cois et al. 06] high n, n >> p [Ammar & Leray, 11] stream [Yasin and Leray, 13] + prior knowledge / ontology [Ben Messaoud et al., 13] Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 11. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete /incomplete [Fran¸cois et al. 06] high n, n >> p [Ammar & Leray, 11] stream [Yasin and Leray, 13] + prior knowledge / ontology [Ben Messaoud et al., 13] structured data [Ben Ishak, Coutant, Chulyadyo et al.] Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 12. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations We would like to learn a BN from data... but which kind of data ? complete /incomplete [Fran¸cois et al. 06] high n, n >> p [Ammar & Leray, 11] stream [Yasin and Leray, 13] + prior knowledge / ontology [Ben Messaoud et al., 13] structured data [Ben Ishak, Coutant, Chulyadyo et al.] not so structured data [Elabri et al.] Philippe Leray Advances in Learning with Bayesian Networks 4/32
  • 13. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations Even the learning task can differ : generative modeling P(X, Y ) no target variable more general model better behavior with incomplete data Objectives of this talk how to learn BNs in such various contexts ? state of the art : founding algorithms and recent ones pointing out our contributions in this field Philippe Leray Advances in Learning with Bayesian Networks 5/32
  • 14. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations Even the learning task can differ : generative vs. discriminative modeling P(X, Y ) no target variable more general model better behavior with incomplete data modeling P(Y |X) one target variable Y dedicated model Objectives of this talk how to learn BNs in such various contexts ? state of the art : founding algorithms and recent ones pointing out our contributions in this field Philippe Leray Advances in Learning with Bayesian Networks 5/32
  • 15. BN learning Dynamic BN learning Relational BN learning Conclusion Motivations Even the learning task can differ : generative vs. discriminative modeling P(X, Y ) no target variable more general model better behavior with incomplete data modeling P(Y |X) one target variable Y dedicated model Objectives of this talk how to learn BNs in such various contexts ? state of the art : founding algorithms and recent ones pointing out our contributions in this field Philippe Leray Advances in Learning with Bayesian Networks 5/32
  • 16. BN learning Dynamic BN learning Relational BN learning Conclusion Outline ... 1 BN learning Definition Parameter learning Structure learning 2 Dynamic BN learning Definition Learning 3 Relational BN learning Definition Learning Graph DB ? 4 Conclusion Last words References Philippe Leray Advances in Learning with Bayesian Networks 6/32
  • 17. BN learning Dynamic BN learning Relational BN learning Conclusion Bayesian network [Pearl, 1985] Definition G qualitative description of conditional dependences / independences between variables directed acyclic graph (DAG) Θ quantitative description of these dependences conditional probability distributions (CPDs) Grade Letter SAT IntelligenceDifficulty d1 d0 0.6 0.4 i1 i0 0.7 0.3 i0 i1 s1s0 0.95 0.2 0.05 0.8 g1 g2 g2 l1 l 0 0.1 0.4 0.99 0.9 0.6 0.01 i0 ,d0 i0,d1 i0 ,d0 i0 ,d1 g2 g3g1 0.3 0.05 0.9 0.5 0.4 0.25 0.08 0.3 0.3 0.7 0.02 0.2 Main property the global model is decomposed into a set of local conditional models Philippe Leray Advances in Learning with Bayesian Networks 7/32
  • 18. BN learning Dynamic BN learning Relational BN learning Conclusion Bayesian network [Pearl, 1985] Definition G qualitative description of conditional dependences / independences between variables directed acyclic graph (DAG) Θ quantitative description of these dependences conditional probability distributions (CPDs) Grade Letter SAT IntelligenceDifficulty d1 d0 0.6 0.4 i1 i0 0.7 0.3 i0 i1 s1s0 0.95 0.2 0.05 0.8 g1 g2 g2 l1 l 0 0.1 0.4 0.99 0.9 0.6 0.01 i0 ,d0 i0,d1 i0 ,d0 i0 ,d1 g2 g3g1 0.3 0.05 0.9 0.5 0.4 0.25 0.08 0.3 0.3 0.7 0.02 0.2 Main property the global model is decomposed into a set of local conditional models Philippe Leray Advances in Learning with Bayesian Networks 7/32
  • 19. BN learning Dynamic BN learning Relational BN learning Conclusion One model... but two learning tasks BN = graph G and set of CPDs Θ parameter learning / G given structure learning Grade Letter SAT IntelligenceDifficulty d1 d0 0.6 0.4 i1 i0 0.7 0.3 i0 i1 s1s0 0.95 0.2 0.05 0.8 g1 g2 g2 l1 l 0 0.1 0.4 0.99 0.9 0.6 0.01 i0 ,d0 i0,d1 i0 ,d0 i0 ,d1 g2 g3g1 0.3 0.05 0.9 0.5 0.4 0.25 0.08 0.3 0.3 0.7 0.02 0.2 Philippe Leray Advances in Learning with Bayesian Networks 8/32
  • 20. BN learning Dynamic BN learning Relational BN learning Conclusion One model... but two learning tasks BN = graph G and set of CPDs Θ parameter learning / G given structure learning Grade Letter SAT IntelligenceDifficulty d1 d0 i1 i0 i0 i1 s1s0 g1 g2 g2 l1 l 0 i0 ,d0 i0,d1 i0 ,d0 i0 ,d1 g2 g3g1 ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?? Philippe Leray Advances in Learning with Bayesian Networks 8/32
  • 21. BN learning Dynamic BN learning Relational BN learning Conclusion One model... but two learning tasks BN = graph G and set of CPDs Θ parameter learning / G given structure learning Grade Letter SAT IntelligenceDifficulty d ? ? ? ? ? Philippe Leray Advances in Learning with Bayesian Networks 8/32
  • 22. BN learning Dynamic BN learning Relational BN learning Conclusion Parameter learning (generative) Complete data D max. of likelihood (ML) : ˆθMV = argmax P(D|θ) closed-form solution : ˆP(Xi = xk|Pa(Xi ) = xj ) = ˆθMV i,j,k = Ni,j,k k Ni,j,k Ni,j,k = nb of occurrences of {Xi = xk and Pa(Xi ) = xj } Other approaches P(θ) ∼ Dirichlet(α) max. a posteriori (MAP) : ˆθMAP = argmax P(θ|D) expectation a posteriori (EAP) : ˆθEAP = E(P(θ|D)) ˆθMAP i,j,k = Ni,j,k +αi,j,k −1 k (Ni,j,k +αi,j,k −1) ˆθEAP i,j,k = Ni,j,k +αi,j,k k (Ni,j,k +αi,j,k ) Philippe Leray Advances in Learning with Bayesian Networks 9/32
  • 23. BN learning Dynamic BN learning Relational BN learning Conclusion Parameter learning (generative) Complete data D max. of likelihood (ML) : ˆθMV = argmax P(D|θ) closed-form solution : ˆP(Xi = xk|Pa(Xi ) = xj ) = ˆθMV i,j,k = Ni,j,k k Ni,j,k Ni,j,k = nb of occurrences of {Xi = xk and Pa(Xi ) = xj } Other approaches P(θ) ∼ Dirichlet(α) max. a posteriori (MAP) : ˆθMAP = argmax P(θ|D) expectation a posteriori (EAP) : ˆθEAP = E(P(θ|D)) ˆθMAP i,j,k = Ni,j,k +αi,j,k −1 k (Ni,j,k +αi,j,k −1) ˆθEAP i,j,k = Ni,j,k +αi,j,k k (Ni,j,k +αi,j,k ) Philippe Leray Advances in Learning with Bayesian Networks 9/32
  • 24. BN learning Dynamic BN learning Relational BN learning Conclusion Parameter learning (generative) Incomplete data no closed-form solution EM (iterative) algorithm [Dempster, 77], convergence to a local optimum Incremental data advantages of sufficient statistics θi,j,k = Nold θold i,j,k + Ni,j,k Nold + N this Bayesian updating can include a forgetting factor Philippe Leray Advances in Learning with Bayesian Networks 10/32
  • 25. BN learning Dynamic BN learning Relational BN learning Conclusion Parameter learning (generative) Incomplete data no closed-form solution EM (iterative) algorithm [Dempster, 77], convergence to a local optimum Incremental data advantages of sufficient statistics θi,j,k = Nold θold i,j,k + Ni,j,k Nold + N this Bayesian updating can include a forgetting factor Philippe Leray Advances in Learning with Bayesian Networks 10/32
  • 26. BN learning Dynamic BN learning Relational BN learning Conclusion Parameter learning (discriminative) Complete data no closed-form iterative algorithms such as gradient descent Incomplete data no closed-form iterative algorithms + EM :-( Philippe Leray Advances in Learning with Bayesian Networks 11/32
  • 27. BN learning Dynamic BN learning Relational BN learning Conclusion Parameter learning (discriminative) Complete data no closed-form iterative algorithms such as gradient descent Incomplete data no closed-form iterative algorithms + EM :-( Philippe Leray Advances in Learning with Bayesian Networks 11/32
  • 28. BN learning Dynamic BN learning Relational BN learning Conclusion BN structure learning is a complex task Size of the ”solution” space the number of possible DAGs with n variables is super-exponential w.r.t n [Robinson, 77] NS(5) = 29281 NS(10) = 4.2 × 1018 an exhaustive search is impossible for realistic n ! One thousand millenniums = 3.2 × 1013 seconds Identifiability data can only help finding (conditional) dependences / independences Markov Equivalence : several graphs describe the same dependence statements causal Sufficiency : do we know all the explaining variables ? Philippe Leray Advances in Learning with Bayesian Networks 12/32
  • 29. BN learning Dynamic BN learning Relational BN learning Conclusion BN structure learning is a complex task Size of the ”solution” space the number of possible DAGs with n variables is super-exponential w.r.t n [Robinson, 77] NS(5) = 29281 NS(10) = 4.2 × 1018 an exhaustive search is impossible for realistic n ! One thousand millenniums = 3.2 × 1013 seconds Identifiability data can only help finding (conditional) dependences / independences Markov Equivalence : several graphs describe the same dependence statements causal Sufficiency : do we know all the explaining variables ? Philippe Leray Advances in Learning with Bayesian Networks 12/32
  • 30. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning (generative / complete) Constraint-based methods BN = independence model ⇒ find CI in data in order to build the DAG ex : IC [Pearl & Verma, 91], PC [Spirtes et al., 93] problem : reliability of CI statistical tests (ok for n < 100) Score-based methods BN = probabilistic model that must fit data as well as possible problem : size of search space (ok for n < 1000) Hybrid/ local search methods local search / neighbor identification (statistical tests) global (score) optimization usually for scalability reasons (ok for high n) Philippe Leray Advances in Learning with Bayesian Networks 13/32
  • 31. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning (generative / complete) Constraint-based methods BN = independence model problem : reliability of CI statistical tests (ok for n < 100) Score-based methods BN = probabilistic model that must fit data as well as possible ⇒ search the DAG space in order to maximize a scoring function ex : Maximum Weighted Spanning Tree [Chow & Liu, 68], Greedy Search [Chickering, 95], evolutionary approaches [Larranaga et al., 96] [Wang & Yang, 10] problem : size of search space (ok for n < 1000) Hybrid/ local search methods local search / neighbor identification (statistical tests)Philippe Leray Advances in Learning with Bayesian Networks 13/32
  • 32. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning (generative / complete) Constraint-based methods BN = independence model problem : reliability of CI statistical tests (ok for n < 100) Score-based methods BN = probabilistic model that must fit data as well as possible problem : size of search space (ok for n < 1000) Hybrid/ local search methods local search / neighbor identification (statistical tests) global (score) optimization usually for scalability reasons (ok for high n) ex : MMHC algorithm [Tsamardinos et al., 06] Philippe Leray Advances in Learning with Bayesian Networks 13/32
  • 33. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning (discriminative) Specific structures naive Bayes, augmented naive Bayes multi-nets ... X1 X2 X3 C X4 X5 X1 X2 X3 C X4 X5 Structure learning usually, the structure is learned in a generative way the parameters are then tuned in a discriminative way Philippe Leray Advances in Learning with Bayesian Networks 14/32
  • 34. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning (discriminative) Specific structures naive Bayes, augmented naive Bayes multi-nets ... X1 X2 X3 C X4 X5 X1 X2 X3 C X4 X5 Structure learning usually, the structure is learned in a generative way the parameters are then tuned in a discriminative way Philippe Leray Advances in Learning with Bayesian Networks 14/32
  • 35. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning Incomplete data hybridization of previous structure learning methods and EM ex : Structural EM [Friedman, 97] Greedy Search + EM problem : convergence Grade Letter SAT IntelligenceDifficulty d ? ? ? ? ? Philippe Leray Advances in Learning with Bayesian Networks 15/32
  • 36. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning n >> p robustness and complexity issues application of Perturb & Combine principle ex : mixture of randomly perturbed trees [Ammar & Leray, 11] Philippe Leray Advances in Learning with Bayesian Networks 16/32
  • 37. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning Incremental learning and data streams Bayesian updating is easy for parameters Bayesian updating is complex for structure learning and other constraints related to data streams (limited storage, ...) ex : incremental MMHC [Yasin and Leray, 13] Philippe Leray Advances in Learning with Bayesian Networks 17/32
  • 38. BN learning Dynamic BN learning Relational BN learning Conclusion Structure learning Integration of prior knowledge in order to reduce search space : white list, black list, node ordering [Campos & Castellano, 07] interaction with ontologies [Ben Messaoud et al., 13] Philippe Leray Advances in Learning with Bayesian Networks 18/32
  • 39. BN learning Dynamic BN learning Relational BN learning Conclusion Outline ... 1 BN learning Definition Parameter learning Structure learning 2 Dynamic BN learning Definition Learning 3 Relational BN learning Definition Learning Graph DB ? 4 Conclusion Last words References Philippe Leray Advances in Learning with Bayesian Networks 19/32
  • 40. BN learning Dynamic BN learning Relational BN learning Conclusion Dynamic Bayesian networks (DBNs) k slices temporal BN (k-TBN) [Murphy, 02] k − 1 Markov order prior graph G0 + transition graph G for example : 2-TBNs model [Dean & Kanazawa, 89] Simplified k-TBN k-TBN with only temporal edges [Dojer, 06][Vinh et al, 12] Philippe Leray Advances in Learning with Bayesian Networks 20/32
  • 41. BN learning Dynamic BN learning Relational BN learning Conclusion Dynamic Bayesian networks (DBNs) k slices temporal BN (k-TBN) [Murphy, 02] k − 1 Markov order prior graph G0 + transition graph G for example : 2-TBNs model [Dean & Kanazawa, 89] Simplified k-TBN k-TBN with only temporal edges [Dojer, 06][Vinh et al, 12] Philippe Leray Advances in Learning with Bayesian Networks 20/32
  • 42. BN learning Dynamic BN learning Relational BN learning Conclusion DBN structure learning (generative) Score-based methods dynamic Greedy Search [Friedman et al., 98], genetic algorithm [Gao et al., 07], dynamic Simulated Annealing [Hartemink, 05], ... for k-TBN (G0 and G learning) but not scalable (high n) Hybrid methods [Dojer, 06] [Vinh et al., 12] for simplified k-TBN, but often limited to k = 2 for scalability dynamic MMHC for ”unsimplified” 2-TBNs with high n [Trabelsi et al., 13] Philippe Leray Advances in Learning with Bayesian Networks 21/32
  • 43. BN learning Dynamic BN learning Relational BN learning Conclusion DBN structure learning (generative) Score-based methods dynamic Greedy Search [Friedman et al., 98], genetic algorithm [Gao et al., 07], dynamic Simulated Annealing [Hartemink, 05], ... for k-TBN (G0 and G learning) but not scalable (high n) Hybrid methods [Dojer, 06] [Vinh et al., 12] for simplified k-TBN, but often limited to k = 2 for scalability dynamic MMHC for ”unsimplified” 2-TBNs with high n [Trabelsi et al., 13] Philippe Leray Advances in Learning with Bayesian Networks 21/32
  • 44. BN learning Dynamic BN learning Relational BN learning Conclusion Outline ... 1 BN learning Definition Parameter learning Structure learning 2 Dynamic BN learning Definition Learning 3 Relational BN learning Definition Learning Graph DB ? 4 Conclusion Last words References Philippe Leray Advances in Learning with Bayesian Networks 22/32
  • 45. BN learning Dynamic BN learning Relational BN learning Conclusion Relational schema Movie User Vote Movie User Rating Gender Age Occupation RealiseDate Genre A relational schema R classes + relational variables reference slots (e.g., Vote.Movie, Vote.User) slot chain = a sequence of reference slots allow to walk in the relational schema to create new variables ex: Vote.User.User−1 .Movie: all the movies voted by a particular user Philippe Leray Advances in Learning with Bayesian Networks 23/32
  • 46. BN learning Dynamic BN learning Relational BN learning Conclusion Relational schema Movie User Vote Movie User Rating Gender Age Occupation RealiseDate Genre A relational schema R classes + relational variables reference slots (e.g., Vote.Movie, Vote.User) slot chain = a sequence of reference slots allow to walk in the relational schema to create new variables ex: Vote.User.User−1 .Movie: all the movies voted by a particular user Philippe Leray Advances in Learning with Bayesian Networks 23/32
  • 47. BN learning Dynamic BN learning Relational BN learning Conclusion Relational schema Movie User Vote Movie User Rating Gender Age Occupation RealiseDate Genre A relational schema R classes + relational variables reference slots (e.g., Vote.Movie, Vote.User) slot chain = a sequence of reference slots allow to walk in the relational schema to create new variables ex: Vote.User.User−1 .Movie: all the movies voted by a particular user Philippe Leray Advances in Learning with Bayesian Networks 23/32
  • 48. BN learning Dynamic BN learning Relational BN learning Conclusion Relational schema Movie User Vote Movie User Rating Gender Age Occupation RealiseDate Genre A relational schema R classes + relational variables reference slots (e.g., Vote.Movie, Vote.User) slot chain = a sequence of reference slots allow to walk in the relational schema to create new variables ex: Vote.User.User−1 .Movie: all the movies voted by a particular user Philippe Leray Advances in Learning with Bayesian Networks 23/32
  • 49. BN learning Dynamic BN learning Relational BN learning Conclusion Probabilistic Relational Models [Koller & Pfeffer, 98] Definition A PRM Π associated to R: a qualitative dependency structure S (with possible long slot chains and aggregation functions) a set of parameters θS Vote Rating Movie User RealiseDate Genre AgeGender Occupation 0.60.4 FM User.Gender 0.40.6Comedy, F 0.50.5Comedy, M 0.10.9Horror, F 0.80.2Horror, M 0.70.3Drama, F 0.50.5Drama, M HighLow Votes.RatingMovie.Genre User.Gender Philippe Leray Advances in Learning with Bayesian Networks 24/32
  • 50. BN learning Dynamic BN learning Relational BN learning Conclusion Probabilistic Relational Models Definition Vote Rating Movie User RealiseDate Genre AgeGender Occupation 0.60.4 FM User.Gender 0.40.6Comedy, F 0.50.5Comedy, M 0.10.9Horror, F 0.80.2Horror, M 0.70.3Drama, F 0.50.5Drama, M HighLow Votes.RatingMovie.Genre User.Gender Aggregators Vote.User.User−1.Movie.genre → Vote.rating movie rating from one user can be dependent with the genre of all the movies voted by this user how to describe the dependency with an unknown number of parents ? solution : using an aggregated value, e.g. γ = MODE Philippe Leray Advances in Learning with Bayesian Networks 25/32
  • 51. BN learning Dynamic BN learning Relational BN learning Conclusion DAPER Another probabilistic relational model [Heckerman & Meek, 04] Definition Probabilistic model associated to an Entity-Relationship model Philippe Leray Advances in Learning with Bayesian Networks 26/32
  • 52. BN learning Dynamic BN learning Relational BN learning Conclusion Learning from a relational datatase GBN PRM/DAPER learning = finding the probabilistic dependencies and the probability tables from an instantiated database the relational schema/ER model is given Age Rating Age Gender Occupation Age Gender Occupation Gender Occupation Genre RealiseDate Genre Genre Genre Genre U1 U2 U3 M1 M2 M3 M4 M5 #U1, #M1 Rating #U1, #M2 Rating #U2, #M1 Rating #U2, #M3 Rating #U2, #M4 Rating #U3, #M1 Rating #U3, #M2 Rating #U3, #M3 Rating #U3, #M5 RealiseDate RealiseDate RealiseDate RealiseDate Philippe Leray Advances in Learning with Bayesian Networks 27/32
  • 53. BN learning Dynamic BN learning Relational BN learning Conclusion PRM/DAPER structure learning Relational variables finding new variables by exploring the relational schema ex: student.reg.grade, registration.course.reg.grade, registration.student reg.course.reg.grade, ... ⇒ adding another dimension in the search space ⇒ limitation to a given maximal slot chain length Constraint-based methods relational PC [Maier et al., 10] relational CD [Maier et al., 13] don’t deal with aggregation functions Score-based methods Greedy search [Getoor et al., 07] Philippe Leray Advances in Learning with Bayesian Networks 28/32
  • 54. BN learning Dynamic BN learning Relational BN learning Conclusion PRM/DAPER structure learning Relational variables ⇒ adding another dimension in the search space ⇒ limitation to a given maximal slot chain length Constraint-based methods relational PC [Maier et al., 10] relational CD [Maier et al., 13] don’t deal with aggregation functions Score-based methods Greedy search [Getoor et al., 07] Hybrid methods relational MMHC [Ben Ishak et al., 15] Philippe Leray Advances in Learning with Bayesian Networks 28/32
  • 55. BN learning Dynamic BN learning Relational BN learning Conclusion PRM/DAPER structure learning Relational variables ⇒ adding another dimension in the search space ⇒ limitation to a given maximal slot chain length Constraint-based methods relational PC [Maier et al., 10] relational CD [Maier et al., 13] don’t deal with aggregation functions Score-based methods Greedy search [Getoor et al., 07] Hybrid methods relational MMHC [Ben Ishak et al., 15] Philippe Leray Advances in Learning with Bayesian Networks 28/32
  • 56. BN learning Dynamic BN learning Relational BN learning Conclusion PRM/DAPER structure learning Relational variables ⇒ adding another dimension in the search space ⇒ limitation to a given maximal slot chain length Constraint-based methods relational PC [Maier et al., 10] relational CD [Maier et al., 13] don’t deal with aggregation functions Score-based methods Greedy search [Getoor et al., 07] Hybrid methods relational MMHC [Ben Ishak et al., 15] Philippe Leray Advances in Learning with Bayesian Networks 28/32
  • 57. BN learning Dynamic BN learning Relational BN learning Conclusion Graph database Definition Data is organized as a graph, with ”labelled” nodes and relationships Attributes can be associated to both. Seems nice for ER model but ... Schema-free Elabri et al., in progress Philippe Leray Advances in Learning with Bayesian Networks 29/32
  • 58. BN learning Dynamic BN learning Relational BN learning Conclusion Graph database Definition Data is organized as a graph, with ”labelled” nodes and relationships Attributes can be associated to both. Seems nice for ER model but ... Schema-free Elabri et al., in progress Philippe Leray Advances in Learning with Bayesian Networks 29/32
  • 59. BN learning Dynamic BN learning Relational BN learning Conclusion Graph database Definition Data is organized as a graph, with ”labelled” nodes and relationships Attributes can be associated to both. Seems nice for ER model but ... Schema-free Elabri et al., in progress Philippe Leray Advances in Learning with Bayesian Networks 29/32
  • 60. BN learning Dynamic BN learning Relational BN learning Conclusion Graph database Definition Schema-free Only data, no ”relational schema” No warranty that the data has been ”stored” by following some meta/ER model. Elabri et al., in progress Philippe Leray Advances in Learning with Bayesian Networks 29/32
  • 61. BN learning Dynamic BN learning Relational BN learning Conclusion Graph database Definition Schema-free Only data, no ”relational schema” No warranty that the data has been ”stored” by following some meta/ER model. Elabri et al., in progress Philippe Leray Advances in Learning with Bayesian Networks 29/32
  • 62. BN learning Dynamic BN learning Relational BN learning Conclusion Graph database Definition Schema-free Elabri et al., in progress Learning a probabilistic relational model from a graph DB Extension to Markov Logic Networks Philippe Leray Advances in Learning with Bayesian Networks 29/32
  • 63. BN learning Dynamic BN learning Relational BN learning Conclusion Outline ... 1 BN learning Definition Parameter learning Structure learning 2 Dynamic BN learning Definition Learning 3 Relational BN learning Definition Learning Graph DB ? 4 Conclusion Last words References Philippe Leray Advances in Learning with Bayesian Networks 30/32
  • 64. BN learning Dynamic BN learning Relational BN learning Conclusion Conclusion Visible face of this talk BNs = powerful tool for knowledge representation and reasoning ⇒ interest in using structure learning algorithms for knowledge discovery BN structure learning is NP-hard, even for ”usual” BN/data but we want to learn more and more complex models with more and more complex data ⇒ many works in progress in order to develop such learning algorithms Hidden face of this talk Philippe Leray Advances in Learning with Bayesian Networks 31/32
  • 65. BN learning Dynamic BN learning Relational BN learning Conclusion Conclusion Visible face of this talk BNs = powerful tool for knowledge representation and reasoning ⇒ interest in using structure learning algorithms for knowledge discovery BN structure learning is NP-hard, even for ”usual” BN/data but we want to learn more and more complex models with more and more complex data ⇒ many works in progress in order to develop such learning algorithms Hidden face of this talk Philippe Leray Advances in Learning with Bayesian Networks 31/32
  • 66. BN learning Dynamic BN learning Relational BN learning Conclusion Conclusion Visible face of this talk Hidden face of this talk BN learning and causality : causal discovery BN learning tools : no unified programming tools, often limited to simple BN models / simple data ⇒ coming soon : PILGRIM our GPL platform in C++, dealing with BN, DBN, RBN, incremental data, ... BN versus other probabilistic graphical models : Qualitative probabilistic models, Markov random fields, Conditional random fields, Deep belief networks, ... Philippe Leray Advances in Learning with Bayesian Networks 31/32
  • 67. BN learning Dynamic BN learning Relational BN learning Conclusion Conclusion Visible face of this talk Hidden face of this talk BN learning and causality : causal discovery BN learning tools : no unified programming tools, often limited to simple BN models / simple data ⇒ coming soon : PILGRIM our GPL platform in C++, dealing with BN, DBN, RBN, incremental data, ... BN versus other probabilistic graphical models : Qualitative probabilistic models, Markov random fields, Conditional random fields, Deep belief networks, ... Philippe Leray Advances in Learning with Bayesian Networks 31/32
  • 68. BN learning Dynamic BN learning Relational BN learning Conclusion Conclusion Visible face of this talk Hidden face of this talk BN learning and causality : causal discovery BN learning tools : no unified programming tools, often limited to simple BN models / simple data ⇒ coming soon : PILGRIM our GPL platform in C++, dealing with BN, DBN, RBN, incremental data, ... BN versus other probabilistic graphical models : Qualitative probabilistic models, Markov random fields, Conditional random fields, Deep belief networks, ... Philippe Leray Advances in Learning with Bayesian Networks 31/32
  • 69. BN learning Dynamic BN learning Relational BN learning Conclusion References One starting point [Koller & Friedman, 09] Probabilistic Graphical Models: Principles and Techniques. MIT Press. Our publications http://tinyurl.com/PhLeray Thank you for your attention Philippe Leray Advances in Learning with Bayesian Networks 32/32
  • 70. BN learning Dynamic BN learning Relational BN learning Conclusion References One starting point [Koller & Friedman, 09] Probabilistic Graphical Models: Principles and Techniques. MIT Press. Our publications http://tinyurl.com/PhLeray Thank you for your attention Philippe Leray Advances in Learning with Bayesian Networks 32/32