SlideShare a Scribd company logo
1 of 40
Download to read offline
Kernel Methods and Relational Learning in
                  Bioinformatics

                                  ir. Michiel Stock
                               Dr. Willem Waegeman
                             Prof. dr. Bernard De Baets

                             Faculty of Bioscience Engineering
                                     Ghent University


                                   November 2012




                                      KERMIT



ir. Michiel Stock (KERMIT)         Kernels for Bioinformatics    November 2012   1 / 40
Outline


1    Introduction

2    Kernel methods

3    Learning relations

4    Case studies
      Enzyme function prediction
      Protein-ligand interactions
      Microbial ecology

5    Conclusions



    ir. Michiel Stock (KERMIT)   Kernels for Bioinformatics   November 2012   2 / 40
Introduction


Introductory example

Problem statement
Predict protein-protein interactions based on high-throughput data.
        Based on a gold standard
        Typical features that can be
        used:
               Yeast two-hybrid
               Pfam profile
               Phylogenetic profile
               Localization
               PSI-BLAST
               Expression
               ...

  ir. Michiel Stock (KERMIT)       Kernels for Bioinformatics   November 2012   3 / 40
Introduction


Machine learning is widelyagaused in bioinformatics
      88               Larran‹ et al.




                                                                                                                    Downloaded from bib.oxfordjournals.org at Biomedische Bibliotheek o
               Figure 1: Classification of the topics where machine learning methods are applied.
 ir. Michiel Stock (KERMIT)                             Kernels for Bioinformatics                  November 2012                                                                         4 / 40
Introduction


Bioinformatics deals with complex data


Bioinformatics data is typically:
               in large dimension (e.g., microarrays or proteomics data)
            structured (e.g., gene sequences, small molecules, interaction
      networks, phylogenetic trees...)
            heterogeneous (e.g., vectors, sequences, graphs to describe
      the same protein)
            in large quantities (e.g., more than 106 known protein
      sequences)
               noisy (e.g., many features are not relevant)




  ir. Michiel Stock (KERMIT)     Kernels for Bioinformatics   November 2012   5 / 40
Kernel methods


Formal definition of a kernel

Kernels are non-linear functions defined over objects x ∈ X .
Definition
A function k : X × X → R is called a positive definite kernel if it is
symmetric, that is, k(x, x ) = k(x , x) for any two objects x, x ∈ X , and
positive semi-definite, that is,
                               N     N
                                         ci cj k(xi , xj ) ≥ 0
                               i=1 j=1

for any N > 0, any choice of N objects x1 , . . . , xN ∈ X , and any choice of
real numbers c1 , . . . , cN ∈ R.

Can be seen as generalized covariances.


  ir. Michiel Stock (KERMIT)        Kernels for Bioinformatics   November 2012   6 / 40
Kernel methods


Interpretation of kernels

    Suppose an object x has an
    implicit feature representation
    φ(x) ∈ F.
    A kernel function can be seen
    as a dot product in this
    feature space:                                                                X              F

         k(x, x ) = φ(x), φ(x )
                                                                                                     h (x), (x0 )i
                                                                                      k

    Linear models in this feature
    space F can be made:
                                                             dinsdag, 10 april 2012




                        T
          y (x) = w φ(x)
                 =            an k(xn , x)
                       n

 ir. Michiel Stock (KERMIT)               Kernels for Bioinformatics                      November 2012              7 / 40
Kernel methods


Many kernel methods exist
                                                           SVM
       Examples of popular kernel
       methods:
              Support vector machine
              (SVM)
              Regularized least squares
              (RLS)
              Kernel principal                             KPCA
              component analysis
              (KPCA)
       Learning algorithm is
       independent of the kernel
       representation!


 ir. Michiel Stock (KERMIT)       Kernels for Bioinformatics      November 2012   8 / 40
Kernel methods


Kernels for (protein) sequences

Spectrum kernel (SK)
The SK considers the number of k-mers m two sequences si and sj have in
common.


       SKk (si , sj ) =              N(m, si )∗N(m, sj )
                              m∈Σk

       with N(m, s) the number of k-mers
       m in sequence s.
              To predict structure, function...
              of DNA, RNA or proteins.
              A discriminative alternative for
              Hidden Markov Models.

 ir. Michiel Stock (KERMIT)               Kernels for Bioinformatics   November 2012   9 / 40
Kernel methods


Kernels for graphs (1)
Graph
Graphs are a set of interconnected objects, called vertices (or nodes), that
are connected through edges.

Graphs can show the structure of an object or interactions between
different objects.




                         Graph are important in bioinformatics!
  ir. Michiel Stock (KERMIT)         Kernels for Bioinformatics   November 2012   10 / 40
Kernel methods


Kernels for graphs (2)

Graph kernel
Constructing a similarity between graphs.
                                                    In chemoinformatics:


    Based on performing a
    random walk on both graphs
    and counting the number of                      In structural bioinformatics:
    matching walks.
    Usually very computationally
    demanding!



                                                                A
 ir. Michiel Stock (KERMIT)       Kernels for Bioinformatics          November 2012   11 / 40
Kernel methods


Kernels for graphs (3)

Diffusion kernel
Constructing a similarity between vertices within the same graph.

    Also based on performing a
    random walk on a graph.
    Captures the long-range
    relationships between
    vertices.
    Inspired by the heat
    equation. The kernel
    quantifies how quickly ‘heat’
    can spread from one node to
    another.


 ir. Michiel Stock (KERMIT)       Kernels for Bioinformatics   November 2012   12 / 40
Kernel methods


Kernels for fingerprints


                                                    Fingerprint representation of
    Objects that can be described                   an object:
    by a long binary vector x can
    be represented by the
    Tanimoto kernel:

      KTan (xm , xn ) =
                 xm , xn
                                 .
     xm , xm + xn , xn − xm , xn




 ir. Michiel Stock (KERMIT)       Kernels for Bioinformatics          November 2012   13 / 40
Learning relations


Kernels for pairs of objects


Problem statement
Predict the binding interaction between a given protein and a ligand
(small molecule). Learning Molecular docking.

        The problem deals with two
        types of objects:
               Proteins (graph kernel of
               structure, sequence
               kernel, fingerprints...)
               Ligand (fingerprints,
               graph kernel...)
        Label is for a pair of objects.


  ir. Michiel Stock (KERMIT)         Kernels for Bioinformatics   November 2012   14 / 40
Learning relations
ng and Ranking Algorithms for Bioinformatics
 example: pairs of objects
  Kernels for
        Applications
nomicsWillem Waegeman, Bernard De Baets
 Michiel Stock,
    Pairwise kernel
IT, Department of Mathematical Modelling, Statistics and Bioinformatics
of Combine the kernel matrices of the individual the process of druga kernel
   proteins and a database of ligands to aid objects to construct
istical model based objects.
   matrix for pairs of on a data set. Kernel methods allow for the
roductory example: chemogenomics
tein and a from individual kernels for the proteins and ligands:
   Starting ligand.
ding interactions between a set of proteins and a database of ligands to aid the process of drug
to model pairwise relations between different types of objects.
s
                                                     Data set                            Object kernels

                                       ( , )
                          By optimizing a ranking loss, our algorithms can also be used for
                                       ( , ) as shown on the right.
                          conditional ranking,
                                          ( , )
                                                                                  SVM
                          In short, our framework is ideally                suited for bioinformatics
                                                                                   RLS
                                             ...




                          challenges:
                                          ( , )
                            - efficient learning process
                                          ( , )                                      ...
                            - can handle complex objects (graphs, trees, sequences...)
                                                   Pairwise kernel
                            - ability to deal with information retrieval problems
                  Object kernels                                               Learning algorithm

 gorithms can also be used for
       ir. Michiel Stock (KERMIT)              Kernels for Bioinformatics         November 2012     15 / 40
( , )          Learning relations
                                                           SVM
  Conditional ranking (1)                                   RLS
                          ...
  Motivation( , )
  Suppose one is not )                                          ...
             ( , particularly interested in the exact value of the
  interaction but in the order of the proteins for a given ligand.
                                     Pairwise kernel
rnels                                                                                    Learning algorithm

ed for                                More relevant




                                                                                               More relevant
matics
                  Query 1                                                      Query 2




                                           Database objects
        ir. Michiel Stock (KERMIT)                    Kernels for Bioinformatics           November 2012       16 / 40
Learning relations


Conditional ranking (2)

       Based on a graph description,
       with e a pair of objects.
       Train the model:

       h(e) =< w, Φ(e) >=                   ae K Φ (e, e )
                                                       ¯
                                      e∈E

       using the algorithm:
                                                      2
       A(T ) = argmin L(h, T )+λ h                    H.
                      h∈H
                                                                       Figure 1 Example of a multi-graph. If this graph, on the left, would be used fo
                                                                       conditioned on C, then A scores better than E, which ranks higher than E, w
       Where we use a ranking loss:                                    higher than D and D ranks higher than B. There is no information about the re
                                                                       and G, respectively, our model could be used to include these two instances in
                                                                       are available. Notice that in this setting unconditional ranking of these objects
                                                                       graph is obviously intransitive. Figure reproduced from (Pahikkala et al., 2010).

       L(h, T ) =                    (ye −ye −h(e)+h(¯))2 .
                                           ¯         e
                                                                  The proposed framework is based on the Kronecker product ke
                       v ∈V e,¯∈Ev
                              e                                   implicit joint feature representations of queries and the sets of ob
                                                                  Exactly this kernel construction will allow a straightforward
                                                                  existing framework to dyadic relations and multi-task l
                                                                  (Objectives 1 and 2). It has been proposed independently by three
                                                                  modeling pairwise inputs in different application domains (Basilico
 ir. Michiel Stock (KERMIT)            Kernels for Bioinformatics et al. 2004, Ben-Hur et al. November a2012
                                                                                              2005). From different perspective, it h
                                                                                                                         17 / 40
Case studies    Enzyme function prediction


Predicting enzyme function

Problem statement
Predict the function (EC number) of an enzyme using structural
information of the active site.
      Data:                               active site of an
           1730 enzymes with 21           enzyme:
           different functions
           four different structural
           similarities
                     CavBase
                     maximum common
                     subgraph
                     labeled point cloud
                     superposition
                     fingerprints

 ir. Michiel Stock (KERMIT)         Kernels for Bioinformatics                  November 2012   18 / 40
Case studies    Enzyme function prediction


EC numbers

EC number
A functional label of an enzyme, based on the reaction that is catalyzed.

Example: EC 2.7.6.1 = ribose-phosphate diphosphokinase




  ir. Michiel Stock (KERMIT)    Kernels for Bioinformatics                  November 2012   19 / 40
Case studies       Enzyme function prediction


Defining catalytic similarity
Catalytic similarity
The catalytic similarity is the number of successive equal digits in the EC
number between two enzymes, starting from the first digit.




                                     0        EC 2.7.7.34
                                                                                  EC ?.?.?.?
                                               3                           2
                                                             0
                                          1
                       EC 4.2.3.90
                                                                            0
                                                     0
                                     0
                                                             EC 4.6.1.11

                                                                   2
                                                                                    EC 2.7.1.12
                                         EC 2.7.7.12


  ir. Michiel Stock (KERMIT)                   Kernels for Bioinformatics                      November 2012   20 / 40
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations
Bioinformatics kernels relations

More Related Content

What's hot

Multilabel Image Annotation using Multimodal Analysis
Multilabel Image Annotation using Multimodal AnalysisMultilabel Image Annotation using Multimodal Analysis
Multilabel Image Annotation using Multimodal Analysisijtsrd
 
X trepan an extended trepan for
X trepan an extended trepan forX trepan an extended trepan for
X trepan an extended trepan forijaia
 
Analysis of Neocognitron of Neural Network Method in the String Recognition
Analysis of Neocognitron of Neural Network Method in the String RecognitionAnalysis of Neocognitron of Neural Network Method in the String Recognition
Analysis of Neocognitron of Neural Network Method in the String RecognitionIDES Editor
 
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...guest3f9c6b
 
A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsDevansh16
 
Spectra of Large Network
Spectra of Large NetworkSpectra of Large Network
Spectra of Large NetworkDavid Gleich
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisAboul Ella Hassanien
 
Extremely Low Bit Transformer Quantization for On-Device NMT
Extremely Low Bit Transformer Quantization for On-Device NMTExtremely Low Bit Transformer Quantization for On-Device NMT
Extremely Low Bit Transformer Quantization for On-Device NMTInsoo Chung
 
Random Neural Network (Erol) by Engr. Edgar Carrillo II
Random Neural Network (Erol) by Engr. Edgar Carrillo IIRandom Neural Network (Erol) by Engr. Edgar Carrillo II
Random Neural Network (Erol) by Engr. Edgar Carrillo IIEdgar Carrillo
 
Cerebellar Model Controller with new Model of Granule Cell-golgi Cell Buildi...
Cerebellar Model Controller with new Model of Granule  Cell-golgi Cell Buildi...Cerebellar Model Controller with new Model of Granule  Cell-golgi Cell Buildi...
Cerebellar Model Controller with new Model of Granule Cell-golgi Cell Buildi...IJECEIAES
 
Algorithmic Information Theory and Computational Biology
Algorithmic Information Theory and Computational BiologyAlgorithmic Information Theory and Computational Biology
Algorithmic Information Theory and Computational BiologyHector Zenil
 
SCALING THE HTM SPATIAL POOLER
SCALING THE HTM SPATIAL POOLERSCALING THE HTM SPATIAL POOLER
SCALING THE HTM SPATIAL POOLERijaia
 
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVMRobust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVMFerhat Ozgur Catak
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionvivatechijri
 
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...Ferhat Ozgur Catak
 
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...csandit
 

What's hot (17)

Multilabel Image Annotation using Multimodal Analysis
Multilabel Image Annotation using Multimodal AnalysisMultilabel Image Annotation using Multimodal Analysis
Multilabel Image Annotation using Multimodal Analysis
 
X trepan an extended trepan for
X trepan an extended trepan forX trepan an extended trepan for
X trepan an extended trepan for
 
Analysis of Neocognitron of Neural Network Method in the String Recognition
Analysis of Neocognitron of Neural Network Method in the String RecognitionAnalysis of Neocognitron of Neural Network Method in the String Recognition
Analysis of Neocognitron of Neural Network Method in the String Recognition
 
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
 
Neural Networks: Introducton
Neural Networks: IntroductonNeural Networks: Introducton
Neural Networks: Introducton
 
A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representations
 
Spectra of Large Network
Spectra of Large NetworkSpectra of Large Network
Spectra of Large Network
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysis
 
Extremely Low Bit Transformer Quantization for On-Device NMT
Extremely Low Bit Transformer Quantization for On-Device NMTExtremely Low Bit Transformer Quantization for On-Device NMT
Extremely Low Bit Transformer Quantization for On-Device NMT
 
Random Neural Network (Erol) by Engr. Edgar Carrillo II
Random Neural Network (Erol) by Engr. Edgar Carrillo IIRandom Neural Network (Erol) by Engr. Edgar Carrillo II
Random Neural Network (Erol) by Engr. Edgar Carrillo II
 
Cerebellar Model Controller with new Model of Granule Cell-golgi Cell Buildi...
Cerebellar Model Controller with new Model of Granule  Cell-golgi Cell Buildi...Cerebellar Model Controller with new Model of Granule  Cell-golgi Cell Buildi...
Cerebellar Model Controller with new Model of Granule Cell-golgi Cell Buildi...
 
Algorithmic Information Theory and Computational Biology
Algorithmic Information Theory and Computational BiologyAlgorithmic Information Theory and Computational Biology
Algorithmic Information Theory and Computational Biology
 
SCALING THE HTM SPATIAL POOLER
SCALING THE HTM SPATIAL POOLERSCALING THE HTM SPATIAL POOLER
SCALING THE HTM SPATIAL POOLER
 
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVMRobust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
 
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
 
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
 

Viewers also liked

Viewers also liked (6)

Cb08 gonzález jesús
Cb08 gonzález jesúsCb08 gonzález jesús
Cb08 gonzález jesús
 
EasyGene oligo factory
EasyGene oligo factoryEasyGene oligo factory
EasyGene oligo factory
 
Bioinformatics in Gene Research
Bioinformatics in Gene ResearchBioinformatics in Gene Research
Bioinformatics in Gene Research
 
Appli bioinfo
Appli bioinfoAppli bioinfo
Appli bioinfo
 
Primer Designing
Primer DesigningPrimer Designing
Primer Designing
 
PCR Primer desining
PCR Primer desiningPCR Primer desining
PCR Primer desining
 

Similar to Bioinformatics kernels relations

Kernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyKernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyMichiel Stock
 
Reproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologyReproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologykhinsen
 
download
downloaddownload
downloadbutest
 
download
downloaddownload
downloadbutest
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"ieee_cis_cyprus
 
SBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesSBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesMike Hucka
 
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning Wanjin Yu
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Miningbutest
 
Deep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and HypeDeep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and HypeSiby Jose Plathottam
 
Molecular autoencoder
Molecular autoencoderMolecular autoencoder
Molecular autoencoderDan Elton
 
Kernel methods in machine learning
Kernel methods in machine learningKernel methods in machine learning
Kernel methods in machine learningbutest
 
Kernel methods for data integration in systems biology
Kernel methods for data integration in systems biology Kernel methods for data integration in systems biology
Kernel methods for data integration in systems biology tuxette
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliersaimsnist
 
Continuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep ArchitecturesContinuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep ArchitecturesVincenzo Lomonaco
 
(Talk in Powerpoint Format)
(Talk in Powerpoint Format)(Talk in Powerpoint Format)
(Talk in Powerpoint Format)butest
 
A Survey of Deep Learning Algorithms for Malware Detection
A Survey of Deep Learning Algorithms for Malware DetectionA Survey of Deep Learning Algorithms for Malware Detection
A Survey of Deep Learning Algorithms for Malware DetectionIJCSIS Research Publications
 
CI image processing mns
CI image processing mnsCI image processing mns
CI image processing mnsMeenakshi Sood
 
Introduction to artificial intelligence technique
Introduction to artificial intelligence technique Introduction to artificial intelligence technique
Introduction to artificial intelligence technique Aishwarya Phalke
 

Similar to Bioinformatics kernels relations (20)

Kernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyKernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational Biology
 
Reproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biologyReproducible research in molecular biophysics and structural biology
Reproducible research in molecular biophysics and structural biology
 
download
downloaddownload
download
 
download
downloaddownload
download
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"
 
SBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesSBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resources
 
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
Deep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and HypeDeep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and Hype
 
Molecular autoencoder
Molecular autoencoderMolecular autoencoder
Molecular autoencoder
 
Kernel methods in machine learning
Kernel methods in machine learningKernel methods in machine learning
Kernel methods in machine learning
 
Kernel methods for data integration in systems biology
Kernel methods for data integration in systems biology Kernel methods for data integration in systems biology
Kernel methods for data integration in systems biology
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Continuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep ArchitecturesContinuous Unsupervised Training of Deep Architectures
Continuous Unsupervised Training of Deep Architectures
 
JOSA TechTalks - Machine Learning in Practice
JOSA TechTalks - Machine Learning in PracticeJOSA TechTalks - Machine Learning in Practice
JOSA TechTalks - Machine Learning in Practice
 
(Talk in Powerpoint Format)
(Talk in Powerpoint Format)(Talk in Powerpoint Format)
(Talk in Powerpoint Format)
 
A Survey of Deep Learning Algorithms for Malware Detection
A Survey of Deep Learning Algorithms for Malware DetectionA Survey of Deep Learning Algorithms for Malware Detection
A Survey of Deep Learning Algorithms for Malware Detection
 
CI image processing mns
CI image processing mnsCI image processing mns
CI image processing mns
 
Introduction to artificial intelligence technique
Introduction to artificial intelligence technique Introduction to artificial intelligence technique
Introduction to artificial intelligence technique
 
E04423133
E04423133E04423133
E04423133
 

More from Michiel Stock

Wiskunde voor Waterbeheer
Wiskunde voor WaterbeheerWiskunde voor Waterbeheer
Wiskunde voor WaterbeheerMichiel Stock
 
How the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the worldHow the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the worldMichiel Stock
 
Disentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methodsDisentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methodsMichiel Stock
 
2018 presentation montréal_handouts
2018 presentation montréal_handouts2018 presentation montréal_handouts
2018 presentation montréal_handoutsMichiel Stock
 
A tour in optimal transport
A tour in optimal transportA tour in optimal transport
A tour in optimal transportMichiel Stock
 
Pairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic BiologyPairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic BiologyMichiel Stock
 
PhD defence pairwise learning
PhD defence pairwise learningPhD defence pairwise learning
PhD defence pairwise learningMichiel Stock
 
Bioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEMBioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEMMichiel Stock
 
Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...Michiel Stock
 
Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Michiel Stock
 
A two-step method to incorporate task features for large output spaces
A two-step method to incorporate task features for large output spacesA two-step method to incorporate task features for large output spaces
A two-step method to incorporate task features for large output spacesMichiel Stock
 
Enzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking AlgorithmsEnzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking AlgorithmsMichiel Stock
 
A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...
A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...
A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...Michiel Stock
 

More from Michiel Stock (13)

Wiskunde voor Waterbeheer
Wiskunde voor WaterbeheerWiskunde voor Waterbeheer
Wiskunde voor Waterbeheer
 
How the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the worldHow the mathematics behind Netflix will save the world
How the mathematics behind Netflix will save the world
 
Disentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methodsDisentangling ecological networks using graph embedding methods
Disentangling ecological networks using graph embedding methods
 
2018 presentation montréal_handouts
2018 presentation montréal_handouts2018 presentation montréal_handouts
2018 presentation montréal_handouts
 
A tour in optimal transport
A tour in optimal transportA tour in optimal transport
A tour in optimal transport
 
Pairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic BiologyPairwise Learning for Synthetic Biology
Pairwise Learning for Synthetic Biology
 
PhD defence pairwise learning
PhD defence pairwise learningPhD defence pairwise learning
PhD defence pairwise learning
 
Bioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEMBioscience engineering together: participating at iGEM
Bioscience engineering together: participating at iGEM
 
Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...Exact and efficient top-K inference for multi-target prediction by querying s...
Exact and efficient top-K inference for multi-target prediction by querying s...
 
Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016
 
A two-step method to incorporate task features for large output spaces
A two-step method to incorporate task features for large output spacesA two-step method to incorporate task features for large output spaces
A two-step method to incorporate task features for large output spaces
 
Enzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking AlgorithmsEnzyme Annotation using Conditional Ranking Algorithms
Enzyme Annotation using Conditional Ranking Algorithms
 
A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...
A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...
A Kernel Based Framework for Predicting Interactions Between Methanotrophs an...
 

Bioinformatics kernels relations

  • 1. Kernel Methods and Relational Learning in Bioinformatics ir. Michiel Stock Dr. Willem Waegeman Prof. dr. Bernard De Baets Faculty of Bioscience Engineering Ghent University November 2012 KERMIT ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 1 / 40
  • 2. Outline 1 Introduction 2 Kernel methods 3 Learning relations 4 Case studies Enzyme function prediction Protein-ligand interactions Microbial ecology 5 Conclusions ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 2 / 40
  • 3. Introduction Introductory example Problem statement Predict protein-protein interactions based on high-throughput data. Based on a gold standard Typical features that can be used: Yeast two-hybrid Pfam profile Phylogenetic profile Localization PSI-BLAST Expression ... ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 3 / 40
  • 4. Introduction Machine learning is widelyagaused in bioinformatics 88 Larran‹ et al. Downloaded from bib.oxfordjournals.org at Biomedische Bibliotheek o Figure 1: Classification of the topics where machine learning methods are applied. ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 4 / 40
  • 5. Introduction Bioinformatics deals with complex data Bioinformatics data is typically: in large dimension (e.g., microarrays or proteomics data) structured (e.g., gene sequences, small molecules, interaction networks, phylogenetic trees...) heterogeneous (e.g., vectors, sequences, graphs to describe the same protein) in large quantities (e.g., more than 106 known protein sequences) noisy (e.g., many features are not relevant) ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 5 / 40
  • 6. Kernel methods Formal definition of a kernel Kernels are non-linear functions defined over objects x ∈ X . Definition A function k : X × X → R is called a positive definite kernel if it is symmetric, that is, k(x, x ) = k(x , x) for any two objects x, x ∈ X , and positive semi-definite, that is, N N ci cj k(xi , xj ) ≥ 0 i=1 j=1 for any N > 0, any choice of N objects x1 , . . . , xN ∈ X , and any choice of real numbers c1 , . . . , cN ∈ R. Can be seen as generalized covariances. ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 6 / 40
  • 7. Kernel methods Interpretation of kernels Suppose an object x has an implicit feature representation φ(x) ∈ F. A kernel function can be seen as a dot product in this feature space: X F k(x, x ) = φ(x), φ(x ) h (x), (x0 )i k Linear models in this feature space F can be made: dinsdag, 10 april 2012 T y (x) = w φ(x) = an k(xn , x) n ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 7 / 40
  • 8. Kernel methods Many kernel methods exist SVM Examples of popular kernel methods: Support vector machine (SVM) Regularized least squares (RLS) Kernel principal KPCA component analysis (KPCA) Learning algorithm is independent of the kernel representation! ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 8 / 40
  • 9. Kernel methods Kernels for (protein) sequences Spectrum kernel (SK) The SK considers the number of k-mers m two sequences si and sj have in common. SKk (si , sj ) = N(m, si )∗N(m, sj ) m∈Σk with N(m, s) the number of k-mers m in sequence s. To predict structure, function... of DNA, RNA or proteins. A discriminative alternative for Hidden Markov Models. ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 9 / 40
  • 10. Kernel methods Kernels for graphs (1) Graph Graphs are a set of interconnected objects, called vertices (or nodes), that are connected through edges. Graphs can show the structure of an object or interactions between different objects. Graph are important in bioinformatics! ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 10 / 40
  • 11. Kernel methods Kernels for graphs (2) Graph kernel Constructing a similarity between graphs. In chemoinformatics: Based on performing a random walk on both graphs and counting the number of In structural bioinformatics: matching walks. Usually very computationally demanding! A ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 11 / 40
  • 12. Kernel methods Kernels for graphs (3) Diffusion kernel Constructing a similarity between vertices within the same graph. Also based on performing a random walk on a graph. Captures the long-range relationships between vertices. Inspired by the heat equation. The kernel quantifies how quickly ‘heat’ can spread from one node to another. ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 12 / 40
  • 13. Kernel methods Kernels for fingerprints Fingerprint representation of Objects that can be described an object: by a long binary vector x can be represented by the Tanimoto kernel: KTan (xm , xn ) = xm , xn . xm , xm + xn , xn − xm , xn ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 13 / 40
  • 14. Learning relations Kernels for pairs of objects Problem statement Predict the binding interaction between a given protein and a ligand (small molecule). Learning Molecular docking. The problem deals with two types of objects: Proteins (graph kernel of structure, sequence kernel, fingerprints...) Ligand (fingerprints, graph kernel...) Label is for a pair of objects. ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 14 / 40
  • 15. Learning relations ng and Ranking Algorithms for Bioinformatics example: pairs of objects Kernels for Applications nomicsWillem Waegeman, Bernard De Baets Michiel Stock, Pairwise kernel IT, Department of Mathematical Modelling, Statistics and Bioinformatics of Combine the kernel matrices of the individual the process of druga kernel proteins and a database of ligands to aid objects to construct istical model based objects. matrix for pairs of on a data set. Kernel methods allow for the roductory example: chemogenomics tein and a from individual kernels for the proteins and ligands: Starting ligand. ding interactions between a set of proteins and a database of ligands to aid the process of drug to model pairwise relations between different types of objects. s Data set Object kernels ( , ) By optimizing a ranking loss, our algorithms can also be used for ( , ) as shown on the right. conditional ranking, ( , ) SVM In short, our framework is ideally suited for bioinformatics RLS ... challenges: ( , ) - efficient learning process ( , ) ... - can handle complex objects (graphs, trees, sequences...) Pairwise kernel - ability to deal with information retrieval problems Object kernels Learning algorithm gorithms can also be used for ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 15 / 40
  • 16. ( , ) Learning relations SVM Conditional ranking (1) RLS ... Motivation( , ) Suppose one is not ) ... ( , particularly interested in the exact value of the interaction but in the order of the proteins for a given ligand. Pairwise kernel rnels Learning algorithm ed for More relevant More relevant matics Query 1 Query 2 Database objects ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 16 / 40
  • 17. Learning relations Conditional ranking (2) Based on a graph description, with e a pair of objects. Train the model: h(e) =< w, Φ(e) >= ae K Φ (e, e ) ¯ e∈E using the algorithm: 2 A(T ) = argmin L(h, T )+λ h H. h∈H Figure 1 Example of a multi-graph. If this graph, on the left, would be used fo conditioned on C, then A scores better than E, which ranks higher than E, w Where we use a ranking loss: higher than D and D ranks higher than B. There is no information about the re and G, respectively, our model could be used to include these two instances in are available. Notice that in this setting unconditional ranking of these objects graph is obviously intransitive. Figure reproduced from (Pahikkala et al., 2010). L(h, T ) = (ye −ye −h(e)+h(¯))2 . ¯ e The proposed framework is based on the Kronecker product ke v ∈V e,¯∈Ev e implicit joint feature representations of queries and the sets of ob Exactly this kernel construction will allow a straightforward existing framework to dyadic relations and multi-task l (Objectives 1 and 2). It has been proposed independently by three modeling pairwise inputs in different application domains (Basilico ir. Michiel Stock (KERMIT) Kernels for Bioinformatics et al. 2004, Ben-Hur et al. November a2012 2005). From different perspective, it h 17 / 40
  • 18. Case studies Enzyme function prediction Predicting enzyme function Problem statement Predict the function (EC number) of an enzyme using structural information of the active site. Data: active site of an 1730 enzymes with 21 enzyme: different functions four different structural similarities CavBase maximum common subgraph labeled point cloud superposition fingerprints ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 18 / 40
  • 19. Case studies Enzyme function prediction EC numbers EC number A functional label of an enzyme, based on the reaction that is catalyzed. Example: EC 2.7.6.1 = ribose-phosphate diphosphokinase ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 19 / 40
  • 20. Case studies Enzyme function prediction Defining catalytic similarity Catalytic similarity The catalytic similarity is the number of successive equal digits in the EC number between two enzymes, starting from the first digit. 0 EC 2.7.7.34 EC ?.?.?.? 3 2 0 1 EC 4.2.3.90 0 0 0 EC 4.6.1.11 2 EC 2.7.1.12 EC 2.7.7.12 ir. Michiel Stock (KERMIT) Kernels for Bioinformatics November 2012 20 / 40