SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Understanding And
Classifying Metabolite
Space and Metabolite-
Likeness
PLoS One (in press)


   Julio E. Peironcely  @peyron
   Juliopeironcely.com
   PhD student at Leiden University and TNO
Metabolomics

   the quantitative and qualitative
      analysis of all metabolites in
     samples of cells, body fluids,
                       tissues, etc.


                  Julio E. Peironcely
Metabolomics

             Experi-                                                                 Biological
Biological                        Sample      Data       Data pre-         Data
             mental    Sampling                                                        inter-
question                        preparation acquisition processing        analysis
             design                                                                  pretation


                                                                 Metabolites




                                                                               Relevant
                                                                            biomolecules/
                                                                List of
                                      Samples     Raw data                   connectivities
                 Protocol                                       peaks/
                                                                                  &
                                                                biomolecules
                                                                                Models




                                                 Julio E. Peironcely
Metabolomics

             Experi-                                                                 Biological
Biological                        Sample      Data       Data pre-         Data
             mental    Sampling                                                        inter-
question                        preparation acquisition processing        analysis
             design                                                                  pretation


                                                                 Metabolites




                                                                               Relevant
                                                                            biomolecules/
                                                                List of
                                      Samples     Raw data                   connectivities
                 Protocol                                       peaks/
                                                                                  &
                                                                biomolecules
                                                                                Models




                                                 Julio E. Peironcely
How do metabolites
        look like?
HMDB          ZINC
 8K           21M



       Julio E. Peironcely
metabolites   non metabolites

      Water Solubility
            MW
         C Atoms
     Struc. Complexity
            PSA


               Julio E. Peironcely
PCA




      Julio E. Peironcely
PCA
Not so different
Decision Tree




                Julio E. Peironcely
Lots of candidates
         structures
Elemental
Composition




              Julio E. Peironcely
Elemental
Composition




      Structure
     Generation




                  Julio E. Peironcely
Elemental
Composition




      Structure
     Generation




              Molecules

                    Julio E. Peironcely
We are looking for
      metabolites
Elemental
Composition




      Structure       Metabolite
     Generation       Likeness




              Molecules

                    Julio E. Peironcely
Elemental
Composition
                                    Metabolites




      Structure       Metabolite
     Generation       Likeness




              Molecules

                    Julio E. Peironcely
Metabolite-likeness
Representation             + Classification
   HMDB            ZINC
    8K             21M


       Atom Counts

   Physicochemical desc.            Support Vector
                                    Machines (SVM)
     MDL Public Keys
                                 Random Forest (RF)
          FCFP_4
                                   Naïve Bayes (NB)
          ECFP_4




                             Julio E. Peironcely
Metabolite-likeness         HMDB
                             8K
                                                ZINC
                                                21M


                               Standardization


      Atom Counts            Diversity Selection
  Physicochemical desc.
    MDL Public Keys
         FCFP_4
         ECFP_4




                          Julio E. Peironcely
Metabolite-likeness           HMDB
                               8K
                                                  ZINC
                                                  21M


                                 Standardization


      Atom Counts              Diversity Selection
  Physicochemical desc.
    MDL Public Keys
         FCFP_4           Training Set              Test Set
         ECFP_4            532 + 532              6.4K + 6.4K




                            Julio E. Peironcely
Metabolite-likeness               HMDB
                                   8K
                                                      ZINC
                                                      21M


                                      Standardization


      Atom Counts                  Diversity Selection
  Physicochemical desc.
    MDL Public Keys
         FCFP_4             Training Set                Test Set
         ECFP_4              532 + 532                6.4K + 6.4K

                            5-fold CV

                          SVM    RF      BC




                                Julio E. Peironcely
Metabolite-likeness        HMDB
                            8K
                                               ZINC
                                               21M


                               Standardization


                            Diversity Selection
   3 classifiers
         X
                      Training Set               Test Set
  5 descriptions       532 + 532               6.4K + 6.4K

                      5-fold CV                Metabolite
                                                likeness
                   SVM    RF      BC




                         Julio E. Peironcely
Metabolite-likeness                          HMDB
                                              8K
                                                                 ZINC
                                                                 21M


Best = RF – MDLPublicKeys                        Standardization

Sensitivity   Specificity    AUC
                                              Diversity Selection
 99.84%        87.52%       99.20%

                                       Training Set                Test Set
      Bad BC – P_desc                   532 + 532                6.4K + 6.4K

Sensitivity   Specificity    AUC       5-fold CV                 Metabolite
                                                                  likeness
                                     SVM    RF      BC
 42.51%        86.56%       61.57%




                                           Julio E. Peironcely
Metabolite-likeness, external
validation
              HMDB
            External          DrugBank          ChEMBL
          validation set


                                          Random Selection



                           Standardization


                             Metabolite
                              likeness




                                    Julio E. Peironcely
Metabolite-likeness, external
validation




                     Julio E. Peironcely
Met-likeness + structure generation
(methylhistamine) 260K

                                          71%
     46%




                    Julio E. Peironcely
Met-likeness + structure generation
(malic acid) 8K

                                          100%

57%          77%




                    Julio E. Peironcely
Conclusions


Prediction is good, interpretation not

              Useful in different fields

                Local models needed



                      Julio E. Peironcely
Acknowledgements



   Leiden University      University of Cambridge

   Theo Reijmers          Andreas Bender
   Thomas Hankemeier


   TNO Quality of Life    HMP University of
                          Alberta
   Leon Coulier
                          David Wishart
                          Ying (Edison) Dong




                         Julio E. Peironcely

Contenu connexe

Tendances

Analyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszkaAnalyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszkaJohannesdedooper
 
Bio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, ParisBio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, ParisKGS Global
 
Lab Informatics 09 Se
Lab Informatics 09 SeLab Informatics 09 Se
Lab Informatics 09 SeSamEid
 
Team presentation min
Team presentation minTeam presentation min
Team presentation minChoo Yang
 
Pat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines BoardPat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines BoardInvestnet
 
Вычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембранВычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембранIlya Klabukov
 
Anovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-confAnovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-confJohn Dangerfield
 
Selective Protein Staining On Native Gel
Selective Protein Staining On Native GelSelective Protein Staining On Native Gel
Selective Protein Staining On Native GelMartina Bertsch
 
Website antibodies
Website   antibodiesWebsite   antibodies
Website antibodiesAmunix
 
Asilomar2005 Ecoli Poster
Asilomar2005 Ecoli PosterAsilomar2005 Ecoli Poster
Asilomar2005 Ecoli Posterjcruzsilva
 
Data Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoveryData Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoverySean Ekins
 
Selkoe webinar slides
Selkoe webinar slidesSelkoe webinar slides
Selkoe webinar slidesnicowef
 

Tendances (16)

Analyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszkaAnalyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszka
 
Poster
PosterPoster
Poster
 
Bio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, ParisBio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, Paris
 
Lab Informatics 09 Se
Lab Informatics 09 SeLab Informatics 09 Se
Lab Informatics 09 Se
 
Team presentation min
Team presentation minTeam presentation min
Team presentation min
 
Pat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines BoardPat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines Board
 
Вычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембранВычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембран
 
Chapt 10
Chapt 10Chapt 10
Chapt 10
 
Anovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-confAnovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-conf
 
Bradshaw - Bioenergy - Spring Review 2012
Bradshaw - Bioenergy - Spring Review 2012Bradshaw - Bioenergy - Spring Review 2012
Bradshaw - Bioenergy - Spring Review 2012
 
Selective Protein Staining On Native Gel
Selective Protein Staining On Native GelSelective Protein Staining On Native Gel
Selective Protein Staining On Native Gel
 
Website antibodies
Website   antibodiesWebsite   antibodies
Website antibodies
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
 
Asilomar2005 Ecoli Poster
Asilomar2005 Ecoli PosterAsilomar2005 Ecoli Poster
Asilomar2005 Ecoli Poster
 
Data Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoveryData Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug Discovery
 
Selkoe webinar slides
Selkoe webinar slidesSelkoe webinar slides
Selkoe webinar slides
 

Similaire à Understanding And Classifying Metabolite Space and Metabolite-Likeness

Structure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likenessStructure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likenessVodafoneZiggo
 
Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011VodafoneZiggo
 
Computational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein EngineeringComputational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein EngineeringPablo Carbonell
 
Biotechnology as Career Option 2012
Biotechnology as Career Option 2012Biotechnology as Career Option 2012
Biotechnology as Career Option 2012Reportbioinformatics
 
Year 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyYear 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyRachelCaico
 
1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)phdcareers
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyChris Evelo
 
Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21Sage Base
 
OBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolutionOBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolutionOut of The Box Seminar
 
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012TCBG
 
BioDiscovery Solutions for Future
BioDiscovery Solutions for FutureBioDiscovery Solutions for Future
BioDiscovery Solutions for Futurecontactmeasif
 
Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Sucheta Tripathy
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1utpaltatu
 
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...BioMAP® Systems
 
Allelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome SequencingAllelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome SequencingDenis C. Bauer
 
Lecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engLecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engDr. Tan Boon Siong
 
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...Barry Smith
 

Similaire à Understanding And Classifying Metabolite Space and Metabolite-Likeness (20)

Structure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likenessStructure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likeness
 
Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011
 
Computational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein EngineeringComputational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein Engineering
 
Biotechnology as Career Option 2012
Biotechnology as Career Option 2012Biotechnology as Career Option 2012
Biotechnology as Career Option 2012
 
Year 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyYear 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro only
 
ALTRABio presents WikiBioPath: new perspectives in biological data analysis
ALTRABio presents WikiBioPath: new perspectives in biological data analysisALTRABio presents WikiBioPath: new perspectives in biological data analysis
ALTRABio presents WikiBioPath: new perspectives in biological data analysis
 
1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biology
 
Metabolomics Data Analysis
Metabolomics Data AnalysisMetabolomics Data Analysis
Metabolomics Data Analysis
 
Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21
 
OBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolutionOBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolution
 
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
 
Bioinformatica t7-protein structure
Bioinformatica t7-protein structureBioinformatica t7-protein structure
Bioinformatica t7-protein structure
 
BioDiscovery Solutions for Future
BioDiscovery Solutions for FutureBioDiscovery Solutions for Future
BioDiscovery Solutions for Future
 
Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1
 
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
 
Allelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome SequencingAllelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome Sequencing
 
Lecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engLecture 4 metabolic pathway eng
Lecture 4 metabolic pathway eng
 
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
 

Dernier

Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Dernier (20)

Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

Understanding And Classifying Metabolite Space and Metabolite-Likeness

  • 1. Understanding And Classifying Metabolite Space and Metabolite- Likeness PLoS One (in press) Julio E. Peironcely  @peyron Juliopeironcely.com PhD student at Leiden University and TNO
  • 2. Metabolomics the quantitative and qualitative analysis of all metabolites in samples of cells, body fluids, tissues, etc. Julio E. Peironcely
  • 3. Metabolomics Experi- Biological Biological Sample Data Data pre- Data mental Sampling inter- question preparation acquisition processing analysis design pretation Metabolites Relevant biomolecules/ List of Samples Raw data connectivities Protocol peaks/ & biomolecules Models Julio E. Peironcely
  • 4. Metabolomics Experi- Biological Biological Sample Data Data pre- Data mental Sampling inter- question preparation acquisition processing analysis design pretation Metabolites Relevant biomolecules/ List of Samples Raw data connectivities Protocol peaks/ & biomolecules Models Julio E. Peironcely
  • 5. How do metabolites look like?
  • 6. HMDB ZINC 8K 21M Julio E. Peironcely
  • 7. metabolites non metabolites Water Solubility MW C Atoms Struc. Complexity PSA Julio E. Peironcely
  • 8. PCA Julio E. Peironcely
  • 9. PCA
  • 11. Decision Tree Julio E. Peironcely
  • 12. Lots of candidates structures
  • 13. Elemental Composition Julio E. Peironcely
  • 14. Elemental Composition Structure Generation Julio E. Peironcely
  • 15. Elemental Composition Structure Generation Molecules Julio E. Peironcely
  • 16. We are looking for metabolites
  • 17. Elemental Composition Structure Metabolite Generation Likeness Molecules Julio E. Peironcely
  • 18. Elemental Composition Metabolites Structure Metabolite Generation Likeness Molecules Julio E. Peironcely
  • 19. Metabolite-likeness Representation + Classification HMDB ZINC 8K 21M Atom Counts Physicochemical desc. Support Vector Machines (SVM) MDL Public Keys Random Forest (RF) FCFP_4 Naïve Bayes (NB) ECFP_4 Julio E. Peironcely
  • 20. Metabolite-likeness HMDB 8K ZINC 21M Standardization Atom Counts Diversity Selection Physicochemical desc. MDL Public Keys FCFP_4 ECFP_4 Julio E. Peironcely
  • 21. Metabolite-likeness HMDB 8K ZINC 21M Standardization Atom Counts Diversity Selection Physicochemical desc. MDL Public Keys FCFP_4 Training Set Test Set ECFP_4 532 + 532 6.4K + 6.4K Julio E. Peironcely
  • 22. Metabolite-likeness HMDB 8K ZINC 21M Standardization Atom Counts Diversity Selection Physicochemical desc. MDL Public Keys FCFP_4 Training Set Test Set ECFP_4 532 + 532 6.4K + 6.4K 5-fold CV SVM RF BC Julio E. Peironcely
  • 23. Metabolite-likeness HMDB 8K ZINC 21M Standardization Diversity Selection 3 classifiers X Training Set Test Set 5 descriptions 532 + 532 6.4K + 6.4K 5-fold CV Metabolite likeness SVM RF BC Julio E. Peironcely
  • 24. Metabolite-likeness HMDB 8K ZINC 21M Best = RF – MDLPublicKeys Standardization Sensitivity Specificity AUC Diversity Selection 99.84% 87.52% 99.20% Training Set Test Set Bad BC – P_desc 532 + 532 6.4K + 6.4K Sensitivity Specificity AUC 5-fold CV Metabolite likeness SVM RF BC 42.51% 86.56% 61.57% Julio E. Peironcely
  • 25. Metabolite-likeness, external validation HMDB External DrugBank ChEMBL validation set Random Selection Standardization Metabolite likeness Julio E. Peironcely
  • 27.
  • 28. Met-likeness + structure generation (methylhistamine) 260K 71% 46% Julio E. Peironcely
  • 29. Met-likeness + structure generation (malic acid) 8K 100% 57% 77% Julio E. Peironcely
  • 30. Conclusions Prediction is good, interpretation not Useful in different fields Local models needed Julio E. Peironcely
  • 31. Acknowledgements Leiden University University of Cambridge Theo Reijmers Andreas Bender Thomas Hankemeier TNO Quality of Life HMP University of Alberta Leon Coulier David Wishart Ying (Edison) Dong Julio E. Peironcely