SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Understanding And
Classifying Metabolite
Space and Metabolite-
Likeness
PLoS One (in press)


   Julio E. Peironcely  @peyron
   Juliopeironcely.com
   PhD student at Leiden University and TNO
Metabolomics

   the quantitative and qualitative
      analysis of all metabolites in
     samples of cells, body fluids,
                       tissues, etc.


                  Julio E. Peironcely
Metabolomics

             Experi-                                                                 Biological
Biological                        Sample      Data       Data pre-         Data
             mental    Sampling                                                        inter-
question                        preparation acquisition processing        analysis
             design                                                                  pretation


                                                                 Metabolites




                                                                               Relevant
                                                                            biomolecules/
                                                                List of
                                      Samples     Raw data                   connectivities
                 Protocol                                       peaks/
                                                                                  &
                                                                biomolecules
                                                                                Models




                                                 Julio E. Peironcely
Metabolomics

             Experi-                                                                 Biological
Biological                        Sample      Data       Data pre-         Data
             mental    Sampling                                                        inter-
question                        preparation acquisition processing        analysis
             design                                                                  pretation


                                                                 Metabolites




                                                                               Relevant
                                                                            biomolecules/
                                                                List of
                                      Samples     Raw data                   connectivities
                 Protocol                                       peaks/
                                                                                  &
                                                                biomolecules
                                                                                Models




                                                 Julio E. Peironcely
How do metabolites
        look like?
HMDB          ZINC
 8K           21M



       Julio E. Peironcely
metabolites   non metabolites

      Water Solubility
            MW
         C Atoms
     Struc. Complexity
            PSA


               Julio E. Peironcely
PCA




      Julio E. Peironcely
PCA
Not so different
Decision Tree




                Julio E. Peironcely
Lots of candidates
         structures
Elemental
Composition




              Julio E. Peironcely
Elemental
Composition




      Structure
     Generation




                  Julio E. Peironcely
Elemental
Composition




      Structure
     Generation




              Molecules

                    Julio E. Peironcely
We are looking for
      metabolites
Elemental
Composition




      Structure       Metabolite
     Generation       Likeness




              Molecules

                    Julio E. Peironcely
Elemental
Composition
                                    Metabolites




      Structure       Metabolite
     Generation       Likeness




              Molecules

                    Julio E. Peironcely
Metabolite-likeness
Representation             + Classification
   HMDB            ZINC
    8K             21M


       Atom Counts

   Physicochemical desc.            Support Vector
                                    Machines (SVM)
     MDL Public Keys
                                 Random Forest (RF)
          FCFP_4
                                   Naïve Bayes (NB)
          ECFP_4




                             Julio E. Peironcely
Metabolite-likeness         HMDB
                             8K
                                                ZINC
                                                21M


                               Standardization


      Atom Counts            Diversity Selection
  Physicochemical desc.
    MDL Public Keys
         FCFP_4
         ECFP_4




                          Julio E. Peironcely
Metabolite-likeness           HMDB
                               8K
                                                  ZINC
                                                  21M


                                 Standardization


      Atom Counts              Diversity Selection
  Physicochemical desc.
    MDL Public Keys
         FCFP_4           Training Set              Test Set
         ECFP_4            532 + 532              6.4K + 6.4K




                            Julio E. Peironcely
Metabolite-likeness               HMDB
                                   8K
                                                      ZINC
                                                      21M


                                      Standardization


      Atom Counts                  Diversity Selection
  Physicochemical desc.
    MDL Public Keys
         FCFP_4             Training Set                Test Set
         ECFP_4              532 + 532                6.4K + 6.4K

                            5-fold CV

                          SVM    RF      BC




                                Julio E. Peironcely
Metabolite-likeness        HMDB
                            8K
                                               ZINC
                                               21M


                               Standardization


                            Diversity Selection
   3 classifiers
         X
                      Training Set               Test Set
  5 descriptions       532 + 532               6.4K + 6.4K

                      5-fold CV                Metabolite
                                                likeness
                   SVM    RF      BC




                         Julio E. Peironcely
Metabolite-likeness                          HMDB
                                              8K
                                                                 ZINC
                                                                 21M


Best = RF – MDLPublicKeys                        Standardization

Sensitivity   Specificity    AUC
                                              Diversity Selection
 99.84%        87.52%       99.20%

                                       Training Set                Test Set
      Bad BC – P_desc                   532 + 532                6.4K + 6.4K

Sensitivity   Specificity    AUC       5-fold CV                 Metabolite
                                                                  likeness
                                     SVM    RF      BC
 42.51%        86.56%       61.57%




                                           Julio E. Peironcely
Metabolite-likeness, external
validation
              HMDB
            External          DrugBank          ChEMBL
          validation set


                                          Random Selection



                           Standardization


                             Metabolite
                              likeness




                                    Julio E. Peironcely
Metabolite-likeness, external
validation




                     Julio E. Peironcely
Met-likeness + structure generation
(methylhistamine) 260K

                                          71%
     46%




                    Julio E. Peironcely
Met-likeness + structure generation
(malic acid) 8K

                                          100%

57%          77%




                    Julio E. Peironcely
Conclusions


Prediction is good, interpretation not

              Useful in different fields

                Local models needed



                      Julio E. Peironcely
Acknowledgements



   Leiden University      University of Cambridge

   Theo Reijmers          Andreas Bender
   Thomas Hankemeier


   TNO Quality of Life    HMP University of
                          Alberta
   Leon Coulier
                          David Wishart
                          Ying (Edison) Dong




                         Julio E. Peironcely

Contenu connexe

Tendances

Analyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszkaAnalyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszkaJohannesdedooper
 
Bio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, ParisBio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, ParisKGS Global
 
Lab Informatics 09 Se
Lab Informatics 09 SeLab Informatics 09 Se
Lab Informatics 09 SeSamEid
 
Team presentation min
Team presentation minTeam presentation min
Team presentation minChoo Yang
 
Pat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines BoardPat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines BoardInvestnet
 
Вычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембранВычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембранIlya Klabukov
 
Anovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-confAnovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-confJohn Dangerfield
 
Selective Protein Staining On Native Gel
Selective Protein Staining On Native GelSelective Protein Staining On Native Gel
Selective Protein Staining On Native GelMartina Bertsch
 
Website antibodies
Website   antibodiesWebsite   antibodies
Website antibodiesAmunix
 
Asilomar2005 Ecoli Poster
Asilomar2005 Ecoli PosterAsilomar2005 Ecoli Poster
Asilomar2005 Ecoli Posterjcruzsilva
 
Data Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoveryData Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoverySean Ekins
 
Selkoe webinar slides
Selkoe webinar slidesSelkoe webinar slides
Selkoe webinar slidesnicowef
 

Tendances (16)

Analyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszkaAnalyzing ligand and small molecule binding activity of solubilized myszka
Analyzing ligand and small molecule binding activity of solubilized myszka
 
Poster
PosterPoster
Poster
 
Bio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, ParisBio leap InnoCos Europe, Paris
Bio leap InnoCos Europe, Paris
 
Lab Informatics 09 Se
Lab Informatics 09 SeLab Informatics 09 Se
Lab Informatics 09 Se
 
Team presentation min
Team presentation minTeam presentation min
Team presentation min
 
Pat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines BoardPat O'Mahony, Chief Executive, Irish Medicines Board
Pat O'Mahony, Chief Executive, Irish Medicines Board
 
Вычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембранВычислительный эксперимент в молекулярной биофизике белков и биомембран
Вычислительный эксперимент в молекулярной биофизике белков и биомембран
 
Chapt 10
Chapt 10Chapt 10
Chapt 10
 
Anovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-confAnovasia technology presentation nov2012 non-conf
Anovasia technology presentation nov2012 non-conf
 
Bradshaw - Bioenergy - Spring Review 2012
Bradshaw - Bioenergy - Spring Review 2012Bradshaw - Bioenergy - Spring Review 2012
Bradshaw - Bioenergy - Spring Review 2012
 
Selective Protein Staining On Native Gel
Selective Protein Staining On Native GelSelective Protein Staining On Native Gel
Selective Protein Staining On Native Gel
 
Website antibodies
Website   antibodiesWebsite   antibodies
Website antibodies
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
 
Asilomar2005 Ecoli Poster
Asilomar2005 Ecoli PosterAsilomar2005 Ecoli Poster
Asilomar2005 Ecoli Poster
 
Data Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoveryData Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug Discovery
 
Selkoe webinar slides
Selkoe webinar slidesSelkoe webinar slides
Selkoe webinar slides
 

Similaire à Understanding And Classifying Metabolite Space and Metabolite-Likeness

Structure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likenessStructure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likenessVodafoneZiggo
 
Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011VodafoneZiggo
 
Computational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein EngineeringComputational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein EngineeringPablo Carbonell
 
Biotechnology as Career Option 2012
Biotechnology as Career Option 2012Biotechnology as Career Option 2012
Biotechnology as Career Option 2012Reportbioinformatics
 
Year 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyYear 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyRachelCaico
 
1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)phdcareers
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyChris Evelo
 
Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21Sage Base
 
OBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolutionOBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolutionOut of The Box Seminar
 
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012TCBG
 
BioDiscovery Solutions for Future
BioDiscovery Solutions for FutureBioDiscovery Solutions for Future
BioDiscovery Solutions for Futurecontactmeasif
 
Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Sucheta Tripathy
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1utpaltatu
 
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...BioMAP® Systems
 
Allelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome SequencingAllelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome SequencingDenis C. Bauer
 
Lecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engLecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engDr. Tan Boon Siong
 
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...Barry Smith
 

Similaire à Understanding And Classifying Metabolite Space and Metabolite-Likeness (20)

Structure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likenessStructure generation, metabolite space, and metabolite likeness
Structure generation, metabolite space, and metabolite likeness
 
Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011Julio Peironcely @ ICCS 2011
Julio Peironcely @ ICCS 2011
 
Computational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein EngineeringComputational Protein Design. 1. Challenges in Protein Engineering
Computational Protein Design. 1. Challenges in Protein Engineering
 
Biotechnology as Career Option 2012
Biotechnology as Career Option 2012Biotechnology as Career Option 2012
Biotechnology as Career Option 2012
 
Year 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyYear 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro only
 
ALTRABio presents WikiBioPath: new perspectives in biological data analysis
ALTRABio presents WikiBioPath: new perspectives in biological data analysisALTRABio presents WikiBioPath: new perspectives in biological data analysis
ALTRABio presents WikiBioPath: new perspectives in biological data analysis
 
1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biology
 
Metabolomics Data Analysis
Metabolomics Data AnalysisMetabolomics Data Analysis
Metabolomics Data Analysis
 
Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21Stephen Friend Fanconi Anemia Research Fund 2012-01-21
Stephen Friend Fanconi Anemia Research Fund 2012-01-21
 
OBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolutionOBC | Synthetic biology announcing the coming technological revolution
OBC | Synthetic biology announcing the coming technological revolution
 
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
Towards Cell Scale Molecular Dyamics - K. Schulten, July 2012
 
Bioinformatica t7-protein structure
Bioinformatica t7-protein structureBioinformatica t7-protein structure
Bioinformatica t7-protein structure
 
BioDiscovery Solutions for Future
BioDiscovery Solutions for FutureBioDiscovery Solutions for Future
BioDiscovery Solutions for Future
 
Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120Tyler functional annotation thurs 1120
Tyler functional annotation thurs 1120
 
Proteomics course 1
Proteomics course 1Proteomics course 1
Proteomics course 1
 
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
Selection of Safer and More Effective Anti-inflammatory Kinase Inhibitors usi...
 
Allelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome SequencingAllelic Imbalance for Pre-capture Whole Exome Sequencing
Allelic Imbalance for Pre-capture Whole Exome Sequencing
 
Lecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engLecture 4 metabolic pathway eng
Lecture 4 metabolic pathway eng
 
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
ENVO: The Environment Ontology (Presentation at the Genomics Standards Consor...
 

Dernier

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 

Dernier (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 

Understanding And Classifying Metabolite Space and Metabolite-Likeness

  • 1. Understanding And Classifying Metabolite Space and Metabolite- Likeness PLoS One (in press) Julio E. Peironcely  @peyron Juliopeironcely.com PhD student at Leiden University and TNO
  • 2. Metabolomics the quantitative and qualitative analysis of all metabolites in samples of cells, body fluids, tissues, etc. Julio E. Peironcely
  • 3. Metabolomics Experi- Biological Biological Sample Data Data pre- Data mental Sampling inter- question preparation acquisition processing analysis design pretation Metabolites Relevant biomolecules/ List of Samples Raw data connectivities Protocol peaks/ & biomolecules Models Julio E. Peironcely
  • 4. Metabolomics Experi- Biological Biological Sample Data Data pre- Data mental Sampling inter- question preparation acquisition processing analysis design pretation Metabolites Relevant biomolecules/ List of Samples Raw data connectivities Protocol peaks/ & biomolecules Models Julio E. Peironcely
  • 5. How do metabolites look like?
  • 6. HMDB ZINC 8K 21M Julio E. Peironcely
  • 7. metabolites non metabolites Water Solubility MW C Atoms Struc. Complexity PSA Julio E. Peironcely
  • 8. PCA Julio E. Peironcely
  • 9. PCA
  • 11. Decision Tree Julio E. Peironcely
  • 12. Lots of candidates structures
  • 13. Elemental Composition Julio E. Peironcely
  • 14. Elemental Composition Structure Generation Julio E. Peironcely
  • 15. Elemental Composition Structure Generation Molecules Julio E. Peironcely
  • 16. We are looking for metabolites
  • 17. Elemental Composition Structure Metabolite Generation Likeness Molecules Julio E. Peironcely
  • 18. Elemental Composition Metabolites Structure Metabolite Generation Likeness Molecules Julio E. Peironcely
  • 19. Metabolite-likeness Representation + Classification HMDB ZINC 8K 21M Atom Counts Physicochemical desc. Support Vector Machines (SVM) MDL Public Keys Random Forest (RF) FCFP_4 Naïve Bayes (NB) ECFP_4 Julio E. Peironcely
  • 20. Metabolite-likeness HMDB 8K ZINC 21M Standardization Atom Counts Diversity Selection Physicochemical desc. MDL Public Keys FCFP_4 ECFP_4 Julio E. Peironcely
  • 21. Metabolite-likeness HMDB 8K ZINC 21M Standardization Atom Counts Diversity Selection Physicochemical desc. MDL Public Keys FCFP_4 Training Set Test Set ECFP_4 532 + 532 6.4K + 6.4K Julio E. Peironcely
  • 22. Metabolite-likeness HMDB 8K ZINC 21M Standardization Atom Counts Diversity Selection Physicochemical desc. MDL Public Keys FCFP_4 Training Set Test Set ECFP_4 532 + 532 6.4K + 6.4K 5-fold CV SVM RF BC Julio E. Peironcely
  • 23. Metabolite-likeness HMDB 8K ZINC 21M Standardization Diversity Selection 3 classifiers X Training Set Test Set 5 descriptions 532 + 532 6.4K + 6.4K 5-fold CV Metabolite likeness SVM RF BC Julio E. Peironcely
  • 24. Metabolite-likeness HMDB 8K ZINC 21M Best = RF – MDLPublicKeys Standardization Sensitivity Specificity AUC Diversity Selection 99.84% 87.52% 99.20% Training Set Test Set Bad BC – P_desc 532 + 532 6.4K + 6.4K Sensitivity Specificity AUC 5-fold CV Metabolite likeness SVM RF BC 42.51% 86.56% 61.57% Julio E. Peironcely
  • 25. Metabolite-likeness, external validation HMDB External DrugBank ChEMBL validation set Random Selection Standardization Metabolite likeness Julio E. Peironcely
  • 27.
  • 28. Met-likeness + structure generation (methylhistamine) 260K 71% 46% Julio E. Peironcely
  • 29. Met-likeness + structure generation (malic acid) 8K 100% 57% 77% Julio E. Peironcely
  • 30. Conclusions Prediction is good, interpretation not Useful in different fields Local models needed Julio E. Peironcely
  • 31. Acknowledgements Leiden University University of Cambridge Theo Reijmers Andreas Bender Thomas Hankemeier TNO Quality of Life HMP University of Alberta Leon Coulier David Wishart Ying (Edison) Dong Julio E. Peironcely