SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
ASSESSING DRUG SAFETY
USING AI
COMPUTER AIDED DRUG DISCOVERY
AND DEVELOPMENT
ABOUT ME
¡ VishnuVettrivel - vishnu@wisecube.ai
¡ Data Science/AI platform Architect
¡ NOT a Molecular Biologist or a Medicinal Chemist !
¡ Will be talking about things learnt mostly on the job
¡ Have been working with a Molecular biologist in a
Biotech research firm to help accelerate drug
discovery using Machine learning
AGENDA
¡ History
¡ Nature as Source
¡ Recent efforts
¡ Rational drug discovery
¡ Drug targeting
¡ Screening
¡ Drug Discovery Cycle
¡ Economics
¡ Computer-aided Drug
Design
¡ Molecular Representation
¡ Drug safety assessment
¡ Demo
¡ Tools and DBs
¡ Resources
¡ Summary
HISTORY OF
DRUG
DISCOVERY
ANCIENT METHODS: NATURE AS A SOURCE
¡ Search for Drugs not new:
¡ Traditional Chinese medicine and Ayurveda both
several thousand years old
¡ Many compounds now being studied
¡ Aspirin’s chemical forefather known to Hippocrates
¡ Even inoculation at least 2000 years old
¡ But also resulted in many ineffective drugs
source: https://amhistory.si.edu/polio/virusvaccine/history.htm
MORE RECENT EFFORTS
¡ In 1796, Jenner finds first vaccine: cowpox prevents
smallpox
¡ 1 century later, Pasteur makes vaccines against
anthrax and rabies
¡ Sulfonamides developed for antibacterial purposes
in 1930s
¡ Penicillin: the “miracle drug”
¡ 2nd half of 20th century: use of modern chemical
techniques to create explosion of medicines
RATIONAL DRUG DISCOVERY
PROCESS OF FINDING NEW MEDICATIONS
BASED ON THE KNOWLEDGE OF A
BIOLOGICAL TARGET.
MOST COMMONLY AN ORGANIC SMALL
MOLECULE THAT ACTIVATES OR INHIBITS
THE FUNCTION OF A PROTEIN
INVOLVES THE DESIGN OF MOLECULES
THAT ARE COMPLEMENTARY IN SHAPE
AND CHARGE TO THE BIOMOLECULAR
TARGET
DRUG TARGET IDENTIFICATION
¡ Different approaches to look for drug targets
¡ Phenotypic screening
¡ gene association studies
¡ chemo proteomics
¡ Transgenetic organisms
¡ Imaging
¡ Biomarkers
Source: https://www.roche.com/research_and_development/drawn_to_science/target_identification.htm
TARGETTO
DRUG CYCLE
source: https://www.researchgate.net/publication/294679594_DRUG_DISCOVERY_HIT_TO_LEAD
SCREENING
¡ High Throughput Screening
¡ Implemented in 1990s, still going strong
¡ Allows scientists to test 1000’s of potential targets
¡ Library size is around 1 million compounds
¡ Single screen program cost ~$75,000
¡ Estimated that only 4 small molecules with roots in
combinatorial chemistry made it to clinical
development by 2001
¡ Can make library even bigger if you spend more, but
can’t get comprehensive coverage
¡ Similarity paradox
¡ Slight change can mean difference between active and
inactive
HITTO LEAD
OPTIMIZATION
source: http://www.sbw.fi/lead-optimization/
DRUG DISCOVERY CYCLE
Involves the identification of screening hits using
medicinal chemistry and optimization of those hits to
increase:
¡ Affinity
¡ Selectivity (to reduce the potential of side effects),
¡ Efficacy/potency
¡ Druglikeness
PHOTO BY BOGHOG / CC BY-SA 4.0
ECONOMICS
source: https://www.nature.com/articles/nrd3681
Eroom’s Law: Opposite
of Moore’s Law –
Signals worrying trends
in number and cost of
Drugs to Market for
the Pharma industry
DRUG DISCOVERY
TIMELINE
source: https://www.innoplexus.com/blog/five-reasons-to-embrace-data-driven-drug-development/
COMPUTER-
AIDED DRUG
DESIGN
source: http://poster123.info/?u=Pharmacological+Strategies+To+Contend+Against+Myocardial
MOLECULAR
REPRESENTATION
1-D DESCRIPTORS
¡ Molecular properties often used for rough
classifications
¡ molecular weight, solubility, charge, number of
rotatable bonds, atom types, topological polar surface
area etc.
¡ Molecular properties like partition coefficient,
or logP, which measures the ratio of solubilities in
two different substances.
¡ The Lipinski rule of 5 is a simple rule of thumb that
is often used to pre-filter drug candidates
Source: chemical Reactivity, Drug-Likeness and Structure Activity/Property Relationship Studies of 2,1,3-Benzoxadiazole Derivatives as Anti-Cancer Activity
2-D DESCRIPTORS
¡ A common way of mapping variably structures
molecules into a fixed-size descriptor vector is
“fingerprinting”
¡ circular fingerprints are in more widespread use
today.
¡ A typical size of the bit vector is 1024
¡ The similarity between two molecules can be
estimated using the Tanimoto coefficient
¡ One standard implementation are extended circular
fingerprints (termed ECFPx,with a
number x designating the maximum diameter; e,g,
ECFP4 for a radius of 2 bonds)
QSAR
¡ Predictive statistical models correlating one or more
piece of response data about chemicals
¡ Statistical tools, including regression and classification-
based strategies, are used to analyze the response and
chemical data and their relationship
¡ Have been part of scientific study for many years.As
early as 1863, Cros found that the toxicity of alcohols
increased with decreasing aqueous solubility
¡ Machine learning tools are also very effective in
developing predictive models, particularly when
handling high-dimensional and complex chemical data
showing a nonlinear relationship with the responses of
the chemicals
SMILE STRING
¡ SMILES (“Simplified molecular-input line-entry system”)
¡ Represents molecules in the form of ASCII character strings
¡ Several equivalent ways to write the same compound
¡ Workaround is to use the canonical version of SMILE
¡ SMILES are reasonably human-readable
NEURAL FINGERPRINTS
¡ Hash function can be replaced by a neural network
¡ Final fingerprint vector is the sum over a number of
atom-wise softmax operations
¡ Similar to the pooling operation in standard neural
networks
¡ Can be more smooth than predefined circular
fingerprints
¡ Auto-encoders are also used to find compact latent
representations
¡ converts discrete representations of molecules to and
from a multidimensional continuous representation
DRUG SAFETY ASSESSMENT
¡ According to Tufts Center for the Study of Drug Development
(CSDD) the three main causes of failures in Phase III trials:
¡ Efficacy (or rather lack thereof) — i.e., failure to meet the
primary efficacy endpoint
¡ Safety (or lack thereof) — i.e., unexpected adverse or serious
adverse events
¡ Commercial / financial — i.e., failure to demonstrate value
compared to existing therapy
¡ According to another study byYale School of Medicine
¡ 71 of the 222 drugs approved in the first decade of the
millennium were withdrawn
¡ Took a median of 4.2 years after the drugs were approved for
these safety concerns to come to light
¡ Drugs ushered through the FDA's accelerated approval
process were among those that had higher rates of safety
interventions
TOX21 CHALLENGE
¡ Challenge was designed to help scientists
understand the potential of the chemicals and
compounds being tested
¡ The goal was to "crowdsource" data analysis by
independent researchers to reveal how well they
can predict compounds' interference in biochemical
pathways using only chemical structure data.
¡ The computational models produced from the
challenge would become decision-making tools for
government agencies
¡ NCATS provided assay activity data and chemical
structures on the Tox21 collection of ~10,000
compounds (Tox21 10K).
DEEPTOX
¡ Normalizes the chemical representations of the compounds
¡ Computes a large number of chemical descriptors that are used as input
to machine learning methods
¡ Trains models, evaluates them, and combines the best of them to
ensembles
¡ Predicts the toxicity of new compounds
¡ Had the highest performance of all computational methods
¡ Outperformed naive Bayes, SVM, and random forests
MULTI-TASK
LEARNING
¡ They were able to apply multi-
task learning in the Tox21
challenge because most of the
compounds were labeled for
several tasks
¡ Multi-task learning has been
shown to enhance the
performance of DNNs when
predicting biological activities at
the protein level
¡ Since the twelve different tasks
of the Tox21 challenge data were
highly correlated, they
implemented multi-task learning
in the DeepTox pipeline.
¡
ASSOCIATIONSTO
TOXICOPHORES
¡ The histogram (A) shows the
fraction of neurons in a layer that
yield significant correlations to a
toxicophore.With an increasing
level of the layer, the number of
neurons with significant
correlation decreases.
¡ The histogram shows the
number of neurons in a layer
that exceed a correlation
threshold of 0.6 to their best
correlated toxicophore.
Contrary to (A) the number of
neurons increases with the
network layer. Note that each
layer consisted of the same
number of neurons.
FEATURE
CONSTRUCTION
BY DEEP
LEARNING.
¡ Neurons that have learned to detect
the presence of toxicophores.
¡ Each row shows a particular hidden
unit in a learned network that
correlates highly with a particular
known toxicophore feature.
¡ The row shows the three chemical
compounds that had the highest
activation for that neuron.
¡ Indicated in red is the toxicophore
structure from the literature that the
neuron correlates with.The first row
and the second row are from the first
hidden layer, the third row is from a
higher-level layer.
DEMO
TOOLS AND
DATABASES
¡ Rdkit collection of cheminformatics and machine-learning
software written in C++ and Python.
¡ DeepChem is an integrated python library for chemistry and
drug discovery; it comes with a collection of implementations
for many deep learning based algorithms.
¡ Chembl is a public database containing millions of bioactive
molecules and assay results.The data has been manually
transcribed and curated from publications. Chembl is an
invaluable source, but has its share of errors—e.g., sometimes
affinities are off by exactly 3 or 6 orders of magnitude due to
wrongly transcribed units (micromols instead of nanomols).
¡ PDBbind is another frequently used database, which contains
protein-ligand co-crystal structures together with binding
affinity values.Again, while certainly very valuable, PDBbind has
some well-known data problems.
¡ https://www.click2drug.org/ website containing a comprehensive
list of computer-aided drug design (CADD) software, databases
and web services.
RESOURCES
¡ Lima,Angélica Nakagawa, Eric Allison Philot, Gustavo Henrique Goulart Trossini, Luis
Paulo Barbour Scott,Vinícius Gonçalves Maltarollo, and Kathia Maria Honorio. "Use of
Machine Learning Approaches for Novel Drug Discovery." Expert Opinion on Drug
Discovery. 2016.Accessed April 23, 2019.
https://www.ncbi.nlm.nih.gov/pubmed/26814169.
¡ Khamis, Mohamed A.,Walid Gomaa, andWalaa F.Ahmed. "Machine Learning in
Computational Docking." Artificial Intelligence in Medicine. March 2015.Accessed
April 23, 2019. https://www.ncbi.nlm.nih.gov/pubmed/25724101.
¡ Lima,Angélica Nakagawa, Eric Allison Philot, Gustavo Henrique Goulart Trossini, Luis
Paulo Barbour Scott,Vinícius Gonçalves Maltarollo, and Kathia Maria Honorio. "Use of
Machine Learning Approaches for Novel Drug Discovery." Expert Opinion on Drug
Discovery. 2016.Accessed April 23, 2019.
https://www.ncbi.nlm.nih.gov/pubmed/26814169.
¡ Mayr,Andreas, Klambauer, Günter,Thomas, Hochreiter, and Sepp. "DeepTox:Toxicity
Prediction Using Deep Learning." Frontiers. December 04, 2015.Accessed April 21,
2019. https://www.frontiersin.org/articles/10.3389/fenvs.2015.00080/full
SUMMARY
¡ Increasing pressure is forcing Pharma industry to turn to AI based
techniques to reduce time, costs and increase success rates of new
drugs to market
¡ Drug Safety is one of the top reasons for failures in FDA approvals of
new drugs and recalls
¡ AI and Deep learning techniques have show lot of promise compared
to traditional techniques in drug discovery and safety
¡ The race for using AI is on and over 100 new startups are now
pursuing this line of inquiry

Contenu connexe

Tendances

Personalized Medicine: Current and Future Perspectives Personalized Medicin...
Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...
Personalized Medicine: Current and Future Perspectives Personalized Medicin...
MedicineAndHealth
 

Tendances (20)

Drug discovery using ai
Drug discovery using aiDrug discovery using ai
Drug discovery using ai
 
Role of AI in Drug Discovery and Development
Role of AI in  Drug Discovery and DevelopmentRole of AI in  Drug Discovery and Development
Role of AI in Drug Discovery and Development
 
The story of personalized medicine
The story of personalized medicineThe story of personalized medicine
The story of personalized medicine
 
Alternative methods to animal toxicity testing
Alternative methods to animal toxicity testingAlternative methods to animal toxicity testing
Alternative methods to animal toxicity testing
 
Artificial Intelligence in Pharmaceutical Science
Artificial Intelligence in Pharmaceutical ScienceArtificial Intelligence in Pharmaceutical Science
Artificial Intelligence in Pharmaceutical Science
 
Target identification
Target identificationTarget identification
Target identification
 
Target Validation
Target ValidationTarget Validation
Target Validation
 
Docking
DockingDocking
Docking
 
drug discovery & development
drug discovery & developmentdrug discovery & development
drug discovery & development
 
Target discovery and validation
Target discovery and validation Target discovery and validation
Target discovery and validation
 
Artificial intelligence and its applications in healthcare and pharmacy
Artificial intelligence and its applications in healthcare and pharmacyArtificial intelligence and its applications in healthcare and pharmacy
Artificial intelligence and its applications in healthcare and pharmacy
 
Artificial intelligence in drug discovery and development
Artificial intelligence in drug discovery and developmentArtificial intelligence in drug discovery and development
Artificial intelligence in drug discovery and development
 
Genomics & Proteomics Based Drug Discovery
Genomics & Proteomics Based Drug DiscoveryGenomics & Proteomics Based Drug Discovery
Genomics & Proteomics Based Drug Discovery
 
Role of Target Identification and Target Validation in Drug Discovery Process
Role of Target Identification and Target Validation in Drug Discovery ProcessRole of Target Identification and Target Validation in Drug Discovery Process
Role of Target Identification and Target Validation in Drug Discovery Process
 
List of studies needed for IND submission
List of studies needed for IND submissionList of studies needed for IND submission
List of studies needed for IND submission
 
Rational drug design
Rational drug designRational drug design
Rational drug design
 
Biosimilars
BiosimilarsBiosimilars
Biosimilars
 
Guidelines on adr reporting
Guidelines on adr reportingGuidelines on adr reporting
Guidelines on adr reporting
 
Role of nuclicacid microarray &protein micro array for drug discovery process
Role of nuclicacid microarray &protein micro array for drug discovery processRole of nuclicacid microarray &protein micro array for drug discovery process
Role of nuclicacid microarray &protein micro array for drug discovery process
 
Personalized Medicine: Current and Future Perspectives Personalized Medicin...
Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...Personalized Medicine: Current and Future Perspectives 	 Personalized Medicin...
Personalized Medicine: Current and Future Perspectives Personalized Medicin...
 

Similaire à Assessing Drug Safety Using AI

CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Kamel Mansouri
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Kamel Mansouri
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
Ankur Khanna
 
Meta analysis of molecular property patterns and filtering of public datasets...
Meta analysis of molecular property patterns and filtering of public datasets...Meta analysis of molecular property patterns and filtering of public datasets...
Meta analysis of molecular property patterns and filtering of public datasets...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Similaire à Assessing Drug Safety Using AI (20)

Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
 
Crofton Evolution of Toxicology
Crofton Evolution of ToxicologyCrofton Evolution of Toxicology
Crofton Evolution of Toxicology
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
New Approach Methods - What is That?
 
Multiplexing analysis of 1000 approved drugs in PubChem
Multiplexing analysis of 1000 approved drugs in PubChemMultiplexing analysis of 1000 approved drugs in PubChem
Multiplexing analysis of 1000 approved drugs in PubChem
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
Bioinformatica 15-12-2011-t9-t10-bio cheminformatics
Bioinformatica 15-12-2011-t9-t10-bio cheminformaticsBioinformatica 15-12-2011-t9-t10-bio cheminformatics
Bioinformatica 15-12-2011-t9-t10-bio cheminformatics
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Back Rapid lead compounds discovery through high-throughput screening
 Back Rapid lead compounds discovery through high-throughput screening Back Rapid lead compounds discovery through high-throughput screening
Back Rapid lead compounds discovery through high-throughput screening
 
Advances in computer aided drug design
Advances in computer aided drug designAdvances in computer aided drug design
Advances in computer aided drug design
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
Linde Gas whitepaper 'Drug discovery advances inextricably linked to specialt...
Linde Gas whitepaper 'Drug discovery advances inextricably linked to specialt...Linde Gas whitepaper 'Drug discovery advances inextricably linked to specialt...
Linde Gas whitepaper 'Drug discovery advances inextricably linked to specialt...
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 
Explore new Frontiers in Medicine with AI (2).pdf
Explore new Frontiers in Medicine with AI (2).pdfExplore new Frontiers in Medicine with AI (2).pdf
Explore new Frontiers in Medicine with AI (2).pdf
 
Explore new Frontiers in Medicine with AI.pdf
Explore new Frontiers in Medicine with AI.pdfExplore new Frontiers in Medicine with AI.pdf
Explore new Frontiers in Medicine with AI.pdf
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
Journal
JournalJournal
Journal
 
AXP302
AXP302AXP302
AXP302
 
Medicinal chemistry
Medicinal chemistryMedicinal chemistry
Medicinal chemistry
 
Meta analysis of molecular property patterns and filtering of public datasets...
Meta analysis of molecular property patterns and filtering of public datasets...Meta analysis of molecular property patterns and filtering of public datasets...
Meta analysis of molecular property patterns and filtering of public datasets...
 
Medicinal chemistry
Medicinal chemistryMedicinal chemistry
Medicinal chemistry
 

Plus de Databricks

Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 

Plus de Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Dernier

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
JohnnyPlasten
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
shambhavirathore45
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
Lars Albertsson
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Dernier (20)

100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 

Assessing Drug Safety Using AI

  • 1. ASSESSING DRUG SAFETY USING AI COMPUTER AIDED DRUG DISCOVERY AND DEVELOPMENT
  • 2. ABOUT ME ¡ VishnuVettrivel - vishnu@wisecube.ai ¡ Data Science/AI platform Architect ¡ NOT a Molecular Biologist or a Medicinal Chemist ! ¡ Will be talking about things learnt mostly on the job ¡ Have been working with a Molecular biologist in a Biotech research firm to help accelerate drug discovery using Machine learning
  • 3. AGENDA ¡ History ¡ Nature as Source ¡ Recent efforts ¡ Rational drug discovery ¡ Drug targeting ¡ Screening ¡ Drug Discovery Cycle ¡ Economics ¡ Computer-aided Drug Design ¡ Molecular Representation ¡ Drug safety assessment ¡ Demo ¡ Tools and DBs ¡ Resources ¡ Summary
  • 5. ANCIENT METHODS: NATURE AS A SOURCE ¡ Search for Drugs not new: ¡ Traditional Chinese medicine and Ayurveda both several thousand years old ¡ Many compounds now being studied ¡ Aspirin’s chemical forefather known to Hippocrates ¡ Even inoculation at least 2000 years old ¡ But also resulted in many ineffective drugs source: https://amhistory.si.edu/polio/virusvaccine/history.htm
  • 6. MORE RECENT EFFORTS ¡ In 1796, Jenner finds first vaccine: cowpox prevents smallpox ¡ 1 century later, Pasteur makes vaccines against anthrax and rabies ¡ Sulfonamides developed for antibacterial purposes in 1930s ¡ Penicillin: the “miracle drug” ¡ 2nd half of 20th century: use of modern chemical techniques to create explosion of medicines
  • 7. RATIONAL DRUG DISCOVERY PROCESS OF FINDING NEW MEDICATIONS BASED ON THE KNOWLEDGE OF A BIOLOGICAL TARGET. MOST COMMONLY AN ORGANIC SMALL MOLECULE THAT ACTIVATES OR INHIBITS THE FUNCTION OF A PROTEIN INVOLVES THE DESIGN OF MOLECULES THAT ARE COMPLEMENTARY IN SHAPE AND CHARGE TO THE BIOMOLECULAR TARGET
  • 8. DRUG TARGET IDENTIFICATION ¡ Different approaches to look for drug targets ¡ Phenotypic screening ¡ gene association studies ¡ chemo proteomics ¡ Transgenetic organisms ¡ Imaging ¡ Biomarkers Source: https://www.roche.com/research_and_development/drawn_to_science/target_identification.htm
  • 10. SCREENING ¡ High Throughput Screening ¡ Implemented in 1990s, still going strong ¡ Allows scientists to test 1000’s of potential targets ¡ Library size is around 1 million compounds ¡ Single screen program cost ~$75,000 ¡ Estimated that only 4 small molecules with roots in combinatorial chemistry made it to clinical development by 2001 ¡ Can make library even bigger if you spend more, but can’t get comprehensive coverage ¡ Similarity paradox ¡ Slight change can mean difference between active and inactive
  • 12. DRUG DISCOVERY CYCLE Involves the identification of screening hits using medicinal chemistry and optimization of those hits to increase: ¡ Affinity ¡ Selectivity (to reduce the potential of side effects), ¡ Efficacy/potency ¡ Druglikeness PHOTO BY BOGHOG / CC BY-SA 4.0
  • 13. ECONOMICS source: https://www.nature.com/articles/nrd3681 Eroom’s Law: Opposite of Moore’s Law – Signals worrying trends in number and cost of Drugs to Market for the Pharma industry
  • 17. 1-D DESCRIPTORS ¡ Molecular properties often used for rough classifications ¡ molecular weight, solubility, charge, number of rotatable bonds, atom types, topological polar surface area etc. ¡ Molecular properties like partition coefficient, or logP, which measures the ratio of solubilities in two different substances. ¡ The Lipinski rule of 5 is a simple rule of thumb that is often used to pre-filter drug candidates Source: chemical Reactivity, Drug-Likeness and Structure Activity/Property Relationship Studies of 2,1,3-Benzoxadiazole Derivatives as Anti-Cancer Activity
  • 18. 2-D DESCRIPTORS ¡ A common way of mapping variably structures molecules into a fixed-size descriptor vector is “fingerprinting” ¡ circular fingerprints are in more widespread use today. ¡ A typical size of the bit vector is 1024 ¡ The similarity between two molecules can be estimated using the Tanimoto coefficient ¡ One standard implementation are extended circular fingerprints (termed ECFPx,with a number x designating the maximum diameter; e,g, ECFP4 for a radius of 2 bonds)
  • 19. QSAR ¡ Predictive statistical models correlating one or more piece of response data about chemicals ¡ Statistical tools, including regression and classification- based strategies, are used to analyze the response and chemical data and their relationship ¡ Have been part of scientific study for many years.As early as 1863, Cros found that the toxicity of alcohols increased with decreasing aqueous solubility ¡ Machine learning tools are also very effective in developing predictive models, particularly when handling high-dimensional and complex chemical data showing a nonlinear relationship with the responses of the chemicals
  • 20. SMILE STRING ¡ SMILES (“Simplified molecular-input line-entry system”) ¡ Represents molecules in the form of ASCII character strings ¡ Several equivalent ways to write the same compound ¡ Workaround is to use the canonical version of SMILE ¡ SMILES are reasonably human-readable
  • 21. NEURAL FINGERPRINTS ¡ Hash function can be replaced by a neural network ¡ Final fingerprint vector is the sum over a number of atom-wise softmax operations ¡ Similar to the pooling operation in standard neural networks ¡ Can be more smooth than predefined circular fingerprints ¡ Auto-encoders are also used to find compact latent representations ¡ converts discrete representations of molecules to and from a multidimensional continuous representation
  • 22. DRUG SAFETY ASSESSMENT ¡ According to Tufts Center for the Study of Drug Development (CSDD) the three main causes of failures in Phase III trials: ¡ Efficacy (or rather lack thereof) — i.e., failure to meet the primary efficacy endpoint ¡ Safety (or lack thereof) — i.e., unexpected adverse or serious adverse events ¡ Commercial / financial — i.e., failure to demonstrate value compared to existing therapy ¡ According to another study byYale School of Medicine ¡ 71 of the 222 drugs approved in the first decade of the millennium were withdrawn ¡ Took a median of 4.2 years after the drugs were approved for these safety concerns to come to light ¡ Drugs ushered through the FDA's accelerated approval process were among those that had higher rates of safety interventions
  • 23. TOX21 CHALLENGE ¡ Challenge was designed to help scientists understand the potential of the chemicals and compounds being tested ¡ The goal was to "crowdsource" data analysis by independent researchers to reveal how well they can predict compounds' interference in biochemical pathways using only chemical structure data. ¡ The computational models produced from the challenge would become decision-making tools for government agencies ¡ NCATS provided assay activity data and chemical structures on the Tox21 collection of ~10,000 compounds (Tox21 10K).
  • 24. DEEPTOX ¡ Normalizes the chemical representations of the compounds ¡ Computes a large number of chemical descriptors that are used as input to machine learning methods ¡ Trains models, evaluates them, and combines the best of them to ensembles ¡ Predicts the toxicity of new compounds ¡ Had the highest performance of all computational methods ¡ Outperformed naive Bayes, SVM, and random forests
  • 25. MULTI-TASK LEARNING ¡ They were able to apply multi- task learning in the Tox21 challenge because most of the compounds were labeled for several tasks ¡ Multi-task learning has been shown to enhance the performance of DNNs when predicting biological activities at the protein level ¡ Since the twelve different tasks of the Tox21 challenge data were highly correlated, they implemented multi-task learning in the DeepTox pipeline. ¡
  • 26. ASSOCIATIONSTO TOXICOPHORES ¡ The histogram (A) shows the fraction of neurons in a layer that yield significant correlations to a toxicophore.With an increasing level of the layer, the number of neurons with significant correlation decreases. ¡ The histogram shows the number of neurons in a layer that exceed a correlation threshold of 0.6 to their best correlated toxicophore. Contrary to (A) the number of neurons increases with the network layer. Note that each layer consisted of the same number of neurons.
  • 27. FEATURE CONSTRUCTION BY DEEP LEARNING. ¡ Neurons that have learned to detect the presence of toxicophores. ¡ Each row shows a particular hidden unit in a learned network that correlates highly with a particular known toxicophore feature. ¡ The row shows the three chemical compounds that had the highest activation for that neuron. ¡ Indicated in red is the toxicophore structure from the literature that the neuron correlates with.The first row and the second row are from the first hidden layer, the third row is from a higher-level layer.
  • 28. DEMO
  • 29. TOOLS AND DATABASES ¡ Rdkit collection of cheminformatics and machine-learning software written in C++ and Python. ¡ DeepChem is an integrated python library for chemistry and drug discovery; it comes with a collection of implementations for many deep learning based algorithms. ¡ Chembl is a public database containing millions of bioactive molecules and assay results.The data has been manually transcribed and curated from publications. Chembl is an invaluable source, but has its share of errors—e.g., sometimes affinities are off by exactly 3 or 6 orders of magnitude due to wrongly transcribed units (micromols instead of nanomols). ¡ PDBbind is another frequently used database, which contains protein-ligand co-crystal structures together with binding affinity values.Again, while certainly very valuable, PDBbind has some well-known data problems. ¡ https://www.click2drug.org/ website containing a comprehensive list of computer-aided drug design (CADD) software, databases and web services.
  • 30. RESOURCES ¡ Lima,Angélica Nakagawa, Eric Allison Philot, Gustavo Henrique Goulart Trossini, Luis Paulo Barbour Scott,Vinícius Gonçalves Maltarollo, and Kathia Maria Honorio. "Use of Machine Learning Approaches for Novel Drug Discovery." Expert Opinion on Drug Discovery. 2016.Accessed April 23, 2019. https://www.ncbi.nlm.nih.gov/pubmed/26814169. ¡ Khamis, Mohamed A.,Walid Gomaa, andWalaa F.Ahmed. "Machine Learning in Computational Docking." Artificial Intelligence in Medicine. March 2015.Accessed April 23, 2019. https://www.ncbi.nlm.nih.gov/pubmed/25724101. ¡ Lima,Angélica Nakagawa, Eric Allison Philot, Gustavo Henrique Goulart Trossini, Luis Paulo Barbour Scott,Vinícius Gonçalves Maltarollo, and Kathia Maria Honorio. "Use of Machine Learning Approaches for Novel Drug Discovery." Expert Opinion on Drug Discovery. 2016.Accessed April 23, 2019. https://www.ncbi.nlm.nih.gov/pubmed/26814169. ¡ Mayr,Andreas, Klambauer, Günter,Thomas, Hochreiter, and Sepp. "DeepTox:Toxicity Prediction Using Deep Learning." Frontiers. December 04, 2015.Accessed April 21, 2019. https://www.frontiersin.org/articles/10.3389/fenvs.2015.00080/full
  • 31. SUMMARY ¡ Increasing pressure is forcing Pharma industry to turn to AI based techniques to reduce time, costs and increase success rates of new drugs to market ¡ Drug Safety is one of the top reasons for failures in FDA approvals of new drugs and recalls ¡ AI and Deep learning techniques have show lot of promise compared to traditional techniques in drug discovery and safety ¡ The race for using AI is on and over 100 new startups are now pursuing this line of inquiry