Jack Tuszynski Accelerating Chemotherapy Drug Discovery with Analytics and High Performance Computing

Jack Tuszynski
Cross Cancer Institute
Department of Physics
University of Alberta
Edmonton, Canada
http://www.phys.ualberta.ca/~jtus
“Accelerating Chemotherapy
Drug Discovery with High
Performance Computing and
Analytics”

Modern Drug DevelopmentModern Drug Development
Success Rate 1:100,000 !Success Rate 1:100,000 !
00 22 44 66 88 1010 1212 1414 1616
DiscoveryDiscovery
Preclinical testingPreclinical testing
Phase IPhase I
Phase IIPhase II
Phase IIIPhase III
ApprovalApproval
Post marketPost market
100,000100,000
100100
55
11
Time in years Cost $1B

Identify disease
Isolate protein
Find drug
Preclinical testing
GENOMICS, PROTEOMICS & BIOPHARM.
HIGH THROUGHPUT SCREENING
MOLECULAR MODELING
VIRTUAL SCREENING
COMBINATORIAL CHEMISTRY
IN VITRO & IN SILICO ADME MODELS
Potentially producing many more targets
and “personalized” targets
Screening up to 100,000 compounds a
day for activity against a target protein
Using a computer to
predict activity
Rapidly producing vast numbers
of compounds
Computer graphics & models help improve activity
Tissue and computer models begin to replace animal testing
VIRTUAL SCREENING
MOLECULAR MODELING
The Evolution in Drug Design and Development

5
Integration of biological dataIntegration of biological data
impacts drug developmentimpacts drug development
information stored in the genetic code (DNA)information stored in the genetic code (DNA)
protein sequencesprotein sequences
3D structures of biomolecules3D structures of biomolecules
experimental results from various sources (kd, IC50,experimental results from various sources (kd, IC50,
expression)expression)
clinical dataclinical data
patient statisticspatient statistics
scientific literaturescientific literature

6
……and leads toand leads to
computational explosioncomputational explosion
An avalanche of data:An avalanche of data:
SequencesSequences
Functional relationsFunctional relations
StructuresStructures
This requiresThis requires
computationalcomputational
approachesapproaches
• 100’s of completed genomes
• 1000’s of known reactions
• 10,000’s of known 3D structures
• 100,000’s of protein-ligand
interactions
• 1,000,000’s of known proteins &
enzymes
• Decades of biological/chemical
know-how
• Computational & Mathematical
resources
The Push to Systems Biology

77
Key areas ofKey areas of
bioinformaticsbioinformatics
organisation of knowledge
(sequences, structures,
functional data)
e.g. homology
searches

Specifically for drug discovery:
PDB : 50,000 proteins + homologs
1500 targets (human proteins)
Approx. 400 (80 in cancer) utilized
Orange Book: 1800 medicinal drugs
Drug Bank: 4900 drugs
Cancer chemotherapy drugs: 103
Protein-drug interactions but also
Protein-protein interactions

Molecular Targets:Cancer Cell NetworkMolecular Targets:Cancer Cell Network
A very complex but algorithmic system
Based on a lock-and-key principle
We will find keys to all these locks by 2061

CANCER CHEMOTHERAPY DRUGS
Approximately 100 standard chemotherapeutic drugs:
1)Alkylating agents: Genotoxic (20-25)
2) Plant alkaloids: Inhibition of mitosis (10-15)
3) Antimetabolites: Inhibition of base synthesis (15-20)
4) Antibiotics: Derived from Streptomyces (10-15)
5) Targeted antibodies: Bind cell surface receptors (5-10)
6) Hormones: Inhibit or stimulate hormone signaling (15-20)
7) Directly targeting small molecules
8)Other indirect effects: Angiogenesis or immune modulators (10-15)
Number of current chemotherapy targets: 101
Number of chemotherapy drugs: 102
Potential Targets (Pharmacogenomics): 103
Paclitaxel
Cisplatin
Methotrexate
Trastuzumab
Imatinib
Tamoxifen
Doxorubicin
Bevacizumab

G2
M
G1
S
G0
tyrosine kinases
DNA synthesis
topoisomerase I
CDK2
tubulin
polymerisation/
depolymerisation
Vinca alkaloids*
taxol/taxotere
halichondrin*
spongistatin*
rhizoxin*
cryptophycin
sarcodictyin
eleutherobin
epothilones
discodermolide
D-24851 ?
dolastatin*
combretastatin*
camptothecin
CDK4
flavopiridol
(R)-roscovitine (CYC202)
paullones, indirubins
gleevec
iressa
OSI774
hydroxyurea
cytarabine
antifolates
5-fluorouracil
6-mercaptopurine
nitrogen mustards
nitrosoureas
mitomycin C
CDK1
Chk1
Chk2
UCN-01, SB-218078
debromohymenialdisine
isogranulatimide
AhR
actin
kinesin Eg5
monastrol
ecteinascidin 743
podophyllotoxin,doxorubicin
etoposide, mitoxantrone
topoisomerase II
ATM/ATR
R115777
SCH66336
ROCK
Y-27632
CDC25
DF203
FK317 HMGA
Plk1
Aurora
wortmanni
n
caffeine
ODC/SAMDC
Pin1
GSK-3
Cdc7
nucleotide excision
repair
Raf cytochalasins
latrunculin A
scytophycins
dolastatin 11
jasplakinolide
(R)-roscovitine (CYC202)
BAY-43-9006
fumagillin,TNP-470
PRIMA-1, pifithrin a
rapamycin mTOR/FRAP
PS-341 proteasome
bryostatin,
PKC412
PKC
histone deacetylasetrichostatin,
FK228
HSP90geldanamycin, 17-
AAGATK, MAFP cytosolic phospholipase A2
hexadecylphosphocholin
e
phospholipase D
CT-2584 choline
kinase
MEK1/Erk-1/2
PD98059, U0126
menadione
(K3)
farnesyl transferase
phosphatasesokadaic acid, fostreicin, calyculin A
Wee1
PD0166285
polyamine analogues
Pin1
p53/MDM2
Source: Cell cycle laboratory, L. Meijer, Roscoff, France
~80 drugs and drug candidates
Cancer chemotherapy is based on cell cycle arrest

CAUSES OF FAILURE IN DRUG
DEVELOPMENT
ADME
ANIMAL TOXICITY
LACK OF EFFICACY
ADVERSE EFFECTS
IN HUMANS
More than 50% of this failure can be predicted computationally in 2011
In 2061: six sigma will be achieved in silico

WET LAB: High-throughput screening (HTS)WET LAB: High-throughput screening (HTS)
Experimental techniqueExperimental technique
384-well microplates, florescence-based detection &384-well microplates, florescence-based detection &
desktop robotsdesktop robots
Up to 1M compounds per targetUp to 1M compounds per target
DRY LAB: Virtual screening (VS)DRY LAB: Virtual screening (VS)
Ligand-based methodsLigand-based methods
2D structures, substructures, fingerprints2D structures, substructures, fingerprints
Volume/surface matchingVolume/surface matching
3D pharmacophores, fingerprints3D pharmacophores, fingerprints
Receptor-based methodsReceptor-based methods
DockingDocking
Even 100B compounds per target triedEven 100B compounds per target tried
Receptor flexibility

OUR 1024-PROCESSOR HPC CLUSTER
WE ALSO USE 500 PROCESSORS FROM
WEST-GRID AND SHARCNET

Target-Protein Structure
MRECISIHVGQAGVQIGNACWELYCLEHGIQPDGQMPSDKTIGGGDDSFNTFFSETGAGKHVPRAVFVDLEPTV
IDEVRTGTYRQLFHPEQLITGKEDAANNYARGHYTIGKEIIDLVLDRIRKLADQCTGLQGFSVFHSFGGGTGSGFT
SLLMERLSVDYGKKSKLEFSIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDIERPTYTNL
NRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRGHFPLATYAPVISAEKAYHEQLSVAEITNACFEPANQMV
KCDPRHGKYMACCLLYRGDVVPKDVNAAIATIKTKRTIQFVDWCPTGFKVGINYEPPTVVPGGDLAKVQRAVCM
LSNTTAIAEAWARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGVDSVEGEGEEEGE
EY
Primary: amino acid
sequence
Secondary: -helix and -α β
sheet
Tertiary: 3D-folding
Quaternary:
multimeric
arrangement

Molecular Dynamics
• Treats molecules
classically:
– Point charges and
masses
– Spring-like bonds
– Numerical integration of
equations of motion

Drug binding sites in tubulin
 Of the more thanOf the more than 100100 approvedapproved
cancer chemotherapy drugs oncancer chemotherapy drugs on
the market, approximately 15%the market, approximately 15%
target tubulin directly.target tubulin directly.
 None are specific for cancerNone are specific for cancer
cells, hence associated sidecells, hence associated side
effectseffects

Drug / Ligand
Protein
Drug ActionDrug Action: Inhibition of Protein-: Inhibition of Protein-
Protein InteractionsProtein Interactions
Cavity
Cavity
Cavity

The computational toolboxThe computational toolbox
The three-fold way:The three-fold way:
rational design andrational design and in silicoin silico testing of derivatives of knowntesting of derivatives of known
agentsagents
brute-force computational search using existing librariesbrute-force computational search using existing libraries
(pharma-matrix)(pharma-matrix)
De novo design from common pharmacophores for bestDe novo design from common pharmacophores for best
space filling propertiesspace filling properties
a pocketome data banka pocketome data bank
Reverse docking allows to predict side effectsReverse docking allows to predict side effects

ContentsContents
Compound dataCompound data sourcessources (PubChem, Zinc, NCI, SciFinder(PubChem, Zinc, NCI, SciFinder
~65M compounds)~65M compounds)
Drug dataDrug data sourcessources (DrugBank, Orange Book, CMC, WDI,(DrugBank, Orange Book, CMC, WDI,
MDDR ~ 250 k drugs)MDDR ~ 250 k drugs)
Molecular dataMolecular data toolkitstoolkits (OpenEye, Open Babel)(OpenEye, Open Babel)
Computational MethodsComputational Methods (MM, MD, QMMM)(MM, MD, QMMM)
Molecule file formatsMolecule file formats (PDB, Smilies )(PDB, Smilies )
DockingDocking (Autodock, Dock)(Autodock, Dock) ParallelParallel (Dovis)(Dovis)

Pharma-matrix apps:Pharma-matrix apps: eRxeRx
100 million targets (100,000 proteins x 100 pockets x 10 mutants):100 million targets (100,000 proteins x 100 pockets x 10 mutants):
pocketomepocketome
100 billion chemical compounds100 billion chemical compounds
10101919
potential interactions (filtering)potential interactions (filtering)
Hand-in-glove match by brute computational screeningHand-in-glove match by brute computational screening
pharmagooglepharmagoogle

Pocketome generation
(pocket clustering)
104
clusters 104
pockets
in a cluster
Docking
(1012
calculations within blocks)
Docking
(1012
calculations within blocks)

Personalized eDx and eRx
in a few decades a personal genome will cost $10 and
will be our ID at birth included in our eRx app

The Virtual Human:The Virtual Human:
Multi-Scale ModelingMulti-Scale Modeling
lobule
liver
whole body
hepatocyte
Drug molecules Interaction matrix

Jack Tuszynski Accelerating Chemotherapy Drug Discovery with Analytics and High Performance Computing

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Jack Tuszynski Accelerating Chemotherapy Drug Discovery with Analytics and High Performance Computing

Similaire à Jack Tuszynski Accelerating Chemotherapy Drug Discovery with Analytics and High Performance Computing (20)

Plus de Kim Solez ,

Plus de Kim Solez , (20)

Dernier

Dernier (20)

Jack Tuszynski Accelerating Chemotherapy Drug Discovery with Analytics and High Performance Computing

Notes de l'éditeur