SlideShare une entreprise Scribd logo
1  sur  66
FBW
8-12-2015
Wim Van Criekinge
Examen
<html>
<title>Examen Bioinformatica</title>
<center>
<head>
<script>
rnd.today=new Date();
rnd.seed=rnd.today.getTime();
function rnd() {
rnd.seed = (rnd.seed*9301+49297) % 233280;
return rnd.seed/(233280.0);
};
function rand(number) {
return Math.ceil(rnd()*number);
};
</SCRIPT>
</head>
<body bgcolor="#FFFFFF" text="#00FF00" link="#00FF00">
<script language="JavaScript">
document.write('<table>');
document.write('<tr>');
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520"
height="360"></a></td>');
rand(98);
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520"
height="360"></a></td>');
rand(98);
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520"
height="360"></a></td>');
rand(98);
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520"
height="360"></a></td>');
rand(98);
Examen
• The keywords can be
– genome structure
– gene-organisation
– known promoter regions
– known critical amino acid residues.
• Combination of functional
modelorganism knowledge
• Structure-function
• Identify similar areas of biology
• Identify orthologous pathways (might
have different endpoints)
Comparative Genomics: The biological Rosetta
Example: Agro
Known “lethal” genes
from worm, drosphila
Sequence Genome
Filter for drugability”,
tractibility & novelty
Example: Extremophiles
Known lipases
Filter for
“workable”lipases
at 90º C
Look for species
with interesting
phenotypes
Clone and produce in large quantities
Washing Powder additives
Sequence Genome
Functional Foods
Convert Highly Energetic Monosaccharides to Dextrane
Drug Discovery: Design new drugs by computer ?
Problem: pipeline cost rise linear, NCE steady
Money: bypassing difficult, work on attrition
Every step requires specific computational tools
• Drugs are generally defined as molecules which
affect biological processes.
• In order to be effective, the molecule must be
present in the body at an adequate concentration
for it to act at the specific site in the body where
it can exert its effect.
• Additionally, the molecule must be safe -- that
is, metabolized and eliminated from the body
without causing injury.
• Assumption: next 50 years still a big market in
small chemical entities which can be
administered orally in form of a pill (in contrast
to antibodies) or gene therapy …
Drug Discovery: What is a drug ?
• Taxol a drug which is an unmodified natural
compound, is the exception
• Most drugs require “work” -> need for target
driven pipeline
• Humane genome is available so all target are
identified
• How to validate (within a given disease area) ?
• target - a molecule (often a protein) that is instrumental
to a disease process (though not necessarily directly
involved), which may be targeted with a potential
therapeutic.
• target identification - identifying a molecule (often a
protein) that is instrumental to a disease process (though
not necessarily directly involved), with the intention of
finding a way to regulate that molecule's activity for
therapeutic purposes.
• target validation - a crucial step in the drug
development process. Following the identification of a
potential disease target, target validation verifies that a
drug that specifically acts on the target can have a
significant therapeutic benefit in the treatment of a given
disease.
Drug Discovery: What is a target ?
Phenotypic Gap
# genes with
known function
Total # genes
Number of genes
1980 1990 2000 2010
Functional Genomics ?
More than running chip experiments !
Proposal to prioritize
hypothetical protein
without annotation, nice
for bioinformatics and
biologist
“Optimal” drug target
Predict side effect
Where is optimal drug target ?
How to correct disease state
Side effects ?
Genome-wide RNAi
RNAI vector
bacteria producing ds RNA for
each of the 20.000 genes
proprietary nematode
responding to RNAi
20.000 responses
20.000 genes insert
library
Normal insulin signaling
Reduced insulin signaling
fat storage LOW
fat storage HIGH
Type-II Diabetes
20,000 bacteria
each containing
selected
C. elegans gene
select genes with desired phenotypes
proprietary C.elegans strains
• sensitized to silencing
• sensitized to relevant pathway
Industrialized knock-downs
Pharma is conservative
Molecular functions of 26 383 human genes
Structural Genomics
Lipinsky for the target ?
Database of all “drugable” human genes
Drug Discovery: Design new drugs by computer ?
screening - the automated examination and
testing of libraries of synthetic and/or organic
compounds and extracts to identify potential drug
leads, based on the compound's binding affinity
for a target molecule.
screening library - a large collection of
compounds with different chemical properties or
shapes, generated either by combinatorial
chemistry or some other process or by collecting
samples with interesting biological properties.
High Throughput Screening: Quick and Dirty…
from 5000 compounds per day
Drug Discovery: Screening definitions
• At the beginning of the 1990s, when the
term "high-throughput screening" was
coined, a department of 20 would
typically be able to screen around 1.5
million samples in a year, each
researcher handling around 75,000
samples. Today, four researchers using
fully automated robotic technology can
screen 50,000 samples a day, or around
2.5 million samples each year.
Drug Discovery: Screening Throughput
Robotic arm
Read-out
Fluorescence /
luminescence
Distribution
96 / 384 wells
Optical Bank
for stability
Drug Discovery: HTS – The Wet Lab
• Available molecules collections from pharma,
chemical and agro industry, also from
academics (Eastern Europe)
• Natural products from fungi, algae, exotic
plants, Chinese and ethnobotanic medicines
• Combinatorial chemistry: it is the generation
of large numbers of diverse chemical
compounds (a library) for use in screening
assays against disease target molecules.
• Computer drug design (from model
substrates or X-ray structure)
Drug Discovery: Chemistry Sources
Drug Discovery
HIT LEAD
• initial screen established
• Compounds screened
• IC50s established
• Structures verified
• Minimum of three independent
chemical series to evaluate
• Positive in silico PK data
Drug Discovery: HIT
• When the structure of the target is unknown,
the activity data can be used to construct a
pharmacophore model for the positioning of
key features like hydrogen-bonding and
hydrophobic groups.
• Such a model can be used as a template to
select the most promising candidates from the
library.
Drug Discovery: Hit/lead computational approaches
• lead compound - a potential drug candidate emerging from a
screening process of a large library of compounds.
• It basically affects specifically a biological process.
Mechanism of activity (reversible/ irreversible, kinetics)
established
• Its is effective at a low concentration: usually nanomolar
activity
• It is not toxic to live cells
• It has been shown to have some in vivo activity
• It is chemically feasible. Specificity of key compound(s) from
each lead series against selected number of receptors/enzymes
• Preliminary PK in vivo (rodent) to establish benchmark for in
vitro SAR
• In vitro PK data good predictor for in vivo activity
• Its is of course New and Original.
Drug Discovery: Lead ?
Christopher A. Lipinski, Franco Lombardo, Beryl W. Dominy, Paul J. Feeney
"Experimental and computational approaches to estimate solubility and
permeability in drug discovery and development settings":
"In the USAN set we found that the sum of Ns and Os in the molecular formula was
greater than 10 in 12% of the compounds. Eleven percent of compounds had
a MWT of over 500. Ten percent of compounds had a CLogP larger than 5 (or
an MLogP larger than 4.15) and in 8% of compounds the sum of OHs and NHs
in the chemical structure was larger than 5. The "rule of 5" states that: poor
absorption or permeation is more likely when:
A. There are less than 5 H-bond donors (expressed as the sum of OHs and
NHs);
B. The MWT is less than 500;
C. The LogP is less than 5 (or MLogP is < 4.15);
D. There are less than 10 H-bond acceptors (expressed as the sum of Ns and
Os).
Compound classes that are substrates for biological transporters are exceptions to
the rule."
Lipinski: « rule of 5 »
• A quick sketch with ChemDraw, conversion to a
3D structure with Chem3D, and processing by
QuikProp, reveals that the problem appears to be
poor cell permeability for this relatively polar
molecule, with predicted PCaco and PMDCK
values near 10 nm/s.
• Free alternative (Chemsketch / PreADME)
(Celebrex)
Methyl in this position makes it a weaker cox-2 inhibitor,
but site of metabolic oxidation and ensures an acceptable clearance
Drug-like-ness
To assist combinatorial chemistry, buy specific compunds
Structural Descriptors: (15 descriptors)
Molecular Formula, Molecular Weight, Formal Charge, The Number of Rotatable Bonds, The Number of Rigid
Bonds, The Number of Rings, The Number of Aromatic Rings, The Number of H Bond Acceptors, The
Number of H Bond Donors, The Number of (+) Charged Groups, The Number of (-) Charged Groups, No.
single, double, triple, aromatic bonds
Topological Descriptors:(350 descriptors)
• Topological descriptors on the adjustancy and distance matrix
• Count descriptors
• Kier & Hall molecular connectivity Indices
• Kier Shape Indices
• Galvez topological charge Indices
• Narumi topological index
• Autocorrelation descriptor of atomic masses, atomic polarizability, Pauling electronegativity and van der
Waals radius
• Information content descriptors
• Electrotopological state index (E-state)
• Atomic-Level-Based AI topological descriptors
Physicochemical Descriptor:(10 descriptors)
AlogP98 (calculated logP), SKlogP (calculated logP), SKlogS in pure water (calculated water solubility), SKlogS in
buffer system (calculated water solubility),SK vap (calculated vapor pressure), SK bp (calculated boiling
point), SK mp (calculated meling point), AMR (calculated molecular refractivity), APOL(calculated
polarizability), Water Solvation Free Energy
Geometrical Descriptor:(9 descriptors)
Topological Polar Surface Area, 2D van der Waals Volume, 2D van der Waals Surface Area, 2D van der Waals
Hydrophobic Surface Area, 2D van der Waals Polar Surface Area, 2D van der Waals H-bond Acceptor Surface
Area, 2D van der Waals H-bond Donor Surface Area, 2D van der Waals (+) Charged Groups Surface Area, 2D
van der Waals (-) Charged Groups Surface Area
• What can you do with these descriptors ?
• Cluster entire chemical library
– Diversity set
– Focused set
Drug Discovery: Hit/lead computational approaches
• Structure is known, virtual screening -> docking
• Many different approaches
– DOCK
– FlexX
– Glide
– GOLD
• Including conformational sampling of the ligand
• Problem:
– host flexibility
– solvatation
• Example: Bissantz et al.
– Hit rate of 10% for single scoring function
– Up to 70% with triple scoring (bagging)
Drug Discovery: Docking
• Given the target site:
• Docking + structure generator
• Specialized approach: growing
substituent on a core
– LUDI
– SPROUT
– BOMB (biochemical and organic model
builder)
– SYNOPSIS
• Problem is the scoring function
which is different for every protein
class
Drug Discovery: De novo design / rational drug design
Drug Discovery: Novel strategies using bio/cheminformatics
- HTS ? Chemical space is big (1041)
- Biased sets/focussed libraries -> bioinformatics !!!
- How ? Use phylogenetics and known structures to define
accesible (conserved) functional implicated residues to
define small molecule pharmacophores (minimal
requirements)
- Desciptor search (cheminformatics) to construct/select
biased compound set
- ensure serendipity by iterative screening of these
predesigned sets
Drug Discovery
Toxigenomics
Metabogenomics
• Preclinical - An early phase of development
including initial safety assessment
Phase I - Evaluation of clinical pharmacology,
usually conducted in volunteers
Phase II - Determination of dose and initial
evaluation of efficacy, conducted in a small
number of patients
Phase III - Large comparative study
(compound versus placebo and/or established
treatment) in patients to establish clinical
benefit and safety
Phase IV - Post marketing study
Drug Discovery: Clinical studies
Drug Discovery & Development: IND filing
Hapmap
Pharmacogenomics
Predictive/preventive – systems biology
Sneak preview
Bioinformatics (re)loaded
Sneak preview
Bioinformatics (re)loaded
• Relational datamodels
– BioSQL (MySQL)
• Data Visualisation
– Interface
• Apache
• PHP
• Large Scale Statistics
– Using R

Contenu connexe

Tendances

Aug2013 NIST highly confident genotype calls for NA12878
Aug2013 NIST highly confident genotype calls for NA12878Aug2013 NIST highly confident genotype calls for NA12878
Aug2013 NIST highly confident genotype calls for NA12878
GenomeInABottle
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomes
GenomeInABottle
 
Genomics & Epigenomics
Genomics & EpigenomicsGenomics & Epigenomics
Genomics & Epigenomics
gumccomm
 

Tendances (20)

Thesis bio bix_2014
Thesis bio bix_2014Thesis bio bix_2014
Thesis bio bix_2014
 
Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
 
GoTermsAnalysisWithR
GoTermsAnalysisWithRGoTermsAnalysisWithR
GoTermsAnalysisWithR
 
Aug2013 NIST highly confident genotype calls for NA12878
Aug2013 NIST highly confident genotype calls for NA12878Aug2013 NIST highly confident genotype calls for NA12878
Aug2013 NIST highly confident genotype calls for NA12878
 
David
DavidDavid
David
 
Molecular and data visualization in drug discovery
Molecular and data visualization in drug discoveryMolecular and data visualization in drug discovery
Molecular and data visualization in drug discovery
 
SOT short course on computational toxicology
SOT short course on computational toxicology SOT short course on computational toxicology
SOT short course on computational toxicology
 
Genome in a Bottle
Genome in a BottleGenome in a Bottle
Genome in a Bottle
 
Function and Phenotype Prediction through Data and Knowledge Fusion
Function and Phenotype Prediction through Data and Knowledge FusionFunction and Phenotype Prediction through Data and Knowledge Fusion
Function and Phenotype Prediction through Data and Knowledge Fusion
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
 
MDC Connects: IKKe/TBK1: Case study of approaches to turning fragment hits in...
MDC Connects: IKKe/TBK1: Case study of approaches to turning fragment hits in...MDC Connects: IKKe/TBK1: Case study of approaches to turning fragment hits in...
MDC Connects: IKKe/TBK1: Case study of approaches to turning fragment hits in...
 
The Language of the Gene Ontology
The Language of the Gene OntologyThe Language of the Gene Ontology
The Language of the Gene Ontology
 
Aug2013 illumina platinum genomes
Aug2013 illumina platinum genomesAug2013 illumina platinum genomes
Aug2013 illumina platinum genomes
 
Open pacbiomodelorgpaper j_landolin_20150121
Open pacbiomodelorgpaper j_landolin_20150121Open pacbiomodelorgpaper j_landolin_20150121
Open pacbiomodelorgpaper j_landolin_20150121
 
The Transforming Genetic Medicine Initiative (TGMI)
The Transforming Genetic Medicine Initiative (TGMI)The Transforming Genetic Medicine Initiative (TGMI)
The Transforming Genetic Medicine Initiative (TGMI)
 
Structural Systems Pharmacology
Structural Systems PharmacologyStructural Systems Pharmacology
Structural Systems Pharmacology
 
Genomics & Epigenomics
Genomics & EpigenomicsGenomics & Epigenomics
Genomics & Epigenomics
 
OpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scaleOpenMS: Quantitative proteomics at large scale
OpenMS: Quantitative proteomics at large scale
 
Introduction to In silico engineering for biologics
Introduction to In silico engineering for biologicsIntroduction to In silico engineering for biologics
Introduction to In silico engineering for biologics
 
Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1
 

En vedette

Programming for engineers in python
Programming for engineers in pythonProgramming for engineers in python
Programming for engineers in python
Harry Potter
 
How to Size a Condensate Return Unit
How to Size a Condensate Return UnitHow to Size a Condensate Return Unit
How to Size a Condensate Return Unit
Kevin Doyle
 
Ado.net &amp; data persistence frameworks
Ado.net &amp; data persistence frameworksAdo.net &amp; data persistence frameworks
Ado.net &amp; data persistence frameworks
Luis Goldster
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
Luis Goldster
 
Object oriented analysis
Object oriented analysisObject oriented analysis
Object oriented analysis
Mahesh Bhalerao
 

En vedette (19)

Poo java
Poo javaPoo java
Poo java
 
Programming for engineers in python
Programming for engineers in pythonProgramming for engineers in python
Programming for engineers in python
 
2016 02 23_biological_databases_part1
2016 02 23_biological_databases_part12016 02 23_biological_databases_part1
2016 02 23_biological_databases_part1
 
Abstract data types
Abstract data typesAbstract data types
Abstract data types
 
How to Size a Condensate Return Unit
How to Size a Condensate Return UnitHow to Size a Condensate Return Unit
How to Size a Condensate Return Unit
 
Ado.net &amp; data persistence frameworks
Ado.net &amp; data persistence frameworksAdo.net &amp; data persistence frameworks
Ado.net &amp; data persistence frameworks
 
GRAFICA, DOMINIO Y RANGO DE UNA FUNCIÓN
GRAFICA, DOMINIO Y RANGO DE UNA FUNCIÓNGRAFICA, DOMINIO Y RANGO DE UNA FUNCIÓN
GRAFICA, DOMINIO Y RANGO DE UNA FUNCIÓN
 
Memory system
Memory systemMemory system
Memory system
 
X.500 More Than a Global Directory
X.500 More Than a Global DirectoryX.500 More Than a Global Directory
X.500 More Than a Global Directory
 
المادة العلمية محاضرة 2 كيفية كتابة المسح الأدبي
المادة العلمية محاضرة 2 كيفية كتابة المسح الأدبيالمادة العلمية محاضرة 2 كيفية كتابة المسح الأدبي
المادة العلمية محاضرة 2 كيفية كتابة المسح الأدبي
 
DIFERENCIACIÓN
DIFERENCIACIÓN DIFERENCIACIÓN
DIFERENCIACIÓN
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
 
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
 
Ρευστά σε κίνηση
Ρευστά σε κίνησηΡευστά σε κίνηση
Ρευστά σε κίνηση
 
The Ldap Protocol
The Ldap ProtocolThe Ldap Protocol
The Ldap Protocol
 
memory hierarchy
memory hierarchymemory hierarchy
memory hierarchy
 
Computer organization memory hierarchy
Computer organization memory hierarchyComputer organization memory hierarchy
Computer organization memory hierarchy
 
Object oriented analysis
Object oriented analysisObject oriented analysis
Object oriented analysis
 
Object Oriented Analysis and Design
Object Oriented Analysis and DesignObject Oriented Analysis and Design
Object Oriented Analysis and Design
 

Similaire à 2015 bioinformatics bio_cheminformatics_wim_vancriekinge

Session 1 part 2
Session 1 part 2Session 1 part 2
Session 1 part 2
plmiami
 
A_Pope_RQRM_LeadDisc_June_2016
A_Pope_RQRM_LeadDisc_June_2016A_Pope_RQRM_LeadDisc_June_2016
A_Pope_RQRM_LeadDisc_June_2016
Andrew Pope
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCN
Jeremy Yang
 

Similaire à 2015 bioinformatics bio_cheminformatics_wim_vancriekinge (20)

Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
 
Bioinformatica 15-12-2011-t9-t10-bio cheminformatics
Bioinformatica 15-12-2011-t9-t10-bio cheminformaticsBioinformatica 15-12-2011-t9-t10-bio cheminformatics
Bioinformatica 15-12-2011-t9-t10-bio cheminformatics
 
Session 1 part 2
Session 1 part 2Session 1 part 2
Session 1 part 2
 
Natural products in drug discovery
Natural products in drug discoveryNatural products in drug discovery
Natural products in drug discovery
 
Drug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsDrug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugs
 
Drug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsDrug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugs
 
A_Pope_RQRM_LeadDisc_June_2016
A_Pope_RQRM_LeadDisc_June_2016A_Pope_RQRM_LeadDisc_June_2016
A_Pope_RQRM_LeadDisc_June_2016
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
HTS
HTSHTS
HTS
 
Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)
 
drug discovery & development
drug discovery & developmentdrug discovery & development
drug discovery & development
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug design
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCN
 
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
 
Data drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistryData drivenapproach to medicinalchemistry
Data drivenapproach to medicinalchemistry
 
MRCT's Centre for Therapeutics Discovery
MRCT's Centre for Therapeutics DiscoveryMRCT's Centre for Therapeutics Discovery
MRCT's Centre for Therapeutics Discovery
 
HIGH THROUGHPUT SCREENING Technology
HIGH THROUGHPUT SCREENING  TechnologyHIGH THROUGHPUT SCREENING  Technology
HIGH THROUGHPUT SCREENING Technology
 
Sparsh bioinfo.ppt
Sparsh bioinfo.pptSparsh bioinfo.ppt
Sparsh bioinfo.ppt
 
High Throughput Screening Technology
High Throughput Screening TechnologyHigh Throughput Screening Technology
High Throughput Screening Technology
 
Drug development approaches
Drug development approaches Drug development approaches
Drug development approaches
 

Plus de Prof. Wim Van Criekinge

Plus de Prof. Wim Van Criekinge (20)

2020 02 11_biological_databases_part1
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1
 
2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload
 
2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload
 
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload
 
2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_upload2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_upload
 
2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload
 
P7 2018 biopython3
P7 2018 biopython3P7 2018 biopython3
P7 2018 biopython3
 
P6 2018 biopython2b
P6 2018 biopython2bP6 2018 biopython2b
P6 2018 biopython2b
 
P4 2018 io_functions
P4 2018 io_functionsP4 2018 io_functions
P4 2018 io_functions
 
P3 2018 python_regexes
P3 2018 python_regexesP3 2018 python_regexes
P3 2018 python_regexes
 
T1 2018 bioinformatics
T1 2018 bioinformaticsT1 2018 bioinformatics
T1 2018 bioinformatics
 
P1 2018 python
P1 2018 pythonP1 2018 python
P1 2018 python
 
Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]
 
2018 05 08_biological_databases_no_sql
2018 05 08_biological_databases_no_sql2018 05 08_biological_databases_no_sql
2018 05 08_biological_databases_no_sql
 
2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_upload2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_upload
 
2018 03 20_biological_databases_part3
2018 03 20_biological_databases_part32018 03 20_biological_databases_part3
2018 03 20_biological_databases_part3
 
2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_upload2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_upload
 
2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload
 
P7 2017 biopython3
P7 2017 biopython3P7 2017 biopython3
P7 2017 biopython3
 
P6 2017 biopython2
P6 2017 biopython2P6 2017 biopython2
P6 2017 biopython2
 

Dernier

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Dernier (20)

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 

2015 bioinformatics bio_cheminformatics_wim_vancriekinge

  • 1.
  • 3.
  • 4. Examen <html> <title>Examen Bioinformatica</title> <center> <head> <script> rnd.today=new Date(); rnd.seed=rnd.today.getTime(); function rnd() { rnd.seed = (rnd.seed*9301+49297) % 233280; return rnd.seed/(233280.0); }; function rand(number) { return Math.ceil(rnd()*number); }; </SCRIPT> </head> <body bgcolor="#FFFFFF" text="#00FF00" link="#00FF00"> <script language="JavaScript"> document.write('<table>'); document.write('<tr>'); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98);
  • 6. • The keywords can be – genome structure – gene-organisation – known promoter regions – known critical amino acid residues. • Combination of functional modelorganism knowledge • Structure-function • Identify similar areas of biology • Identify orthologous pathways (might have different endpoints) Comparative Genomics: The biological Rosetta
  • 7.
  • 8. Example: Agro Known “lethal” genes from worm, drosphila Sequence Genome Filter for drugability”, tractibility & novelty
  • 9. Example: Extremophiles Known lipases Filter for “workable”lipases at 90º C Look for species with interesting phenotypes Clone and produce in large quantities Washing Powder additives Sequence Genome Functional Foods Convert Highly Energetic Monosaccharides to Dextrane
  • 10.
  • 11. Drug Discovery: Design new drugs by computer ? Problem: pipeline cost rise linear, NCE steady Money: bypassing difficult, work on attrition Every step requires specific computational tools
  • 12. • Drugs are generally defined as molecules which affect biological processes. • In order to be effective, the molecule must be present in the body at an adequate concentration for it to act at the specific site in the body where it can exert its effect. • Additionally, the molecule must be safe -- that is, metabolized and eliminated from the body without causing injury. • Assumption: next 50 years still a big market in small chemical entities which can be administered orally in form of a pill (in contrast to antibodies) or gene therapy … Drug Discovery: What is a drug ?
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. • Taxol a drug which is an unmodified natural compound, is the exception • Most drugs require “work” -> need for target driven pipeline • Humane genome is available so all target are identified • How to validate (within a given disease area) ?
  • 18. • target - a molecule (often a protein) that is instrumental to a disease process (though not necessarily directly involved), which may be targeted with a potential therapeutic. • target identification - identifying a molecule (often a protein) that is instrumental to a disease process (though not necessarily directly involved), with the intention of finding a way to regulate that molecule's activity for therapeutic purposes. • target validation - a crucial step in the drug development process. Following the identification of a potential disease target, target validation verifies that a drug that specifically acts on the target can have a significant therapeutic benefit in the treatment of a given disease. Drug Discovery: What is a target ?
  • 19. Phenotypic Gap # genes with known function Total # genes Number of genes 1980 1990 2000 2010 Functional Genomics ? More than running chip experiments ! Proposal to prioritize hypothetical protein without annotation, nice for bioinformatics and biologist
  • 20.
  • 21. “Optimal” drug target Predict side effect Where is optimal drug target ? How to correct disease state Side effects ?
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28. Genome-wide RNAi RNAI vector bacteria producing ds RNA for each of the 20.000 genes proprietary nematode responding to RNAi 20.000 responses 20.000 genes insert library
  • 29.
  • 30.
  • 31. Normal insulin signaling Reduced insulin signaling fat storage LOW fat storage HIGH Type-II Diabetes
  • 32. 20,000 bacteria each containing selected C. elegans gene select genes with desired phenotypes proprietary C.elegans strains • sensitized to silencing • sensitized to relevant pathway Industrialized knock-downs
  • 34.
  • 35. Molecular functions of 26 383 human genes Structural Genomics
  • 36.
  • 37. Lipinsky for the target ? Database of all “drugable” human genes
  • 38. Drug Discovery: Design new drugs by computer ?
  • 39. screening - the automated examination and testing of libraries of synthetic and/or organic compounds and extracts to identify potential drug leads, based on the compound's binding affinity for a target molecule. screening library - a large collection of compounds with different chemical properties or shapes, generated either by combinatorial chemistry or some other process or by collecting samples with interesting biological properties. High Throughput Screening: Quick and Dirty… from 5000 compounds per day Drug Discovery: Screening definitions
  • 40. • At the beginning of the 1990s, when the term "high-throughput screening" was coined, a department of 20 would typically be able to screen around 1.5 million samples in a year, each researcher handling around 75,000 samples. Today, four researchers using fully automated robotic technology can screen 50,000 samples a day, or around 2.5 million samples each year. Drug Discovery: Screening Throughput
  • 41. Robotic arm Read-out Fluorescence / luminescence Distribution 96 / 384 wells Optical Bank for stability Drug Discovery: HTS – The Wet Lab
  • 42. • Available molecules collections from pharma, chemical and agro industry, also from academics (Eastern Europe) • Natural products from fungi, algae, exotic plants, Chinese and ethnobotanic medicines • Combinatorial chemistry: it is the generation of large numbers of diverse chemical compounds (a library) for use in screening assays against disease target molecules. • Computer drug design (from model substrates or X-ray structure) Drug Discovery: Chemistry Sources
  • 44. • initial screen established • Compounds screened • IC50s established • Structures verified • Minimum of three independent chemical series to evaluate • Positive in silico PK data Drug Discovery: HIT
  • 45. • When the structure of the target is unknown, the activity data can be used to construct a pharmacophore model for the positioning of key features like hydrogen-bonding and hydrophobic groups. • Such a model can be used as a template to select the most promising candidates from the library. Drug Discovery: Hit/lead computational approaches
  • 46. • lead compound - a potential drug candidate emerging from a screening process of a large library of compounds. • It basically affects specifically a biological process. Mechanism of activity (reversible/ irreversible, kinetics) established • Its is effective at a low concentration: usually nanomolar activity • It is not toxic to live cells • It has been shown to have some in vivo activity • It is chemically feasible. Specificity of key compound(s) from each lead series against selected number of receptors/enzymes • Preliminary PK in vivo (rodent) to establish benchmark for in vitro SAR • In vitro PK data good predictor for in vivo activity • Its is of course New and Original. Drug Discovery: Lead ?
  • 47. Christopher A. Lipinski, Franco Lombardo, Beryl W. Dominy, Paul J. Feeney "Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings": "In the USAN set we found that the sum of Ns and Os in the molecular formula was greater than 10 in 12% of the compounds. Eleven percent of compounds had a MWT of over 500. Ten percent of compounds had a CLogP larger than 5 (or an MLogP larger than 4.15) and in 8% of compounds the sum of OHs and NHs in the chemical structure was larger than 5. The "rule of 5" states that: poor absorption or permeation is more likely when: A. There are less than 5 H-bond donors (expressed as the sum of OHs and NHs); B. The MWT is less than 500; C. The LogP is less than 5 (or MLogP is < 4.15); D. There are less than 10 H-bond acceptors (expressed as the sum of Ns and Os). Compound classes that are substrates for biological transporters are exceptions to the rule." Lipinski: « rule of 5 »
  • 48. • A quick sketch with ChemDraw, conversion to a 3D structure with Chem3D, and processing by QuikProp, reveals that the problem appears to be poor cell permeability for this relatively polar molecule, with predicted PCaco and PMDCK values near 10 nm/s. • Free alternative (Chemsketch / PreADME)
  • 49. (Celebrex) Methyl in this position makes it a weaker cox-2 inhibitor, but site of metabolic oxidation and ensures an acceptable clearance Drug-like-ness
  • 50. To assist combinatorial chemistry, buy specific compunds
  • 51.
  • 52. Structural Descriptors: (15 descriptors) Molecular Formula, Molecular Weight, Formal Charge, The Number of Rotatable Bonds, The Number of Rigid Bonds, The Number of Rings, The Number of Aromatic Rings, The Number of H Bond Acceptors, The Number of H Bond Donors, The Number of (+) Charged Groups, The Number of (-) Charged Groups, No. single, double, triple, aromatic bonds Topological Descriptors:(350 descriptors) • Topological descriptors on the adjustancy and distance matrix • Count descriptors • Kier & Hall molecular connectivity Indices • Kier Shape Indices • Galvez topological charge Indices • Narumi topological index • Autocorrelation descriptor of atomic masses, atomic polarizability, Pauling electronegativity and van der Waals radius • Information content descriptors • Electrotopological state index (E-state) • Atomic-Level-Based AI topological descriptors Physicochemical Descriptor:(10 descriptors) AlogP98 (calculated logP), SKlogP (calculated logP), SKlogS in pure water (calculated water solubility), SKlogS in buffer system (calculated water solubility),SK vap (calculated vapor pressure), SK bp (calculated boiling point), SK mp (calculated meling point), AMR (calculated molecular refractivity), APOL(calculated polarizability), Water Solvation Free Energy Geometrical Descriptor:(9 descriptors) Topological Polar Surface Area, 2D van der Waals Volume, 2D van der Waals Surface Area, 2D van der Waals Hydrophobic Surface Area, 2D van der Waals Polar Surface Area, 2D van der Waals H-bond Acceptor Surface Area, 2D van der Waals H-bond Donor Surface Area, 2D van der Waals (+) Charged Groups Surface Area, 2D van der Waals (-) Charged Groups Surface Area
  • 53. • What can you do with these descriptors ? • Cluster entire chemical library – Diversity set – Focused set Drug Discovery: Hit/lead computational approaches
  • 54. • Structure is known, virtual screening -> docking • Many different approaches – DOCK – FlexX – Glide – GOLD • Including conformational sampling of the ligand • Problem: – host flexibility – solvatation • Example: Bissantz et al. – Hit rate of 10% for single scoring function – Up to 70% with triple scoring (bagging) Drug Discovery: Docking
  • 55. • Given the target site: • Docking + structure generator • Specialized approach: growing substituent on a core – LUDI – SPROUT – BOMB (biochemical and organic model builder) – SYNOPSIS • Problem is the scoring function which is different for every protein class Drug Discovery: De novo design / rational drug design
  • 56. Drug Discovery: Novel strategies using bio/cheminformatics - HTS ? Chemical space is big (1041) - Biased sets/focussed libraries -> bioinformatics !!! - How ? Use phylogenetics and known structures to define accesible (conserved) functional implicated residues to define small molecule pharmacophores (minimal requirements) - Desciptor search (cheminformatics) to construct/select biased compound set - ensure serendipity by iterative screening of these predesigned sets
  • 58.
  • 59. • Preclinical - An early phase of development including initial safety assessment Phase I - Evaluation of clinical pharmacology, usually conducted in volunteers Phase II - Determination of dose and initial evaluation of efficacy, conducted in a small number of patients Phase III - Large comparative study (compound versus placebo and/or established treatment) in patients to establish clinical benefit and safety Phase IV - Post marketing study Drug Discovery: Clinical studies
  • 60.
  • 61.
  • 62. Drug Discovery & Development: IND filing
  • 66. Sneak preview Bioinformatics (re)loaded • Relational datamodels – BioSQL (MySQL) • Data Visualisation – Interface • Apache • PHP • Large Scale Statistics – Using R