SlideShare a Scribd company logo
1 of 30
CAESAR II:
The Combination of Direct Geometry
Method and CAESAR Algorithm for Super
Fast Conformational Search


Jiabo Li, Ph.D.

ACS Meeting, March 21-25, 2010 , San Francisco
Product Roadmap Disclaimer


• This presentation and/or any related documents contains statements
  regarding our plans or expectations for future features, enhancements
  or functionalities of current or future products (collectively
  "Enhancements"). Our plans or expectations are subject to change at
  any time at our discretion. Accordingly, Accelrys is making no
  representation, undertaking no commitment or legal obligation to
  create, develop or license any product or Enhancements. The
  presentation, documents or any related statements are not intended
  to, nor shall, create any legal obligation upon Accelrys, and shall not be
  relied upon in purchasing any product. Any such obligation shall only
  result from a written agreement executed by both parties. In addition,
  information disclosed in this presentation and related documents,
  whether oral or written, is confidential or proprietary information of
  Accelrys. It shall be used only for the purpose of furthering our
  business relationship, and shall not be disclosed to third parties.


© 2009 Accelrys, Inc.                                                      2
Outline


     • Original CAESAR: very efficient torsion search
           –    Recursive partition of a molecule
           –    Recursive build-up of conformations
           –    Fast energy screening for bad clashes
           –    New strategy for eliminating TopSym duplicates


     • CAESAR II: Deal with various geometric constraints
           – New algorithm for 3D structure generation for ring systems
           – Enforce stereo-chemistry (chirality, cis-trans, stereogenic)
           – Ring conformation library and access of ring conformations


     • Validation of CAESAR II: Speed, Quality, Diversity
           – Quality (binding conformations)
           – Diversity: 3D pharmacophore space, distribution of radius of gyration
           – Test on macrocyles



© 2009 Accelrys, Inc.                                                                3
Conformation Sampling is important

    •The Conformationimportant in many
     3D conformation search is Search Problem
     applications
          – How a new compound binds to a protein is usually unknown
          – 3D pharmacophore modeling
          – Docking

    • Efficient search is challenging !
          – Most drug molecule is flexible in their conformations
          – Low energy conformation space is high dimensional and highly
            irregular in shape.
          – Stereo-chemistry and ring closure constraints

    • CAESAR is an efficient search algorithm




© 2009 Accelrys, Inc.                                                      4
CAESAR: Conformation Algorithm base on Energy
Screening And Recursive Buildup




            1. Recursively partition                  2. Recursively assemble




                E       A
                            = E A + E B + E A− B
                            B


           3. Quickly filter out bad clashes       4. New method for eliminating duplicates

© 2009 Accelrys, Inc.                                                                    5
3D Search results

 Database: build with both Catalyst/FAST and CAESAR for a test set of 50,000 molecules




    Table 1. Number of catSearch hits with three 3D pharmacophore queries

                         Catalyst/FAST     CAESAR              Common

       ang-IIHypo        90                100                 73
       Hypo2             247               236                 215
       ang-IIHypoShape   22                21                  13

© 2009 Accelrys, Inc.                                                                    6
Reference of CAESAR I




   J. Li, T. Ehlers, J. Sutter, S. Varma-O’Brien, and J. Kirchmair,
   CAESAR: A New Conformer Generation Algorithm Based on
   Recursive Buildup and Local Rotational Symmetry
   Consideration, J. Chem. Inf. Model. 2007, 47, 1923-1932




© 2009 Accelrys, Inc.                                                 7
New Developments of CAESAR


• CAESAR II
   – Direct Geometry Method for conformation generation of
     constraint structures
   – New method for stereo chemistry control (chirality, cis-
     trans etc).
   – New strategy of ring conformation library




© 2009 Accelrys, Inc.                                           8
Traditional method for ring structure
(Distance Geometry)

• – Bound Smoothing (can be very time consuming)
• – Embedding from a distance matrix
• – Optimization of the generated structures




© 2009 Accelrys, Inc.                              9
Direct Geometry Method


• Direct 3D coordinate modification according to geometric
  constraints
• Type of geometric constraints
   – Bond length
   – Bond angle (co-linear, 180 degree)
   – Torsion (co-planar, 180 degree)
   – Stereo chemistry (chirality, cis-trans, stereogenic)
   – VDW clash
• All types of constraints can be converted into distance
  constraints



© 2009 Accelrys, Inc.                                       10
Bond Length

• Bond Length correction




      Bond length between C16 and N15 are too long.
      Correction: Move the two atoms to each other.




© 2009 Accelrys, Inc.                                 11
Bond Angle


• Bond angle correction




                Bond anlge C12-C13-C14 is too small.
                Correction: Increase the distance between C12 and C14.

© 2009 Accelrys, Inc.                                                    12
Linear bonds


• Linear bond correction




                Carbon C2 is off line.
                Correction: Move C2 to its correct position.

© 2009 Accelrys, Inc.                                          13
VDW Clash


• Remove VDW clash




                Two atoms H3 and H6 are too close..
                Correction: Move H3 and H6 from each other.

© 2009 Accelrys, Inc.                                         14
Other types of geometric constraints
can also converted into distance
constraints
• For instance, simple distance constraints does help for
  chiral centers. We can use stereo templates with correct
  chirality to guide each atom’s move to achieve the
  correct geometries. SOS by Zhu and Agrafiotis also had
  similar idea.
• If the chirality is unknown, no additional constraints are
  needed




© 2009 Accelrys, Inc.                                          15
Put the simple ideas into practice


• Not a single correction can satisfy all the
  constraints.
• Correction needs to be done iteratively
• Control of convergence is important




© 2009 Accelrys, Inc.                           16
Test 1: Diamond structures from random
starting coordinates




© 2009 Accelrys, Inc.                    17
Iterative correction towards correct
     structure




© 2009 Accelrys, Inc.                       18
Test 2: C-60




© 2009 Accelrys, Inc.   19
Timing:Direct Geometry Method VS.
     Distance Geometry Method




          Table 2: CPU time (second) for generating 3D structures

        Molecule          Direct Method    DG Method        Ratio
        C60               0.12             1                8
        Diamond           0.04             43               1000




© 2009 Accelrys, Inc.                                               20
Test 3: All-trans cyclic peptide bonds




© 2009 Accelrys, Inc.                    21
Test 4: Conformation sampling of macrocycles (Pascal
Bonnet data set). Diversity by fingerprints
               Table 3: Number of pharmacophore fingerprints of conformation models of macrocycles
             MOL                              CASERII                                           OMEGA
                                         Number of 3  Number of 4                        Number of 3          Number of 4
                         Num. of                                             Num. of
                                         points       points                             points               points
                         Conf.                                               Conf.
                                         Fingerprints Fingerprints                       Fingerprints         Fingerprints
            P1                      45            10567            251585            0                   0                 0
            P2                      71             8375            236449           76                8948            448917
            P3                       8              714              4468          157                3550             37078
            P4                     206             6104            165997          200                6440            109051
            P5                      11             3735             60097           22                4142             83547
            P6                      19            10481            221494           49                7326            322580
            CD6                     25             1207             44184            6                1062             29948
            G6                       6              250               498            9                 131               342
            G8                       6              648              2476           13                 359              1828
            G10                      6             1374              7503           11                 760              5922
            G12                      6             2354             20640            5                 969              8202
            G14                      6             3735             41996            2                 817              6458
            G16                      6             4784            101998            2                1578             14654
            G18                      6             7345            165467            3                1154             23631
            G20                      6             8626            245656            2                1305             26909
            SUM-1                  433            70299            1570508         557                38541          1119067
            SUM-2                  388            59732            1318923         557                38541          1119067



                 Notes: (1) Bin Size 1.5A. All other setting are default in DS 2.1.
                        (2) D8-D14 molecules failed in fingerprint generations, thus were excluded.
                        (3) SUM-1: summation of all 15 molecules. SUM-2: P1 excluded.
 © 2009 Accelrys, Inc.                                                                                                         22
Test 4: Conformation sampling of macrocycles
 (Pascal Bonnet data set). Radius of gyration




            Figure 1. Distribution of radius gyration of conformations generated
            by OMEGA and CAESAR II

© 2009 Accelrys, Inc.                                                              23
Distribution of sum of atom-atom distances




            Figure 2. Distribution of atom-atom distance summation of
            conformations generated by OMEGA and CAESAR II

© 2009 Accelrys, Inc.                                                   24
Test 5. Find bioactive conformation with
CAESAR I test dataset


            Table 4. RMSD of the best fitting conformation to the bioactive conformations

              PDB Ligands                 CASERII                      OMEGA
           (CAESAR I test data)         (maxconf=400)                (maxconf=400)
                Average                     0.96                         0.93
             RMSD(angstrom)
              CPU time (s)                   226                           4385
              RMSD < 0.5                    26.6%                         22.9%
              RMSD < 1.0                    61.0%                         61.9%
               RMSD <1.5                    82.1%                         87.6%
               RMSD <2.0                    92.4%                         94.4%

            *Machine: Intel(R) Xeon(R) CPU E7420 @ 2.13GHz




© 2009 Accelrys, Inc.                                                                       25
Push efficiency to the new limit: reuse ring
 conformations by creating a library

• Scan 6M compounds=> 100,000 ring/rigid structures
• Generation conformations for all rings in the library using the BEST
  method
• Build index for the library
• ~100MB file size




© 2009 Accelrys, Inc.                                                    26
Retrieve ring conformations from library
efficiently

• Load the index file
• Read in ring conformation from file if it is not cached in the
  memory, else just use the ring conformation from memory
• If the ring is not in the library, generate ring conformations on
  the fly using the direct method, and save them in the memory
  for reuse




© 2009 Accelrys, Inc.                                                 27
Speed Test of on-the-fly conformation
generation using CAESAR II

• Test condition
      – Data set: CAP2008 database, ~6million compounds
      – Max Conformations/compound = 100
      – Ring conformation library is pre-generated using Catalyst/BEST
        method
      – Quad-core CPU, 2.2 GHz, parallel computing
      – Without saving conformations in SD file (I/O bottleneck)

• Speed
      – It takes 1.5 hours for generating conformations for all 6 million
        compounds, or 250 compounds/second/processor




© 2009 Accelrys, Inc.                                                       28
Summary



      • There are two new technologies in CAESAR II which make the new
        algorithm much more robust and efficient than the original CAESAR


            – Direct method for 3D structure generation of ring and rigid structures
            – New ring/rigid structure library and retrieving method


      • The ring conformation generation using Direct Geometry Method is
        highly efficient and robust


      • The conformer model of ring molecules has good coverage of 3D
        phamacophore space.




© 2009 Accelrys, Inc.                                                                  29
Acknowledgment




      • Jon Sutter
      • Honglin Li
      • Fang Bai
      • David Zhang
      • Paul Flook
      • Frank Brown




© 2009 Accelrys, Inc.   30

More Related Content

Viewers also liked

Understanding Stress Analysis Topics
Understanding Stress Analysis TopicsUnderstanding Stress Analysis Topics
Understanding Stress Analysis Topics
Zachary Swartz
 
Pipe hangers
Pipe hangersPipe hangers
Pipe hangers
Wanechan
 
H 154 e-design instruct_piping design
H 154 e-design instruct_piping designH 154 e-design instruct_piping design
H 154 e-design instruct_piping design
Srini Vasan
 
2805_Worland-2805 - The Interoperability Advantage
2805_Worland-2805 - The Interoperability Advantage2805_Worland-2805 - The Interoperability Advantage
2805_Worland-2805 - The Interoperability Advantage
Matt Worland
 
Engineering & piping design
Engineering & piping designEngineering & piping design
Engineering & piping design
jisothomas
 

Viewers also liked (18)

manual stress analysis
manual stress analysismanual stress analysis
manual stress analysis
 
Design, Application & Installation of Pipe Clamps Webinar
Design, Application & Installation of Pipe Clamps WebinarDesign, Application & Installation of Pipe Clamps Webinar
Design, Application & Installation of Pipe Clamps Webinar
 
Tutorial caesars II
Tutorial caesars IITutorial caesars II
Tutorial caesars II
 
Piping design and flexibility analysis
Piping design and flexibility analysisPiping design and flexibility analysis
Piping design and flexibility analysis
 
Understanding Stress Analysis Topics
Understanding Stress Analysis TopicsUnderstanding Stress Analysis Topics
Understanding Stress Analysis Topics
 
Stress analysis method
Stress analysis methodStress analysis method
Stress analysis method
 
Pipe hangers
Pipe hangersPipe hangers
Pipe hangers
 
Caeser II manual
Caeser II manualCaeser II manual
Caeser II manual
 
H 154 e-design instruct_piping design
H 154 e-design instruct_piping designH 154 e-design instruct_piping design
H 154 e-design instruct_piping design
 
2805_Worland-2805 - The Interoperability Advantage
2805_Worland-2805 - The Interoperability Advantage2805_Worland-2805 - The Interoperability Advantage
2805_Worland-2805 - The Interoperability Advantage
 
136894665 rps-frp-piping-stress-analysis
136894665 rps-frp-piping-stress-analysis136894665 rps-frp-piping-stress-analysis
136894665 rps-frp-piping-stress-analysis
 
Piping handbook
Piping handbookPiping handbook
Piping handbook
 
Stress Analysis Using Caesar II
Stress Analysis Using Caesar IIStress Analysis Using Caesar II
Stress Analysis Using Caesar II
 
Engineering & piping design
Engineering & piping designEngineering & piping design
Engineering & piping design
 
Pipeline stress analysis with ceaser ii
Pipeline stress analysis with ceaser iiPipeline stress analysis with ceaser ii
Pipeline stress analysis with ceaser ii
 
Pipe Clamps Webinar
Pipe Clamps WebinarPipe Clamps Webinar
Pipe Clamps Webinar
 
Pipes and bends
Pipes and bendsPipes and bends
Pipes and bends
 
178680732 pipe-hanger-design
178680732 pipe-hanger-design178680732 pipe-hanger-design
178680732 pipe-hanger-design
 

Similar to CAESAR II:The Combination of Direct Geometry Method and CAESAR Algorithm for Super Fast Conformational Search

An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
grssieee
 
CellSeeker-PDRPresentaton-2
CellSeeker-PDRPresentaton-2CellSeeker-PDRPresentaton-2
CellSeeker-PDRPresentaton-2
Edward Chiang
 
Framework for stochastic modeling of dragline energy efficiency
Framework for stochastic modeling of dragline energy efficiencyFramework for stochastic modeling of dragline energy efficiency
Framework for stochastic modeling of dragline energy efficiency
Maryam Abdi Oskouei
 
GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...
GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...
GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...
Altair
 
ARVO ISIE 2012 IMAGING POSTER.DDDK.Final
ARVO ISIE 2012 IMAGING POSTER.DDDK.FinalARVO ISIE 2012 IMAGING POSTER.DDDK.Final
ARVO ISIE 2012 IMAGING POSTER.DDDK.Final
Darrel Krimmel
 
Wavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression modelsWavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression models
Vinit Sehgal
 
M E T R O L O G Y A N D Q U A L I T Y C O N T R O L J N T U M O D E L P A ...
M E T R O L O G Y A N D Q U A L I T Y C O N T R O L  J N T U  M O D E L  P A ...M E T R O L O G Y A N D Q U A L I T Y C O N T R O L  J N T U  M O D E L  P A ...
M E T R O L O G Y A N D Q U A L I T Y C O N T R O L J N T U M O D E L P A ...
guest3f9c6b
 

Similar to CAESAR II:The Combination of Direct Geometry Method and CAESAR Algorithm for Super Fast Conformational Search (20)

ACEMD: High-throughput Molecular Dynamics with NVIDIA Kepler GPUs
ACEMD: High-throughput Molecular Dynamics with NVIDIA Kepler GPUsACEMD: High-throughput Molecular Dynamics with NVIDIA Kepler GPUs
ACEMD: High-throughput Molecular Dynamics with NVIDIA Kepler GPUs
 
RE-FRAC OF SHALE WELLS USING ARTIFICIAL INTELLIGENCE
RE-FRAC OF SHALE WELLS USING ARTIFICIAL INTELLIGENCERE-FRAC OF SHALE WELLS USING ARTIFICIAL INTELLIGENCE
RE-FRAC OF SHALE WELLS USING ARTIFICIAL INTELLIGENCE
 
How can access to experimental X-ray data further our understanding of struct...
How can access to experimental X-ray data further our understanding of struct...How can access to experimental X-ray data further our understanding of struct...
How can access to experimental X-ray data further our understanding of struct...
 
50 years of size exclusion chromatography
50 years of size exclusion chromatography50 years of size exclusion chromatography
50 years of size exclusion chromatography
 
GE4230 Micromirror Project 2
GE4230 Micromirror Project 2GE4230 Micromirror Project 2
GE4230 Micromirror Project 2
 
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
An Autocorrelation Analysis Approach to Detecting Land Cover Change using Hyp...
 
CNVMiner: Pipeline to Mine CNV & Structural Variation in Hierarchical Fashion
CNVMiner: Pipeline to Mine CNV & Structural Variation in Hierarchical FashionCNVMiner: Pipeline to Mine CNV & Structural Variation in Hierarchical Fashion
CNVMiner: Pipeline to Mine CNV & Structural Variation in Hierarchical Fashion
 
CellSeeker-PDRPresentaton-2
CellSeeker-PDRPresentaton-2CellSeeker-PDRPresentaton-2
CellSeeker-PDRPresentaton-2
 
Framework for stochastic modeling of dragline energy efficiency
Framework for stochastic modeling of dragline energy efficiencyFramework for stochastic modeling of dragline energy efficiency
Framework for stochastic modeling of dragline energy efficiency
 
GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...
GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...
GENOA and HyperWorks Integration for Advance Composite Product Design and Ana...
 
New Ideas for Repairing Gearboxes and Generators
New Ideas for Repairing Gearboxes and GeneratorsNew Ideas for Repairing Gearboxes and Generators
New Ideas for Repairing Gearboxes and Generators
 
Mass balancing techniques The Midas approach
Mass balancing techniques The  Midas approachMass balancing techniques The  Midas approach
Mass balancing techniques The Midas approach
 
ARVO ISIE 2012 IMAGING POSTER.DDDK.Final
ARVO ISIE 2012 IMAGING POSTER.DDDK.FinalARVO ISIE 2012 IMAGING POSTER.DDDK.Final
ARVO ISIE 2012 IMAGING POSTER.DDDK.Final
 
Wavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression modelsWavelet bootstrap Multiple linear regression models
Wavelet bootstrap Multiple linear regression models
 
M E T R O L O G Y A N D Q U A L I T Y C O N T R O L J N T U M O D E L P A ...
M E T R O L O G Y A N D Q U A L I T Y C O N T R O L  J N T U  M O D E L  P A ...M E T R O L O G Y A N D Q U A L I T Y C O N T R O L  J N T U  M O D E L  P A ...
M E T R O L O G Y A N D Q U A L I T Y C O N T R O L J N T U M O D E L P A ...
 
Digital Image Correlation
Digital Image Correlation Digital Image Correlation
Digital Image Correlation
 
Composite Lab Report
Composite Lab ReportComposite Lab Report
Composite Lab Report
 
Cad cam in prosthodontics
Cad cam in prosthodonticsCad cam in prosthodontics
Cad cam in prosthodontics
 
Chemoinformatics in Action
Chemoinformatics in ActionChemoinformatics in Action
Chemoinformatics in Action
 
Vessels delineation in retinal 
images using COSFIRE filters
Vessels delineation in retinal 
images using COSFIRE filtersVessels delineation in retinal 
images using COSFIRE filters
Vessels delineation in retinal 
images using COSFIRE filters
 

More from BIOVIA

(ATS6-PLAT03) What's behind Discngine collections
(ATS6-PLAT03) What's behind Discngine collections(ATS6-PLAT03) What's behind Discngine collections
(ATS6-PLAT03) What's behind Discngine collections
BIOVIA
 

More from BIOVIA (20)

ScienceCloud: Collaborative Workflows in Biologics R&D
ScienceCloud: Collaborative Workflows in Biologics R&DScienceCloud: Collaborative Workflows in Biologics R&D
ScienceCloud: Collaborative Workflows in Biologics R&D
 
(ATS6-PLAT03) What's behind Discngine collections
(ATS6-PLAT03) What's behind Discngine collections(ATS6-PLAT03) What's behind Discngine collections
(ATS6-PLAT03) What's behind Discngine collections
 
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
(ATS6-PLAT09) Deploying Applications on load balanced AEP servers for high av...
 
(ATS6-PLAT07) Managing AEP in an enterprise environment
(ATS6-PLAT07) Managing AEP in an enterprise environment(ATS6-PLAT07) Managing AEP in an enterprise environment
(ATS6-PLAT07) Managing AEP in an enterprise environment
 
(ATS6-PLAT06) Maximizing AEP Performance
(ATS6-PLAT06) Maximizing AEP Performance(ATS6-PLAT06) Maximizing AEP Performance
(ATS6-PLAT06) Maximizing AEP Performance
 
(ATS6-PLAT05) Security enhancements in AEP 9
(ATS6-PLAT05) Security enhancements in AEP 9(ATS6-PLAT05) Security enhancements in AEP 9
(ATS6-PLAT05) Security enhancements in AEP 9
 
(ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service (ATS6-PLAT04) Query service
(ATS6-PLAT04) Query service
 
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
 
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
 
(ATS6-GS04) Performance Analysis of Accelrys Enterprise Platform 9.0 on IBM’s...
(ATS6-GS04) Performance Analysis of Accelrys Enterprise Platform 9.0 on IBM’s...(ATS6-GS04) Performance Analysis of Accelrys Enterprise Platform 9.0 on IBM’s...
(ATS6-GS04) Performance Analysis of Accelrys Enterprise Platform 9.0 on IBM’s...
 
(ATS6-GS02) Integrating Contur and HEOS
(ATS6-GS02) Integrating Contur and HEOS(ATS6-GS02) Integrating Contur and HEOS
(ATS6-GS02) Integrating Contur and HEOS
 
(ATS6-GS01) Welcome
(ATS6-GS01) Welcome (ATS6-GS01) Welcome
(ATS6-GS01) Welcome
 
(ATS6-DEV09) Deep Dive into REST and SOAP Integration for Protocol Authors
(ATS6-DEV09) Deep Dive into REST and SOAP Integration for Protocol Authors(ATS6-DEV09) Deep Dive into REST and SOAP Integration for Protocol Authors
(ATS6-DEV09) Deep Dive into REST and SOAP Integration for Protocol Authors
 
(ATS6-DEV08) Integrating Contur ELN with other systems using a RESTful API
(ATS6-DEV08) Integrating Contur ELN with other systems using a RESTful API(ATS6-DEV08) Integrating Contur ELN with other systems using a RESTful API
(ATS6-DEV08) Integrating Contur ELN with other systems using a RESTful API
 
(ATS6-DEV07) Building widgets for ELN home page
(ATS6-DEV07) Building widgets for ELN home page(ATS6-DEV07) Building widgets for ELN home page
(ATS6-DEV07) Building widgets for ELN home page
 
(ATS6-DEV06) Using Packages for Protocol, Component, and Application Delivery
(ATS6-DEV06) Using Packages for Protocol, Component, and Application Delivery(ATS6-DEV06) Using Packages for Protocol, Component, and Application Delivery
(ATS6-DEV06) Using Packages for Protocol, Component, and Application Delivery
 
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
 
(ATS6-DEV04) Building Web MashUp applications that include Accelrys Applicati...
(ATS6-DEV04) Building Web MashUp applications that include Accelrys Applicati...(ATS6-DEV04) Building Web MashUp applications that include Accelrys Applicati...
(ATS6-DEV04) Building Web MashUp applications that include Accelrys Applicati...
 
(ATS6-DEV03) Building an Enterprise Web Solution with AEP
(ATS6-DEV03) Building an Enterprise Web Solution with AEP(ATS6-DEV03) Building an Enterprise Web Solution with AEP
(ATS6-DEV03) Building an Enterprise Web Solution with AEP
 
(ATS6-DEV02) Web Application Strategies
(ATS6-DEV02) Web Application Strategies(ATS6-DEV02) Web Application Strategies
(ATS6-DEV02) Web Application Strategies
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

CAESAR II:The Combination of Direct Geometry Method and CAESAR Algorithm for Super Fast Conformational Search

  • 1. CAESAR II: The Combination of Direct Geometry Method and CAESAR Algorithm for Super Fast Conformational Search Jiabo Li, Ph.D. ACS Meeting, March 21-25, 2010 , San Francisco
  • 2. Product Roadmap Disclaimer • This presentation and/or any related documents contains statements regarding our plans or expectations for future features, enhancements or functionalities of current or future products (collectively "Enhancements"). Our plans or expectations are subject to change at any time at our discretion. Accordingly, Accelrys is making no representation, undertaking no commitment or legal obligation to create, develop or license any product or Enhancements. The presentation, documents or any related statements are not intended to, nor shall, create any legal obligation upon Accelrys, and shall not be relied upon in purchasing any product. Any such obligation shall only result from a written agreement executed by both parties. In addition, information disclosed in this presentation and related documents, whether oral or written, is confidential or proprietary information of Accelrys. It shall be used only for the purpose of furthering our business relationship, and shall not be disclosed to third parties. © 2009 Accelrys, Inc. 2
  • 3. Outline • Original CAESAR: very efficient torsion search – Recursive partition of a molecule – Recursive build-up of conformations – Fast energy screening for bad clashes – New strategy for eliminating TopSym duplicates • CAESAR II: Deal with various geometric constraints – New algorithm for 3D structure generation for ring systems – Enforce stereo-chemistry (chirality, cis-trans, stereogenic) – Ring conformation library and access of ring conformations • Validation of CAESAR II: Speed, Quality, Diversity – Quality (binding conformations) – Diversity: 3D pharmacophore space, distribution of radius of gyration – Test on macrocyles © 2009 Accelrys, Inc. 3
  • 4. Conformation Sampling is important •The Conformationimportant in many 3D conformation search is Search Problem applications – How a new compound binds to a protein is usually unknown – 3D pharmacophore modeling – Docking • Efficient search is challenging ! – Most drug molecule is flexible in their conformations – Low energy conformation space is high dimensional and highly irregular in shape. – Stereo-chemistry and ring closure constraints • CAESAR is an efficient search algorithm © 2009 Accelrys, Inc. 4
  • 5. CAESAR: Conformation Algorithm base on Energy Screening And Recursive Buildup 1. Recursively partition 2. Recursively assemble E A = E A + E B + E A− B B 3. Quickly filter out bad clashes 4. New method for eliminating duplicates © 2009 Accelrys, Inc. 5
  • 6. 3D Search results Database: build with both Catalyst/FAST and CAESAR for a test set of 50,000 molecules Table 1. Number of catSearch hits with three 3D pharmacophore queries Catalyst/FAST CAESAR Common ang-IIHypo 90 100 73 Hypo2 247 236 215 ang-IIHypoShape 22 21 13 © 2009 Accelrys, Inc. 6
  • 7. Reference of CAESAR I J. Li, T. Ehlers, J. Sutter, S. Varma-O’Brien, and J. Kirchmair, CAESAR: A New Conformer Generation Algorithm Based on Recursive Buildup and Local Rotational Symmetry Consideration, J. Chem. Inf. Model. 2007, 47, 1923-1932 © 2009 Accelrys, Inc. 7
  • 8. New Developments of CAESAR • CAESAR II – Direct Geometry Method for conformation generation of constraint structures – New method for stereo chemistry control (chirality, cis- trans etc). – New strategy of ring conformation library © 2009 Accelrys, Inc. 8
  • 9. Traditional method for ring structure (Distance Geometry) • – Bound Smoothing (can be very time consuming) • – Embedding from a distance matrix • – Optimization of the generated structures © 2009 Accelrys, Inc. 9
  • 10. Direct Geometry Method • Direct 3D coordinate modification according to geometric constraints • Type of geometric constraints – Bond length – Bond angle (co-linear, 180 degree) – Torsion (co-planar, 180 degree) – Stereo chemistry (chirality, cis-trans, stereogenic) – VDW clash • All types of constraints can be converted into distance constraints © 2009 Accelrys, Inc. 10
  • 11. Bond Length • Bond Length correction Bond length between C16 and N15 are too long. Correction: Move the two atoms to each other. © 2009 Accelrys, Inc. 11
  • 12. Bond Angle • Bond angle correction Bond anlge C12-C13-C14 is too small. Correction: Increase the distance between C12 and C14. © 2009 Accelrys, Inc. 12
  • 13. Linear bonds • Linear bond correction Carbon C2 is off line. Correction: Move C2 to its correct position. © 2009 Accelrys, Inc. 13
  • 14. VDW Clash • Remove VDW clash Two atoms H3 and H6 are too close.. Correction: Move H3 and H6 from each other. © 2009 Accelrys, Inc. 14
  • 15. Other types of geometric constraints can also converted into distance constraints • For instance, simple distance constraints does help for chiral centers. We can use stereo templates with correct chirality to guide each atom’s move to achieve the correct geometries. SOS by Zhu and Agrafiotis also had similar idea. • If the chirality is unknown, no additional constraints are needed © 2009 Accelrys, Inc. 15
  • 16. Put the simple ideas into practice • Not a single correction can satisfy all the constraints. • Correction needs to be done iteratively • Control of convergence is important © 2009 Accelrys, Inc. 16
  • 17. Test 1: Diamond structures from random starting coordinates © 2009 Accelrys, Inc. 17
  • 18. Iterative correction towards correct structure © 2009 Accelrys, Inc. 18
  • 19. Test 2: C-60 © 2009 Accelrys, Inc. 19
  • 20. Timing:Direct Geometry Method VS. Distance Geometry Method Table 2: CPU time (second) for generating 3D structures Molecule Direct Method DG Method Ratio C60 0.12 1 8 Diamond 0.04 43 1000 © 2009 Accelrys, Inc. 20
  • 21. Test 3: All-trans cyclic peptide bonds © 2009 Accelrys, Inc. 21
  • 22. Test 4: Conformation sampling of macrocycles (Pascal Bonnet data set). Diversity by fingerprints Table 3: Number of pharmacophore fingerprints of conformation models of macrocycles MOL CASERII OMEGA Number of 3 Number of 4 Number of 3 Number of 4 Num. of Num. of points points points points Conf. Conf. Fingerprints Fingerprints Fingerprints Fingerprints P1 45 10567 251585 0 0 0 P2 71 8375 236449 76 8948 448917 P3 8 714 4468 157 3550 37078 P4 206 6104 165997 200 6440 109051 P5 11 3735 60097 22 4142 83547 P6 19 10481 221494 49 7326 322580 CD6 25 1207 44184 6 1062 29948 G6 6 250 498 9 131 342 G8 6 648 2476 13 359 1828 G10 6 1374 7503 11 760 5922 G12 6 2354 20640 5 969 8202 G14 6 3735 41996 2 817 6458 G16 6 4784 101998 2 1578 14654 G18 6 7345 165467 3 1154 23631 G20 6 8626 245656 2 1305 26909 SUM-1 433 70299 1570508 557 38541 1119067 SUM-2 388 59732 1318923 557 38541 1119067 Notes: (1) Bin Size 1.5A. All other setting are default in DS 2.1. (2) D8-D14 molecules failed in fingerprint generations, thus were excluded. (3) SUM-1: summation of all 15 molecules. SUM-2: P1 excluded. © 2009 Accelrys, Inc. 22
  • 23. Test 4: Conformation sampling of macrocycles (Pascal Bonnet data set). Radius of gyration Figure 1. Distribution of radius gyration of conformations generated by OMEGA and CAESAR II © 2009 Accelrys, Inc. 23
  • 24. Distribution of sum of atom-atom distances Figure 2. Distribution of atom-atom distance summation of conformations generated by OMEGA and CAESAR II © 2009 Accelrys, Inc. 24
  • 25. Test 5. Find bioactive conformation with CAESAR I test dataset Table 4. RMSD of the best fitting conformation to the bioactive conformations PDB Ligands CASERII OMEGA (CAESAR I test data) (maxconf=400) (maxconf=400) Average 0.96 0.93 RMSD(angstrom) CPU time (s) 226 4385 RMSD < 0.5 26.6% 22.9% RMSD < 1.0 61.0% 61.9% RMSD <1.5 82.1% 87.6% RMSD <2.0 92.4% 94.4% *Machine: Intel(R) Xeon(R) CPU E7420 @ 2.13GHz © 2009 Accelrys, Inc. 25
  • 26. Push efficiency to the new limit: reuse ring conformations by creating a library • Scan 6M compounds=> 100,000 ring/rigid structures • Generation conformations for all rings in the library using the BEST method • Build index for the library • ~100MB file size © 2009 Accelrys, Inc. 26
  • 27. Retrieve ring conformations from library efficiently • Load the index file • Read in ring conformation from file if it is not cached in the memory, else just use the ring conformation from memory • If the ring is not in the library, generate ring conformations on the fly using the direct method, and save them in the memory for reuse © 2009 Accelrys, Inc. 27
  • 28. Speed Test of on-the-fly conformation generation using CAESAR II • Test condition – Data set: CAP2008 database, ~6million compounds – Max Conformations/compound = 100 – Ring conformation library is pre-generated using Catalyst/BEST method – Quad-core CPU, 2.2 GHz, parallel computing – Without saving conformations in SD file (I/O bottleneck) • Speed – It takes 1.5 hours for generating conformations for all 6 million compounds, or 250 compounds/second/processor © 2009 Accelrys, Inc. 28
  • 29. Summary • There are two new technologies in CAESAR II which make the new algorithm much more robust and efficient than the original CAESAR – Direct method for 3D structure generation of ring and rigid structures – New ring/rigid structure library and retrieving method • The ring conformation generation using Direct Geometry Method is highly efficient and robust • The conformer model of ring molecules has good coverage of 3D phamacophore space. © 2009 Accelrys, Inc. 29
  • 30. Acknowledgment • Jon Sutter • Honglin Li • Fang Bai • David Zhang • Paul Flook • Frank Brown © 2009 Accelrys, Inc. 30