SlideShare une entreprise Scribd logo
1  sur  13
Télécharger pour lire hors ligne
Combinatorial Markush
structures at ChemAxon:
from drawing to analysis
Szabolcs Csepregi



                       Solutions for Cheminformatics
Outline
• Combinatorial and patent Markush structures
• Drawing Combinatorial Markush structures with
  Marvin
• Markush Enumeration
• Markush registration & searching in a database
• What is coming in Marvin / JChem 5.1
• Future plans – towards patents




                                         2
Markush structures
Generic notation for describing many molecules
  (= Markush library) in a compact form.
  Main usage:
   – Combinatorial chemistry: similar steps of synthesis
   – Chemistry-related patents: to claim part of the chemical
     space for a particular purpose.




                                               3
Markush structures
Combinatorial Markush    Patent Markush
• Smaller libraries      • The goal is as wide
• Usually simpler          coverage as possible
  constructs:            • Uses more
   – R-groups              sophisticated methods
   – Link nodes             – Homology variation
   – Atom lists               (Alkyl, Aryl, etc.)
                            – Position variation
• Suitable to describe
                            – Etc.
  simpler patents
                         • Extra conditions to
                           avoid overlap with
                           existing patents
                                                    4
Drawing with Marvin
• Easy R-group drawing & zoom functions




• Atom list, link node, bond list


• Position variation                     New in 5.1

                                     5
Enumeration
• Full enumeration
• Selected parts only
• Random enumeration
• Calculate library size
  exact size of huge
  Markush libraries
   – arbitrary precision or
   – magnitude




                                  6
Enumeration
• Coloring: scaffold and each R-group parts get different colors
                                                                   New in 5.1
• Alignment: as original scaffold




                                                    7
Markush database tables
• Available in JChem Base and Instant JChem
• Search in the Markush library space of
  combinatorial Markush structures
   – No enumeration involved – can handle very complex
     Markush structures (tested up to 1040, but no explicit limits
     were built in.)
   – Search types:
       •   Substructure
       •   Exact structure (contained in the Markush library)
       •   Exact fragment
       •   Perfect (same Markush structure)
   – Stereochemistry, query atoms, bonds, query properties:
       • Aromatic/aliphatic atom, ring atom and bond, chain bond,
         number of bonds handled
                                                       8
Markush database tables
Markush structure reduction to display substructure hits:
   All matchings of the query to the Markush structure are reduced
      into less generic structures. (Generic parts overlapping the hit
      are expanded.)
   Example
     Markush structure    Reduced Markush structures
      with hit coloring




   + Query




                                                       9
Integration in Instant JChem 2.3
                                                             New in 2.3
• Markush tables available: create, import, insert, search
• Show / hide R-groups for Markush table views
• Markush enumeration / hit reduction dialog




                                                 10
What is coming in 5.1
                                                         New in 5.1
• Drawing
   – Easier sketching of position
     variation (from 5.0.2)


• Markush features
   – Position variation
• Enumeration:
   – R-group coloring
   – Scaffold alignment
• Markush search:
   – Abbreviations (superatom s-groups) can be included in
     Markush structures (from 5.0.2)
   – Position variation in both query and database
                                             11
Longer term plans
Further developments towards patents
• Homology variation (Alkyl, Aryl, Protecting group, etc.)
    – properties (# of atoms, branching points, # of heteroatoms, etc.)
• Multiple graphical attachment points for R-groups


• Larger repeating groups



• Bridged definition of multiple R-atoms
                  R1, R2= H, CH3, NO2 or together form a ring


                                                          12
Thank you for your attention!

For more information please visit
      www.chemaxon.com



                                    13

Contenu connexe

Plus de ChemAxon

Plus de ChemAxon (20)

Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
 
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
 
ChemAxon ChemLocator - Cheminfo Stories Day 5
ChemAxon ChemLocator - Cheminfo Stories Day 5ChemAxon ChemLocator - Cheminfo Stories Day 5
ChemAxon ChemLocator - Cheminfo Stories Day 5
 
AWS Lambdas are cool - Cheminfo Stories Day 1
AWS Lambdas are cool - Cheminfo Stories Day 1AWS Lambdas are cool - Cheminfo Stories Day 1
AWS Lambdas are cool - Cheminfo Stories Day 1
 
Search Engine Improvements - Cheminfo Stories 2020 Day 1
Search Engine Improvements - Cheminfo Stories 2020 Day 1Search Engine Improvements - Cheminfo Stories 2020 Day 1
Search Engine Improvements - Cheminfo Stories 2020 Day 1
 
An application of ChemAxon's platform for education
An application of ChemAxon's platform for educationAn application of ChemAxon's platform for education
An application of ChemAxon's platform for education
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Dernier (20)

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 

Combinatorial Markush structure handling at ChemAxon: US UGM 2008

  • 1. Combinatorial Markush structures at ChemAxon: from drawing to analysis Szabolcs Csepregi Solutions for Cheminformatics
  • 2. Outline • Combinatorial and patent Markush structures • Drawing Combinatorial Markush structures with Marvin • Markush Enumeration • Markush registration & searching in a database • What is coming in Marvin / JChem 5.1 • Future plans – towards patents 2
  • 3. Markush structures Generic notation for describing many molecules (= Markush library) in a compact form. Main usage: – Combinatorial chemistry: similar steps of synthesis – Chemistry-related patents: to claim part of the chemical space for a particular purpose. 3
  • 4. Markush structures Combinatorial Markush Patent Markush • Smaller libraries • The goal is as wide • Usually simpler coverage as possible constructs: • Uses more – R-groups sophisticated methods – Link nodes – Homology variation – Atom lists (Alkyl, Aryl, etc.) – Position variation • Suitable to describe – Etc. simpler patents • Extra conditions to avoid overlap with existing patents 4
  • 5. Drawing with Marvin • Easy R-group drawing & zoom functions • Atom list, link node, bond list • Position variation New in 5.1 5
  • 6. Enumeration • Full enumeration • Selected parts only • Random enumeration • Calculate library size exact size of huge Markush libraries – arbitrary precision or – magnitude 6
  • 7. Enumeration • Coloring: scaffold and each R-group parts get different colors New in 5.1 • Alignment: as original scaffold 7
  • 8. Markush database tables • Available in JChem Base and Instant JChem • Search in the Markush library space of combinatorial Markush structures – No enumeration involved – can handle very complex Markush structures (tested up to 1040, but no explicit limits were built in.) – Search types: • Substructure • Exact structure (contained in the Markush library) • Exact fragment • Perfect (same Markush structure) – Stereochemistry, query atoms, bonds, query properties: • Aromatic/aliphatic atom, ring atom and bond, chain bond, number of bonds handled 8
  • 9. Markush database tables Markush structure reduction to display substructure hits: All matchings of the query to the Markush structure are reduced into less generic structures. (Generic parts overlapping the hit are expanded.) Example Markush structure Reduced Markush structures with hit coloring + Query 9
  • 10. Integration in Instant JChem 2.3 New in 2.3 • Markush tables available: create, import, insert, search • Show / hide R-groups for Markush table views • Markush enumeration / hit reduction dialog 10
  • 11. What is coming in 5.1 New in 5.1 • Drawing – Easier sketching of position variation (from 5.0.2) • Markush features – Position variation • Enumeration: – R-group coloring – Scaffold alignment • Markush search: – Abbreviations (superatom s-groups) can be included in Markush structures (from 5.0.2) – Position variation in both query and database 11
  • 12. Longer term plans Further developments towards patents • Homology variation (Alkyl, Aryl, Protecting group, etc.) – properties (# of atoms, branching points, # of heteroatoms, etc.) • Multiple graphical attachment points for R-groups • Larger repeating groups • Bridged definition of multiple R-atoms R1, R2= H, CH3, NO2 or together form a ring 12
  • 13. Thank you for your attention! For more information please visit www.chemaxon.com 13