SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
INCOT.NET
René Deplanque | International Chemical Ontology Network
@The International Information Conference on Search, Data Mining and Visualization.
y
SUMMERY
• The final goal of this project is to build a system that is
available to all member of the network.
• It will be developed to make Big Data collections from various
fields of chemistry / pharmacology manageable using parent
ontologies.
• Thus the development of an innovative industry 4.0 approach
will be simplified and accelerated.
PAINPOINTS OF TODAY'S DATA COLLECTIONS:
Access Issues => Problems with finding and/or getting access to
data
Audience Issues => who is looking at data, how they perceive it,
perspectives, language of discipline
Chemical Structure Representation Issues
=>
what areas are problems - inorganic,
organometallic, large molecules, mixtures, chiral
centers
Community Issues => policies, procedures, and best practices we need to
adopt to move things forwards
Data Issues => standardization/interoperability, metadata, gaps,
scale, and sharing, dark data
Ontology/Vocabulary Issues => consensus on terms, maintenance, versions,
optimal vocabularies, areas where needed
Tools to Help Data/Metadata Capture
Issues =>
adding metadata, feedback, consistency,
synchronization
InternetofThings
AI
3D-Printing
VirtualReality
CloudComputing
SocialMedia
Mobility
Analytics
Security
Energy / Utilities
Consumer goods
Entertainment / media
Administration
Insurance
IT-technologies
Pharmaceuticals
Productions Industries
Trade
Telecommunication
Banks
Important
Unchanged
Unimportant
Technologies Trends of the coming 3-5 YearsBasis 3700 Manager worldwide
Source: Krallinger, M. et al. (2005) Text-mining approaches in molecular biology and biomedicine. DDT 10(6) 440
Ontology Defined
Google Definitions on the web
• An ontology is a controlled vocabulary that describes objects and the
relations between them in a formal way, and has a grammar for using the
vocabulary terms to express something meaningful within a specified domain
of interest. Source: members.optusnet.com.au/~webindexing/Webbook2Ed/glossary.htm
• Ontology is the newest label attached to some KOSs. Ontologies are being
developed as specific concept models by the Knowledge Management
community. They can represent complex relationships between objects, and
include the rules and axioms missing from semantic networks. Ontologies
that describe knowledge in a specific area are often connected with systems
for data mining and knowledge management.
Source: www.und.nodak.edu/dept/library/Departments/abc/SACSEM-SemInGlossary.htm
CREATING A COMPUTABLE CHEMICAL TAXONOMY
REQUIRES THREE KEY COMPONENTS:
A well-defined hierarchical taxonomic structure;
A dictionary of chemical classes (with full definitions
and category mappings); and
Computable rules or algorithms for assigning chemicals
to taxonomic categories.
Semantic Web
The Semantic Web "layer cake" as presented by Tim Berners-Lee.
Source: Hendler, J. (2001) Agents and the semantic web. http://www.cs.umd.edu/users/hendler/AgentWeb.html
KNOWN CLASSIFICATION SYSTEMS OF
CHEMICAL SUBSTANCES
 Classification as defined by EU regulations
 Regulation (EC) No 1272/2008 on classification, labelling and packaging of
substances and mixtures (the 'CLP Regulation').
 Classification as defined by UBA (Germany’s environmental protection
agency)
 These criteria and limiting values help to determine hazardous physical-chemical
properties as well as health and environmental hazards
 The Anatomical Therapeutic Chemical Classification System (ATC/DDD of
the world health organisation WHO)
 The purpose of the ATC/DDD system is to serve as a tool for drug utilization research
in order to improve quality of drug use.
 GUIDANCE ON THE CLASSIFICATION OF HAZARDOUS CHEMICALS UNDER
THE WHS REGULATIONS
 This Guidance is intended for manufacturers and importers of
substances, mixtures and articles who have a duty under the World
Health and Safety (WHS) Act and Regulations to classify them
 the Globally Harmonised System of Classification and Labelling of
Chemicals (the GHS).
 The WHS Regulations also implement the harmonised hazard
communication elements of the GHS that are to appear on labels and
safety data sheets (SDS)
 The Chemical Fragmentation Coding system
 It was developed in 1963 by the Derwent World Patent Index (DWPI) to
facilitate the manual classification of chemical compounds reported in
patents.
 The system consists of 2200 numerical codes corresponding to a set of
pre-defined, chemically significant structure fragments
Tools for developing Chemical Ontologies
 HOSE (Hierarchical Organisation of Spherical Environments) code.
 This hierarchical substructure system, allows one to automatically
characterize atoms and complete rings in terms of their spherical
environment
 Gene Ontology (GO) system,
 was one of the first open-source, automated functional group ontologies
to be formalized.
 CO functional groups can be automatically assigned to a given structure
by Checkmol a freely available program. CO’s assignment of functional
groups is accurate and consistent, and it has been applied to several
small datasets. However,
 the CO system is limited to just ~200 chemical groups
 SODIAC tool for automatic compound classification.
 It uses a comprehensive chemical ontology and an elegant structure-
based reasoning logic.
 The underlying chemical ontology can be freely downloaded and the
SODIAC software, which is closed-source, is free for academics
WHAT ARE THE MAJOR PROBLEMS
➢ In contrast to biology, geology, and many other scientific
disciplines, the world of chemistry still lacks a standardized
chemical ontology or taxonomy
➢ The chemical classification of a compound could help predict its
metabolic fate in humans, its drug ability or potential hazards
associated with it.
➢ The sheer number (tens of millions of compounds) and complexity
of chemical structures is such that any manual classification effort
would prove to be near impossible
two-ring heterocyclic compounds
isoquinolines
isoquinoline alkaloids
morphinans
morphine
grouped_by_chemistry
FRAGMENT OF CHEMICAL ONTOLOGY
molecules
organic molecules
heterocyclic compounds
bridged-ring heterocyclic compounds
morphinans
morphine
IsA
O
N
OH
OH
CH3
H
NH
H
morphine
morphinan
IsA
Source: Ennis, M. (2004) ChEBI A Dictionary of Chemical Entities with an Associated Ontology.
SOFG-2, Philadelphia, October 23-26 2004
CH3
O
NH2
H
O
OHCH2
OH
NH2
H
O
O
-
CH3
O
NH2
H
O
O
-
CH2
OH
NH2
H
O
OH
CH3
O
H NH2
O
OH
CH2
OH
H NH2
O
O
- CH2
OH
H NH2
O
OH CH3
O
H NH2
O
O
-
L-Amino acid
D-Amino acid
Amino acid
CO2H
OH =O
NH2
CO2
¯
is_a is_part_of
is_enantiomer_of is_conjugate_base_of
is_tautomer_of
Source: Ennis, M. (2004) ChEBI A Dictionary of Chemical
Entities with an Associated Ontology. SOFG-2,
Philadelphia, October 23-26 2004
AUTOMATED CHEMICAL
CLASSIFICATION SYSTEM
However, as in PubChem, the annotation is incomplete. Class assignments to “clavams” and
“azetidines”, among others, are missing
ONTOLOGIE ENGINEERING
HOW TO WORK WITH ONTOLOGIES
Michael Büttner Ontology Learning
THE ONTOLOGY DEVELOPMENT PROCESS
Michael Büttner Ontology Learning
WHAT DO WE HAVE - WHAT DO WE NEED
➢Chemists have a standardized nomenclature (IUPAC, CAS,
REAXIS)
➢Chemists have standardized methods for drawing or
exchanging chemical structures
➢Chemistry still lacks a standardized, comprehensive, and
clearly defined chemical taxonomy or chemical ontology
WHAT WAS DONE
➢ Chemist have developed domain specific ontologies
➢ Medical Chemist classify according to pharmaceutical
activities (antibacterial antihypertensive)
➢ Biochemist classify according biosynthetic origin
(nucleic acids, terpenoids)
➢ They do not fit
➢ In the PubChem database only 0.12% of the >91,000,000 compounds (as
of June 2016) are classified via the MeSH thesaurus
WHO AND WHAT IS INCOT.NET
 The Problem of defining overlapping Ontologies is of such
a magnitude that it can not be solved by a single
Organisation.
 INCOT.NET is an organisation based on an idea, need and
interest of major Chemical Companies.
 It is organized as independent Partnership
 It is attempting to coordinate a large variety of
Organisations to solve major pre-production problems.
 One of the prototype problems will be: The use of
Ontologies in the development of new methodologies for
the development of new Antibiotics.
Thank you for your patience
you will need it for your future

Contenu connexe

En vedette

II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...Dr. Haxel Consult
 
II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...
II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...
II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...Dr. Haxel Consult
 
II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?Dr. Haxel Consult
 
II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You? II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You? Dr. Haxel Consult
 
II-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space AnalysisII-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space AnalysisDr. Haxel Consult
 
II-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPII-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPDr. Haxel Consult
 
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPPII-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPPDr. Haxel Consult
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...Dr. Haxel Consult
 

En vedette (8)

II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
II-SDV 2017: Will Virtual Reality (VR) be changing the way we deal with infor...
 
II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...
II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...
II-SDV 2017: Datafari - Building an Open Source Enterprise Search Solution fr...
 
II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?II-SDV 2017: What is Innovation and how can we measure it?
II-SDV 2017: What is Innovation and how can we measure it?
 
II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You? II-SDV 2017: Auto Classification: Can/Should AI replace You?
II-SDV 2017: Auto Classification: Can/Should AI replace You?
 
II-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space AnalysisII-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
II-SDV 2017: Decoding the Gray Shades of Patent White Space Analysis
 
II-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IPII-SV 2017: How to effectively monitor Technological Developments in IP
II-SV 2017: How to effectively monitor Technological Developments in IP
 
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPPII-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
II-SDV 2017: From KNIME to HighThroughPut Pipelining - from KNIME to HTPP
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 

Similaire à II-SDV 2017: The "International Chemical Ontology Network"

EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...
EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...
EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...webscience-montpellier
 
Accessing small molecule data using ChEBI
Accessing small molecule data using ChEBIAccessing small molecule data using ChEBI
Accessing small molecule data using ChEBIDuncan Hull
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsJeremy Yang
 
An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0Natasha Grant
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalgowthamnaidu0986
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptxwadhava gurumeet
 
AI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite PredictionAI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite PredictionYannick Djoumbou
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitBOSC 2010
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyBarry Smith
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkLisa Muthukumar
 
Ecocyc database
Ecocyc databaseEcocyc database
Ecocyc databaseShiv Kumar
 
Chemoinformatics—an introduction for computer scientists
Chemoinformatics—an introduction for computer scientistsChemoinformatics—an introduction for computer scientists
Chemoinformatics—an introduction for computer scientistsunyil96
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxChris Mungall
 

Similaire à II-SDV 2017: The "International Chemical Ontology Network" (20)

Assignment 105B.pptx
Assignment 105B.pptxAssignment 105B.pptx
Assignment 105B.pptx
 
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
 
EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...
EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...
EnCOrE: Chemistry, Education, Knowledge From the Real to the Virtual Needs, P...
 
Automatic vs manual curation of a multisource chemical dictionary
Automatic vs manual curation of a multisource chemical dictionaryAutomatic vs manual curation of a multisource chemical dictionary
Automatic vs manual curation of a multisource chemical dictionary
 
Accessing small molecule data using ChEBI
Accessing small molecule data using ChEBIAccessing small molecule data using ChEBI
Accessing small molecule data using ChEBI
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformatics
 
An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0An Open Annotation Ontology For Science On Web 3.0
An Open Annotation Ontology For Science On Web 3.0
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professional
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptx
 
AI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite PredictionAI and Machine Learning for Secondary Metabolite Prediction
AI and Machine Learning for Secondary Metabolite Prediction
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkit
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 
A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration Framework
 
Ecocyc database
Ecocyc databaseEcocyc database
Ecocyc database
 
Checking, Curating And Qualifying Chemistry
Checking, Curating And Qualifying ChemistryChecking, Curating And Qualifying Chemistry
Checking, Curating And Qualifying Chemistry
 
Chemoinformatics—an introduction for computer scientists
Chemoinformatics—an introduction for computer scientistsChemoinformatics—an introduction for computer scientists
Chemoinformatics—an introduction for computer scientists
 
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptx
 
mec
mecmec
mec
 
Ontology work at the Royal Society of Chemistry
Ontology work at the Royal Society of ChemistryOntology work at the Royal Society of Chemistry
Ontology work at the Royal Society of Chemistry
 

Plus de Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
 

Plus de Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Dernier

Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsMonica Sydney
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdfMatthew Sinclair
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...kajalverma014
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtrahman018755
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.krishnachandrapal52
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查ydyuyu
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查ydyuyu
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdfMatthew Sinclair
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查ydyuyu
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsMonica Sydney
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsMonica Sydney
 
Power point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria IuzzolinoPower point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria Iuzzolinonuriaiuzzolino1
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"growthgrids
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样ayvbos
 
PowerDirector Explination Process...pptx
PowerDirector Explination Process...pptxPowerDirector Explination Process...pptx
PowerDirector Explination Process...pptxgalaxypingy
 
Best SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasBest SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasDigicorns Technologies
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdfMatthew Sinclair
 

Dernier (20)

Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi EscortsRussian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
Russian Escort Abu Dhabi 0503464457 Abu DHabi Escorts
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.
 
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
原版制作美国爱荷华大学毕业证(iowa毕业证书)学位证网上存档可查
 
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
 
Power point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria IuzzolinoPower point inglese - educazione civica di Nuria Iuzzolino
Power point inglese - educazione civica di Nuria Iuzzolino
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
 
PowerDirector Explination Process...pptx
PowerDirector Explination Process...pptxPowerDirector Explination Process...pptx
PowerDirector Explination Process...pptx
 
Best SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency DallasBest SEO Services Company in Dallas | Best SEO Agency Dallas
Best SEO Services Company in Dallas | Best SEO Agency Dallas
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 

II-SDV 2017: The "International Chemical Ontology Network"

  • 1. INCOT.NET René Deplanque | International Chemical Ontology Network @The International Information Conference on Search, Data Mining and Visualization.
  • 2. y
  • 3. SUMMERY • The final goal of this project is to build a system that is available to all member of the network. • It will be developed to make Big Data collections from various fields of chemistry / pharmacology manageable using parent ontologies. • Thus the development of an innovative industry 4.0 approach will be simplified and accelerated.
  • 4. PAINPOINTS OF TODAY'S DATA COLLECTIONS: Access Issues => Problems with finding and/or getting access to data Audience Issues => who is looking at data, how they perceive it, perspectives, language of discipline Chemical Structure Representation Issues => what areas are problems - inorganic, organometallic, large molecules, mixtures, chiral centers Community Issues => policies, procedures, and best practices we need to adopt to move things forwards Data Issues => standardization/interoperability, metadata, gaps, scale, and sharing, dark data Ontology/Vocabulary Issues => consensus on terms, maintenance, versions, optimal vocabularies, areas where needed Tools to Help Data/Metadata Capture Issues => adding metadata, feedback, consistency, synchronization
  • 5. InternetofThings AI 3D-Printing VirtualReality CloudComputing SocialMedia Mobility Analytics Security Energy / Utilities Consumer goods Entertainment / media Administration Insurance IT-technologies Pharmaceuticals Productions Industries Trade Telecommunication Banks Important Unchanged Unimportant Technologies Trends of the coming 3-5 YearsBasis 3700 Manager worldwide
  • 6. Source: Krallinger, M. et al. (2005) Text-mining approaches in molecular biology and biomedicine. DDT 10(6) 440
  • 7. Ontology Defined Google Definitions on the web • An ontology is a controlled vocabulary that describes objects and the relations between them in a formal way, and has a grammar for using the vocabulary terms to express something meaningful within a specified domain of interest. Source: members.optusnet.com.au/~webindexing/Webbook2Ed/glossary.htm • Ontology is the newest label attached to some KOSs. Ontologies are being developed as specific concept models by the Knowledge Management community. They can represent complex relationships between objects, and include the rules and axioms missing from semantic networks. Ontologies that describe knowledge in a specific area are often connected with systems for data mining and knowledge management. Source: www.und.nodak.edu/dept/library/Departments/abc/SACSEM-SemInGlossary.htm
  • 8. CREATING A COMPUTABLE CHEMICAL TAXONOMY REQUIRES THREE KEY COMPONENTS: A well-defined hierarchical taxonomic structure; A dictionary of chemical classes (with full definitions and category mappings); and Computable rules or algorithms for assigning chemicals to taxonomic categories.
  • 9. Semantic Web The Semantic Web "layer cake" as presented by Tim Berners-Lee. Source: Hendler, J. (2001) Agents and the semantic web. http://www.cs.umd.edu/users/hendler/AgentWeb.html
  • 10. KNOWN CLASSIFICATION SYSTEMS OF CHEMICAL SUBSTANCES  Classification as defined by EU regulations  Regulation (EC) No 1272/2008 on classification, labelling and packaging of substances and mixtures (the 'CLP Regulation').  Classification as defined by UBA (Germany’s environmental protection agency)  These criteria and limiting values help to determine hazardous physical-chemical properties as well as health and environmental hazards  The Anatomical Therapeutic Chemical Classification System (ATC/DDD of the world health organisation WHO)  The purpose of the ATC/DDD system is to serve as a tool for drug utilization research in order to improve quality of drug use.
  • 11.  GUIDANCE ON THE CLASSIFICATION OF HAZARDOUS CHEMICALS UNDER THE WHS REGULATIONS  This Guidance is intended for manufacturers and importers of substances, mixtures and articles who have a duty under the World Health and Safety (WHS) Act and Regulations to classify them  the Globally Harmonised System of Classification and Labelling of Chemicals (the GHS).  The WHS Regulations also implement the harmonised hazard communication elements of the GHS that are to appear on labels and safety data sheets (SDS)  The Chemical Fragmentation Coding system  It was developed in 1963 by the Derwent World Patent Index (DWPI) to facilitate the manual classification of chemical compounds reported in patents.  The system consists of 2200 numerical codes corresponding to a set of pre-defined, chemically significant structure fragments
  • 12. Tools for developing Chemical Ontologies  HOSE (Hierarchical Organisation of Spherical Environments) code.  This hierarchical substructure system, allows one to automatically characterize atoms and complete rings in terms of their spherical environment  Gene Ontology (GO) system,  was one of the first open-source, automated functional group ontologies to be formalized.  CO functional groups can be automatically assigned to a given structure by Checkmol a freely available program. CO’s assignment of functional groups is accurate and consistent, and it has been applied to several small datasets. However,  the CO system is limited to just ~200 chemical groups  SODIAC tool for automatic compound classification.  It uses a comprehensive chemical ontology and an elegant structure- based reasoning logic.  The underlying chemical ontology can be freely downloaded and the SODIAC software, which is closed-source, is free for academics
  • 13. WHAT ARE THE MAJOR PROBLEMS ➢ In contrast to biology, geology, and many other scientific disciplines, the world of chemistry still lacks a standardized chemical ontology or taxonomy ➢ The chemical classification of a compound could help predict its metabolic fate in humans, its drug ability or potential hazards associated with it. ➢ The sheer number (tens of millions of compounds) and complexity of chemical structures is such that any manual classification effort would prove to be near impossible
  • 14. two-ring heterocyclic compounds isoquinolines isoquinoline alkaloids morphinans morphine grouped_by_chemistry FRAGMENT OF CHEMICAL ONTOLOGY molecules organic molecules heterocyclic compounds bridged-ring heterocyclic compounds morphinans morphine IsA O N OH OH CH3 H NH H morphine morphinan IsA Source: Ennis, M. (2004) ChEBI A Dictionary of Chemical Entities with an Associated Ontology. SOFG-2, Philadelphia, October 23-26 2004
  • 15.
  • 16. CH3 O NH2 H O OHCH2 OH NH2 H O O - CH3 O NH2 H O O - CH2 OH NH2 H O OH CH3 O H NH2 O OH CH2 OH H NH2 O O - CH2 OH H NH2 O OH CH3 O H NH2 O O - L-Amino acid D-Amino acid Amino acid CO2H OH =O NH2 CO2 ¯ is_a is_part_of is_enantiomer_of is_conjugate_base_of is_tautomer_of Source: Ennis, M. (2004) ChEBI A Dictionary of Chemical Entities with an Associated Ontology. SOFG-2, Philadelphia, October 23-26 2004
  • 17. AUTOMATED CHEMICAL CLASSIFICATION SYSTEM However, as in PubChem, the annotation is incomplete. Class assignments to “clavams” and “azetidines”, among others, are missing
  • 19. HOW TO WORK WITH ONTOLOGIES Michael Büttner Ontology Learning
  • 20. THE ONTOLOGY DEVELOPMENT PROCESS Michael Büttner Ontology Learning
  • 21. WHAT DO WE HAVE - WHAT DO WE NEED ➢Chemists have a standardized nomenclature (IUPAC, CAS, REAXIS) ➢Chemists have standardized methods for drawing or exchanging chemical structures ➢Chemistry still lacks a standardized, comprehensive, and clearly defined chemical taxonomy or chemical ontology
  • 22. WHAT WAS DONE ➢ Chemist have developed domain specific ontologies ➢ Medical Chemist classify according to pharmaceutical activities (antibacterial antihypertensive) ➢ Biochemist classify according biosynthetic origin (nucleic acids, terpenoids) ➢ They do not fit ➢ In the PubChem database only 0.12% of the >91,000,000 compounds (as of June 2016) are classified via the MeSH thesaurus
  • 23. WHO AND WHAT IS INCOT.NET  The Problem of defining overlapping Ontologies is of such a magnitude that it can not be solved by a single Organisation.  INCOT.NET is an organisation based on an idea, need and interest of major Chemical Companies.  It is organized as independent Partnership  It is attempting to coordinate a large variety of Organisations to solve major pre-production problems.  One of the prototype problems will be: The use of Ontologies in the development of new methodologies for the development of new Antibiotics.
  • 24. Thank you for your patience you will need it for your future