SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 1
Enriching Content with Semantic Tagging
Molecular Connections, Bangalore, India
www.molecularconnections.com
ICIC 2013, Vienna
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 2
Outline
• Introduction to MC
• Content Enrichment – Concept
• Content Enrichment Use Case
• Key Take Aways
About MC OPERATIONS
 Information curation and annotation expertise
 work with leading R & D Institutions , STM publishing &
IP Search & Law Firms
 Right mix of human resources and scale
 LifeScience (Bio – Chem), Engineering, IP, information
and technology background
 Established workflow and processes to ensure quality
and on time delivery
 ISO 27001: 2005 Certified knowledge management
platforms and workflow systems
CORPORATE
 Established in 2001
 Executive team backed by
renowned informaticans & strong
advisory board -~ 1000 strong
 Scalable & state of the art
infrastructure
 Global footprint
 Core Values: Customer focused,
Quality, Ethics, Excellence,
Accountability
Life Sciences
companies
Text mining &
Informatics
IP
Verticals
Publishing,
R & D
Institutions
 MCPaIRS
 MCDESiGN
 Patent Search Services
Highly
Customized
Services
CONTENT
MINING
CONTENT
REPRESENTATION
/ DELIVERY
CONTENT
MANAGEMENT
 App Development
 User Interface Design
 Visualization
 Analytics
• Indexing ( automatic and semi-automatic),
• Abstraction (manual and semi-automatic)
• Open Access Data Mining
• Content Enrichment
• Semantic Tagging & systematic review of
literature
• MC Outlink - Text Mining & Discovery
• Developing customized text mining engines
• Ontology Building
• Custom Dbase Creation
• Content Normalization
End <– to –> End Solutions
Over 3500 Man Years of expertise
MC - Solutions
Semantic Tagging
Text Mining
Ontology
Mapping
Augmented
Reference
Outlinking
Enriching Content
CONTENT
ENRICHMENT
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 7
Why CE?
• Enables deeper knowledge discovery from diverse sources like patent,
databases, journal etc.
• Semantic tagging ensures that different names of an entity are mapped
to standard name and hence, searchable by any name.
For Instance: Discoverability is a challenge in pharma patents as entities
of interest may be named differently in different patents by different
authors.
• Publishers are quick to adopt CE, time to adopt it for patents?
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 8
Unlocking Small Data to Big Data
Number of articles (diamonds) and patents (open boxes) abstracted
annually by Chemical Abstracts Services
Bachrach Journal of Cheminformatics 2009 1:2 doi:10.1186/1758-2946-1-2
Need Smarter Content
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 9
Leveraging Linked Data
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 10
Implementation - Content Enrichment Levels
What kind of Content Enrichment can be done?
• Entity
• Document
• Others
- Journal article
- Patent
- Book chapter
- Image
- Table
- Multimedia
- News links
- Author/Assignee, Protein, Gene, Drug, Chemical, Disease,
Reaction, Organism, Technology, Organization
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 11
Content Enrichment – Use Case
MCPaIRS TM (Proprietary Indian Patent Database)
•"Expertly , Manually Curated,
Fully Searchable, Value Added
Knowledgebase" of Full Text of
Indian Granted and Applied
Patents
•Caters to a diversified user-base
of bench Scientists, Engineers,
R&D Managers & Business
Professionals.
Molecular Connections Patent Information Retrieval System
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 13
MCPaIRS TM – Homepage
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 14
MCPaIRS TM – Search
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 15
MCPaIRS TM – View Patent
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 16
Demo of actual full text document
Benefits of Semantic Search Cartridge Enabled
MCPaIRS TM
 All results in a single query
 Automatic Expansion of the query with all possible synonyms
 Broadening of the search query
 Complex search queries possible
 All the synonyms highlighted
17
Automatic Expansion of the query with all
possible synonyms
18
Automatic Expansion of the query with all
possible synonyms
Multiple key-words highlighted for the
search: VEGF
Complex Queries can be performed by using
operators
Boolean search is performed
Sample queries with Semantic Search Cartridge
No Query
No of results in
iPairs
No of results in
mcpairs
No of results in mcpairs with
semantic search cartridge
1 Salbutamol 27 1560 2548
2 Amethocaine 0 58 954
3 Diazepam 4 1725 2146
4 Valsartan 84 1372 1429
5 Imatinib 65 1703 1999
6 Tamoxifen 16 3950 4190
7 Aspirin 61 5679 6427
8 Paracetamol 74 1161 3696
9 MyoD 2 130 138
10 Pax3 1 49 56
11 Sox9 0 39 58
12 FGF10 0 43 131
13 VEGF 192 4808 6058
14 BMP2 5 137 214
15 Salbutamol AND CD48 0 0 4
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 23
Benefit - Identifying Related Patents
A B
Proteins
Chemicals
Indications
…….
Proteins
Chemicals
Indications
…….
Similarity Score
Relatedness
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 24
Content Enrichment Approaches
• Manual
 high quality, costly, not scalable, slow
• Automated
 fast, quality below par, cost effective, scalable
• Hybrid
 high quality, cost effective, scalable, reasonable
speed
Molecular Connections is a pioneer in the use of hybrid approach to content enrichment
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 25
Key Takeaways
 Content Enrichment can improve search and retrieval
immensely
?? CE can be looked at various levels
- Biology / chemistry / both / authors etc.
 You can bring the Web into the document through CE
- e.g. Augmented reference cards
 Growing Adoption of Content Enrichment
- Publishing (Early adopters)
- Patents
Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 26
Thank You
Molecular Connections
www.molecularconnections.com

Contenu connexe

Tendances

II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
Dr. Haxel Consult
 
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
Dr. Haxel Consult
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
Michel Dumontier
 

Tendances (20)

AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
AI-SDV 2021 - Klaus Kater - The secret of successful CI: precise targeting + ...
 
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
IC-SDV 2018: Stefan Geißler (Expert System) Navigating to new shores: the Bio...
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
II-SDV 2014 The Road to Federated Text Mining: Are we there yet? (Guy Singh -...
 
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
II-SDV 2014 Recommender Systems for Analysis Applications (Roger Bradford - A...
 
SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...SAML protected resources: the theory and practice of granularity and manageme...
SAML protected resources: the theory and practice of granularity and manageme...
 
Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016Open PHACTS MIOSS may 2016
Open PHACTS MIOSS may 2016
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
Channeling insights to the right people
Channeling insights to the right peopleChanneling insights to the right people
Channeling insights to the right people
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Digital Representation of Privacy Terms
Digital Representation of Privacy TermsDigital Representation of Privacy Terms
Digital Representation of Privacy Terms
 
Emily Thompson
Emily ThompsonEmily Thompson
Emily Thompson
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Privacy preserving computing and secure multi-party computation ISACA Atlanta
Privacy preserving computing and secure multi-party computation ISACA AtlantaPrivacy preserving computing and secure multi-party computation ISACA Atlanta
Privacy preserving computing and secure multi-party computation ISACA Atlanta
 
W3C DPVCG - DPV v0.2
W3C DPVCG - DPV v0.2W3C DPVCG - DPV v0.2
W3C DPVCG - DPV v0.2
 
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
 
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Future data security ‘will come from several sources’
Future data security ‘will come from several sources’Future data security ‘will come from several sources’
Future data security ‘will come from several sources’
 

En vedette (6)

Building Blocks of IFRS 9 Impairment Modeling
Building Blocks of IFRS 9 Impairment ModelingBuilding Blocks of IFRS 9 Impairment Modeling
Building Blocks of IFRS 9 Impairment Modeling
 
Credit Impairment under IFRS 9 for Banks
Credit Impairment under IFRS 9 for BanksCredit Impairment under IFRS 9 for Banks
Credit Impairment under IFRS 9 for Banks
 
IFRS 9 conference presentation - Philip Lewis
IFRS 9 conference presentation - Philip LewisIFRS 9 conference presentation - Philip Lewis
IFRS 9 conference presentation - Philip Lewis
 
Ifrs 9
Ifrs 9Ifrs 9
Ifrs 9
 
IFRS 9 Overview (For all Accountants)
IFRS 9 Overview (For all Accountants)IFRS 9 Overview (For all Accountants)
IFRS 9 Overview (For all Accountants)
 
Build Features, Not Apps
Build Features, Not AppsBuild Features, Not Apps
Build Features, Not Apps
 

Similaire à ICIC 2013 Conference Proceedings Krishna Molecular Connections

(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
BIOVIA
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)
Matt Barnes
 

Similaire à ICIC 2013 Conference Proceedings Krishna Molecular Connections (20)

Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...
Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...
Drug Discovery Innovation in a Precompetitive Cloud Platform (LFS302-S) - AWS...
 
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
(ATS6-APP03) Thomson Rueters Content used in Acclrys Pipeline Pilot
 
ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...
ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...
ELIXIR and Industry presentation given by Jerome Wojcik, CEO, Quartz Bio at E...
 
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
Joint Pistoia Alliance & PRISME AI in pharma webinar 18 Oct 2018
 
PharmaLedger Press Release #2 June 2020
PharmaLedger Press Release #2 June 2020 PharmaLedger Press Release #2 June 2020
PharmaLedger Press Release #2 June 2020
 
Connected Health: The Importance of Systems Integration
Connected Health: The Importance of Systems IntegrationConnected Health: The Importance of Systems Integration
Connected Health: The Importance of Systems Integration
 
PharmaLedger: A Digital Trust Ecosystem for Healthcare
PharmaLedger: A Digital Trust Ecosystem for HealthcarePharmaLedger: A Digital Trust Ecosystem for Healthcare
PharmaLedger: A Digital Trust Ecosystem for Healthcare
 
2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation
 
Compound passport (BOS)
Compound passport (BOS)Compound passport (BOS)
Compound passport (BOS)
 
About Indegene
About IndegeneAbout Indegene
About Indegene
 
Michael Alvers, Transinsight, DE (Fortissimo)
Michael Alvers, Transinsight, DE (Fortissimo)Michael Alvers, Transinsight, DE (Fortissimo)
Michael Alvers, Transinsight, DE (Fortissimo)
 
e-HealthWhitepaper
e-HealthWhitepapere-HealthWhitepaper
e-HealthWhitepaper
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protection
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)
 
Introduction to healthcare and life sciences
Introduction to healthcare and life sciencesIntroduction to healthcare and life sciences
Introduction to healthcare and life sciences
 
Precompetitive Collaborations
Precompetitive CollaborationsPrecompetitive Collaborations
Precompetitive Collaborations
 
SmartChem Presentation
SmartChem PresentationSmartChem Presentation
SmartChem Presentation
 
Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics Artificial intelligence robotics and computational fluid dynamics
Artificial intelligence robotics and computational fluid dynamics
 
Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...
Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...
Managing Regulatory Compliance and Food safety With Cloud Data Supervision-Ch...
 
Kemxtree Presentation
Kemxtree PresentationKemxtree Presentation
Kemxtree Presentation
 

Plus de Dr. Haxel Consult

AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
Dr. Haxel Consult
 

Plus de Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 

ICIC 2013 Conference Proceedings Krishna Molecular Connections

  • 1. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 1 Enriching Content with Semantic Tagging Molecular Connections, Bangalore, India www.molecularconnections.com ICIC 2013, Vienna
  • 2. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 2 Outline • Introduction to MC • Content Enrichment – Concept • Content Enrichment Use Case • Key Take Aways
  • 3. About MC OPERATIONS  Information curation and annotation expertise  work with leading R & D Institutions , STM publishing & IP Search & Law Firms  Right mix of human resources and scale  LifeScience (Bio – Chem), Engineering, IP, information and technology background  Established workflow and processes to ensure quality and on time delivery  ISO 27001: 2005 Certified knowledge management platforms and workflow systems CORPORATE  Established in 2001  Executive team backed by renowned informaticans & strong advisory board -~ 1000 strong  Scalable & state of the art infrastructure  Global footprint  Core Values: Customer focused, Quality, Ethics, Excellence, Accountability
  • 4. Life Sciences companies Text mining & Informatics IP Verticals Publishing, R & D Institutions  MCPaIRS  MCDESiGN  Patent Search Services
  • 5. Highly Customized Services CONTENT MINING CONTENT REPRESENTATION / DELIVERY CONTENT MANAGEMENT  App Development  User Interface Design  Visualization  Analytics • Indexing ( automatic and semi-automatic), • Abstraction (manual and semi-automatic) • Open Access Data Mining • Content Enrichment • Semantic Tagging & systematic review of literature • MC Outlink - Text Mining & Discovery • Developing customized text mining engines • Ontology Building • Custom Dbase Creation • Content Normalization End <– to –> End Solutions Over 3500 Man Years of expertise MC - Solutions
  • 7. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 7 Why CE? • Enables deeper knowledge discovery from diverse sources like patent, databases, journal etc. • Semantic tagging ensures that different names of an entity are mapped to standard name and hence, searchable by any name. For Instance: Discoverability is a challenge in pharma patents as entities of interest may be named differently in different patents by different authors. • Publishers are quick to adopt CE, time to adopt it for patents?
  • 8. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 8 Unlocking Small Data to Big Data Number of articles (diamonds) and patents (open boxes) abstracted annually by Chemical Abstracts Services Bachrach Journal of Cheminformatics 2009 1:2 doi:10.1186/1758-2946-1-2 Need Smarter Content
  • 9. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 9 Leveraging Linked Data
  • 10. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 10 Implementation - Content Enrichment Levels What kind of Content Enrichment can be done? • Entity • Document • Others - Journal article - Patent - Book chapter - Image - Table - Multimedia - News links - Author/Assignee, Protein, Gene, Drug, Chemical, Disease, Reaction, Organism, Technology, Organization
  • 11. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 11 Content Enrichment – Use Case
  • 12. MCPaIRS TM (Proprietary Indian Patent Database) •"Expertly , Manually Curated, Fully Searchable, Value Added Knowledgebase" of Full Text of Indian Granted and Applied Patents •Caters to a diversified user-base of bench Scientists, Engineers, R&D Managers & Business Professionals. Molecular Connections Patent Information Retrieval System
  • 13. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 13 MCPaIRS TM – Homepage
  • 14. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 14 MCPaIRS TM – Search
  • 15. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 15 MCPaIRS TM – View Patent
  • 16. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 16 Demo of actual full text document
  • 17. Benefits of Semantic Search Cartridge Enabled MCPaIRS TM  All results in a single query  Automatic Expansion of the query with all possible synonyms  Broadening of the search query  Complex search queries possible  All the synonyms highlighted 17
  • 18. Automatic Expansion of the query with all possible synonyms 18
  • 19. Automatic Expansion of the query with all possible synonyms
  • 20. Multiple key-words highlighted for the search: VEGF
  • 21. Complex Queries can be performed by using operators Boolean search is performed
  • 22. Sample queries with Semantic Search Cartridge No Query No of results in iPairs No of results in mcpairs No of results in mcpairs with semantic search cartridge 1 Salbutamol 27 1560 2548 2 Amethocaine 0 58 954 3 Diazepam 4 1725 2146 4 Valsartan 84 1372 1429 5 Imatinib 65 1703 1999 6 Tamoxifen 16 3950 4190 7 Aspirin 61 5679 6427 8 Paracetamol 74 1161 3696 9 MyoD 2 130 138 10 Pax3 1 49 56 11 Sox9 0 39 58 12 FGF10 0 43 131 13 VEGF 192 4808 6058 14 BMP2 5 137 214 15 Salbutamol AND CD48 0 0 4
  • 23. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 23 Benefit - Identifying Related Patents A B Proteins Chemicals Indications ……. Proteins Chemicals Indications ……. Similarity Score Relatedness
  • 24. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 24 Content Enrichment Approaches • Manual  high quality, costly, not scalable, slow • Automated  fast, quality below par, cost effective, scalable • Hybrid  high quality, cost effective, scalable, reasonable speed Molecular Connections is a pioneer in the use of hybrid approach to content enrichment
  • 25. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 25 Key Takeaways  Content Enrichment can improve search and retrieval immensely ?? CE can be looked at various levels - Biology / chemistry / both / authors etc.  You can bring the Web into the document through CE - e.g. Augmented reference cards  Growing Adoption of Content Enrichment - Publishing (Early adopters) - Patents
  • 26. Copyright ©2013 Molecular Connections Pvt. Ltd. All rights Reserved 26 Thank You Molecular Connections www.molecularconnections.com