SlideShare a Scribd company logo
1 of 19
Download to read offline
Rapid Delivery Of Business Intelligence
Applications Through R&D Search Experience
Search Solutions 2013
Tuesday October 8th

Nick Brown, Susan Donohoe, Rob Hernandez, Youssef
Belghali, Nasko Radev, Steve Woodward & Akshay Tankhiwale
AstraZeneca
Health Connect Us All
AstraZeneca is a biopharmaceutical company with Research and Development at
its core. Our business is providing innovative, effective medicines that make a real
difference to patients. We focus on six important areas of healthcare.

In R&D, we invest over $4 billion every year and with over 15,000 professionals
in 8 countries, on 3 continents, accessing and leveraging information is key.
Distributed R&D
Leads to Information Silos

Photo Credit: http://cdn-wac.emirates247.com/polopoly_fs/1.509718.1370831315!/image/256556252.jpg
Existing Semantic
Search Architecture
5. Business
Applications

4. Insight & Analytics

Auto-Tagging
Auto-Class

NLP
Rules Match

2. Ontology
Enrich

Text Mining

Normalization

Entity Extraction

3. Search
Index

Cluster

Publications

Trials

Patents

Conferences

Grants

News

RDF

data

CRM

SharePoint

PKT

LDMS

Yammer

Wiki

File shares

1. ETL
Unstructured External

Oracle

Data Marts

structured Internal

Unstructured Internal
Strategic Approach
Technology Stack
1

Connectors to any unstructured and structured sources

2
1

Accurate semantic mark-up with text-mining capabilities

3
1

Intelligent, intuitive search that hides the advanced features

4
1

Generate insight & analytics across information types

5
1

Rapidly deliver mobile business intelligence applications

3 months ago, we licensed Sinequa for our R&D search platform.
Advanced Widgets
Built To Be Put Together Easily In Different Ways

Photo Credit: http://media2.ph.88db.com/DB88UploadFiles_med2/2010/05/08/1909CD39-2608-4333-B7D5-16F79E0FA1D4.JPG
Virtual Team
Connected By Passion

To build our applications rapidly, we supplement our team with external experts,
including running competitions on open innovation platform like TopCoder.
External Data Sources
Easily Connected
25M

80M
Publications

60M

Patents

Clinical Trial Registries

Grants

Conferences

In R&D, we have over 200 million documents in publications, patents and
conference abstracts. Having a historical perspective can help when designing
business intelligence applications like breaking science or target selection
Internal Data Sources
Security & Access Control

Department
Fileshares

R&D Wiki

The richest, most valuable content is our internal data sources. Our systems
adheres to our security controls – you only find what you have access to…
R&D Search
Screenshot

We automatically search other synonyms like
Vandetanib and internal identifiers such as ZD6474
Top hits are now key relevant scientific documents
R&D vocabs are dispayed, from brand, disease,
scientists & mechanisms such as EGFR and VEGFR
R&D Search can handle a number of languages
R&D Vocabularies
Screenshot
Focused on vocabularies that are important to scientists :Drugs
People
Cell types

Diseases
Companies
Technology

Genes
Organisms
Skills

MicroRNA
Cell-lines
Safety Mechanisms

Developed new approaches within Sinequa to allow easy vocabulary curation. -Tagging scores allow us to identify documents with no tags or too many tags
Hiearchical synonym trees help to rapidly identify problem terms like ‘when’
Individual documents display number of synonym occurrences.
R&D Department
Screenshot

Teams can search across this rich internal content and find not just relevant
documents but also other drugs, mechanisms and even people to help.
R&D Journal
Screenshot

Developed to look like an external scientific journal,
R&D Journal provides a mechanism within
AstraZeneca where our scientists can publish articles
and experimental reports that can be shared and
pushed out to other members of the department
Other users can add ratings and comments, as well as
sign up for alerts and search across this content
R&D Labs
Mobile Access To Apps

Currently piloting Amazon web-services with Ping Federate (authentication) and Data
Power (access), to enabled mobile applications to query against our search index:
drug repositioning
conference capture

life cycle management
breaking science

external KOL identification
chemical search
R&D Experts
Find & Connect within AZ & MedImmune

Experts allows R&D to find and connect to the
key experts on any scientific topic.








Minimise duplication
Increase cross R&D collaboration
Automatically updated
Recommend new contacts
Curate & advertise yourself
Social network analysis & visual
connectivity
Next Steps
More R&D Indexing

Photo credit: http://chamorrobible.org/images/photos/gpw-200904-NASA-ISS016-E-37922-The-World-Dubai-United-Arab-Emirates-20080403-large.jpg
Next Steps
More Business Applications

Deliver applications that use analytics across the entire document index such as
drug repositioning and external KOL identification, made mobile.
Next Steps
More Search Widgets
Further collaborate with Sinequa to implement other features around
visualisation, feedback & commenting and new search relevancy algorithms
Thank You
Acknowledgements & Questions
Delivering this in the past 12 weeks wouldn’t have been possible without an
enormous amount of support from many people, not all listed here today.
Sinequa: Christian Sestier, Tim Bell, Xavier Pornain, Ariane Cavet, Frédéric
Lardé, Olivier Gaunet & Alex Bilger
Pebble Code: John Mildinhall, Tak Tran, Mark Durrant & Toby Hunt
AstraZeneca: Youssef Belghali, Tim McCoy, David Rafferty, Nick Barlow, Tania
Hide, Lisa Taylor, Hari Radhakrishnan, Adel Kassim & Pete Dudek.
Finally many thanks to Sebastian Lefebvre, Jason Swift & Paul Fitzpatrick for
sponsoring and helping us to get this project launched.

More Related Content

What's hot

Nick Brown - Camp Digital 2016
Nick Brown - Camp Digital 2016Nick Brown - Camp Digital 2016
Nick Brown - Camp Digital 2016Nexer Digital
 
AI today and its power to transform healthcare
AI today and its power to transform healthcareAI today and its power to transform healthcare
AI today and its power to transform healthcareBonnie Cheuk
 
Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...
Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...
Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...Nick Brown
 
AstraZeneca - chatbot and applications in pharmaceuticals
AstraZeneca -  chatbot and applications in pharmaceuticalsAstraZeneca -  chatbot and applications in pharmaceuticals
AstraZeneca - chatbot and applications in pharmaceuticalsHari Prasad
 
Tech Incubation. Delivering an enterprise platform on AWS
Tech Incubation. Delivering an enterprise platform on AWSTech Incubation. Delivering an enterprise platform on AWS
Tech Incubation. Delivering an enterprise platform on AWSNick Brown
 
Chris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZenecaChris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZenecaSteve Ashton
 
The journey to world class IT - Astrazeneca, Chris Day
The journey to world class IT - Astrazeneca, Chris DayThe journey to world class IT - Astrazeneca, Chris Day
The journey to world class IT - Astrazeneca, Chris DayGlobal Business Intelligence
 
Extending Enterprise Search at AstraZeneca
Extending Enterprise Search at AstraZenecaExtending Enterprise Search at AstraZeneca
Extending Enterprise Search at AstraZenecaSteve Woodward
 
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...Nick Brown
 
Data-Driven is Passé: Transform Into An Insights-Driven Enterprise
Data-Driven is Passé: Transform Into An Insights-Driven EnterpriseData-Driven is Passé: Transform Into An Insights-Driven Enterprise
Data-Driven is Passé: Transform Into An Insights-Driven EnterpriseDenodo
 
Embracing Cloud Deployment for Big Data and DevOps
Embracing Cloud Deployment for Big Data and DevOpsEmbracing Cloud Deployment for Big Data and DevOps
Embracing Cloud Deployment for Big Data and DevOpsSteve Woodward
 
IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16Volkan Tekeli
 
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v0213 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v02Erin Kerrigan
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...Jürgen Ambrosi
 
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...Datameer
 
4° Sessione - Telemetria e internet delle cose nell'ambito della ricerca
4° Sessione - Telemetria e internet delle cose nell'ambito della ricerca4° Sessione - Telemetria e internet delle cose nell'ambito della ricerca
4° Sessione - Telemetria e internet delle cose nell'ambito della ricercaJürgen Ambrosi
 
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...Keith Kraus
 
Ai design sprint - Finance - Wealth management
Ai design sprint  - Finance - Wealth managementAi design sprint  - Finance - Wealth management
Ai design sprint - Finance - Wealth managementChinmay Patel
 

What's hot (20)

Nick Brown - Camp Digital 2016
Nick Brown - Camp Digital 2016Nick Brown - Camp Digital 2016
Nick Brown - Camp Digital 2016
 
AI today and its power to transform healthcare
AI today and its power to transform healthcareAI today and its power to transform healthcare
AI today and its power to transform healthcare
 
Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...
Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...
Artificial Intelligence, Predictive Modelling and Chatbots: Applications in P...
 
AstraZeneca - chatbot and applications in pharmaceuticals
AstraZeneca -  chatbot and applications in pharmaceuticalsAstraZeneca -  chatbot and applications in pharmaceuticals
AstraZeneca - chatbot and applications in pharmaceuticals
 
Tech Incubation. Delivering an enterprise platform on AWS
Tech Incubation. Delivering an enterprise platform on AWSTech Incubation. Delivering an enterprise platform on AWS
Tech Incubation. Delivering an enterprise platform on AWS
 
Chris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZenecaChris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZeneca
 
The journey to world class IT - Astrazeneca, Chris Day
The journey to world class IT - Astrazeneca, Chris DayThe journey to world class IT - Astrazeneca, Chris Day
The journey to world class IT - Astrazeneca, Chris Day
 
Extending Enterprise Search at AstraZeneca
Extending Enterprise Search at AstraZenecaExtending Enterprise Search at AstraZeneca
Extending Enterprise Search at AstraZeneca
 
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
Genome Simulation & Applications: Use of Managed Distributed Compute Infrastr...
 
Data-Driven is Passé: Transform Into An Insights-Driven Enterprise
Data-Driven is Passé: Transform Into An Insights-Driven EnterpriseData-Driven is Passé: Transform Into An Insights-Driven Enterprise
Data-Driven is Passé: Transform Into An Insights-Driven Enterprise
 
Embracing Cloud Deployment for Big Data and DevOps
Embracing Cloud Deployment for Big Data and DevOpsEmbracing Cloud Deployment for Big Data and DevOps
Embracing Cloud Deployment for Big Data and DevOps
 
IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16
 
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v0213 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
 
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
 
4° Sessione - Telemetria e internet delle cose nell'ambito della ricerca
4° Sessione - Telemetria e internet delle cose nell'ambito della ricerca4° Sessione - Telemetria e internet delle cose nell'ambito della ricerca
4° Sessione - Telemetria e internet delle cose nell'ambito della ricerca
 
1645 dyskant using our laptop
1645 dyskant using our laptop1645 dyskant using our laptop
1645 dyskant using our laptop
 
IDOL presentation
IDOL presentationIDOL presentation
IDOL presentation
 
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
Streaming Cyber Security into Graph: Accelerating Data into DataStax Graph an...
 
Ai design sprint - Finance - Wealth management
Ai design sprint  - Finance - Wealth managementAi design sprint  - Finance - Wealth management
Ai design sprint - Finance - Wealth management
 

Similar to R&D Search 081013 Search Solutions Conference

LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic InnovationLFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic InnovationAmazon Web Services
 
How Big Data can drive innovative technologies and new approaches in large or...
How Big Data can drive innovative technologies and new approaches in large or...How Big Data can drive innovative technologies and new approaches in large or...
How Big Data can drive innovative technologies and new approaches in large or...Nick Brown
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksLucidworks
 
Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And FootballAmanda Gray
 
FocalCxm presentation on improving productivity in life sciences research
FocalCxm presentation on improving productivity in life sciences researchFocalCxm presentation on improving productivity in life sciences research
FocalCxm presentation on improving productivity in life sciences researchFOCALCXM
 
Enabling patient-centricity-pfizer
Enabling patient-centricity-pfizerEnabling patient-centricity-pfizer
Enabling patient-centricity-pfizerDavid Teszler
 
Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...
Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...
Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...Amazon Web Services
 
2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovationopen_phacts
 
MPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdfMPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdfProsper85
 
Data & Technology in Clinical Trials
Data & Technology in Clinical TrialsData & Technology in Clinical Trials
Data & Technology in Clinical TrialsNassim Azzi, MBA
 
How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...
How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...
How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...Seth Grimes
 
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docxCMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docxfathwaitewalter
 
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docxCMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docxdrennanmicah
 
CMIT 321 EXECUTIVE PROPOSAL PROJECT
CMIT 321 EXECUTIVE PROPOSAL PROJECTCMIT 321 EXECUTIVE PROPOSAL PROJECT
CMIT 321 EXECUTIVE PROPOSAL PROJECTHamesKellor
 
3RDi - Semantic Search Tool Brochure
3RDi - Semantic Search Tool Brochure3RDi - Semantic Search Tool Brochure
3RDi - Semantic Search Tool BrochureThe Digital Group
 
Nick Brown Drug Repositioning Informatics
Nick Brown Drug Repositioning InformaticsNick Brown Drug Repositioning Informatics
Nick Brown Drug Repositioning InformaticsNick Brown
 
London Online 2008
London Online 2008London Online 2008
London Online 2008Joe Buzzanga
 

Similar to R&D Search 081013 Search Solutions Conference (20)

LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic InnovationLFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
 
How Big Data can drive innovative technologies and new approaches in large or...
How Big Data can drive innovative technologies and new approaches in large or...How Big Data can drive innovative technologies and new approaches in large or...
How Big Data can drive innovative technologies and new approaches in large or...
 
UKSG 2018 Breakout - Organisation Identifier Registry update - Pentz
UKSG 2018 Breakout - Organisation Identifier Registry update - PentzUKSG 2018 Breakout - Organisation Identifier Registry update - Pentz
UKSG 2018 Breakout - Organisation Identifier Registry update - Pentz
 
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, LucidworksBio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
Bio IT World 2019 - AI For Healthcare - Simon Taylor, Lucidworks
 
Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And Football
 
FocalCxm presentation on improving productivity in life sciences research
FocalCxm presentation on improving productivity in life sciences researchFocalCxm presentation on improving productivity in life sciences research
FocalCxm presentation on improving productivity in life sciences research
 
Enabling patient-centricity-pfizer
Enabling patient-centricity-pfizerEnabling patient-centricity-pfizer
Enabling patient-centricity-pfizer
 
Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...
Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...
Enabling Patient Centricity for Pfizer through AWS Cloud (LFS301-S-i) - AWS r...
 
2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation
 
MPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdfMPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdf
 
Data & Technology in Clinical Trials
Data & Technology in Clinical TrialsData & Technology in Clinical Trials
Data & Technology in Clinical Trials
 
How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...
How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...
How Semantic Technology Will Enrich Our Lives: Scientific Research, Advertisi...
 
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docxCMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
 
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docxCMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
CMIT 321 Executive Proposal ProjectThe purpose of this project is .docx
 
CMIT 321 EXECUTIVE PROPOSAL PROJECT
CMIT 321 EXECUTIVE PROPOSAL PROJECTCMIT 321 EXECUTIVE PROPOSAL PROJECT
CMIT 321 EXECUTIVE PROPOSAL PROJECT
 
3RDi - Semantic Search Tool Brochure
3RDi - Semantic Search Tool Brochure3RDi - Semantic Search Tool Brochure
3RDi - Semantic Search Tool Brochure
 
Why Quertle?
Why Quertle?Why Quertle?
Why Quertle?
 
Popsi Cube 2011
Popsi Cube 2011Popsi Cube 2011
Popsi Cube 2011
 
Nick Brown Drug Repositioning Informatics
Nick Brown Drug Repositioning InformaticsNick Brown Drug Repositioning Informatics
Nick Brown Drug Repositioning Informatics
 
London Online 2008
London Online 2008London Online 2008
London Online 2008
 

Recently uploaded

Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 

R&D Search 081013 Search Solutions Conference

  • 1. Rapid Delivery Of Business Intelligence Applications Through R&D Search Experience Search Solutions 2013 Tuesday October 8th Nick Brown, Susan Donohoe, Rob Hernandez, Youssef Belghali, Nasko Radev, Steve Woodward & Akshay Tankhiwale
  • 2. AstraZeneca Health Connect Us All AstraZeneca is a biopharmaceutical company with Research and Development at its core. Our business is providing innovative, effective medicines that make a real difference to patients. We focus on six important areas of healthcare. In R&D, we invest over $4 billion every year and with over 15,000 professionals in 8 countries, on 3 continents, accessing and leveraging information is key.
  • 3. Distributed R&D Leads to Information Silos Photo Credit: http://cdn-wac.emirates247.com/polopoly_fs/1.509718.1370831315!/image/256556252.jpg
  • 4. Existing Semantic Search Architecture 5. Business Applications 4. Insight & Analytics Auto-Tagging Auto-Class NLP Rules Match 2. Ontology Enrich Text Mining Normalization Entity Extraction 3. Search Index Cluster Publications Trials Patents Conferences Grants News RDF data CRM SharePoint PKT LDMS Yammer Wiki File shares 1. ETL Unstructured External Oracle Data Marts structured Internal Unstructured Internal
  • 5. Strategic Approach Technology Stack 1 Connectors to any unstructured and structured sources 2 1 Accurate semantic mark-up with text-mining capabilities 3 1 Intelligent, intuitive search that hides the advanced features 4 1 Generate insight & analytics across information types 5 1 Rapidly deliver mobile business intelligence applications 3 months ago, we licensed Sinequa for our R&D search platform.
  • 6. Advanced Widgets Built To Be Put Together Easily In Different Ways Photo Credit: http://media2.ph.88db.com/DB88UploadFiles_med2/2010/05/08/1909CD39-2608-4333-B7D5-16F79E0FA1D4.JPG
  • 7. Virtual Team Connected By Passion To build our applications rapidly, we supplement our team with external experts, including running competitions on open innovation platform like TopCoder.
  • 8. External Data Sources Easily Connected 25M 80M Publications 60M Patents Clinical Trial Registries Grants Conferences In R&D, we have over 200 million documents in publications, patents and conference abstracts. Having a historical perspective can help when designing business intelligence applications like breaking science or target selection
  • 9. Internal Data Sources Security & Access Control Department Fileshares R&D Wiki The richest, most valuable content is our internal data sources. Our systems adheres to our security controls – you only find what you have access to…
  • 10. R&D Search Screenshot We automatically search other synonyms like Vandetanib and internal identifiers such as ZD6474 Top hits are now key relevant scientific documents R&D vocabs are dispayed, from brand, disease, scientists & mechanisms such as EGFR and VEGFR R&D Search can handle a number of languages
  • 11. R&D Vocabularies Screenshot Focused on vocabularies that are important to scientists :Drugs People Cell types Diseases Companies Technology Genes Organisms Skills MicroRNA Cell-lines Safety Mechanisms Developed new approaches within Sinequa to allow easy vocabulary curation. -Tagging scores allow us to identify documents with no tags or too many tags Hiearchical synonym trees help to rapidly identify problem terms like ‘when’ Individual documents display number of synonym occurrences.
  • 12. R&D Department Screenshot Teams can search across this rich internal content and find not just relevant documents but also other drugs, mechanisms and even people to help.
  • 13. R&D Journal Screenshot Developed to look like an external scientific journal, R&D Journal provides a mechanism within AstraZeneca where our scientists can publish articles and experimental reports that can be shared and pushed out to other members of the department Other users can add ratings and comments, as well as sign up for alerts and search across this content
  • 14. R&D Labs Mobile Access To Apps Currently piloting Amazon web-services with Ping Federate (authentication) and Data Power (access), to enabled mobile applications to query against our search index: drug repositioning conference capture life cycle management breaking science external KOL identification chemical search
  • 15. R&D Experts Find & Connect within AZ & MedImmune Experts allows R&D to find and connect to the key experts on any scientific topic.       Minimise duplication Increase cross R&D collaboration Automatically updated Recommend new contacts Curate & advertise yourself Social network analysis & visual connectivity
  • 16. Next Steps More R&D Indexing Photo credit: http://chamorrobible.org/images/photos/gpw-200904-NASA-ISS016-E-37922-The-World-Dubai-United-Arab-Emirates-20080403-large.jpg
  • 17. Next Steps More Business Applications Deliver applications that use analytics across the entire document index such as drug repositioning and external KOL identification, made mobile.
  • 18. Next Steps More Search Widgets Further collaborate with Sinequa to implement other features around visualisation, feedback & commenting and new search relevancy algorithms
  • 19. Thank You Acknowledgements & Questions Delivering this in the past 12 weeks wouldn’t have been possible without an enormous amount of support from many people, not all listed here today. Sinequa: Christian Sestier, Tim Bell, Xavier Pornain, Ariane Cavet, Frédéric Lardé, Olivier Gaunet & Alex Bilger Pebble Code: John Mildinhall, Tak Tran, Mark Durrant & Toby Hunt AstraZeneca: Youssef Belghali, Tim McCoy, David Rafferty, Nick Barlow, Tania Hide, Lisa Taylor, Hari Radhakrishnan, Adel Kassim & Pete Dudek. Finally many thanks to Sebastian Lefebvre, Jason Swift & Paul Fitzpatrick for sponsoring and helping us to get this project launched.