SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
V.2.2
Eric Little, PhD
Chief Data Officer
OSTHUS
eric.little@osthus.com
Data Lifecycle Management
Across The Enterprise
Slide 2
Pharma invests in R&D and has to
make $ back over subsequent years
 Most R&D will fail, so risk is high
Law of Diminishing Returns
 R&D productivity is declining
 Harder treatments have greater costs,
potentially lower returns
 Drugs with minimal improvements
(not as many blockbusters + generics)
The Pharma Industry Is At A Tipping Point
From: Kelvin Stott - https://endpts.com/pharmas-broken-business-model-
an-industry-on-the-brink-of-terminal-decline/
Slide 3
Reduce R&D costs through better use of data
 Many experiments are re-run because scientists cannot find existing data
 Costs of system integration is much higher than data integration
 Standardization upstream can significantly impact costs downstream
Once data is available – automate as much as possible
Connect your internal data to other external data sources
 Many items exist in open source that can be modified easier than built from scratch
How To Help Remedy the Situation
Use the data you have before you generate more!
Start with reoccurring tasks – workflows, models,
query patterns, analytics, etc., then build out!
Don’t reinvent the wheel! Build data communities!
Slide 4
THE MOVE FROM BIG DATA TO
BIG ANALYSIS
STATISTICAL
SEMANTICS
MACHINE
LEARNING
REASONING
Slide 5
Moving to Smart Data
Smart data can be added to existing systems
 Does not require replacement of existing tech
Smart data provides a separation of:
 Model Layer
 Data Layer
Link to the model layer
 Leave data in place
 Smart data links information from the models to instance-level data
Smart Data uses metadata in order to capture context about data
Slide 6
Semantic Spectrum of Knowledge Organization Systems
• Deborah L. McGuinness. "Ontologies Come of Age". In Dieter Fensel, Jim Hendler, Henry Lieberman, and Wolfgang Wahlster, editors. Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential. MIT Press, 2003.
• Michael Uschold and Michael Gruninger “Ontologies and semantics for seamless connectivity” SIGMOD Rec. 33, 4 (December 2004), 58-64. DOI=http://dx.doi.org/10.1145/1041410.1041420
• Leo Obrst “The Ontology Spectrum”. Book section in of Roberto Poli, Michael Healy, Achilles Kameas “Theory and Applications of Ontology: Computer Applications”. Springer Netherlands, 17 Sep 2010.
• Leo Obrst and Mills Davis "Semantic Wave 2008 Report: Industry Roadmap to Web 3.0 & Multibillion Dollar Market Opportunities”. 2008.
Sources
Slide 7
Advantages of Using This Tech
Use cases where customers report distinct improvement:
 Better defined terms
• Differentiates between Entities and Labels – more specific data dictionary
 Better taxonomic structure
• Hierarchies can be accurately captured – not buried in incorrect tables
 Query Federation
• Can easily use multiple data sources (integration)
 Query Faceting
• Query results can be easily refined (and shared)
 Better use of metadata
• Provides context for users
• Raw data is more valuable over time
 Makes data actionable across an enterprise
• Moves from local data (on people’s machines, in their heads) to explicit sharable resources
• Adding SMART DATA to BIG DATA provides the means to access and use the data
• Requires combining logical data with statistical data in order to find patterns of
interest inside of large data sets
Slide 8
A Semantic Framework can connect the entire enterprise using a common semantics
The Semantic Hub should only focus on metadata (not instance level data)
Benefits: Common Terms, Models, Queries, Rules and Results (End-to-End)
Integrating Data Across the Enterprise
Lab Instruments Clinical Trials Regulatory AffairsProduction eArchiving
Slide 9
Lab Instrument Use Case –
Allotrope Framework
HPLC – UV
Mobile Phase Selection
Slide 10
Ontology for HPLC Example (Allotrope)
resultdevice
material
process
Slide 11
Clinical Trials Use Case –
Astra Zeneca & MedImmune
Slide 12
Connecting The Dots Across AstraZeneca & MedImmune
For Clinical Trials
Slide 13
FAIR Principles Bring Together Clinical Trials Data Across Phases
Slide 14
Domain Knowledge Is Captured In Models
Slide 15
Production Use Case –
Manufacturing Data Integration
Slide 16
Often times R&D and manufacturing cannot easily share data
Competing systems can evolve which cause incompatibilities
Manufacturing data is often lower less complex than R&D data, but significantly
higher in throughput
 QA/QC plays a major role
 Far more interpretation in R&D
 Manufacturing needs results fast
• Alarms
• Trends
 Manufacturing data is less retrospective
Manufacturing Data Vs. R&D Data
Slide 17
Regulatory Use Case –
Unstructured Data Integration
Slide 18
Regulatory compliance requires accessing and mining unstructured data
Linking unstructured data to other data provides significant advantages
 Text to DB links unstructured and structured data
 Text to Public Data Sources leverages open source research
Regulatory Compliance
Regulatory Documentation
Slide 19
E-Archiving: Managing Data
Over Long Lifecycles
Slide 20
Data is made available for easier search and indexing (even after long periods of time)
Archiving is no longer a “vault” concept but is integrated within the Data Mgt. Lifecycle
E-Archiving Using the Allotrope Data Framework
Slide 21
Big Analysis Requires Hybrid Architectures
Semantic DBs
Unstructured Docs
Structured Data
Cloud DBs (NoSQL)Analytics
Dashboards & Reports
Integration Layer
Slide 22
Data Science (machine learning, text analytics, clustering etc.)
FAIR Data Is Now Accessible For Advanced Analytics
Linked Open Data
& Open APIs
Semantic
Graph DB
(Knowledge Graph)
Operational DBs
…
Unstructured
Documents
Analytics Tools
simulations
statistics
reasoning
Visualization
dashboards
exploration
search
…
Semi-structured
Data
Instrument
Data
Lightweight Semantic Integration Layer
(semantic RMDM, APIs, semantic indexing, data annotation, catalogues, meta data and linking)
Reporting
regulatory
internal
external
Slide 23
CONNECTING DATA, PEOPLE AND ORGANIZATIONS
Contact Information:
Email: eric.little@osthus.com
Web: www.osthus.com
www.biganalysis.com
Twitter: OntoEric

Contenu connexe

Tendances

Acceliant white paper_edc_and_epro
Acceliant white paper_edc_and_eproAcceliant white paper_edc_and_epro
Acceliant white paper_edc_and_epro
Trianz
 
Finding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologiesFinding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologies
mhaendel
 
Sowmya Raghavan Strand Life
Sowmya Raghavan Strand LifeSowmya Raghavan Strand Life
Sowmya Raghavan Strand Life
EmTech
 

Tendances (20)

Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
 
5th Forum on Laboratory Informatics
5th Forum on Laboratory Informatics5th Forum on Laboratory Informatics
5th Forum on Laboratory Informatics
 
Data Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsData Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future Jobs
 
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirti
 
Acceliant white paper_edc_and_epro
Acceliant white paper_edc_and_eproAcceliant white paper_edc_and_epro
Acceliant white paper_edc_and_epro
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia article
 
Linked data in pharma
Linked data in pharmaLinked data in pharma
Linked data in pharma
 
Understand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.SUnderstand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.S
 
Data Science
Data ScienceData Science
Data Science
 
Data science lecture1_doaa_mohey
Data science lecture1_doaa_moheyData science lecture1_doaa_mohey
Data science lecture1_doaa_mohey
 
SciBite
SciBiteSciBite
SciBite
 
Removing the information bottleneck in R&D
Removing the information bottleneck in R&DRemoving the information bottleneck in R&D
Removing the information bottleneck in R&D
 
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
 
Hybrid Fuzzy Approches for Networks
Hybrid Fuzzy Approches for NetworksHybrid Fuzzy Approches for Networks
Hybrid Fuzzy Approches for Networks
 
Finding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologiesFinding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologies
 
Sowmya Raghavan Strand Life
Sowmya Raghavan Strand LifeSowmya Raghavan Strand Life
Sowmya Raghavan Strand Life
 
Nvidia why every industry should be thinking about AI today
Nvidia why every industry should be thinking about AI todayNvidia why every industry should be thinking about AI today
Nvidia why every industry should be thinking about AI today
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information Collateral
 

Similaire à Data lifecycle mgt across the enterprise

CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARECLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
UsmanYakubuMaaruf
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
Alan Morrison
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Barry Smith
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphere
Alex Hardisty
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Denodo
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
John Koch
 

Similaire à Data lifecycle mgt across the enterprise (20)

Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
 
Licensing Linked Data
Licensing Linked DataLicensing Linked Data
Licensing Linked Data
 
Thesis Defense MBI
Thesis Defense MBIThesis Defense MBI
Thesis Defense MBI
 
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARECLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
Laboratory Integration John Trigg
Laboratory Integration  John TriggLaboratory Integration  John Trigg
Laboratory Integration John Trigg
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge GraphActivate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
 
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & Practice
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphere
 
Considerations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflowConsiderations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflow
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data
 
IoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDILIoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDIL
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
 

Plus de OSTHUS

Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
OSTHUS
 
Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016
OSTHUS
 

Plus de OSTHUS (12)

The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data
 
Early AI Adoption Via Advanced Analytics
Early AI Adoption Via  Advanced AnalyticsEarly AI Adoption Via  Advanced Analytics
Early AI Adoption Via Advanced Analytics
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
Why paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart labWhy paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart lab
 
Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016
 
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
 
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
 
Best Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationBest Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data Curation
 
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
 
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
 
Data Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy dataData Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy data
 
Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?
 

Dernier

introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Dernier (20)

%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
LEVEL 5 - SESSION 1 2023 (1).pptx - PDF 123456
LEVEL 5   - SESSION 1 2023 (1).pptx - PDF 123456LEVEL 5   - SESSION 1 2023 (1).pptx - PDF 123456
LEVEL 5 - SESSION 1 2023 (1).pptx - PDF 123456
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
ManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide DeckManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide Deck
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Pharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyPharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodology
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 

Data lifecycle mgt across the enterprise

  • 1. V.2.2 Eric Little, PhD Chief Data Officer OSTHUS eric.little@osthus.com Data Lifecycle Management Across The Enterprise
  • 2. Slide 2 Pharma invests in R&D and has to make $ back over subsequent years  Most R&D will fail, so risk is high Law of Diminishing Returns  R&D productivity is declining  Harder treatments have greater costs, potentially lower returns  Drugs with minimal improvements (not as many blockbusters + generics) The Pharma Industry Is At A Tipping Point From: Kelvin Stott - https://endpts.com/pharmas-broken-business-model- an-industry-on-the-brink-of-terminal-decline/
  • 3. Slide 3 Reduce R&D costs through better use of data  Many experiments are re-run because scientists cannot find existing data  Costs of system integration is much higher than data integration  Standardization upstream can significantly impact costs downstream Once data is available – automate as much as possible Connect your internal data to other external data sources  Many items exist in open source that can be modified easier than built from scratch How To Help Remedy the Situation Use the data you have before you generate more! Start with reoccurring tasks – workflows, models, query patterns, analytics, etc., then build out! Don’t reinvent the wheel! Build data communities!
  • 4. Slide 4 THE MOVE FROM BIG DATA TO BIG ANALYSIS STATISTICAL SEMANTICS MACHINE LEARNING REASONING
  • 5. Slide 5 Moving to Smart Data Smart data can be added to existing systems  Does not require replacement of existing tech Smart data provides a separation of:  Model Layer  Data Layer Link to the model layer  Leave data in place  Smart data links information from the models to instance-level data Smart Data uses metadata in order to capture context about data
  • 6. Slide 6 Semantic Spectrum of Knowledge Organization Systems • Deborah L. McGuinness. "Ontologies Come of Age". In Dieter Fensel, Jim Hendler, Henry Lieberman, and Wolfgang Wahlster, editors. Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential. MIT Press, 2003. • Michael Uschold and Michael Gruninger “Ontologies and semantics for seamless connectivity” SIGMOD Rec. 33, 4 (December 2004), 58-64. DOI=http://dx.doi.org/10.1145/1041410.1041420 • Leo Obrst “The Ontology Spectrum”. Book section in of Roberto Poli, Michael Healy, Achilles Kameas “Theory and Applications of Ontology: Computer Applications”. Springer Netherlands, 17 Sep 2010. • Leo Obrst and Mills Davis "Semantic Wave 2008 Report: Industry Roadmap to Web 3.0 & Multibillion Dollar Market Opportunities”. 2008. Sources
  • 7. Slide 7 Advantages of Using This Tech Use cases where customers report distinct improvement:  Better defined terms • Differentiates between Entities and Labels – more specific data dictionary  Better taxonomic structure • Hierarchies can be accurately captured – not buried in incorrect tables  Query Federation • Can easily use multiple data sources (integration)  Query Faceting • Query results can be easily refined (and shared)  Better use of metadata • Provides context for users • Raw data is more valuable over time  Makes data actionable across an enterprise • Moves from local data (on people’s machines, in their heads) to explicit sharable resources • Adding SMART DATA to BIG DATA provides the means to access and use the data • Requires combining logical data with statistical data in order to find patterns of interest inside of large data sets
  • 8. Slide 8 A Semantic Framework can connect the entire enterprise using a common semantics The Semantic Hub should only focus on metadata (not instance level data) Benefits: Common Terms, Models, Queries, Rules and Results (End-to-End) Integrating Data Across the Enterprise Lab Instruments Clinical Trials Regulatory AffairsProduction eArchiving
  • 9. Slide 9 Lab Instrument Use Case – Allotrope Framework HPLC – UV Mobile Phase Selection
  • 10. Slide 10 Ontology for HPLC Example (Allotrope) resultdevice material process
  • 11. Slide 11 Clinical Trials Use Case – Astra Zeneca & MedImmune
  • 12. Slide 12 Connecting The Dots Across AstraZeneca & MedImmune For Clinical Trials
  • 13. Slide 13 FAIR Principles Bring Together Clinical Trials Data Across Phases
  • 14. Slide 14 Domain Knowledge Is Captured In Models
  • 15. Slide 15 Production Use Case – Manufacturing Data Integration
  • 16. Slide 16 Often times R&D and manufacturing cannot easily share data Competing systems can evolve which cause incompatibilities Manufacturing data is often lower less complex than R&D data, but significantly higher in throughput  QA/QC plays a major role  Far more interpretation in R&D  Manufacturing needs results fast • Alarms • Trends  Manufacturing data is less retrospective Manufacturing Data Vs. R&D Data
  • 17. Slide 17 Regulatory Use Case – Unstructured Data Integration
  • 18. Slide 18 Regulatory compliance requires accessing and mining unstructured data Linking unstructured data to other data provides significant advantages  Text to DB links unstructured and structured data  Text to Public Data Sources leverages open source research Regulatory Compliance Regulatory Documentation
  • 19. Slide 19 E-Archiving: Managing Data Over Long Lifecycles
  • 20. Slide 20 Data is made available for easier search and indexing (even after long periods of time) Archiving is no longer a “vault” concept but is integrated within the Data Mgt. Lifecycle E-Archiving Using the Allotrope Data Framework
  • 21. Slide 21 Big Analysis Requires Hybrid Architectures Semantic DBs Unstructured Docs Structured Data Cloud DBs (NoSQL)Analytics Dashboards & Reports Integration Layer
  • 22. Slide 22 Data Science (machine learning, text analytics, clustering etc.) FAIR Data Is Now Accessible For Advanced Analytics Linked Open Data & Open APIs Semantic Graph DB (Knowledge Graph) Operational DBs … Unstructured Documents Analytics Tools simulations statistics reasoning Visualization dashboards exploration search … Semi-structured Data Instrument Data Lightweight Semantic Integration Layer (semantic RMDM, APIs, semantic indexing, data annotation, catalogues, meta data and linking) Reporting regulatory internal external
  • 23. Slide 23 CONNECTING DATA, PEOPLE AND ORGANIZATIONS Contact Information: Email: eric.little@osthus.com Web: www.osthus.com www.biganalysis.com Twitter: OntoEric