SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
A journey in developing an
open-source application on
Neo4j
GraphSummit Copenhagen, 7th March 2024
Mikkel Traun, Principal System Developer, Novo Nordisk A/S
Henrik Enquist, PhD, Lead Software Developer, Novo Nordisk A/S
Novo Nordisk®
What is the OpenStudyBuilder?…
A NEW APPROACH TO STUDY
SPECIFICATION
• Compliance with external and internal standards
• Facilitates automation and content reuse
• Ensures a higher degree of end-to-end consistency
2
3 ELEMENTS OF OpenStudyBuilder
• Clinical Metadata Repository (clinical MDR)
(central repository for all study specification data)
• OpenStudyBuilder application / Web UI
• API layer
(allowing interoperability with other applications)
(DDF API Adaptor – enabling DDF SDR Compatibility)
clinical MDR
Novo Nordisk®
OpenStudyBuilder Components
3
STUDIES
TITLE CRITERIA
REGISTRY IDENTIFERS INTERVENTIONS
STRUCTURE PURPOSE
POPULATION ACTVITIES
LIBRARY
CONTROLLED
TERMINOLOGY
MEDICAL DICTIONARIES
(e.g., MedDRA)
CONCEPTS (ACTIVITIES,
UNITS, CRFs, COMPOUNDS)
SYNTAX TEMPLATES
DATA EXCHANGE STANDARDS
Novo Nordisk®
Applying Domain Driven Design principles
• Domain Driven Design (DDD) is a common
design pattern in system development
• Works well with the linked graph database
design
• Ending up with a very big data model – as
the domain is complex – maybe OK
• Can be complicated in the Python based
API component
• Challenges in using Neo4j OGM NeoModel
with DDD principles in Python
4
Novo Nordisk®
Benefits working with linked graph model
• Big and complex data domain is captured well in
the label property graph model
• Given you understand the
clinical data domain!
• Support fine granular versioning
• Support domain driven queries
• Can deliver fast performance
• Other MDR solutions have applied generic
relational data models
• Having difficulties in managing versioning at a
granular level
• Very complex and long running queries
5
Novo Nordisk®
Technical details| MDR and StudyBuilder
Vue.js: https://vuejs.org/ Vuetify: https://vuetifyjs.com/en/ VuePress: https://vuepress.vuejs.org/ FastAPI: https://fastapi.tiangolo.com/ Neo4j: https://neo4j.com/neo4j-graph-database/
Front-end
• Vue.js
• Modern JavaScript web framework
• Vuetify user interface styling library
• Websites can be displayed on all operating
systems
• Views automatically adjust to underlying
data
• Pages are created with re-usable
components, e.g. a table or a visualisation.
Code once, reuse many times.
• Wide usage:
• Popular among developers
• Google, Apple, Netflix
• VuePress Documentation portal
Service layer
• Python driven API
• FastAPI Framework
• Highly readable code
• More than 40% of developers use Python,
according to Stack Overflow
• Restful API
• CDISC use python for their API as well
• Automatically generate API
documentation with inline code comments
• Lightweight
• Cloud hosting (Azure) provides elastic
scaling – upscale and downscale
according to immediate usage
Backend
• Neo4j database
• Modern label-property graph database
• Data model is close to the domain
• Hosted on any cloud or locally
Novo Nordisk®
Challenges using neomodel versus plain Cypher
- neomodel can be fast
https://github.com/neo4j-contrib/neomodel
Novo Nordisk®
Challenges using neomodel versus plain Cypher
- neomodel can be slow
• Fetching many items with several properties 🡪 many queries, latencies add up
• A single “big” cypher query is much faster but is more work to write and maintain
• neomodel can do better
items = root_class.nodes.order_by(”-name")
for item in items:
obj_a = item.has_obj_a.get_or_none()
obj_b = item.has_obj_b.get_or_none()
obj_c = item.has_obj_c.get_or_none()
obj_d = item.has_obj_d.get_or_none()
items = root_class.nodes.all().fetch_relationships("obj_a", "obj_b", ...)
Novo Nordisk®
Challenges using neomodel versus plain Cypher
9
Make simple implementation with neomodel
If performance is satisfactory, done
If not, refactor to utilize neomodel better
• Or, if not possible, reimplement with Cypher
Keep track of performance as data amount increases
Novo Nordisk®
Challenges using neomodel versus plain Cypher
Dummy data:
• Simple
• Small
• Uses a subset of the data model
Production data:
• Complicated
• Big
• Uses (nearly) the full data model
• Need better dummy data!
• Generative AI can help to create richer dummy data
10
Novo Nordisk®
Challenges showing graph data in tables
• Vuetify Data table
11
Novo Nordisk®
Challenges showing graph data in tables
• StudyBuilder
list of Activities
12
Novo Nordisk®
Changing data models and data migrations
• Model continuously evolves
• Example: Adding an intermediate node between two existing
• Migration needed!
13
Novo Nordisk®
Changing data models and data migrations
We need:
- Migration script (usually Cypher)
- Verification script (usually Cypher)
- Test data, preferably a sample extracted from production.
Test:
- Inject test data in a temporary DB
- Run migration
- Run verification
- Run migration again, assert that nothing changed
Challenge:
- Documenting the changes
14
Novo Nordisk®
Changing data models and data migrations
- Change Data Capture
15
@capture_changes() decorator
1) Extract docstring from wrapped function
2) Enable log enrichment
3) Query for the current change id
4) Run the wrapped function
5) Query for the changes
6) Dump the changes to a json file
7) Generate a summary and append to a
markdown file
Novo Nordisk®
Changing data models and data migrations
- Change Data Capture
16
Novo Nordisk®
Neo4j Enterprise and open-source sharing
17
- Enterprise features are very useful.
- Who are the users of your open-source application?
- Mainly other pharma's – they need full support so not an issue
- Non-profit organizations – they can get a free license so not an issue
- Smaller biotech's and CRO’s – they can have an issue with costs!
- Can the functionality using Enterprise features be optional?
- Will require a dedicated deployment, data level access control, will
impact system validation and potential performance
Novo Nordisk®
Summary
• Neo4j database as store for enterprise application
• Big and complex data domain is captured well in the label property graph model
• Graph data model allows data to grow without getting more complicated
• Some challenges in how to present data to users
18
Thanks!
Questions?

Contenu connexe

Similaire à Novo Nordisk's journey in developing an open-source application on Neo4j

Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & Analysis
Marcus Hanwell
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...
Ola Spjuth
 

Similaire à Novo Nordisk's journey in developing an open-source application on Neo4j (20)

Make your Microservices sing!
Make your Microservices sing!Make your Microservices sing!
Make your Microservices sing!
 
Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...Utilising Cloud Computing for Research through Infrastructure, Software and D...
Utilising Cloud Computing for Research through Infrastructure, Software and D...
 
Federated Cloud Computing
Federated Cloud ComputingFederated Cloud Computing
Federated Cloud Computing
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & Analysis
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
Gluent Extending Enterprise Applications with Hadoop
Gluent Extending Enterprise Applications with HadoopGluent Extending Enterprise Applications with Hadoop
Gluent Extending Enterprise Applications with Hadoop
 
Current & Future Use-Cases of OpenDaylight
Current & Future Use-Cases of OpenDaylightCurrent & Future Use-Cases of OpenDaylight
Current & Future Use-Cases of OpenDaylight
 
Elasticsearch + Cascading for Scalable Log Processing
Elasticsearch + Cascading for Scalable Log ProcessingElasticsearch + Cascading for Scalable Log Processing
Elasticsearch + Cascading for Scalable Log Processing
 
Oracle OpenWo2014 review part 03 three_paa_s_database
Oracle OpenWo2014 review part 03 three_paa_s_databaseOracle OpenWo2014 review part 03 three_paa_s_database
Oracle OpenWo2014 review part 03 three_paa_s_database
 
Introducing NoSQL and MongoDB to complement Relational Databases (AMIS SIG 14...
Introducing NoSQL and MongoDB to complement Relational Databases (AMIS SIG 14...Introducing NoSQL and MongoDB to complement Relational Databases (AMIS SIG 14...
Introducing NoSQL and MongoDB to complement Relational Databases (AMIS SIG 14...
 
Getting started with postgresql
Getting started with postgresqlGetting started with postgresql
Getting started with postgresql
 
Student Industrial Training Presentation Slide
Student Industrial Training Presentation SlideStudent Industrial Training Presentation Slide
Student Industrial Training Presentation Slide
 
SCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation EnvironmentsSCAPE - Scalable Preservation Environments
SCAPE - Scalable Preservation Environments
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...
 
MIGRATION - PAIN OR GAIN?
MIGRATION - PAIN OR GAIN?MIGRATION - PAIN OR GAIN?
MIGRATION - PAIN OR GAIN?
 
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
 Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
 
Postgres NoSQL - Delivering Apps Faster
Postgres NoSQL - Delivering Apps FasterPostgres NoSQL - Delivering Apps Faster
Postgres NoSQL - Delivering Apps Faster
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
 
Optymalizacja środowiska Open Source w celu zwiększenia oszczędności i kontroli
Optymalizacja środowiska Open Source w celu zwiększenia oszczędności i kontroliOptymalizacja środowiska Open Source w celu zwiększenia oszczędności i kontroli
Optymalizacja środowiska Open Source w celu zwiększenia oszczędności i kontroli
 
OGCE SC10
OGCE SC10OGCE SC10
OGCE SC10
 

Plus de Neo4j

Plus de Neo4j (20)

Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4j
 
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptxBT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
 
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit MilanWorkshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jGraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
 
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with GraphGraphSummit Milan - Neo4j: The Art of the Possible with Graph
GraphSummit Milan - Neo4j: The Art of the Possible with Graph
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 

Dernier

Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 

Dernier (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 

Novo Nordisk's journey in developing an open-source application on Neo4j

  • 1. A journey in developing an open-source application on Neo4j GraphSummit Copenhagen, 7th March 2024 Mikkel Traun, Principal System Developer, Novo Nordisk A/S Henrik Enquist, PhD, Lead Software Developer, Novo Nordisk A/S
  • 2. Novo Nordisk® What is the OpenStudyBuilder?… A NEW APPROACH TO STUDY SPECIFICATION • Compliance with external and internal standards • Facilitates automation and content reuse • Ensures a higher degree of end-to-end consistency 2 3 ELEMENTS OF OpenStudyBuilder • Clinical Metadata Repository (clinical MDR) (central repository for all study specification data) • OpenStudyBuilder application / Web UI • API layer (allowing interoperability with other applications) (DDF API Adaptor – enabling DDF SDR Compatibility) clinical MDR
  • 3. Novo Nordisk® OpenStudyBuilder Components 3 STUDIES TITLE CRITERIA REGISTRY IDENTIFERS INTERVENTIONS STRUCTURE PURPOSE POPULATION ACTVITIES LIBRARY CONTROLLED TERMINOLOGY MEDICAL DICTIONARIES (e.g., MedDRA) CONCEPTS (ACTIVITIES, UNITS, CRFs, COMPOUNDS) SYNTAX TEMPLATES DATA EXCHANGE STANDARDS
  • 4. Novo Nordisk® Applying Domain Driven Design principles • Domain Driven Design (DDD) is a common design pattern in system development • Works well with the linked graph database design • Ending up with a very big data model – as the domain is complex – maybe OK • Can be complicated in the Python based API component • Challenges in using Neo4j OGM NeoModel with DDD principles in Python 4
  • 5. Novo Nordisk® Benefits working with linked graph model • Big and complex data domain is captured well in the label property graph model • Given you understand the clinical data domain! • Support fine granular versioning • Support domain driven queries • Can deliver fast performance • Other MDR solutions have applied generic relational data models • Having difficulties in managing versioning at a granular level • Very complex and long running queries 5
  • 6. Novo Nordisk® Technical details| MDR and StudyBuilder Vue.js: https://vuejs.org/ Vuetify: https://vuetifyjs.com/en/ VuePress: https://vuepress.vuejs.org/ FastAPI: https://fastapi.tiangolo.com/ Neo4j: https://neo4j.com/neo4j-graph-database/ Front-end • Vue.js • Modern JavaScript web framework • Vuetify user interface styling library • Websites can be displayed on all operating systems • Views automatically adjust to underlying data • Pages are created with re-usable components, e.g. a table or a visualisation. Code once, reuse many times. • Wide usage: • Popular among developers • Google, Apple, Netflix • VuePress Documentation portal Service layer • Python driven API • FastAPI Framework • Highly readable code • More than 40% of developers use Python, according to Stack Overflow • Restful API • CDISC use python for their API as well • Automatically generate API documentation with inline code comments • Lightweight • Cloud hosting (Azure) provides elastic scaling – upscale and downscale according to immediate usage Backend • Neo4j database • Modern label-property graph database • Data model is close to the domain • Hosted on any cloud or locally
  • 7. Novo Nordisk® Challenges using neomodel versus plain Cypher - neomodel can be fast https://github.com/neo4j-contrib/neomodel
  • 8. Novo Nordisk® Challenges using neomodel versus plain Cypher - neomodel can be slow • Fetching many items with several properties 🡪 many queries, latencies add up • A single “big” cypher query is much faster but is more work to write and maintain • neomodel can do better items = root_class.nodes.order_by(”-name") for item in items: obj_a = item.has_obj_a.get_or_none() obj_b = item.has_obj_b.get_or_none() obj_c = item.has_obj_c.get_or_none() obj_d = item.has_obj_d.get_or_none() items = root_class.nodes.all().fetch_relationships("obj_a", "obj_b", ...)
  • 9. Novo Nordisk® Challenges using neomodel versus plain Cypher 9 Make simple implementation with neomodel If performance is satisfactory, done If not, refactor to utilize neomodel better • Or, if not possible, reimplement with Cypher Keep track of performance as data amount increases
  • 10. Novo Nordisk® Challenges using neomodel versus plain Cypher Dummy data: • Simple • Small • Uses a subset of the data model Production data: • Complicated • Big • Uses (nearly) the full data model • Need better dummy data! • Generative AI can help to create richer dummy data 10
  • 11. Novo Nordisk® Challenges showing graph data in tables • Vuetify Data table 11
  • 12. Novo Nordisk® Challenges showing graph data in tables • StudyBuilder list of Activities 12
  • 13. Novo Nordisk® Changing data models and data migrations • Model continuously evolves • Example: Adding an intermediate node between two existing • Migration needed! 13
  • 14. Novo Nordisk® Changing data models and data migrations We need: - Migration script (usually Cypher) - Verification script (usually Cypher) - Test data, preferably a sample extracted from production. Test: - Inject test data in a temporary DB - Run migration - Run verification - Run migration again, assert that nothing changed Challenge: - Documenting the changes 14
  • 15. Novo Nordisk® Changing data models and data migrations - Change Data Capture 15 @capture_changes() decorator 1) Extract docstring from wrapped function 2) Enable log enrichment 3) Query for the current change id 4) Run the wrapped function 5) Query for the changes 6) Dump the changes to a json file 7) Generate a summary and append to a markdown file
  • 16. Novo Nordisk® Changing data models and data migrations - Change Data Capture 16
  • 17. Novo Nordisk® Neo4j Enterprise and open-source sharing 17 - Enterprise features are very useful. - Who are the users of your open-source application? - Mainly other pharma's – they need full support so not an issue - Non-profit organizations – they can get a free license so not an issue - Smaller biotech's and CRO’s – they can have an issue with costs! - Can the functionality using Enterprise features be optional? - Will require a dedicated deployment, data level access control, will impact system validation and potential performance
  • 18. Novo Nordisk® Summary • Neo4j database as store for enterprise application • Big and complex data domain is captured well in the label property graph model • Graph data model allows data to grow without getting more complicated • Some challenges in how to present data to users 18