SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Presenter:
Date:
Note:
Company:
eMail:
Marc Bouma
June 5, 2014
UMC Utrecht
m.c.bouma@umcutrecht.nl
Our Dot on the Horizon
- Central point for delivering healthcare processes data
for medical research
- Integrate various sources
- Historize, trace and pseudonymize all data used
Our Journey
- Learning and adapting to Data Vault
 not everybody is a modeler (Shu Ha Ri)
- Script, code, build, try, test, throw away and start again
- Testing overrated?
- Architectureimprovements
 Performance issues SAS/Microsoft
 Performance issues loading scripts
 Automate DV load
- From Chaos to SCRUM
Our Obstacles
- Registration for healthcare process vs. usability for
research
- Questionnaires: sources or generic models?
- Performance:
 Do we really need all complete texts?
 Do we really need 20 years of lab results?
- The usual: conflicting interests,politics etc.
Our preliminary results
- 2013: selection of 5 major
Studies as starting
showcasesproved difficult
- 2014: had to choose 5
new showcasesfrom 25
applicants
- Started as Research Data
Platform, now growth
towards Enterprise Data
Platform (including
Education and BI)
- Architecturenow stable
Lessons learned
• Automate when possible
• Invest in a team of skilled pioneers
• Models rule everything
• Adapt agility, teach agility
Presenter:
Date:
Note:
Company:
eMail:
Twitter:
Sander Robijns
June 5, 2014
Estrenuo BVBA
sander.robijns@gmail.com
@srobijns
The Issue
No enterprise-wide business keys
The Current Approach
Using recursive links on hubs to identify the
same-as relationship
The Struggle
Getting the facts reported under a single
business key
The Future Approach
Master Data Management will take away some
of the struggles
The Lesson Learned
Get the enterprise-wide business keys in place
first using data governance
Presenter:
Date:
Note:
Company:
eMail:
Twitter:
Kasper de Graaf
June 5 2014
Occurro
kasper@occurro.nl
kdgraaf
Groups of Links: context at hospital
Imagine the following:
• An operation (surgery) is executed by a
group of people (first surgeon, second
surgeon, assistant,anesthiologist, etc.)
• An operation is planned a couple of weeks
in advance
• Whenever the planning changes in the
source the complete group is sent to the
EDW
Group of Links: the Data
{Time} operation_no employee_no role
T=1 19354 John OP1
19354 Jane OP2
19354 Chris ANA
T=2 19354 John OP1
19354 Mary ANA
T=3 19354 Jane OP1
19354 Chris ANA
Please note: the actual operation with operation_no
19354 is executed by Jane (OP1) and Chris (ANA)
Groups of Links: the Problem
Standard Data Vault loading routines cannot
handle this situation:
operation_no employee_no role load_dts
19354 John OP1 T=1
19354 Jane OP2 T=1
19354 Chris ANA T=1
19354 Mary ANA T=2
19354 Jane OP1 T=3
Groups of Links: the Problem
Using end-dating of a link (preferable a validity
satellite) cannot handle this problem either:
operation_no employee_no role load_dts Active?
19354 John OP1 T=1 No (T=3)
19354 Jane OP2 T=1 Yes
19354 Chris ANA T=1 No (T=3)
19354 Mary ANA T=2 Yes
19354 Jane OP1 T=3 Yes
BK of link used: operation_no + role
Groups of Links: our solution
1. Add a validity satellite to the link (for end-dating)
2. Tell the meta data of the automatin tool this is a
group validity satellite with BK=operation_no
3. Whenever an existing operation_no is present in
the staging layer set all current links to
Active=No
4. Process as usual
• Remark: because the same row can come back
(i.e. John/OP1) it will be set to Active=No and
Active=Yes at the same time there can be no
unique index on BK of Validity satellite and some
cleaning up is required after loading
Groups of Links: special thanks to …
St. Antonius Hospital (for having the problem)
Edwin Weber (for coding the solution)
Get your copy of the solution:
http://sourceforge.net/projects/pdidatavaultf
w/
Presenter:
Date:
Note:
Company:
eMail:
Twitter:
Juan-Josévan der Linden
June 5, 2014
DV, MPP
QOSQO
juanjose.vanderlinden@qosqo.nl
@delostilos
SMP => MPP => AMPP
SMP
Symmetric
Processing
MPP
Massively
Parallel
Processing
AMPP
Asymmetric MPP
( SMP + MPP)
Primary key => distribution key 
hub -< satellite join
- data redistribution
- join local in parallel
BK SID
Ensemble 1
Dimensional 2
SID LDTS INFO
1 2001-01-01 My first DV
1 2014-06-05 DV Masters
2 1997-08-02 DM manifesto
Node 1
Node 2
Hub SID => distribution key 
hub -< satellite join
- join local in parallel
BK SID
Ensemble 1
Dimensional 2
SID LDTS INFO
1 2001-01-01 First DV
1 2014-06-05 DV Masters
2 1997-08-02 DM manifesto
Node 1
Node 2
Link SID => distribution key 
Default L_SID, 1:N & N:M
- data redistribution
- join local in parallel
H_MID H_SID L_SID
1 A 1
1 B 2
L_SID LDTS LDTS_END CURRENT
1 2001-01-01 2006-01-01 N
1 2014-06-05 9999-12-31 Y
2 2006-01-01 2014-06-05 N
H_MID H_SID L_SID
1 A 1
1 B 2
L_SID H_MID H_SID LDTS LDTS_END
1 1 A 2001-01-01 2006-01-01
1 1 B 2014-06-05 9999-12-31
2 1 A 2006-01-01 2014-06-05
1:N => H_MID on link satellite
- join local in parallel
H_MID is the ensemble identifier !
Node 1
Node 2
Use the ensemble identifier if possible!
H_SID H_SID LDTS INFO
L_SID? H_SID H_MID H_SID ? L_SID ? LDTS INFO
Distributing data efficiently to ensure good
performance in a MPP database.
- If uneven distribution, one node may become a
bottleneck for the whole execution
Try to minimize data movement between nodes
- Data redistribution may occur when joining tables
Ensemble
Presenter:
Date:
Note:
Company:
eMail:
Twitter:
Remco Broekmans
June 5, 2014
Example for ReConnect
Coarem
Remco@Coarem.nl
RemcoBroekmans
SAP #Hana is a column store #database which
brings #efficiency in storage and access - #in-
memory.
SAP #Hana seems to benefit on their technical
#architecture in using 1 broad Satellite per
#Hub - #benefit no need for #PIT, less tables
Splitting #Sat’s in #rate-of-change as efficient
in storage as column store
#multiple Sat’s to prefer if data coming from
multiple sources (#write efficiency)
#referential join will only perform the join if
data from the joined tables is used create 1
#PIT per #Hub (not as #SQL view)
#Lesson: DV is #efficient way of storing data
#Lesson: #SQL views can’t be read by Hana
Studio
#Lesson: #Hana is still evolving

Contenu connexe

En vedette

Shorter time to insight more adaptable less costly bi with end to end modelst...
Shorter time to insight more adaptable less costly bi with end to end modelst...Shorter time to insight more adaptable less costly bi with end to end modelst...
Shorter time to insight more adaptable less costly bi with end to end modelst...Daniel Upton
 
Data vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopData vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopjohannesvdb
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultDaniel Upton
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingDaniel Upton
 
Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Andreas Buckenhofer
 
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieCDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieAndreas Buckenhofer
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileDaniel Upton
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationVishal Kumar
 
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...DATAVERSITY
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapSrinath Perera
 
Wide-column Stores für Architekten (HBase, Cassandra)
Wide-column Stores für Architekten (HBase, Cassandra)Wide-column Stores für Architekten (HBase, Cassandra)
Wide-column Stores für Architekten (HBase, Cassandra)Andreas Buckenhofer
 

En vedette (15)

Shorter time to insight more adaptable less costly bi with end to end modelst...
Shorter time to insight more adaptable less costly bi with end to end modelst...Shorter time to insight more adaptable less costly bi with end to end modelst...
Shorter time to insight more adaptable less costly bi with end to end modelst...
 
Data vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopData vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshop
 
Data Vault and DW2.0
Data Vault and DW2.0Data Vault and DW2.0
Data Vault and DW2.0
 
Data Vault Overview
Data Vault OverviewData Vault Overview
Data Vault Overview
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data Vault
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Agile BI via Data Vault and Modelstorming
Agile BI via Data Vault and ModelstormingAgile BI via Data Vault and Modelstorming
Agile BI via Data Vault and Modelstorming
 
Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)
 
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieCDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes Agile
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Wide-column Stores für Architekten (HBase, Cassandra)
Wide-column Stores für Architekten (HBase, Cassandra)Wide-column Stores für Architekten (HBase, Cassandra)
Wide-column Stores für Architekten (HBase, Cassandra)
 

Similaire à Data Vault ReConnect Speed Presenting PM Part Three

Ensemble model and mpp
Ensemble model and mppEnsemble model and mpp
Ensemble model and mppdelostilos
 
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...LINE Corp.
 
Initial acquisition in digital communication systems
Initial acquisition in digital communication systemsInitial acquisition in digital communication systems
Initial acquisition in digital communication systemsFuyun Ling
 
1st Technical Meeting - WP6
1st Technical Meeting - WP61st Technical Meeting - WP6
1st Technical Meeting - WP6SLOPE Project
 
Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...
Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...
Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...Matt Stubbs
 
pi-Lisco: Parallel and Incremental Stream-Based Point-Cloud Clustering
pi-Lisco: Parallel and Incremental Stream-Based Point-Cloud Clusteringpi-Lisco: Parallel and Incremental Stream-Based Point-Cloud Clustering
pi-Lisco: Parallel and Incremental Stream-Based Point-Cloud ClusteringHannaneh Najdataei
 
Entity Summarization with User Feedback (ESWC 2020)
Entity Summarization with User Feedback (ESWC 2020)Entity Summarization with User Feedback (ESWC 2020)
Entity Summarization with User Feedback (ESWC 2020)Qingxia Liu
 
Nix for etl using scripting to automate data cleaning & transformation
Nix for etl using scripting to automate data cleaning & transformationNix for etl using scripting to automate data cleaning & transformation
Nix for etl using scripting to automate data cleaning & transformationLynchpin Analytics Consultancy
 
Forecasting QuestionsStudent NameUniversity Affiliate.docx
Forecasting QuestionsStudent NameUniversity Affiliate.docxForecasting QuestionsStudent NameUniversity Affiliate.docx
Forecasting QuestionsStudent NameUniversity Affiliate.docxalisoncarleen
 
The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014multimediaeval
 
Kick-Off Meeting - WP6
Kick-Off Meeting - WP6Kick-Off Meeting - WP6
Kick-Off Meeting - WP6SLOPE Project
 
Scalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data ShardingScalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data Shardinginside-BigData.com
 
An Exploration of Ranking-based Strategy for Contextual Suggestions
An Exploration of Ranking-based Strategy for Contextual SuggestionsAn Exploration of Ranking-based Strategy for Contextual Suggestions
An Exploration of Ranking-based Strategy for Contextual SuggestionsTwitter Inc.
 
Slides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesSlides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesParang Saraf
 
Alex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning ApplicationsAlex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning ApplicationsSri Ambati
 
Context-aware Recommendation: A Quick View
Context-aware Recommendation: A Quick ViewContext-aware Recommendation: A Quick View
Context-aware Recommendation: A Quick ViewYONG ZHENG
 
MLSD18. Unsupervised Learning
MLSD18. Unsupervised LearningMLSD18. Unsupervised Learning
MLSD18. Unsupervised LearningBigML, Inc
 
Building and deploying analytics
Building and deploying analyticsBuilding and deploying analytics
Building and deploying analyticsCollin Bennett
 

Similaire à Data Vault ReConnect Speed Presenting PM Part Three (20)

Ensemble model and mpp
Ensemble model and mppEnsemble model and mpp
Ensemble model and mpp
 
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
 
Initial acquisition in digital communication systems
Initial acquisition in digital communication systemsInitial acquisition in digital communication systems
Initial acquisition in digital communication systems
 
1st Technical Meeting - WP6
1st Technical Meeting - WP61st Technical Meeting - WP6
1st Technical Meeting - WP6
 
Ten-T data report
Ten-T data reportTen-T data report
Ten-T data report
 
Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...
Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...
Big Data LDN 2016: Data Warehouse Automation: Solve integration challenges, s...
 
pi-Lisco: Parallel and Incremental Stream-Based Point-Cloud Clustering
pi-Lisco: Parallel and Incremental Stream-Based Point-Cloud Clusteringpi-Lisco: Parallel and Incremental Stream-Based Point-Cloud Clustering
pi-Lisco: Parallel and Incremental Stream-Based Point-Cloud Clustering
 
Entity Summarization with User Feedback (ESWC 2020)
Entity Summarization with User Feedback (ESWC 2020)Entity Summarization with User Feedback (ESWC 2020)
Entity Summarization with User Feedback (ESWC 2020)
 
Nix for etl using scripting to automate data cleaning & transformation
Nix for etl using scripting to automate data cleaning & transformationNix for etl using scripting to automate data cleaning & transformation
Nix for etl using scripting to automate data cleaning & transformation
 
Forecasting QuestionsStudent NameUniversity Affiliate.docx
Forecasting QuestionsStudent NameUniversity Affiliate.docxForecasting QuestionsStudent NameUniversity Affiliate.docx
Forecasting QuestionsStudent NameUniversity Affiliate.docx
 
The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014The Search and Hyperlinking Task at MediaEval 2014
The Search and Hyperlinking Task at MediaEval 2014
 
Kick-Off Meeting - WP6
Kick-Off Meeting - WP6Kick-Off Meeting - WP6
Kick-Off Meeting - WP6
 
Scalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data ShardingScalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data Sharding
 
An Exploration of Ranking-based Strategy for Contextual Suggestions
An Exploration of Ranking-based Strategy for Contextual SuggestionsAn Exploration of Ranking-based Strategy for Contextual Suggestions
An Exploration of Ranking-based Strategy for Contextual Suggestions
 
Slides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data PerspectivesSlides: Safeguarding Abila through Multiple Data Perspectives
Slides: Safeguarding Abila through Multiple Data Perspectives
 
Practical data analysis with wine
Practical data analysis with winePractical data analysis with wine
Practical data analysis with wine
 
Alex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning ApplicationsAlex Tellez, Deep Learning Applications
Alex Tellez, Deep Learning Applications
 
Context-aware Recommendation: A Quick View
Context-aware Recommendation: A Quick ViewContext-aware Recommendation: A Quick View
Context-aware Recommendation: A Quick View
 
MLSD18. Unsupervised Learning
MLSD18. Unsupervised LearningMLSD18. Unsupervised Learning
MLSD18. Unsupervised Learning
 
Building and deploying analytics
Building and deploying analyticsBuilding and deploying analytics
Building and deploying analytics
 

Dernier

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx9to5mart
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 

Dernier (20)

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 

Data Vault ReConnect Speed Presenting PM Part Three

  • 1. Presenter: Date: Note: Company: eMail: Marc Bouma June 5, 2014 UMC Utrecht m.c.bouma@umcutrecht.nl
  • 2. Our Dot on the Horizon - Central point for delivering healthcare processes data for medical research - Integrate various sources - Historize, trace and pseudonymize all data used
  • 3. Our Journey - Learning and adapting to Data Vault  not everybody is a modeler (Shu Ha Ri) - Script, code, build, try, test, throw away and start again - Testing overrated? - Architectureimprovements  Performance issues SAS/Microsoft  Performance issues loading scripts  Automate DV load - From Chaos to SCRUM
  • 4. Our Obstacles - Registration for healthcare process vs. usability for research - Questionnaires: sources or generic models? - Performance:  Do we really need all complete texts?  Do we really need 20 years of lab results? - The usual: conflicting interests,politics etc.
  • 5. Our preliminary results - 2013: selection of 5 major Studies as starting showcasesproved difficult - 2014: had to choose 5 new showcasesfrom 25 applicants - Started as Research Data Platform, now growth towards Enterprise Data Platform (including Education and BI) - Architecturenow stable
  • 6. Lessons learned • Automate when possible • Invest in a team of skilled pioneers • Models rule everything • Adapt agility, teach agility
  • 7. Presenter: Date: Note: Company: eMail: Twitter: Sander Robijns June 5, 2014 Estrenuo BVBA sander.robijns@gmail.com @srobijns
  • 9. The Current Approach Using recursive links on hubs to identify the same-as relationship
  • 10. The Struggle Getting the facts reported under a single business key
  • 11. The Future Approach Master Data Management will take away some of the struggles
  • 12. The Lesson Learned Get the enterprise-wide business keys in place first using data governance
  • 14. Groups of Links: context at hospital Imagine the following: • An operation (surgery) is executed by a group of people (first surgeon, second surgeon, assistant,anesthiologist, etc.) • An operation is planned a couple of weeks in advance • Whenever the planning changes in the source the complete group is sent to the EDW
  • 15. Group of Links: the Data {Time} operation_no employee_no role T=1 19354 John OP1 19354 Jane OP2 19354 Chris ANA T=2 19354 John OP1 19354 Mary ANA T=3 19354 Jane OP1 19354 Chris ANA Please note: the actual operation with operation_no 19354 is executed by Jane (OP1) and Chris (ANA)
  • 16. Groups of Links: the Problem Standard Data Vault loading routines cannot handle this situation: operation_no employee_no role load_dts 19354 John OP1 T=1 19354 Jane OP2 T=1 19354 Chris ANA T=1 19354 Mary ANA T=2 19354 Jane OP1 T=3
  • 17. Groups of Links: the Problem Using end-dating of a link (preferable a validity satellite) cannot handle this problem either: operation_no employee_no role load_dts Active? 19354 John OP1 T=1 No (T=3) 19354 Jane OP2 T=1 Yes 19354 Chris ANA T=1 No (T=3) 19354 Mary ANA T=2 Yes 19354 Jane OP1 T=3 Yes BK of link used: operation_no + role
  • 18. Groups of Links: our solution 1. Add a validity satellite to the link (for end-dating) 2. Tell the meta data of the automatin tool this is a group validity satellite with BK=operation_no 3. Whenever an existing operation_no is present in the staging layer set all current links to Active=No 4. Process as usual • Remark: because the same row can come back (i.e. John/OP1) it will be set to Active=No and Active=Yes at the same time there can be no unique index on BK of Validity satellite and some cleaning up is required after loading
  • 19. Groups of Links: special thanks to … St. Antonius Hospital (for having the problem) Edwin Weber (for coding the solution) Get your copy of the solution: http://sourceforge.net/projects/pdidatavaultf w/
  • 20. Presenter: Date: Note: Company: eMail: Twitter: Juan-Josévan der Linden June 5, 2014 DV, MPP QOSQO juanjose.vanderlinden@qosqo.nl @delostilos
  • 21. SMP => MPP => AMPP SMP Symmetric Processing MPP Massively Parallel Processing AMPP Asymmetric MPP ( SMP + MPP)
  • 22. Primary key => distribution key  hub -< satellite join - data redistribution - join local in parallel BK SID Ensemble 1 Dimensional 2 SID LDTS INFO 1 2001-01-01 My first DV 1 2014-06-05 DV Masters 2 1997-08-02 DM manifesto Node 1 Node 2
  • 23. Hub SID => distribution key  hub -< satellite join - join local in parallel BK SID Ensemble 1 Dimensional 2 SID LDTS INFO 1 2001-01-01 First DV 1 2014-06-05 DV Masters 2 1997-08-02 DM manifesto Node 1 Node 2
  • 24. Link SID => distribution key  Default L_SID, 1:N & N:M - data redistribution - join local in parallel H_MID H_SID L_SID 1 A 1 1 B 2 L_SID LDTS LDTS_END CURRENT 1 2001-01-01 2006-01-01 N 1 2014-06-05 9999-12-31 Y 2 2006-01-01 2014-06-05 N H_MID H_SID L_SID 1 A 1 1 B 2 L_SID H_MID H_SID LDTS LDTS_END 1 1 A 2001-01-01 2006-01-01 1 1 B 2014-06-05 9999-12-31 2 1 A 2006-01-01 2014-06-05 1:N => H_MID on link satellite - join local in parallel H_MID is the ensemble identifier ! Node 1 Node 2
  • 25. Use the ensemble identifier if possible! H_SID H_SID LDTS INFO L_SID? H_SID H_MID H_SID ? L_SID ? LDTS INFO Distributing data efficiently to ensure good performance in a MPP database. - If uneven distribution, one node may become a bottleneck for the whole execution Try to minimize data movement between nodes - Data redistribution may occur when joining tables Ensemble
  • 26. Presenter: Date: Note: Company: eMail: Twitter: Remco Broekmans June 5, 2014 Example for ReConnect Coarem Remco@Coarem.nl RemcoBroekmans
  • 27. SAP #Hana is a column store #database which brings #efficiency in storage and access - #in- memory.
  • 28. SAP #Hana seems to benefit on their technical #architecture in using 1 broad Satellite per #Hub - #benefit no need for #PIT, less tables
  • 29. Splitting #Sat’s in #rate-of-change as efficient in storage as column store #multiple Sat’s to prefer if data coming from multiple sources (#write efficiency)
  • 30. #referential join will only perform the join if data from the joined tables is used create 1 #PIT per #Hub (not as #SQL view)
  • 31. #Lesson: DV is #efficient way of storing data #Lesson: #SQL views can’t be read by Hana Studio #Lesson: #Hana is still evolving