SlideShare une entreprise Scribd logo
1  sur  20
Developing a Strategy for
Data Lake Governance
Tony Baer, Principal Analyst, Information Management
tony.baer@ovum.com
@TonyBaer
Ovum | TMT intelligence | informa2 Copyright © Informa PLC
Agenda
 Why are we having this conversation?
 Why is governance critical?
 How to govern the Data Lake?
Ovum | TMT intelligence | informa3 Copyright © Informa PLC
Let’s go to the polls
Where is your organization on the Data Lake journey?
Check one Already
implementing
Starting to
implement
Considering
implementation
No current plans
to implement
Ovum | TMT intelligence | informa4 Copyright © Informa PLC
Getting
the data
Profiting
from data
Seeking value from Big Data:
The Journey
Ovum | TMT intelligence | informa5 Copyright © Informa PLC
Getting
the data
Profiting
from data
Seeking value from Big Data:
The Journey
Core assumption: The Data Lake is a shared
enterprise resource
Ovum | TMT intelligence | informa6 Copyright © Informa PLC
Group
Log analytics
Sentiment Analysis
DW offload
The journey to Data Lake starts small
Ovum | TMT intelligence | informa7 Copyright © Informa PLC
Group Multi-department
Log analytics
Sentiment Analysis
DW offload
Exploratory Analytics
LOB analytic applications
Operational analytics
Success spreads…
Ovum | TMT intelligence | informa8 Copyright © Informa PLC
Group Multi-department Enterprise
Log analytics
Sentiment Analysis
DW offload
Data Lake
Exploratory Analytics
Line of business analytic applications
Operational analytics
The Data Lake is the culmination of the journey, not the start
Ovum | TMT intelligence | informa9 Copyright © Informa PLC
Why is governance critical?
Costs out of control
Ovum | TMT intelligence | informa10 Copyright © Informa PLC
Why is governance critical?
Costs out of control
Privacy, legal &
regulatory
compliance issues
Ovum | TMT intelligence | informa11 Copyright © Informa PLC
Why is governance critical?
Costs out of control
Privacy, legal &
regulatory
compliance issues
Untrustworthy
data
Ovum | TMT intelligence | informa12 Copyright © Informa PLC
How to govern the Data Lake
How to make the content
of your data lake
transparent?
Ovum | TMT intelligence | informa13 Copyright © Informa PLC
Availability/Reliability
(FT,HA,BackupDR)
Monitoring&troubleshooting
Perimeter
Security
END USER TIER
Data Lake building block
Hadoop platform management
End user tool
Data Lake governance reference architecture
DATA INVENTORY TIER
DATA SECURITY TIER
OPTIMIZATION TIER
DATA PLATFORM TIER
Ovum | TMT intelligence | informa14 Copyright © Informa PLC
Availability/Reliability
(FT,HA,BackupDR)
Monitoring&troubleshooting
Perimeter
Security
Data platform (Hadoop)
Query/Analytics tools, programs
Cost Optimization & Integration
Physical Inventory
Curation
Data-level security
Self-
service
tier
Data Lake building block
Hadoop platform management
End user tool
Data Lake governance functions
Ovum | TMT intelligence | informa15 Copyright © Informa PLC
Curation
Build your library of
information
Physical Inventory
Know/manage what data is in the
data lake
Data profiling, data preparation,
collaborative data enrichment,
catalog, match data, derive master
data, record data lineage
Business & Analytics teams Technology team
Manage data access, track data
lineage, tag for security, data
retention
Manage data access, tag for
security, data retention, lifecycle &
workflow, track data lineage
Data Inventory tier
Ovum | TMT intelligence | informa16 Copyright © Informa PLC
Data Security & Data Lake Optimization tiers
 Security
 Data Protection – policy-based masking,
encryption
 Authorization, accounting & access control
(AAA)
 Perimeter security & remote authentication are
functions of the core data platform
 Optimization
 Integration with other data platforms
 Import/Export
 Remote/federated/pushdown query processing
 Lifecycle/workflow
 Data retention policy?
 Storage tiering?
Ovum | TMT intelligence | informa17 Copyright © Informa PLC
Governance: How Data Lakes compare to EDWs
90%
50%
90%
50%
30% confidence level
 EDW provides good starting point
 Core building blocks of governance are similar, but
approaches differ
 Data Inventory
 Flexible, evolving schema
 Quality critical, but adjust to need
 Business users exert key roles
 IT still provides adult supervision
 Security
 Greater varieties of data, use of external data sources, and
(arguably) broader user constituencies demand more granular
approaches to data protection
 Optimization
 Just as important as any EDW. Workloads must be prioritized
 Lifecycle
 Sleeper issue for Data Lakes
90%
Ovum | TMT intelligence | informa18 Copyright © Informa PLC
Takeaways
 The Data Lake is a shared enterprise resource
 It is a later, mature stage of Hadoop adoption
 Exploratory analytics is a great way to sell business users on the value proposition of
the Data Lake
 Why governance? Because the data lake is an enterprise data resource
 Governance will adapt & extend practices from EDW
 Greater variety of data sources demands greater scrutiny for security, data retention &
lifecycle management practices
 Data lineage is critical!!!
 Like any enterprise data platform, workloads must be prioritized
Ovum | TMT intelligence | informa19 Copyright © Informa PLC
There is no silver bullet recipe for Data
Lake Governance
Ovum | TMT intelligence | informa20 Copyright © Informa PLC
Thank you
Tony Baer
Ovum
(646) 546-5330
tony.baer@ovum.com
Twitter: @TonyBaer

Contenu connexe

Tendances

Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
DataWorks Summit
 

Tendances (20)

Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Designing the Next Generation Data Lake
Designing the Next Generation Data LakeDesigning the Next Generation Data Lake
Designing the Next Generation Data Lake
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
 
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
Unlocking Big Data Silos in the Enterprise or the Cloud (Con7877)
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
 
Stream based Data Integration
Stream based Data IntegrationStream based Data Integration
Stream based Data Integration
 
Fast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming AnalyticsFast Data:The Rebirth of Streaming Analytics
Fast Data:The Rebirth of Streaming Analytics
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic Architecture
 
Deploying a Governed Data Lake
Deploying a Governed Data LakeDeploying a Governed Data Lake
Deploying a Governed Data Lake
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
Ovum Fireside Chat: Governing the data lake - Understanding what's in there
Ovum Fireside Chat: Governing the data lake - Understanding what's in thereOvum Fireside Chat: Governing the data lake - Understanding what's in there
Ovum Fireside Chat: Governing the data lake - Understanding what's in there
 
One Slide Overview: ORCL Big Data Integration and Governance
One Slide Overview: ORCL Big Data Integration and GovernanceOne Slide Overview: ORCL Big Data Integration and Governance
One Slide Overview: ORCL Big Data Integration and Governance
 
Webinar - Data Lake Management: Extending Storage and Lifecycle of Data
Webinar - Data Lake Management: Extending Storage and Lifecycle of DataWebinar - Data Lake Management: Extending Storage and Lifecycle of Data
Webinar - Data Lake Management: Extending Storage and Lifecycle of Data
 
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
 
Traditional data warehouse vs data lake
Traditional data warehouse vs data lakeTraditional data warehouse vs data lake
Traditional data warehouse vs data lake
 
Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...
Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...
Using Machine Learning to Capture Data Meaning and Wrangle it to Liberate its...
 
Hadoop Big Data Lakes Keynote
Hadoop Big Data Lakes KeynoteHadoop Big Data Lakes Keynote
Hadoop Big Data Lakes Keynote
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
 

Similaire à Developing a Strategy for Data Lake Governance

CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowale
Capgemini
 
IT for Management On-Demand Strategies for Performance, Growth,.docx
IT for Management On-Demand Strategies for Performance, Growth,.docxIT for Management On-Demand Strategies for Performance, Growth,.docx
IT for Management On-Demand Strategies for Performance, Growth,.docx
vrickens
 

Similaire à Developing a Strategy for Data Lake Governance (20)

Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...Who changed my data? Need for data governance and provenance in a streaming w...
Who changed my data? Need for data governance and provenance in a streaming w...
 
Innovation Without Compromise: The Challenges of Securing Big Data
Innovation Without Compromise: The Challenges of Securing Big DataInnovation Without Compromise: The Challenges of Securing Big Data
Innovation Without Compromise: The Challenges of Securing Big Data
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Composable data for the composable enterprise
Composable data for the composable enterpriseComposable data for the composable enterprise
Composable data for the composable enterprise
 
Making Big Data a First Class citizen in the enterprise
Making Big Data a First Class citizen in the enterpriseMaking Big Data a First Class citizen in the enterprise
Making Big Data a First Class citizen in the enterprise
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
 
CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowale
 
Ibm big data-platform
Ibm big data-platformIbm big data-platform
Ibm big data-platform
 
Oracle Big Data Governance Webcast Charts
Oracle Big Data Governance Webcast ChartsOracle Big Data Governance Webcast Charts
Oracle Big Data Governance Webcast Charts
 
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
 
IT for Management On-Demand Strategies for Performance, Growth,.docx
IT for Management On-Demand Strategies for Performance, Growth,.docxIT for Management On-Demand Strategies for Performance, Growth,.docx
IT for Management On-Demand Strategies for Performance, Growth,.docx
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
 
Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data
Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big DataHortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data
Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data
 
Fbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesFbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_services
 
Big Data analytics per le IT Operations
Big Data analytics per le IT OperationsBig Data analytics per le IT Operations
Big Data analytics per le IT Operations
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Perspectives on Ethical Big Data Governance
Perspectives on Ethical Big Data GovernancePerspectives on Ethical Big Data Governance
Perspectives on Ethical Big Data Governance
 

Dernier

➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 

Dernier (20)

➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 

Developing a Strategy for Data Lake Governance

  • 1. Developing a Strategy for Data Lake Governance Tony Baer, Principal Analyst, Information Management tony.baer@ovum.com @TonyBaer
  • 2. Ovum | TMT intelligence | informa2 Copyright © Informa PLC Agenda  Why are we having this conversation?  Why is governance critical?  How to govern the Data Lake?
  • 3. Ovum | TMT intelligence | informa3 Copyright © Informa PLC Let’s go to the polls Where is your organization on the Data Lake journey? Check one Already implementing Starting to implement Considering implementation No current plans to implement
  • 4. Ovum | TMT intelligence | informa4 Copyright © Informa PLC Getting the data Profiting from data Seeking value from Big Data: The Journey
  • 5. Ovum | TMT intelligence | informa5 Copyright © Informa PLC Getting the data Profiting from data Seeking value from Big Data: The Journey Core assumption: The Data Lake is a shared enterprise resource
  • 6. Ovum | TMT intelligence | informa6 Copyright © Informa PLC Group Log analytics Sentiment Analysis DW offload The journey to Data Lake starts small
  • 7. Ovum | TMT intelligence | informa7 Copyright © Informa PLC Group Multi-department Log analytics Sentiment Analysis DW offload Exploratory Analytics LOB analytic applications Operational analytics Success spreads…
  • 8. Ovum | TMT intelligence | informa8 Copyright © Informa PLC Group Multi-department Enterprise Log analytics Sentiment Analysis DW offload Data Lake Exploratory Analytics Line of business analytic applications Operational analytics The Data Lake is the culmination of the journey, not the start
  • 9. Ovum | TMT intelligence | informa9 Copyright © Informa PLC Why is governance critical? Costs out of control
  • 10. Ovum | TMT intelligence | informa10 Copyright © Informa PLC Why is governance critical? Costs out of control Privacy, legal & regulatory compliance issues
  • 11. Ovum | TMT intelligence | informa11 Copyright © Informa PLC Why is governance critical? Costs out of control Privacy, legal & regulatory compliance issues Untrustworthy data
  • 12. Ovum | TMT intelligence | informa12 Copyright © Informa PLC How to govern the Data Lake How to make the content of your data lake transparent?
  • 13. Ovum | TMT intelligence | informa13 Copyright © Informa PLC Availability/Reliability (FT,HA,BackupDR) Monitoring&troubleshooting Perimeter Security END USER TIER Data Lake building block Hadoop platform management End user tool Data Lake governance reference architecture DATA INVENTORY TIER DATA SECURITY TIER OPTIMIZATION TIER DATA PLATFORM TIER
  • 14. Ovum | TMT intelligence | informa14 Copyright © Informa PLC Availability/Reliability (FT,HA,BackupDR) Monitoring&troubleshooting Perimeter Security Data platform (Hadoop) Query/Analytics tools, programs Cost Optimization & Integration Physical Inventory Curation Data-level security Self- service tier Data Lake building block Hadoop platform management End user tool Data Lake governance functions
  • 15. Ovum | TMT intelligence | informa15 Copyright © Informa PLC Curation Build your library of information Physical Inventory Know/manage what data is in the data lake Data profiling, data preparation, collaborative data enrichment, catalog, match data, derive master data, record data lineage Business & Analytics teams Technology team Manage data access, track data lineage, tag for security, data retention Manage data access, tag for security, data retention, lifecycle & workflow, track data lineage Data Inventory tier
  • 16. Ovum | TMT intelligence | informa16 Copyright © Informa PLC Data Security & Data Lake Optimization tiers  Security  Data Protection – policy-based masking, encryption  Authorization, accounting & access control (AAA)  Perimeter security & remote authentication are functions of the core data platform  Optimization  Integration with other data platforms  Import/Export  Remote/federated/pushdown query processing  Lifecycle/workflow  Data retention policy?  Storage tiering?
  • 17. Ovum | TMT intelligence | informa17 Copyright © Informa PLC Governance: How Data Lakes compare to EDWs 90% 50% 90% 50% 30% confidence level  EDW provides good starting point  Core building blocks of governance are similar, but approaches differ  Data Inventory  Flexible, evolving schema  Quality critical, but adjust to need  Business users exert key roles  IT still provides adult supervision  Security  Greater varieties of data, use of external data sources, and (arguably) broader user constituencies demand more granular approaches to data protection  Optimization  Just as important as any EDW. Workloads must be prioritized  Lifecycle  Sleeper issue for Data Lakes 90%
  • 18. Ovum | TMT intelligence | informa18 Copyright © Informa PLC Takeaways  The Data Lake is a shared enterprise resource  It is a later, mature stage of Hadoop adoption  Exploratory analytics is a great way to sell business users on the value proposition of the Data Lake  Why governance? Because the data lake is an enterprise data resource  Governance will adapt & extend practices from EDW  Greater variety of data sources demands greater scrutiny for security, data retention & lifecycle management practices  Data lineage is critical!!!  Like any enterprise data platform, workloads must be prioritized
  • 19. Ovum | TMT intelligence | informa19 Copyright © Informa PLC There is no silver bullet recipe for Data Lake Governance
  • 20. Ovum | TMT intelligence | informa20 Copyright © Informa PLC Thank you Tony Baer Ovum (646) 546-5330 tony.baer@ovum.com Twitter: @TonyBaer