Ce diaporama a bien été signalé.
Nous utilisons votre profil LinkedIn et vos données d’activité pour vous proposer des publicités personnalisées et pertinentes. Vous pouvez changer vos préférences de publicités à tout moment.

The Missing Link in Enterprise Data Governance - Automated Metadata Management

835 vues

Publié le

So many companies and organizations are in the same boat. They’re drowning in their data — so much data, from so many different sources. They understand that data governance is hugely important for them to be able to know their data inside and out and comply with regulations. What many companies have not yet come to terms with when implementing their data governance strategy and supporting tools, is the criticality of metadata in the process. As the ‘data about data,’ metadata provides the value and purpose of the data content, thereby becoming an extremely effective tool for quickly locating information – a must for BI groups dealing with analytics and business user reporting.

Octopai's CEO, Amnon Drori will discuss this critical missing link in enterprise data governance and the impact of automating metadata management for data discovery and data lineage for BI. He'll demonstrate how BI groups use Octopai to not only locate their data instantly, but to quickly and accurately visualize and understand the entire data journey to enable the business to move forward.

Publié dans : Technologie
  • Soyez le premier à commenter

The Missing Link in Enterprise Data Governance - Automated Metadata Management

  1. 1. Donna Burbank, Managing Director, Global Data Strategy, Ltd. Amnon Drori, CEO, Octopai July 24th , 2018
  2. 2. Donna Burbank Donna is a recognised industry expert in information management with over 20 years of experience in data strategy, information management, data modeling, metadata management, and enterprise architecture. Her background is multi-faceted across consulting, product development, product management, brand strategy, marketing, and business leadership. She is currently the Managing Director at Global Data Strategy, Ltd., an international information management consulting company that specializes in the alignment of business drivers with data-centric technology. In past roles, she has served in key brand strategy and product management roles at CA Technologies and Embarcadero Technologies for several of the leading data management products in the market. As an active contributor to the data management community, she is a long time DAMA International member, Past President and Advisor to the DAMA Rocky Mountain chapter, and was recently awarded the Excellence in Data Management Award from DAMA International in 2016. Donna is also an analyst at the Boulder BI Train Trust (BBBT) where she provides advice and gains insight on the latest BI and Analytics software in the market. She was on several review committees for the Object Management Group’s for key information management and process modeling notations. She has worked with dozens of Fortune 500 companies worldwide in the Americas, Europe, Asia, and Africa and speaks regularly at industry conferences. She has co- authored two books: Data Modeling for the Business and Data Modeling Made Simple with ERwin Data Modeler was a contributor to the book Metadata Solutions, and is a regular contributor to industry publications. She can be reached at donna.burbank@globaldatastrategy.com Donna is based in Boulder, Colorado, USA. 2Follow on Twitter @donnaburbank
  3. 3. AMNON DRORI CO-FOUNDER & CEO, OCTOPAI 3 Amnon has over 20 years of leadership experience in technology companies. Before co-founding Octopai he led sales efforts at companies like Panaya, Zend Technologies, ModusNovo and Alvarion, and also served as the Chief Revenue Officer at CoolaData, a big data behavioral analytics platform. Amnon studied Management and Computer Science at the Open University of Tel Aviv. Octopai was founded in 2015 by business intelligence professionals that saw a real pain point in the sector, Octopai’s SaaS solution fully automates metadata management and analysis, enabling enterprise BI groups to quickly, easily and accurately find and understand their data for improved reporting accuracy, regulation compliance, data modeling, data quality and data governance.
  4. 4. Today’s Topic 4 • Companies are drowning in their data — so much data, from so many different sources. • They understand that data governance is hugely important, but many have not grasped the criticality of metadata in the process. • Metadata helps with locating data - a must for BI groups dealing with analytics and business user reporting. • Automating metadata management for data discovery and data lineage for BI is critical for enterprise data governance. • BI groups use Octopai to locate their data instantly, and to quickly and accurately visualize and understand the entire data journey. The Missing Link in Enterprise Data Governance: Automated Metadata Management
  5. 5. Agenda • Data Governance and Metadata: The Critical Link • The Business Need: Why Metadata is hotter than ever • New Strategies & Approaches to support the ever-evolving data landscape • How Octopai can Help 5
  6. 6. DataGovernance & Metadata– the Interdependency Metadata Management is critical to enforcing Data Governance Retail What is Data Governance? 1 Data Governance is the exercise of authority and control (planning, monitoring, and enforcement) over the management of data assets. Metadata Provides the means to deliver & enforce Data Governance Drives the need for What is Metadata? 1 Metadata “includes information about technical and business processes, data rules and constraints, and logical and physical data structures.” 1 From DAMA DMBOK
  7. 7. Metadata is the “Who, What, Where, Why, When & How” of Data 7 Who What Where Why When How Who created this data? What is the business definition of this data element? Where is this data stored? Why are we storing this data? When was this data created? How is this data formatted? (character, numeric, etc.) Who is the Steward of this data? What are the business rules for this data? Where did this data come from? What is its usage & purpose? When was this data last updated? How many databases or data sources store this data? Who is using this data? What is the security level or privacy level of this data? Where is this data used & shared? What are the business drivers for using this data? How long should it be stored? Who “owns” this data? What is the abbreviation or acronym for this data element? Where is the backup for this data? When does it need to be purged/deleted? Who is regulating or auditing this data? What are the technical naming standards for database implementation? Are there regional privacy or security policies that regulate this data? Metadata is Data In Context
  8. 8. Metadata is Part of a Larger Enterprise Landscape 8 A Successful Strategy Requires Many Inter-related Disciplines “Top-Down” alignment with business priorities “Bottom-Up” management & inventory of data sources Managing the people, process, policies & culture around data Coordinating & integrating disparate data sources Leveraging & managing data for strategic advantage
  9. 9. Metadata is Hotter than ever 9 A Growing Trend In a recent DATAVERSITY survey, over 80% of respondents stated that: Metadata is as important, if not more important, than in the past.
  10. 10. Metadata Management Use Cases 10 • Leading Use Cases were similar in 2016 & 2017, according to two recent DATAVERSITY surveys1: • Data Governance • Data Quality • Data Warehousing (DW) & Business Intelligence (BI) • Master Data Management (MDM) • 2017 saw growth in: • Regulation & Audit (e.g. GDPR) • Master Data Management 1 Emerging Trends in Metadata Management, 2016, DATAVERSITY, by Donna Burbank and Charles Roe Trends in Data Architecture, 2017, DATAVERSITY, by Donna Burbank and Charles Roe
  11. 11. BI Reporting, Data Governance, and Metadata 11 Total Sales Figures seem wrong in this report. How were they calculated? I need the answer for this afternoon’s meeting. Thanks! Sure! • With the rise of the data-driven organization, data in business intelligence reports has more visibility than ever. • This visibility highlights data quality issues, and drives the need for data lineage and governance
  12. 12. Data Source 1 Reality Can Be Complicated 12 Data Source 2 Data Source 1 Data Source X The complexity of most data warehouse and BI systems makes manual documentation feel like a Rube Goldberg diagram!
  13. 13. Data Lineage Automation • Automated metadata lineage can help show the path from source system to BI report. • The good news is that many systems have embedded metadata that can be inferred & captured by automated tools. 13 Audit and Traceability Sales Report CUSTOMER Database Table CUST Database Table CUSTOMER Database Table CUSTOMER Database Table TBL_C1 Database Table ETL Tool ETL Tool Physical Data Model Physical Data Model Logical Data Model Dimensional Data Model BI Tool Total Sales for Customer X this Quarter are $1.5M
  14. 14. Unlocking the Details of ETL Mappings • In addition to high-level data flow mapping, detailed source to target mapping and transformation is important to business intelligence lineage and governance. 14 The Devil is Often in the Details Field to Field Mapping is Complex Cust_No Cust_Num Associate ID Customer_Number
  15. 15. Managing & Governing Change 15 Hey man, for that big marketing launch next week, we’re changing the product name – it’s really cool. Could you make sure all the reports and systems show the new name? Thanks! Sure! • Business today moves quickly, and technical systems need to keep pace. • A change in one simple field can wreak havoc on downstream systems if not managed carefully -> metadata management can help govern change and impact analysis.
  16. 16. Impact Analysis & Where Used • Impact Analysis shows the relationship between data sources to assess the impact of a potential change. • Driving Agility & Responsiveness • Reducing Risk • For example, if I change the length & name of a field, what other systems that are referencing that field will be affected? • With this roadmap in place, it is easier to assess the impact of a proposed change, significantly reducing development and maintenance time, and improving overall governance. 16 Proactively Showing the Impact of Change What happens if I change the name & length of the “Brand” field? Brand CHAR(10) MyBrand VARCHAR(30) Customer Database Oracle Sales Application Sales Database DB2 Staging Area ETL Data Warehouse Sales Report
  17. 17. Data Governance – Overarching Framework Organization & People Process & Workflows Data Management & Measures Culture & Communication Vision & Strategy Tools & Technology - Automation is critical Business Goals & Objectives Data Issues & Challenges Managing the Complex Interactions between Technology, Process and People Automated metadata management is a critical foundation for data governance.
  18. 18. Technical Metadata Makes Data Governance Actionable • Metadata helps align data governance policies and make them actionable in physical systems, maintaining a lineage & audit trail. • How was a given field calculated on a report? • Where is personal information (e.g. PII) used across the organization? • Etc. 18 Policies & Procedures Business Rules & Definitions Technical Implementation Audit & Lineage Technical Metadata Lineage makes Data Governance Policies Actionable.
  19. 19. Metadata Matters Even with today’s advanced hardware & storage options, self-service BI tools, and data science skills & tools, attention needs to be paid to the quality, context, & structure of data (aka Metadata) The absence of commonly understood and shared metadata and data definitions and the lack of data governance are cited as the main impediments to the success of Data Lakes. Source: Radiant Advisors 71% of interviewees surveyed in larger global organizations expect data-driven digitization to help their business grow. But… • 70% say the biggest barrier is finding the right data • 62% cite inconsistent data. Source: Stibo Systems If I have to manually map data lineage in one more spreadsheet, I’m going to shoot myself.
  20. 20. Types of Metadata Managed in Today’s Organization1 20 Now Future • Strong focus on: • Data Warehousing • Relational Databases • Data Models • Business Glossaries • Business Intelligence • ETL Tools • Focus on Data Warehouse & Relational systems continues. • With more diverse sources added: • Big Data Platforms • Machine Learning/AI • Semantic Technologies • NoSQL Platforms • Legacy Platforms (?!) – retirees? • Social Media • Media Files • Etc. 1 Emerging Trends in Metadata Management, 2016, DATAVERSITY, by Donna Burbank and Charles Roe
  21. 21. Diversity of Sources Makes Metadata Management More Challenging In the 2017 DATAVERSITY Report on Trends in Data Architecture, two of the top concerns from respondents around metadata management were: • Tracking metadata and lineage across heterogeneous environments. • More automation for metadata management. 21 Trends in Data Architecture, 2017, DATAVERSITY, by Donna Burbank and Charles Roe
  22. 22. Technical Innovation in Metadata Management Technical Innovation not only has created new sources of metadata. …but it has created new ways to manage metadata as well. 22
  23. 23. Machine Learning & Metadata Discovery • Machine Learning offers ways to automate tedious tasks that may have been done manually before: • e.g. Data Mapping • SSN -> Field1_SSN • SSN -> Soc_Num • Etc. • Machine Learning Pattern Matching • NNN-NN-NNNN -> Field_X follows this pattern, it must be a SSN 23 Source kdnuggets.com • There is a place for both methods: • Sometimes you want to define specific mapping rules • Sometimes you want a pattern-matching, discovery- style approach.
  24. 24. Metadata Discovery Tools 24 • With the ever-growing sources of data… • …and increased visibility on data due to regulation and business needs… • Automation is critical for managing the complexity of today’s metadata environment. • Automated population from common sources (reports, databases, ETL tools, etc.) • Machine Learning capabilities facilitate mapping & lineage • Visual data lineage to understand traceability, audit, and impact of change Metadata Discovery Tools Automated Lineage, Governance & Traceability ERP OLAP Oracle Teradata SQL ServerCognos Informatica ETL SAP BO Tableau
  25. 25. Disrupting Metadata Management with Metadata Automation 25
  26. 26. Build New Processes M&A Fix Broken Processes Migration/Upgrades Fix Reports Changes & Impact Analysis JUST SOME OF MANY USE CASES
  27. 27. METADATA IS SCATTERED ALL OVER THE PLACE DB Sources Marketing CRM ERP Finance HR Other Sources ETL Tools Informatica DataStage (IBM) SSIS (MS) Others (Talend, etc’) BI groups invest >50% of time and effort to manually find and understand metadata Octopai – Cross Platform Metadata Management for BI (in IT org) Data Warehouse Oracle Others (Hadoop, etc’) SQL Server(MS) Teradata Vertica(Big Data) Reporting & Analysis Tools Cognos (IBM) BO (SAP) QlikView Tableau OBIEE (Oracle) Others (Sisense, etc’) SSAS (OLAP)
  28. 28. Demo 28
  29. 29. Questions? 29 Thoughts? Ideas?