SlideShare une entreprise Scribd logo
1  sur  47
Télécharger pour lire hors ligne
Data Modeling for Big Data
Donna Burbank
Global Data Strategy Ltd.
Lessons in Data Modeling DATAVERSITY Series
August 25th, 2016
Global Data Strategy, Ltd. 2016
Donna Burbank
Donna is a recognised industry expert in
information management with over 20
years of experience in data strategy,
information management, data modeling,
metadata management, and enterprise
architecture. Her background is multi-
faceted across consulting, product
development, product management,
brand strategy, marketing, and business
leadership.
She is currently the Managing Director at
Global Data Strategy, Ltd., an international
information management consulting
company that specialises in the alignment
of business drivers with data-centric
technology. In past roles, she has served in
key brand strategy and product
management roles at CA Technologies and
Embarcadero Technologies for several of
the leading data management products in
the market.
As an active contributor to the data
management community, she is a long
time DAMA International member and is
the President of the DAMA Rocky
Mountain chapter. She was also on the
review committee for the Object
Management Group’s Information
Management Metamodel (IMM) and a
member of the OMG’s Finalization
Taskforce for the Business Process
Modeling Notation (BPMN).
She has worked with dozens of Fortune
500 companies worldwide in the
Americas, Europe, Asia, and Africa and
speaks regularly at industry
conferences. She has co-authored two
books: Data Modeling for the
Business and Data Modeling Made Simple
with ERwin Data Modeler and is a regular
contributor to industry publications such
as DATAVERSITY, EM360, & TDAN. She can
be reached at
donna.burbank@globaldatastrategy.com
Donna is based in Boulder, Colorado, USA.
2
Follow on Twitter @donnaburbank
Global Data Strategy, Ltd. 2016
Lessons in Data Modeling Series
• July 28th Why a Data Model is an Important Part of your Data Strategy
• August 25th Data Modeling for Big Data
• September 22nd UML for Data Modeling – When Does it Make Sense?
• October 27th Data Modeling & Metadata Management
• December 6th Data Modeling for XML and JSON
3
This Year’s Line Up
Global Data Strategy, Ltd. 2016
Agenda
• Big Data –A Technical & Cultural Paradigm Shift
• Big Data in the Larger Information Management Landscape
• Modeling & Technology Considerations
• Organizational Considerations: The Role of the Data Architect in the World of Big Data
• Summary & Questions
4
What we’ll cover today
Global Data Strategy, Ltd. 2016
What is a Model?
5
The Importance of Context & Definitions
X X
6
Big Data –A Technical &
Cultural Paradigm Shift
Global Data Strategy, Ltd. 2016
Technological and Culture Shift in Society
1960 1970 1980 1990 2000 2010
Global Data Strategy, Ltd. 2016
Technological and Culture Shift in Data Management
1960 1970 1980 1990 2000 2010
• Mainframes
• Flat Files
• In-House Development
• Waterfall Methodology
• Individual PC’s
• Relational Databases
• Democratization of Computing
• Client/Server Computing
• Relational Databases
• Data Warehousing
• Packaged Applications
• RAD Development
• Dot COM Revolution
• Changing Business Models
• Dot COM Bubble Bursts
• Data Integration
• “Do More with Less”
• Cloud Computing
• “Big Data”, NoSQL
• Data Lakes
• Agile Development
Global Data Strategy, Ltd. 2016
Traditional Relational Technologies and “Big Data”:
a Paradigm Shift
Traditional
• Top-Down, Hierarchical
• Design, then Implement
• “Passive”, Push technology
• “Manageable” volumes of information
• “Stable” rate of change
• Business Intelligence
Big Data
• Distributed, Democratic
• Discover and Analyze
• Collaborative, Interactive
• Massive volumes of information
• Rapid and Exponential rate of growth
• Statistical Analysis
Design Implement Discover Analyze
Global Data Strategy, Ltd. 2016
“Traditional” way of Looking at the World: Hierarchies
• Carolus Linnaeus in 1735 established a hierarchy/taxonomy for organizing and identifying
biological systems.
Kingdom
Phylum
Class
Order
Family
Genus
Species
Global Data Strategy, Ltd. 2016
“New” Way of Looking at the World - Emergence
In philosophy, systems theory, science, and art, emergence is
the way complex systems and patterns arise out of a
multiplicity of relatively simple interactions.
- Wikipedia
I love my new
Levis jeans.
Is Levi coming
to my party?
Sale #LEVIS
20% at Macys.
LOL. TTYL.
Leving soon.
12
Big Data within the Larger
Information Management Landscape
Global Data Strategy, Ltd. 2016
Big Data is Part of a Larger Enterprise Landscape
13
A Successful Data Strategy Requires Many Inter-related Disciplines
“Top-Down” alignment with
business priorities
“Bottom-Up” management &
inventory of data sources
Managing the people, process,
policies & culture around data
Coordinating & integrating
disparate data sources
Leveraging & managing data for
strategic advantage
Global Data Strategy, Ltd. 2016
What is Big Data?
• Big Data is often characterised by the “3 Vs”:
• Volume: Is there a high volume of data? (e.g. terabytes per day)
• Velocity: Is data generated or changed at a rapid pace? (e.g. per second, sub-second)
• Variety: Is data stored across multiple formats? (e.g. machine data, OSS data, log files)
• The ability to understand and manage these sources and integrate them into the larger Business
Intelligence ecosystem can provide the ability to gain valuable insights from data.
• This ability leads to the “4th V” of Big Data – Value.
• Value: Valuable insights gained from the ability to analyze and discover new patterns and trends from
high-volume and/or cross-platform systems.
• Volume
• Velocity
• Variety
Value
Global Data Strategy, Ltd. 2016
The 5th “V” - Veracity
• Only through proper Governance, Data Quality Management, Metadata Management, etc., can
organizations achieve the 5th “V” – Veracity.
• Veracity: Trust in the accuracy, quality and content of the organizations’ information assets.
• i.e. The hard work doesn’t go away with Big Data
Raw data used in Self-Service Analytics and BI environments is
often so poor that many data scientists and BI professionals
spend an estimated 50 – 90% of their time cleaning and
reformatting data to make it fit for purpose.(4
Source: DataCenterJournal.com
The absence of commonly understood and shared metadata
and data definitions is cited as one of the main impediments
to the success of Data Lakes.
Source: Radiant Advisors
Correcting poor data quality is a Data Scientist’s least favorite
task, consuming on average 80% of their working day
Source: Forbes 2016
71% of interviewees expect digitization to grow their
business. But 70% say the biggest barrier is finding the right
data; 62% cite inconsistent data
Source: Stibo Systems
Data Science Data Lakes
Data Science Digitization & Data Quality
Global Data Strategy, Ltd. 2016
Metadata & Big Data Analytics
• Modern advances in data analytics & big data storage provide a wealth of opportunities
• But the analytics are only as good as the quality of the underlying data
• Metadata is critical – where did the data come from? What was its intended purpose? What are the units of
measure? What is the definition of key terms?
• Good data analysis is based on good data. Good data requires good metadata.
16
Global Data Strategy, Ltd. 2016
Are Data Models Still Relevant?
From Data Modeling for the Business by
Hoberman, Burbank, Bradley, Technics
Publications, 2009
Global Data Strategy, Ltd. 2016
Metadata & Big Data Analytics
18
“Our analysis shows that energy usage with Smart Meters increases by 5% for each percentage point decrease in
temperature compared to a 20% increase for traditional thermostat customers.”
Global Data Strategy, Ltd. 2016
Metadata & Big Data Analytics
19
• What was the source for the weather data?
• Were readings taking daily, monthly, weekly? Averages or actuals?
• What was the original purpose & format for the readings?
• Were temperatures in Celsius or Fahrenheit?
• Etc.
• Were readings taken by meter readings or billing amounts?
• Were readings taking daily, monthly, weekly? Averages or actuals?
• Were temperatures in Celsius or Fahrenheit?
• Meter readings for were in completely different formats.
It took us weeks to standardize them.
• Etc.
• Is Usage by Address, by Individual,
or by Household?
• Are households determined by
residence or relationships?
• Etc.
“Our analysis shows that energy usage with Smart Meters increases by 5% for each percentage point decrease in
temperature compared to a 20% increase for traditional thermostat customers.”
Global Data Strategy, Ltd. 2016
The Business Case Remains the Same
Single, Consistent View of Information – from all Sources
Tell me what
customers are
saying about our
product.
Sybase
SAP
DB2
Oracle
SQL
Server
SQL
Azure
Informix
Teradata
DBA
Which customer
database do you
want me to pull this
from? We have 25.
Data
Architect
And, by the way, the databases
all store customer information
in a different format.
“CUST_NM” on DB2,
“cust_last_nm” on Oracle, etc.
It’s a mess.
I love my new
Levis jeans.
Is Levi coming
to my party?
Sale #LEVIS
20% at Macys.
LOL. TTYL.
Leving soon.
Big Data
Traditional/Relational Data
Data
Scientist
I’ll need to input the raw data
from thousands of sources, and
write a program to parse and
analyze the relevant
information.
20
Global Data Strategy, Ltd. 2016
Combining DW & Big Data Can Provide Valuable Information
• There are numerous ways to gain value from data
• Relational Database and Data Warehouse systems are one key source of value
• Customer information
• Product information
• Big Data can offer new insights from data
• From new data sources (e.g. social media, IoT)
• By correlating multiple new and existing data sources (e.g. network patterns & customer data)
• Integrating DW and Big Data can provide valuable new insights.
• Examples include:
• Customer Experience Optimization
• Churn Management
• Products & Services Innovation
New
InsightsData
Warehouse
21
Global Data Strategy, Ltd. 2016
Case Study: Facebook’s Data Warehouse
• Started with Big Data using Hadoop, then saw the need for a traditional Data Warehouse
• Ken Rudin, director of analytics at Facebook presented keynote at TDWI Chicago in May 2013—video replay
available.
• Needed a single source of reference for core business data
• All data in one place- “managed chaos”
• Define the core elements of the business, leave the rest alone
• 100,000s of tables in Hadoop down to a few dozen in Data Warehouse
• What data warehouse was good at:
• Operational analysis (e.g. how many users logged in by region?)
• Faster query time (1 minute on DW, over 1 hour on Hadoop)
• What Big Data was good at:
• Exploratory analysis (e.g. Where are users posting from—how can we infer a location if one is not listed?)
“The genius of AND and the tyranny of OR” – Jim Collins, author of
“Good to Great” -> i.e. Both solutions have their place
22
Global Data Strategy, Ltd. 2016
Case Study: Facebook’s Data Warehouse
• Challenge in both Big Data and Data Warehouse solutions — Business Definitions & Metadata
• e.g. How many users logged in yesterday?
• What do you mean by user?
• Does user include mobile devices?
• If a user posted from Spotify, is that a user?
• Sound familiar?
23
24
Modeling & Technology
Considerations
Global Data Strategy, Ltd. 2016
Levels of Data Modeling
25
Conceptual
Logical
Physical
Purpose
Communication & Definition of
Business Terms & Rules
Clarification & Detail
of Business Rules &
Data Structures
Technical
Implementation on
a Physical Database
Audience
Business Stakeholders
Data Architecture
Business Analysts
DBAs
Developers
Business Concepts
Data Entities
Physical Tables
Global Data Strategy, Ltd. 2016
Conceptual Data Model
• Creation & Communication of Business Rules and Definitions
Global Data Strategy, Ltd. 2016
The Importance of Definitions
• Definitions are as important as the data elements themselves.
• Many data-related business issues are caused by unclear or ill-defined terms
27
What is in a name?
What do you mean by
“customer”?
We’re calculating “total sales”
differently in each region!
Sales is using a different
“monthly calendar” than
Finance.
How are we defining a
“household”?
What’s an “equity
derivative”?
What’s a “PEG ratio”?
“API” as in “Application Programming Interface?”
or “American Petroleum Institute”? Or a bee?
What’s the difference between an
“ingredient” and a “raw material”?
Global Data Strategy, Ltd. 2016
The 5th “V” - Veracity
• Remember the 5th “V” – Veracity!
• Definitions & Metadata are just as important with Big Data as with
traditional systems.
The absence of commonly understood and shared
metadata and data definitions is cited as one of the
main impediments to the success of Data Lakes.
Source: Radiant Advisors
Global Data Strategy, Ltd. 2016
Logical Data Model
• More Detailed, Potential Pre-cursor
To Physical Design
• Definition of Key Business Rules,
Relationships, and Attributes
Global Data Strategy, Ltd. 2016
Physical Data Model
• Optimization and design of a physical database for storage and performance.
Global Data Strategy, Ltd. 2016
Traditional Model – “Schema on Write”
Sybase
MySQL
Oracle
Data Models
Teradata
Sybase
SQL
Server
DB2
Teradata
SQL
Server DB2
MySQLSQL
Azure
SQL
Azure
Oracle
• With the traditional relational database paradigm, forward & reverse engineering can both create
and read database structure using a graphical data model.
Global Data Strategy, Ltd. 2016
Big Data Model - “Schema on Read”
• With the Big Data and NoSQL paradigm, “Schema-on-Read” means you do not need to know how you will
use your data when you are storing it.
32
Hive
HDFS File system
hdfs dfs -put /local/path/userdump /hdfs/path/data/users
Table Structures
Create table …
Exploration
Analysis
Analyze & understand the data. Build a data structure to suite
your needs.
• You do need to know how you will use your data when you are using
it and model accordingly.
• i.e. it’s not magic.
• For example, you may first place the data on HDFS in files, then apply a
table structure in Hive.
• Apache Hive provides a mechanism to project structure onto the
data in Hadoop and to query that data using a SQL-like language
called HiveQL (HQL).
Global Data Strategy, Ltd. 2016
Data Modeling in the Big Data Ecosystem
Hive HBase
Structured Data Unstructured Data
MapReduce / AnalyticsHadoop Framework
HDFS File System
JSON / XML
HQL
Semi-structured Data
JSON
XML JSON
Data Sources
Global Data Strategy, Ltd. 2016
NoSQL – Key Value Databases
• NoSQL Databases are often optimal solutions for flexibility & performance in certain scenarios.
• One common NoSQL database is a key-value pair database (e.g. Redis, Oracle NoSQL, etc.)
• They can support extremely high volumes of records & state changes per second through distributed
processing and distributed storage.
• Use cases include: Managing user sessions in web applications, online gaming, online shopping carts,
etc.
• The structure is often created by the application code, not within a database or metadata
structure.
• Metadata for NoSQL databases is typically minimal or non-existent.
• The structure & metadata is generally determined by the application code
34
Key Value
1839047 John Doe, Prepaid, 40.00
9287320 01/01/2008, 50.00, Green
Global Data Strategy, Ltd. 2016
*
NoSQL Metadata – Document Databases
• Document databases are popular ways to store unstructured information in a flexible way (e.g.
multimedia, social media posts, etc. )
• Each Collection can contain numerous Documents which could all contain different fields.
35
• Some data modeling can be done, and some data modeling tools support this (e.g. MongoDB).
* Example from docs.mongodb.com
{type: “Artifact”,
medium: “Ceramic”
country: “China”,
}
{type: “Book”,
title: “Ancient China”
country: “China”,
}
Global Data Strategy, Ltd. 2016
Graph Relationships
• Graph databases are ideal for analyzing metadata relationships between objects and finding
patterns in those relationships.
• Common use cases for graph relationship metadata analysis include:
• Fraud detection - e.g. financial transactions
• Threat detection - e.g. email and phone patterns
• Marketing – e.g. social media connections, product recommendation engines
• Network optimization - e.g. IoT, Telecommunications
36
The data model
is the database.
37
Organizational Considerations:
The Role of the Data Architect in the
World of Big Data
Global Data Strategy, Ltd. 2016
Roles & Culture
DBAs
• Analytical
• Structured
• Project & Task focused
• Cautious – identifies risks
• “Just let me code!”
Business Executive
• Results-Oriented
• Optimistic – Identifies opportunities
• “Big Picture” focused
• “I’m busy.”
• “What’s the business opportunity?”
Data Modelers
• Analytical
• Structured
• “Big Picture” focused
• Passionate
• Likes to Talk
• Can be considered “old school”
• “Let me tell you about my data model!”
Big Data Vendors
• It’s magic!
• It’s easy!
• No modeling
needed!
Data Scientist
• Looks for opportunities
• Likes to explore
• Seen as “modern”
• Seen as “hip” & “sexy”
Global Data Strategy, Ltd. 2016
New Operating Model:
Interactions Between New & Existing Roles
Data Scientist
Hadoop
Administrator
Data Architect
Privacy
Analyst
ETL Developer
Network
Administrator
Existing Roles New Roles
Alignment
Global Data Strategy, Ltd. 2016
Data-Driven Business
• In the current environment of data-driven business, Data Professionals have an opportunity to
have a “seat at the table”
• Business Optimization: Becoming a Data Driven Company - Making the Business More Efficient
• Better Marketing Campaigns
• Higher quality customer data, 360 view of customer, competitive info, etc.
• Better Products
• Data-Driven product development, Customer usage monitoring, etc.
• Better Customer Support
• Linking customer data with support logs, network outages, etc.
• Transformative: Becoming a Data Company - Changing the Business Model via Data – data
becomes the product
• Monetization of Information: examples across multiple industries including:
• Telco: location information, usage & search data, etc.
• Retail: Click-stream data, purchasing patterns
• Social Media: social & family connections, purchasing trends &
recommendations, etc.
• Energy: Sensor data, consumer usage patterns, smart metering, etc.
40
Global Data Strategy, Ltd. 2016
Case Study: Consumer Energy Company
• For the consumer energy sector Big Data and Smart Meters are transforming the ways of
doing business and interacting with customers.
• Moving away from traditional data use cases of metering & billing.
• Smart meters allow customers to be in control of their energy usage.
• Control over energy usage with connected systems
• Custom Energy Reports & Usage
• Smart Billing based on usage times
• As energy usage declines, data is becoming the true business asset for this energy company.
• Monetization of non-personal data is a future consideration.
• While the Big Data Opportunity is crucial, equally important are the traditional data sources
• New Data Quality Tools in place for operational and DW data
• Data Governance Program analyzing data in relation to business processes & roles
• Business-critical data elements identified and definitions created
Business Transformation via Data
Global Data Strategy, Ltd. 2016
Data-Driven Business Evolution
42
Data is a key component for new business opportunities
New Business Model
• Consumer-Driven Smart
Metering
• Connected Devices, IoT
• Proactive service monitoring
• Monetization of usage data
Traditional Business Model
• Usage-based billing
• Issue-driven customer service
More Efficient Business Model
• More efficient billing
• Faster customer service
response
• More consumer information
re: energy efficiency, etc.
Databases Big DataData
Quality
Data
Governance
Metadata Management
Global Data Strategy, Ltd. 2016
Summary
• We are in a period of “disruptive” technology with new opportunities
• Rapid rate of change, Massive volumes of data
• Social changes: more participatory, engaged
• New business models based on data
• Create a fit-for-purpose solution
• Relational databases are still great for operational systems & data warehouses
• Big Data offers new opportunities for analysis across large volumes of diverse data
• As with any Age of Change, the basics still apply
• The “hard stuff” still needs to be done: analysis, metadata definition, data models, etc.
• Governance and Operating Models are critical
• Data models are valuable to document business requirements and technical implementation
• Have fun! This is an exciting time to be in Information Management
Global Data Strategy, Ltd. 2016
About Global Data Strategy, Ltd
• Global Data Strategy is an international information management consulting company that specializes
in the alignment of business drivers with data-centric technology.
• Our passion is data, and helping organizations enrich their business opportunities through data and
information.
• Our core values center around providing solutions that are:
• Business-Driven: We put the needs of your business first, before we look at any technology solution.
• Clear & Relevant: We provide clear explanations using real-world examples.
• Customized & Right-Sized: Our implementations are based on the unique needs of your organization’s
size, corporate culture, and geography.
• High Quality & Technically Precise: We pride ourselves in excellence of execution, with years of
technical expertise in the industry.
44
Data-Driven Business Transformation
Business Strategy
Aligned With
Data Strategy
Visit www.globaldatastrategy.com for more information
Global Data Strategy, Ltd. 2016
Contact Info
• Email: donna.burbank@globaldatastrategy.com
• Twitter: @donnaburbank
@GlobalDataStrat
• Website: www.globaldatastrategy.com
• Company Linkedin: https://www.linkedin.com/company/global-data-strategy-ltd
• Personal Linkedin: https://www.linkedin.com/in/donnaburbank
45
Global Data Strategy, Ltd. 2016
Lessons in Data Modeling Series
• July 28th Why a Data Model is an Important Part of your Data Strategy
• August 25th Data Modeling for Big Data
• September 22nd UML for Data Modeling – When Does it Make Sense?
• October 27th Data Modeling & Metadata Management
• December 6th Data Modeling for XML and JSON
46
Join us next month
Global Data Strategy, Ltd. 2016
Questions?
47
Thoughts? Ideas?

Contenu connexe

Tendances

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
DAS Slides: Master Data Management – Aligning Data, Process, and Governance
DAS Slides: Master Data Management – Aligning Data, Process, and GovernanceDAS Slides: Master Data Management – Aligning Data, Process, and Governance
DAS Slides: Master Data Management – Aligning Data, Process, and GovernanceDATAVERSITY
 
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...Certus Solutions
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
Data Architecture for Data Governance
Data Architecture for Data GovernanceData Architecture for Data Governance
Data Architecture for Data GovernanceDATAVERSITY
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best PracticesDATAVERSITY
 
Snowflake Data Governance
Snowflake Data GovernanceSnowflake Data Governance
Snowflake Data Governancessuser538b022
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureDATAVERSITY
 
How to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityHow to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityDATAVERSITY
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDATAVERSITY
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouseJames Serra
 
Snowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat SheetSnowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat SheetJeno Yamma
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesDATAVERSITY
 
Understanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityUnderstanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityDevOps.com
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Data Lake Architecture
Data Lake ArchitectureData Lake Architecture
Data Lake ArchitectureDATAVERSITY
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogDATAVERSITY
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceData Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceDATAVERSITY
 
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...IDERA Software
 

Tendances (20)

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
DAS Slides: Master Data Management – Aligning Data, Process, and Governance
DAS Slides: Master Data Management – Aligning Data, Process, and GovernanceDAS Slides: Master Data Management – Aligning Data, Process, and Governance
DAS Slides: Master Data Management – Aligning Data, Process, and Governance
 
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data Architecture for Data Governance
Data Architecture for Data GovernanceData Architecture for Data Governance
Data Architecture for Data Governance
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Snowflake Data Governance
Snowflake Data GovernanceSnowflake Data Governance
Snowflake Data Governance
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
How to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityHow to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data Quality
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Snowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat SheetSnowflake SnowPro Certification Exam Cheat Sheet
Snowflake SnowPro Certification Exam Cheat Sheet
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
 
Understanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityUnderstanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application Quality
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Data Lake Architecture
Data Lake ArchitectureData Lake Architecture
Data Lake Architecture
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceData Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and Governance
 
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
 

Similaire à Data Modeling Still Important for Big Data Insights- Big data represents a technological and cultural paradigm shift from traditional data management approaches due to the volume, velocity and variety of data. - Data modeling remains relevant for big data as it provides a single consistent view of information from all sources and helps address issues like data quality, governance and metadata management.- Metadata is critical for big data analytics to deliver value and insights. Without proper metadata, significant time is wasted cleaning and preparing poor quality data.- Integrating traditional data warehouses with big data sources can provide valuable new insights by analyzing existing and new data together. This helps with initiatives like customer

Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...DATAVERSITY
 
Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata StrategiesDATAVERSITY
 
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data LandscapeData Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data LandscapeDATAVERSITY
 
LDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementLDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementDATAVERSITY
 
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & RisksDAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & RisksDATAVERSITY
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDATAVERSITY
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata ManagementDATAVERSITY
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data IntegrationDATAVERSITY
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...DATAVERSITY
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?DATAVERSITY
 
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...DATAVERSITY
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling TechniquesDATAVERSITY
 
DAS Slides: Best Practices in Metadata Management
DAS Slides: Best Practices in Metadata ManagementDAS Slides: Best Practices in Metadata Management
DAS Slides: Best Practices in Metadata ManagementDATAVERSITY
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceDATAVERSITY
 
Data Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical ApproachesData Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical ApproachesDATAVERSITY
 
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DATAVERSITY
 
Data Architecture Strategies: The Rise of the Graph Database
Data Architecture Strategies: The Rise of the Graph DatabaseData Architecture Strategies: The Rise of the Graph Database
Data Architecture Strategies: The Rise of the Graph DatabaseDATAVERSITY
 
Big Data why Now and where to?
Big Data why Now and where to?Big Data why Now and where to?
Big Data why Now and where to?Fady Sayah
 
The Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementThe Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementDATAVERSITY
 

Similaire à Data Modeling Still Important for Big Data Insights- Big data represents a technological and cultural paradigm shift from traditional data management approaches due to the volume, velocity and variety of data. - Data modeling remains relevant for big data as it provides a single consistent view of information from all sources and helps address issues like data quality, governance and metadata management.- Metadata is critical for big data analytics to deliver value and insights. Without proper metadata, significant time is wasted cleaning and preparing poor quality data.- Integrating traditional data warehouses with big data sources can provide valuable new insights by analyzing existing and new data together. This helps with initiatives like customer (20)

Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
 
Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata Strategies
 
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data LandscapeData Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
 
LDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementLDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata Management
 
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & RisksDAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data Integration
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
 
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
 
DAS Slides: Best Practices in Metadata Management
DAS Slides: Best Practices in Metadata ManagementDAS Slides: Best Practices in Metadata Management
DAS Slides: Best Practices in Metadata Management
 
LDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business IntelligenceLDM Webinar: Data Modeling & Business Intelligence
LDM Webinar: Data Modeling & Business Intelligence
 
Data Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical ApproachesData Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical Approaches
 
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
 
Data Architecture Strategies: The Rise of the Graph Database
Data Architecture Strategies: The Rise of the Graph DatabaseData Architecture Strategies: The Rise of the Graph Database
Data Architecture Strategies: The Rise of the Graph Database
 
Big Data why Now and where to?
Big Data why Now and where to?Big Data why Now and where to?
Big Data why Now and where to?
 
The Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementThe Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata Management
 

Plus de DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

Plus de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Dernier

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Dernier (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

Data Modeling Still Important for Big Data Insights- Big data represents a technological and cultural paradigm shift from traditional data management approaches due to the volume, velocity and variety of data. - Data modeling remains relevant for big data as it provides a single consistent view of information from all sources and helps address issues like data quality, governance and metadata management.- Metadata is critical for big data analytics to deliver value and insights. Without proper metadata, significant time is wasted cleaning and preparing poor quality data.- Integrating traditional data warehouses with big data sources can provide valuable new insights by analyzing existing and new data together. This helps with initiatives like customer

  • 1. Data Modeling for Big Data Donna Burbank Global Data Strategy Ltd. Lessons in Data Modeling DATAVERSITY Series August 25th, 2016
  • 2. Global Data Strategy, Ltd. 2016 Donna Burbank Donna is a recognised industry expert in information management with over 20 years of experience in data strategy, information management, data modeling, metadata management, and enterprise architecture. Her background is multi- faceted across consulting, product development, product management, brand strategy, marketing, and business leadership. She is currently the Managing Director at Global Data Strategy, Ltd., an international information management consulting company that specialises in the alignment of business drivers with data-centric technology. In past roles, she has served in key brand strategy and product management roles at CA Technologies and Embarcadero Technologies for several of the leading data management products in the market. As an active contributor to the data management community, she is a long time DAMA International member and is the President of the DAMA Rocky Mountain chapter. She was also on the review committee for the Object Management Group’s Information Management Metamodel (IMM) and a member of the OMG’s Finalization Taskforce for the Business Process Modeling Notation (BPMN). She has worked with dozens of Fortune 500 companies worldwide in the Americas, Europe, Asia, and Africa and speaks regularly at industry conferences. She has co-authored two books: Data Modeling for the Business and Data Modeling Made Simple with ERwin Data Modeler and is a regular contributor to industry publications such as DATAVERSITY, EM360, & TDAN. She can be reached at donna.burbank@globaldatastrategy.com Donna is based in Boulder, Colorado, USA. 2 Follow on Twitter @donnaburbank
  • 3. Global Data Strategy, Ltd. 2016 Lessons in Data Modeling Series • July 28th Why a Data Model is an Important Part of your Data Strategy • August 25th Data Modeling for Big Data • September 22nd UML for Data Modeling – When Does it Make Sense? • October 27th Data Modeling & Metadata Management • December 6th Data Modeling for XML and JSON 3 This Year’s Line Up
  • 4. Global Data Strategy, Ltd. 2016 Agenda • Big Data –A Technical & Cultural Paradigm Shift • Big Data in the Larger Information Management Landscape • Modeling & Technology Considerations • Organizational Considerations: The Role of the Data Architect in the World of Big Data • Summary & Questions 4 What we’ll cover today
  • 5. Global Data Strategy, Ltd. 2016 What is a Model? 5 The Importance of Context & Definitions X X
  • 6. 6 Big Data –A Technical & Cultural Paradigm Shift
  • 7. Global Data Strategy, Ltd. 2016 Technological and Culture Shift in Society 1960 1970 1980 1990 2000 2010
  • 8. Global Data Strategy, Ltd. 2016 Technological and Culture Shift in Data Management 1960 1970 1980 1990 2000 2010 • Mainframes • Flat Files • In-House Development • Waterfall Methodology • Individual PC’s • Relational Databases • Democratization of Computing • Client/Server Computing • Relational Databases • Data Warehousing • Packaged Applications • RAD Development • Dot COM Revolution • Changing Business Models • Dot COM Bubble Bursts • Data Integration • “Do More with Less” • Cloud Computing • “Big Data”, NoSQL • Data Lakes • Agile Development
  • 9. Global Data Strategy, Ltd. 2016 Traditional Relational Technologies and “Big Data”: a Paradigm Shift Traditional • Top-Down, Hierarchical • Design, then Implement • “Passive”, Push technology • “Manageable” volumes of information • “Stable” rate of change • Business Intelligence Big Data • Distributed, Democratic • Discover and Analyze • Collaborative, Interactive • Massive volumes of information • Rapid and Exponential rate of growth • Statistical Analysis Design Implement Discover Analyze
  • 10. Global Data Strategy, Ltd. 2016 “Traditional” way of Looking at the World: Hierarchies • Carolus Linnaeus in 1735 established a hierarchy/taxonomy for organizing and identifying biological systems. Kingdom Phylum Class Order Family Genus Species
  • 11. Global Data Strategy, Ltd. 2016 “New” Way of Looking at the World - Emergence In philosophy, systems theory, science, and art, emergence is the way complex systems and patterns arise out of a multiplicity of relatively simple interactions. - Wikipedia I love my new Levis jeans. Is Levi coming to my party? Sale #LEVIS 20% at Macys. LOL. TTYL. Leving soon.
  • 12. 12 Big Data within the Larger Information Management Landscape
  • 13. Global Data Strategy, Ltd. 2016 Big Data is Part of a Larger Enterprise Landscape 13 A Successful Data Strategy Requires Many Inter-related Disciplines “Top-Down” alignment with business priorities “Bottom-Up” management & inventory of data sources Managing the people, process, policies & culture around data Coordinating & integrating disparate data sources Leveraging & managing data for strategic advantage
  • 14. Global Data Strategy, Ltd. 2016 What is Big Data? • Big Data is often characterised by the “3 Vs”: • Volume: Is there a high volume of data? (e.g. terabytes per day) • Velocity: Is data generated or changed at a rapid pace? (e.g. per second, sub-second) • Variety: Is data stored across multiple formats? (e.g. machine data, OSS data, log files) • The ability to understand and manage these sources and integrate them into the larger Business Intelligence ecosystem can provide the ability to gain valuable insights from data. • This ability leads to the “4th V” of Big Data – Value. • Value: Valuable insights gained from the ability to analyze and discover new patterns and trends from high-volume and/or cross-platform systems. • Volume • Velocity • Variety Value
  • 15. Global Data Strategy, Ltd. 2016 The 5th “V” - Veracity • Only through proper Governance, Data Quality Management, Metadata Management, etc., can organizations achieve the 5th “V” – Veracity. • Veracity: Trust in the accuracy, quality and content of the organizations’ information assets. • i.e. The hard work doesn’t go away with Big Data Raw data used in Self-Service Analytics and BI environments is often so poor that many data scientists and BI professionals spend an estimated 50 – 90% of their time cleaning and reformatting data to make it fit for purpose.(4 Source: DataCenterJournal.com The absence of commonly understood and shared metadata and data definitions is cited as one of the main impediments to the success of Data Lakes. Source: Radiant Advisors Correcting poor data quality is a Data Scientist’s least favorite task, consuming on average 80% of their working day Source: Forbes 2016 71% of interviewees expect digitization to grow their business. But 70% say the biggest barrier is finding the right data; 62% cite inconsistent data Source: Stibo Systems Data Science Data Lakes Data Science Digitization & Data Quality
  • 16. Global Data Strategy, Ltd. 2016 Metadata & Big Data Analytics • Modern advances in data analytics & big data storage provide a wealth of opportunities • But the analytics are only as good as the quality of the underlying data • Metadata is critical – where did the data come from? What was its intended purpose? What are the units of measure? What is the definition of key terms? • Good data analysis is based on good data. Good data requires good metadata. 16
  • 17. Global Data Strategy, Ltd. 2016 Are Data Models Still Relevant? From Data Modeling for the Business by Hoberman, Burbank, Bradley, Technics Publications, 2009
  • 18. Global Data Strategy, Ltd. 2016 Metadata & Big Data Analytics 18 “Our analysis shows that energy usage with Smart Meters increases by 5% for each percentage point decrease in temperature compared to a 20% increase for traditional thermostat customers.”
  • 19. Global Data Strategy, Ltd. 2016 Metadata & Big Data Analytics 19 • What was the source for the weather data? • Were readings taking daily, monthly, weekly? Averages or actuals? • What was the original purpose & format for the readings? • Were temperatures in Celsius or Fahrenheit? • Etc. • Were readings taken by meter readings or billing amounts? • Were readings taking daily, monthly, weekly? Averages or actuals? • Were temperatures in Celsius or Fahrenheit? • Meter readings for were in completely different formats. It took us weeks to standardize them. • Etc. • Is Usage by Address, by Individual, or by Household? • Are households determined by residence or relationships? • Etc. “Our analysis shows that energy usage with Smart Meters increases by 5% for each percentage point decrease in temperature compared to a 20% increase for traditional thermostat customers.”
  • 20. Global Data Strategy, Ltd. 2016 The Business Case Remains the Same Single, Consistent View of Information – from all Sources Tell me what customers are saying about our product. Sybase SAP DB2 Oracle SQL Server SQL Azure Informix Teradata DBA Which customer database do you want me to pull this from? We have 25. Data Architect And, by the way, the databases all store customer information in a different format. “CUST_NM” on DB2, “cust_last_nm” on Oracle, etc. It’s a mess. I love my new Levis jeans. Is Levi coming to my party? Sale #LEVIS 20% at Macys. LOL. TTYL. Leving soon. Big Data Traditional/Relational Data Data Scientist I’ll need to input the raw data from thousands of sources, and write a program to parse and analyze the relevant information. 20
  • 21. Global Data Strategy, Ltd. 2016 Combining DW & Big Data Can Provide Valuable Information • There are numerous ways to gain value from data • Relational Database and Data Warehouse systems are one key source of value • Customer information • Product information • Big Data can offer new insights from data • From new data sources (e.g. social media, IoT) • By correlating multiple new and existing data sources (e.g. network patterns & customer data) • Integrating DW and Big Data can provide valuable new insights. • Examples include: • Customer Experience Optimization • Churn Management • Products & Services Innovation New InsightsData Warehouse 21
  • 22. Global Data Strategy, Ltd. 2016 Case Study: Facebook’s Data Warehouse • Started with Big Data using Hadoop, then saw the need for a traditional Data Warehouse • Ken Rudin, director of analytics at Facebook presented keynote at TDWI Chicago in May 2013—video replay available. • Needed a single source of reference for core business data • All data in one place- “managed chaos” • Define the core elements of the business, leave the rest alone • 100,000s of tables in Hadoop down to a few dozen in Data Warehouse • What data warehouse was good at: • Operational analysis (e.g. how many users logged in by region?) • Faster query time (1 minute on DW, over 1 hour on Hadoop) • What Big Data was good at: • Exploratory analysis (e.g. Where are users posting from—how can we infer a location if one is not listed?) “The genius of AND and the tyranny of OR” – Jim Collins, author of “Good to Great” -> i.e. Both solutions have their place 22
  • 23. Global Data Strategy, Ltd. 2016 Case Study: Facebook’s Data Warehouse • Challenge in both Big Data and Data Warehouse solutions — Business Definitions & Metadata • e.g. How many users logged in yesterday? • What do you mean by user? • Does user include mobile devices? • If a user posted from Spotify, is that a user? • Sound familiar? 23
  • 25. Global Data Strategy, Ltd. 2016 Levels of Data Modeling 25 Conceptual Logical Physical Purpose Communication & Definition of Business Terms & Rules Clarification & Detail of Business Rules & Data Structures Technical Implementation on a Physical Database Audience Business Stakeholders Data Architecture Business Analysts DBAs Developers Business Concepts Data Entities Physical Tables
  • 26. Global Data Strategy, Ltd. 2016 Conceptual Data Model • Creation & Communication of Business Rules and Definitions
  • 27. Global Data Strategy, Ltd. 2016 The Importance of Definitions • Definitions are as important as the data elements themselves. • Many data-related business issues are caused by unclear or ill-defined terms 27 What is in a name? What do you mean by “customer”? We’re calculating “total sales” differently in each region! Sales is using a different “monthly calendar” than Finance. How are we defining a “household”? What’s an “equity derivative”? What’s a “PEG ratio”? “API” as in “Application Programming Interface?” or “American Petroleum Institute”? Or a bee? What’s the difference between an “ingredient” and a “raw material”?
  • 28. Global Data Strategy, Ltd. 2016 The 5th “V” - Veracity • Remember the 5th “V” – Veracity! • Definitions & Metadata are just as important with Big Data as with traditional systems. The absence of commonly understood and shared metadata and data definitions is cited as one of the main impediments to the success of Data Lakes. Source: Radiant Advisors
  • 29. Global Data Strategy, Ltd. 2016 Logical Data Model • More Detailed, Potential Pre-cursor To Physical Design • Definition of Key Business Rules, Relationships, and Attributes
  • 30. Global Data Strategy, Ltd. 2016 Physical Data Model • Optimization and design of a physical database for storage and performance.
  • 31. Global Data Strategy, Ltd. 2016 Traditional Model – “Schema on Write” Sybase MySQL Oracle Data Models Teradata Sybase SQL Server DB2 Teradata SQL Server DB2 MySQLSQL Azure SQL Azure Oracle • With the traditional relational database paradigm, forward & reverse engineering can both create and read database structure using a graphical data model.
  • 32. Global Data Strategy, Ltd. 2016 Big Data Model - “Schema on Read” • With the Big Data and NoSQL paradigm, “Schema-on-Read” means you do not need to know how you will use your data when you are storing it. 32 Hive HDFS File system hdfs dfs -put /local/path/userdump /hdfs/path/data/users Table Structures Create table … Exploration Analysis Analyze & understand the data. Build a data structure to suite your needs. • You do need to know how you will use your data when you are using it and model accordingly. • i.e. it’s not magic. • For example, you may first place the data on HDFS in files, then apply a table structure in Hive. • Apache Hive provides a mechanism to project structure onto the data in Hadoop and to query that data using a SQL-like language called HiveQL (HQL).
  • 33. Global Data Strategy, Ltd. 2016 Data Modeling in the Big Data Ecosystem Hive HBase Structured Data Unstructured Data MapReduce / AnalyticsHadoop Framework HDFS File System JSON / XML HQL Semi-structured Data JSON XML JSON Data Sources
  • 34. Global Data Strategy, Ltd. 2016 NoSQL – Key Value Databases • NoSQL Databases are often optimal solutions for flexibility & performance in certain scenarios. • One common NoSQL database is a key-value pair database (e.g. Redis, Oracle NoSQL, etc.) • They can support extremely high volumes of records & state changes per second through distributed processing and distributed storage. • Use cases include: Managing user sessions in web applications, online gaming, online shopping carts, etc. • The structure is often created by the application code, not within a database or metadata structure. • Metadata for NoSQL databases is typically minimal or non-existent. • The structure & metadata is generally determined by the application code 34 Key Value 1839047 John Doe, Prepaid, 40.00 9287320 01/01/2008, 50.00, Green
  • 35. Global Data Strategy, Ltd. 2016 * NoSQL Metadata – Document Databases • Document databases are popular ways to store unstructured information in a flexible way (e.g. multimedia, social media posts, etc. ) • Each Collection can contain numerous Documents which could all contain different fields. 35 • Some data modeling can be done, and some data modeling tools support this (e.g. MongoDB). * Example from docs.mongodb.com {type: “Artifact”, medium: “Ceramic” country: “China”, } {type: “Book”, title: “Ancient China” country: “China”, }
  • 36. Global Data Strategy, Ltd. 2016 Graph Relationships • Graph databases are ideal for analyzing metadata relationships between objects and finding patterns in those relationships. • Common use cases for graph relationship metadata analysis include: • Fraud detection - e.g. financial transactions • Threat detection - e.g. email and phone patterns • Marketing – e.g. social media connections, product recommendation engines • Network optimization - e.g. IoT, Telecommunications 36 The data model is the database.
  • 37. 37 Organizational Considerations: The Role of the Data Architect in the World of Big Data
  • 38. Global Data Strategy, Ltd. 2016 Roles & Culture DBAs • Analytical • Structured • Project & Task focused • Cautious – identifies risks • “Just let me code!” Business Executive • Results-Oriented • Optimistic – Identifies opportunities • “Big Picture” focused • “I’m busy.” • “What’s the business opportunity?” Data Modelers • Analytical • Structured • “Big Picture” focused • Passionate • Likes to Talk • Can be considered “old school” • “Let me tell you about my data model!” Big Data Vendors • It’s magic! • It’s easy! • No modeling needed! Data Scientist • Looks for opportunities • Likes to explore • Seen as “modern” • Seen as “hip” & “sexy”
  • 39. Global Data Strategy, Ltd. 2016 New Operating Model: Interactions Between New & Existing Roles Data Scientist Hadoop Administrator Data Architect Privacy Analyst ETL Developer Network Administrator Existing Roles New Roles Alignment
  • 40. Global Data Strategy, Ltd. 2016 Data-Driven Business • In the current environment of data-driven business, Data Professionals have an opportunity to have a “seat at the table” • Business Optimization: Becoming a Data Driven Company - Making the Business More Efficient • Better Marketing Campaigns • Higher quality customer data, 360 view of customer, competitive info, etc. • Better Products • Data-Driven product development, Customer usage monitoring, etc. • Better Customer Support • Linking customer data with support logs, network outages, etc. • Transformative: Becoming a Data Company - Changing the Business Model via Data – data becomes the product • Monetization of Information: examples across multiple industries including: • Telco: location information, usage & search data, etc. • Retail: Click-stream data, purchasing patterns • Social Media: social & family connections, purchasing trends & recommendations, etc. • Energy: Sensor data, consumer usage patterns, smart metering, etc. 40
  • 41. Global Data Strategy, Ltd. 2016 Case Study: Consumer Energy Company • For the consumer energy sector Big Data and Smart Meters are transforming the ways of doing business and interacting with customers. • Moving away from traditional data use cases of metering & billing. • Smart meters allow customers to be in control of their energy usage. • Control over energy usage with connected systems • Custom Energy Reports & Usage • Smart Billing based on usage times • As energy usage declines, data is becoming the true business asset for this energy company. • Monetization of non-personal data is a future consideration. • While the Big Data Opportunity is crucial, equally important are the traditional data sources • New Data Quality Tools in place for operational and DW data • Data Governance Program analyzing data in relation to business processes & roles • Business-critical data elements identified and definitions created Business Transformation via Data
  • 42. Global Data Strategy, Ltd. 2016 Data-Driven Business Evolution 42 Data is a key component for new business opportunities New Business Model • Consumer-Driven Smart Metering • Connected Devices, IoT • Proactive service monitoring • Monetization of usage data Traditional Business Model • Usage-based billing • Issue-driven customer service More Efficient Business Model • More efficient billing • Faster customer service response • More consumer information re: energy efficiency, etc. Databases Big DataData Quality Data Governance Metadata Management
  • 43. Global Data Strategy, Ltd. 2016 Summary • We are in a period of “disruptive” technology with new opportunities • Rapid rate of change, Massive volumes of data • Social changes: more participatory, engaged • New business models based on data • Create a fit-for-purpose solution • Relational databases are still great for operational systems & data warehouses • Big Data offers new opportunities for analysis across large volumes of diverse data • As with any Age of Change, the basics still apply • The “hard stuff” still needs to be done: analysis, metadata definition, data models, etc. • Governance and Operating Models are critical • Data models are valuable to document business requirements and technical implementation • Have fun! This is an exciting time to be in Information Management
  • 44. Global Data Strategy, Ltd. 2016 About Global Data Strategy, Ltd • Global Data Strategy is an international information management consulting company that specializes in the alignment of business drivers with data-centric technology. • Our passion is data, and helping organizations enrich their business opportunities through data and information. • Our core values center around providing solutions that are: • Business-Driven: We put the needs of your business first, before we look at any technology solution. • Clear & Relevant: We provide clear explanations using real-world examples. • Customized & Right-Sized: Our implementations are based on the unique needs of your organization’s size, corporate culture, and geography. • High Quality & Technically Precise: We pride ourselves in excellence of execution, with years of technical expertise in the industry. 44 Data-Driven Business Transformation Business Strategy Aligned With Data Strategy Visit www.globaldatastrategy.com for more information
  • 45. Global Data Strategy, Ltd. 2016 Contact Info • Email: donna.burbank@globaldatastrategy.com • Twitter: @donnaburbank @GlobalDataStrat • Website: www.globaldatastrategy.com • Company Linkedin: https://www.linkedin.com/company/global-data-strategy-ltd • Personal Linkedin: https://www.linkedin.com/in/donnaburbank 45
  • 46. Global Data Strategy, Ltd. 2016 Lessons in Data Modeling Series • July 28th Why a Data Model is an Important Part of your Data Strategy • August 25th Data Modeling for Big Data • September 22nd UML for Data Modeling – When Does it Make Sense? • October 27th Data Modeling & Metadata Management • December 6th Data Modeling for XML and JSON 46 Join us next month
  • 47. Global Data Strategy, Ltd. 2016 Questions? 47 Thoughts? Ideas?