SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
The World is changing Rapidly and Here
is how BIG DATA is contributing
Prepared by: Madhu Reddiboina
What are you going to get out of this session today?
Recognize how the
world is changing…
Rapidly!
Understand the role
of Big Data and
Machine Learning in
this Transformation
One approach to
Harvest Big Data
within an Enterprise
How is the World Changing?
Accelerated Disruption
Lending and Borrowing
Used to be
Borrower Apply for Short
term Credit
Banks/Services Package
Loans
Securitization InvestorsMutual
Funds
Borrower Investors
Lending Platforms
What it is now
Higher Education and Learning
Apply for
Admission
Colleges &
Universities
Education Platforms
Massive Open Online Courses (MOOCs)
Instructors
Teachers
Professors
Students In Person
Courses
Banks Loans
GraduationEntrance
Exam
Students Instructors
Teachers
Professors
Used to be
What it is now
Insurance:
Human Capital
Sourcing:
Farming: Retail:
Life Sciences: Energy and Utilities: Media and Publishing:Financial Services
Many Other Disruptors
Patterns of Disruption
New Business Models
Completely New Business
Models that are disrupting
and disposing existing
businesses and creating
white space for new
businesses
Breaking the Barriers
Technology, Big Data
and Analytics breaking
the barriers that were
never possible before
Emerging Asian Economies
Almost half a billion people
are connecting to the internet
for the first time from Asia
creating many new
opportunities for every
business to market and
service them.
Venture Backed Big Data Start-ups
Venture Funded Big-data STARTUPS – Globally
Data from
Q1 2015
Source: Crunchbase
Big Data Landscape 2016
Source: Matt Turck, Jim Hao – FirstMark Capital
Unicorns
There are 143 companies representing more than half Trillion dollars in
valuation
What is Catalyzing this Disruption?
Moore’s Law Internet Platforms
(Mobile, Social, Cloud, IOT/IOE)
Open Source
Adoption & Big
Data
Convergence
Internet Platforms
IOT Revolution
Internet
Internet of Everything
Social Revolution
Connected Customer
Mobile Revolution
APP Economy
Service Oriented Architecture
+ Cloud Revolution
API Economy
Service Platforms
Big Data
& Machine Learning
How are the Start-ups Exploiting Big Data and ML?
Connected
Consumers Providers
Demand Supply
Big Data and Machine Learning
Crash Course
“Big Data is like teenage sex: everyone talks about it, nobody really
knows how to do it, so everyone claims they are doing it..”
- Dan Ariely, Professor at Duke University
Big data is a broad term for data sets so large or complex that
traditional data processing applications are inadequate. Challenges
include analysis, capture, data curation, search, sharing, storage,
transfer, visualization, querying and information privacy.
-- Wikipedia
Big data is high-volume, high-velocity and/or high-variety information
assets that demand cost-effective, innovative forms of information
processing that enable enhanced insight, decision making, and
process automation.
- Gartner (IT Glossary)
What is Big Data?
What are some examples of Big Data sets
Click-Stream Data from Websites
Data from Sensors
• Sensors on a Car
• Sensors on Industrial Machinery
• Sensors on Wifi Networks
• Sensors on Airplanes
• Sensors on IOT/IOE Devices
System Log Data
What are some examples of Big Data sets
Geolocation Data
Social Media Comments and Likes
Human Gnome
What is Machine Learning?
"Field of study that gives computers the ability to learn without
being explicitly programmed.”
- Arthur Samuel (1959)
What is Machine Learning?
“In the past decade, machine learning has given us self-driving cars,
practical machine translation, effective web search, and a vastly
improved understanding of the human genome.”
-Andrew NG. - Assoc. Professor, Stanford
At a high level there are two forms of Machine Learning
– Supervised Learning
– Unsupervised Learning
Recommendation Engine
Machine Learning Applications – Examples
Recommendation Engine
Machine Learning Applications – Examples
Continuous
Learning
System
How Does a Recommendation System Work?
On-line Web
Application
Recommendation
Engine
Closed
Loop
Process
Real-time Click-
Stream Data
New Business
Rules
Data
Engineering
Data
Analysis
Feature
Engineering
Model
Creation
Model
Evaluation
and Testing
Transaction History
Metadata
Upfront Data Science Work
Image Recognition
Machine Learning Application Examples
Autonomous Driving
Machine Learning Application Examples
An Approach To Harvest Big Data
Solution Architecture of a Data Platform To Enable…..
Descriptive
Analytics
Advanced
Analytics
Streaming
Analytics
People
Process
Technology
One flavor of information management framework and process infrastructure
to operationalize the data eco system
Infrastructure Management
Data Governance
Data Quality Management
Identity Management & Security
Metadata Management & Data Discovery
DataAcquisition
DataIngestion
DataPreparation
BusinessAccess
Services
Information
Consumers
Information Management Framework
Infrastructure Management
Data Governance
Data Quality Management
Identity Management & Security
Metadata Management & Data Discovery
Data Acquisition Data Ingestion
Business
Access Services
Information
ConsumersData Preparation
Unstructured&
SemiStructured
DataSources
Workflow Orchestration
Data
Engineering
Data Quality
Profiling
Search & Indexing
Data
Enrichment
Data
Aggregation
HadoopFileSystem(HDFS)
Dashboards & Reports
Customers
External Users
Internal Users
Summarized Data Marts
Integrated Data Extracts
Reference/Master
DataSources
Operational
Transactional
Sources
LandingZone
Custom
Interfaces
Enterprise Data Hub
Historical
ParquetIngest
AnalyticsSandbox
Archive
To Enable Descriptive Analytics
Infrastructure Management
Data Governance
Data Quality Management
Identity Management & Security
Metadata Management & Data Discovery
Data Acquisition Data Ingestion
Business
Access Services
Information
ConsumersData Preparation
Advanced Analytics
Ad Hoc Data AnalysisDetailed Operational Marts
Data Analysts
Workflow Orchestration
Data
Engineering
Data Quality
Profiling
Search & Indexing
Data
Enrichment
Data
Aggregation
HadoopFileSystem(HDFS)
Summarized Data Marts
Integrated Data Extracts
LandingZone
Custom
Interfaces
Enterprise Data Hub
Historical
ParquetIngest
AnalyticsSandbox
Archive
Dashboards & Reports
Customers
External Users
Internal Users
To Enable Advanced Analytics
Unstructured&
SemiStructured
DataSources
Reference/Master
DataSources
Operational
Transactional
Sources
Infrastructure Management
Data Governance
Data Quality Management
Identity Management & Security
Metadata Management & Data Discovery
Data Acquisition Data Ingestion
Business
Access Services
Information
ConsumersData Preparation
Custom Web Service Interfaces
End Users
HadoopFileSystem(HDFS)
LandingZone
Real-time Streaming
Data
Custom
Interfaces
To Enable Streaming Analytics
Unstructured&
SemiStructured
DataSources
Reference/Master
DataSources
Operational
Transactional
Sources
Infrastructure Management
Data Governance
Data Quality Management
Identity Management & Security
Metadata Management & Data Discovery
JAMS Scheduler
Data Integration
Machine Learning &
Stream Processing
Data flow Orchestration
Advanced Analytics
Search & Indexing
SQL Querying & Analytic
Access
Data Storage
Custom Web Service
Integrations
SFTP
NFS
Custom Built Meta Data Tools
Custom Built Data QC Tools
Kerberos for Authentication Sentry for Authorization
Custom Built Processes
Data Delivery
Dashboards, Reports &
Data Analysis
Tableau Server
Tableau Desktop
Applications
Custom Web
Applications
Hive Data marts
Tableau Data Extracts
Data Analysts
End Users
Data Acquisition Data Ingestion Data Preparation
Business
Access Services
Information
Consumers
Salesforce Applications
Custom Web Services
Customers
External Users
Internal Users
Platform Technologies
Database Technology Trends
THANK YOU
Questions?
mreddiboina@gmail.com

Contenu connexe

Tendances

Approximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous EventsApproximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous Events
Edward Curry
 
The Sharala Axryd Story
The Sharala Axryd StoryThe Sharala Axryd Story
The Sharala Axryd Story
Sharala Axryd
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
Edward Curry
 
PatternLanguageOfData
PatternLanguageOfDataPatternLanguageOfData
PatternLanguageOfData
kimErwin
 

Tendances (20)

SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"
 
Understanding Cognitive Applications: A Framework - Sue Feldman
Understanding Cognitive Applications:  A Framework - Sue FeldmanUnderstanding Cognitive Applications:  A Framework - Sue Feldman
Understanding Cognitive Applications: A Framework - Sue Feldman
 
Approximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous EventsApproximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous Events
 
New professional careers in data
New professional careers in dataNew professional careers in data
New professional careers in data
 
Metadata Standards and Organizational Resource Allocation: A Case for the Eff...
Metadata Standards and Organizational Resource Allocation: A Case for the Eff...Metadata Standards and Organizational Resource Allocation: A Case for the Eff...
Metadata Standards and Organizational Resource Allocation: A Case for the Eff...
 
Matt McIlwain opening keynote
Matt McIlwain opening keynoteMatt McIlwain opening keynote
Matt McIlwain opening keynote
 
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
 
Getstarteddssd12717sd
Getstarteddssd12717sdGetstarteddssd12717sd
Getstarteddssd12717sd
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
The Sharala Axryd Story
The Sharala Axryd StoryThe Sharala Axryd Story
The Sharala Axryd Story
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
 
PatternLanguageOfData
PatternLanguageOfDataPatternLanguageOfData
PatternLanguageOfData
 
Ai presentatie
Ai presentatieAi presentatie
Ai presentatie
 
Business analytics
Business analyticsBusiness analytics
Business analytics
 
Data Lake: A simple introduction
Data Lake: A simple introductionData Lake: A simple introduction
Data Lake: A simple introduction
 
Deck 92-146 (3)
Deck 92-146 (3)Deck 92-146 (3)
Deck 92-146 (3)
 
Dark data by Worapol Alex Pongpech
Dark data by Worapol Alex PongpechDark data by Worapol Alex Pongpech
Dark data by Worapol Alex Pongpech
 
Fraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business AuthorityFraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business Authority
 
Identity Matters
Identity MattersIdentity Matters
Identity Matters
 
Smart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in BusinessSmart Data Webinar: A Roadmap for Deploying Modern AI in Business
Smart Data Webinar: A Roadmap for Deploying Modern AI in Business
 

En vedette

How Much Is Your Patent Portfolio Worth
How Much Is Your Patent Portfolio WorthHow Much Is Your Patent Portfolio Worth
How Much Is Your Patent Portfolio Worth
jrstorella
 
C.V. of Mohamed Sherif Hegazi
C.V. of Mohamed Sherif HegaziC.V. of Mohamed Sherif Hegazi
C.V. of Mohamed Sherif Hegazi
Mohamed S. Hegazi
 
2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...
2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...
2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...
Selvasangeetha Selvarajan
 

En vedette (18)

Hieu nguoi
Hieu nguoiHieu nguoi
Hieu nguoi
 
Gieo niem tin
Gieo niem tinGieo niem tin
Gieo niem tin
 
Dan tron cuoc doi
Dan tron cuoc doiDan tron cuoc doi
Dan tron cuoc doi
 
How Much Is Your Patent Portfolio Worth
How Much Is Your Patent Portfolio WorthHow Much Is Your Patent Portfolio Worth
How Much Is Your Patent Portfolio Worth
 
NORTH ELEVATION
NORTH ELEVATIONNORTH ELEVATION
NORTH ELEVATION
 
Segunda ingles
Segunda inglesSegunda ingles
Segunda ingles
 
Bases del Concurso
Bases del Concurso Bases del Concurso
Bases del Concurso
 
C.V. of Mohamed Sherif Hegazi
C.V. of Mohamed Sherif HegaziC.V. of Mohamed Sherif Hegazi
C.V. of Mohamed Sherif Hegazi
 
Quan ly tien bac
Quan ly tien bacQuan ly tien bac
Quan ly tien bac
 
Niem vui that
Niem vui thatNiem vui that
Niem vui that
 
Application of Decision Analysis & Portfolio Management to the Generic Drug I...
Application of Decision Analysis & Portfolio Management to the Generic Drug I...Application of Decision Analysis & Portfolio Management to the Generic Drug I...
Application of Decision Analysis & Portfolio Management to the Generic Drug I...
 
Phai chang dct cua basn qua nho be
Phai chang dct cua basn qua nho bePhai chang dct cua basn qua nho be
Phai chang dct cua basn qua nho be
 
Sau dieu doi tra
Sau dieu doi traSau dieu doi tra
Sau dieu doi tra
 
BK-Rajajayoga - An Evidence-Based Mental Silence Type of Meditation
BK-Rajajayoga - An Evidence-Based Mental Silence Type of MeditationBK-Rajajayoga - An Evidence-Based Mental Silence Type of Meditation
BK-Rajajayoga - An Evidence-Based Mental Silence Type of Meditation
 
Ciclos biogeoquimicos
Ciclos biogeoquimicosCiclos biogeoquimicos
Ciclos biogeoquimicos
 
2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...
2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...
2.0 rancangan pengajaran harian yang menggunakan gabungan pendidikan seni vis...
 
The Tokamak Project
The Tokamak ProjectThe Tokamak Project
The Tokamak Project
 
Cloud Security Mechanisms
Cloud Security MechanismsCloud Security Mechanisms
Cloud Security Mechanisms
 

Similaire à Big_Data_ML_Madhu_Reddiboina

02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
Raul Chong
 

Similaire à Big_Data_ML_Madhu_Reddiboina (20)

Web mining
Web miningWeb mining
Web mining
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Introduction to Data Science 1118.pptx
Introduction to Data Science 1118.pptxIntroduction to Data Science 1118.pptx
Introduction to Data Science 1118.pptx
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptx
 
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIMAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
 
Data Mining With Big Data
Data Mining With Big DataData Mining With Big Data
Data Mining With Big Data
 
Introduction to Data Science 1114.pptx
Introduction to Data Science 1114.pptxIntroduction to Data Science 1114.pptx
Introduction to Data Science 1114.pptx
 
Big Data why Now and where to?
Big Data why Now and where to?Big Data why Now and where to?
Big Data why Now and where to?
 
Data sci sd-11.6.17
Data sci sd-11.6.17Data sci sd-11.6.17
Data sci sd-11.6.17
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data Scientists
 
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
 
Big Data: Issues and Challenges
Big Data: Issues and ChallengesBig Data: Issues and Challenges
Big Data: Issues and Challenges
 
Main Street, Meet Mr Watson - Matt Coatney
Main Street, Meet Mr Watson - Matt CoatneyMain Street, Meet Mr Watson - Matt Coatney
Main Street, Meet Mr Watson - Matt Coatney
 
MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-shared
 
Data mining and its applications!
Data mining and its applications!Data mining and its applications!
Data mining and its applications!
 

Big_Data_ML_Madhu_Reddiboina

  • 1. The World is changing Rapidly and Here is how BIG DATA is contributing Prepared by: Madhu Reddiboina
  • 2. What are you going to get out of this session today? Recognize how the world is changing… Rapidly! Understand the role of Big Data and Machine Learning in this Transformation One approach to Harvest Big Data within an Enterprise
  • 3. How is the World Changing? Accelerated Disruption
  • 4.
  • 5. Lending and Borrowing Used to be Borrower Apply for Short term Credit Banks/Services Package Loans Securitization InvestorsMutual Funds Borrower Investors Lending Platforms What it is now
  • 6. Higher Education and Learning Apply for Admission Colleges & Universities Education Platforms Massive Open Online Courses (MOOCs) Instructors Teachers Professors Students In Person Courses Banks Loans GraduationEntrance Exam Students Instructors Teachers Professors Used to be What it is now
  • 7. Insurance: Human Capital Sourcing: Farming: Retail: Life Sciences: Energy and Utilities: Media and Publishing:Financial Services Many Other Disruptors
  • 8. Patterns of Disruption New Business Models Completely New Business Models that are disrupting and disposing existing businesses and creating white space for new businesses Breaking the Barriers Technology, Big Data and Analytics breaking the barriers that were never possible before Emerging Asian Economies Almost half a billion people are connecting to the internet for the first time from Asia creating many new opportunities for every business to market and service them.
  • 9. Venture Backed Big Data Start-ups
  • 10. Venture Funded Big-data STARTUPS – Globally Data from Q1 2015 Source: Crunchbase
  • 11. Big Data Landscape 2016 Source: Matt Turck, Jim Hao – FirstMark Capital
  • 12. Unicorns There are 143 companies representing more than half Trillion dollars in valuation
  • 13. What is Catalyzing this Disruption? Moore’s Law Internet Platforms (Mobile, Social, Cloud, IOT/IOE) Open Source Adoption & Big Data Convergence
  • 14. Internet Platforms IOT Revolution Internet Internet of Everything Social Revolution Connected Customer Mobile Revolution APP Economy Service Oriented Architecture + Cloud Revolution API Economy
  • 15. Service Platforms Big Data & Machine Learning How are the Start-ups Exploiting Big Data and ML? Connected Consumers Providers Demand Supply
  • 16.
  • 17. Big Data and Machine Learning Crash Course
  • 18. “Big Data is like teenage sex: everyone talks about it, nobody really knows how to do it, so everyone claims they are doing it..” - Dan Ariely, Professor at Duke University Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, data curation, search, sharing, storage, transfer, visualization, querying and information privacy. -- Wikipedia Big data is high-volume, high-velocity and/or high-variety information assets that demand cost-effective, innovative forms of information processing that enable enhanced insight, decision making, and process automation. - Gartner (IT Glossary) What is Big Data?
  • 19. What are some examples of Big Data sets Click-Stream Data from Websites Data from Sensors • Sensors on a Car • Sensors on Industrial Machinery • Sensors on Wifi Networks • Sensors on Airplanes • Sensors on IOT/IOE Devices System Log Data
  • 20. What are some examples of Big Data sets Geolocation Data Social Media Comments and Likes Human Gnome
  • 21. What is Machine Learning?
  • 22. "Field of study that gives computers the ability to learn without being explicitly programmed.” - Arthur Samuel (1959) What is Machine Learning? “In the past decade, machine learning has given us self-driving cars, practical machine translation, effective web search, and a vastly improved understanding of the human genome.” -Andrew NG. - Assoc. Professor, Stanford At a high level there are two forms of Machine Learning – Supervised Learning – Unsupervised Learning
  • 23. Recommendation Engine Machine Learning Applications – Examples
  • 24. Recommendation Engine Machine Learning Applications – Examples
  • 25. Continuous Learning System How Does a Recommendation System Work? On-line Web Application Recommendation Engine Closed Loop Process Real-time Click- Stream Data New Business Rules Data Engineering Data Analysis Feature Engineering Model Creation Model Evaluation and Testing Transaction History Metadata Upfront Data Science Work
  • 26. Image Recognition Machine Learning Application Examples
  • 27. Autonomous Driving Machine Learning Application Examples
  • 28. An Approach To Harvest Big Data
  • 29. Solution Architecture of a Data Platform To Enable….. Descriptive Analytics Advanced Analytics Streaming Analytics
  • 30. People Process Technology One flavor of information management framework and process infrastructure to operationalize the data eco system Infrastructure Management Data Governance Data Quality Management Identity Management & Security Metadata Management & Data Discovery DataAcquisition DataIngestion DataPreparation BusinessAccess Services Information Consumers Information Management Framework
  • 31. Infrastructure Management Data Governance Data Quality Management Identity Management & Security Metadata Management & Data Discovery Data Acquisition Data Ingestion Business Access Services Information ConsumersData Preparation Unstructured& SemiStructured DataSources Workflow Orchestration Data Engineering Data Quality Profiling Search & Indexing Data Enrichment Data Aggregation HadoopFileSystem(HDFS) Dashboards & Reports Customers External Users Internal Users Summarized Data Marts Integrated Data Extracts Reference/Master DataSources Operational Transactional Sources LandingZone Custom Interfaces Enterprise Data Hub Historical ParquetIngest AnalyticsSandbox Archive To Enable Descriptive Analytics
  • 32. Infrastructure Management Data Governance Data Quality Management Identity Management & Security Metadata Management & Data Discovery Data Acquisition Data Ingestion Business Access Services Information ConsumersData Preparation Advanced Analytics Ad Hoc Data AnalysisDetailed Operational Marts Data Analysts Workflow Orchestration Data Engineering Data Quality Profiling Search & Indexing Data Enrichment Data Aggregation HadoopFileSystem(HDFS) Summarized Data Marts Integrated Data Extracts LandingZone Custom Interfaces Enterprise Data Hub Historical ParquetIngest AnalyticsSandbox Archive Dashboards & Reports Customers External Users Internal Users To Enable Advanced Analytics Unstructured& SemiStructured DataSources Reference/Master DataSources Operational Transactional Sources
  • 33. Infrastructure Management Data Governance Data Quality Management Identity Management & Security Metadata Management & Data Discovery Data Acquisition Data Ingestion Business Access Services Information ConsumersData Preparation Custom Web Service Interfaces End Users HadoopFileSystem(HDFS) LandingZone Real-time Streaming Data Custom Interfaces To Enable Streaming Analytics Unstructured& SemiStructured DataSources Reference/Master DataSources Operational Transactional Sources
  • 34. Infrastructure Management Data Governance Data Quality Management Identity Management & Security Metadata Management & Data Discovery JAMS Scheduler Data Integration Machine Learning & Stream Processing Data flow Orchestration Advanced Analytics Search & Indexing SQL Querying & Analytic Access Data Storage Custom Web Service Integrations SFTP NFS Custom Built Meta Data Tools Custom Built Data QC Tools Kerberos for Authentication Sentry for Authorization Custom Built Processes Data Delivery Dashboards, Reports & Data Analysis Tableau Server Tableau Desktop Applications Custom Web Applications Hive Data marts Tableau Data Extracts Data Analysts End Users Data Acquisition Data Ingestion Data Preparation Business Access Services Information Consumers Salesforce Applications Custom Web Services Customers External Users Internal Users Platform Technologies