SlideShare a Scribd company logo
1 of 13
Download to read offline
Future of Data : Big Data
   Shankar Radhakrishnan
        Cognizant
Topics
 How did we get here ?
 Data Explosion
 Big Data
 Big Data in an Enterprise
 Big Data Platform - Hadoop
 Big Data Adoption
Q&A
How did we get here?
Familiar World
                                           Data Integration Problems
   EDW
   Datamarts                              Data Processing Problems



   Familiar Problems           Data
                              warehouse
                                              Storage Management



                                             Performance Problems



                                          Limitations out of Complexity




New World
   Newer type of data to integrate
   Increase in volume
   Newer analytical requirements
Data Explosion
Newer Interests
 Social Intelligence
   DBIM, Sentiment Analysis, Social Customer Care
 Predictive Analytics
   Propensity, Price Elasticity, Anti-Fraud Analytics
 Segmentation Insights
   Funnel Analysis, Behavioral Patterns, Cohort Analysis
 Mobile Analytics
   Ad-Targeting, Geo-spatial Analytics
Categories
 Structured Data
  Enterprise Data (CRM, ERP, Data Stores, Reference Data)
 Semi-structured Data
  Machine Generated Data (Sensor Data, RFIDs)
 Unstructured Data
  Social Data (Comments, Tweets), Blog posts
Big Data
                                         Volume




                      Complexity
                                        Big               Velocity
                                        Data


                                         Variety



“Big Data” refers to high volume, velocity, variety and complex information assets that
demand cost-effective, innovative forms of information processing for enhanced insight
and decision making
Big Data Platforms
• Data Integration
   o Informatica, Infosphere
   o talenD, Pentaho, Karmasphere, Apache Sqoop, Apache Flume

• Database Framework
   o Hadoop (Distributions: Cloudera, Hortonworks, MapR)
   o Hbase
   o Hive

• NoSQL Databases
   o MongoDB, CouchDB

• Machine Data Processing
   o Splunk, Mahout

• Text Analytics
   o Clarabridge, Lexanalytics
Big Data in an Enterprise

 Big Data            Big Data
            ETL
 Sources             Platform




                                   Datamarts
                       ETL                                  Analytical
                                               Datamarts   Applications
                                   Datamarts




   Data
            ETL   Data warehouse
  Sources
Hadoop - Ecosystem
Big Data : Adoption Drivers
                   Cluster         Distributed



    Platform          Storage      Scalable       Process


                   Availability    Performance




                   Data
                                    Augmented
                   Integration


                      Data
   Possibilities      Processing
                                      TCO        Ecosystem


                   Actionable
                                          ROI
                   Insights
Big Data – Adoption Scenarios

 Replatforming to Big Data (Hadoop, MapR)
 Archival Solution (Hadoop)
 Offloading Data warehouse, EDW (Hadoop, Hive)
 Social Media Integration
 Machine Data Analysis (Splunk, Mahout)
 Complex Analytical Requirements (Hbase)
Q&A

More Related Content

What's hot

Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
magda3695
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
Vamshikrishna Goud
 

What's hot (20)

Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Big Data Tech Stack
Big Data Tech StackBig Data Tech Stack
Big Data Tech Stack
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
Big Data- Automotive Industry Use Case
Big Data- Automotive Industry Use CaseBig Data- Automotive Industry Use Case
Big Data- Automotive Industry Use Case
 
Big Data Analysis Patterns - TriHUG 6/27/2013
Big Data Analysis Patterns - TriHUG 6/27/2013Big Data Analysis Patterns - TriHUG 6/27/2013
Big Data Analysis Patterns - TriHUG 6/27/2013
 
big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
 
Big data 101
Big data 101Big data 101
Big data 101
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use Cases
 
AI meets Big Data
AI meets Big DataAI meets Big Data
AI meets Big Data
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Big Data Analysis Patterns with Hadoop, Mahout and Solr
Big Data Analysis Patterns with Hadoop, Mahout and SolrBig Data Analysis Patterns with Hadoop, Mahout and Solr
Big Data Analysis Patterns with Hadoop, Mahout and Solr
 
Great Expectations Presentation
Great Expectations PresentationGreat Expectations Presentation
Great Expectations Presentation
 
Big data Analytics Hadoop
Big data Analytics HadoopBig data Analytics Hadoop
Big data Analytics Hadoop
 
Big Data Final Presentation
Big Data Final PresentationBig Data Final Presentation
Big Data Final Presentation
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big Data Analytics 2014
Big Data Analytics 2014Big Data Analytics 2014
Big Data Analytics 2014
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 

Similar to Future of Data - Big Data

Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
Jean-Marc Desvaux
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
m_hepburn
 
"Demystifying Big Data by AIBDP.org
"Demystifying Big Data by AIBDP.org"Demystifying Big Data by AIBDP.org
"Demystifying Big Data by AIBDP.org
AIBDP
 

Similar to Future of Data - Big Data (20)

Future of Data - Big Data
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big Data
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
 
Big data combat
Big data combatBig data combat
Big data combat
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
 
Hadoop India Summit, Feb 2011 - Informatica
Hadoop India Summit, Feb 2011 - InformaticaHadoop India Summit, Feb 2011 - Informatica
Hadoop India Summit, Feb 2011 - Informatica
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
An introduction to Big Data
An introduction to Big DataAn introduction to Big Data
An introduction to Big Data
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
Exploring the Wider World of Big Data
Exploring the Wider World of Big DataExploring the Wider World of Big Data
Exploring the Wider World of Big Data
 
Big Data = Big Decisions
Big Data = Big DecisionsBig Data = Big Decisions
Big Data = Big Decisions
 
"Demystifying Big Data by AIBDP.org
"Demystifying Big Data by AIBDP.org"Demystifying Big Data by AIBDP.org
"Demystifying Big Data by AIBDP.org
 
Seminar presentation
Seminar presentationSeminar presentation
Seminar presentation
 
De-Mystifying Big Data
De-Mystifying Big DataDe-Mystifying Big Data
De-Mystifying Big Data
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Big data peresintaion
Big data peresintaion Big data peresintaion
Big data peresintaion
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
 
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentationEclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentation
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Future of Data - Big Data

  • 1. Future of Data : Big Data Shankar Radhakrishnan Cognizant
  • 2. Topics  How did we get here ?  Data Explosion  Big Data  Big Data in an Enterprise  Big Data Platform - Hadoop  Big Data Adoption Q&A
  • 3. How did we get here? Familiar World Data Integration Problems  EDW  Datamarts Data Processing Problems  Familiar Problems Data warehouse Storage Management Performance Problems Limitations out of Complexity New World  Newer type of data to integrate  Increase in volume  Newer analytical requirements
  • 5. Newer Interests  Social Intelligence  DBIM, Sentiment Analysis, Social Customer Care  Predictive Analytics  Propensity, Price Elasticity, Anti-Fraud Analytics  Segmentation Insights  Funnel Analysis, Behavioral Patterns, Cohort Analysis  Mobile Analytics  Ad-Targeting, Geo-spatial Analytics
  • 6. Categories  Structured Data  Enterprise Data (CRM, ERP, Data Stores, Reference Data)  Semi-structured Data  Machine Generated Data (Sensor Data, RFIDs)  Unstructured Data  Social Data (Comments, Tweets), Blog posts
  • 7. Big Data Volume Complexity Big Velocity Data Variety “Big Data” refers to high volume, velocity, variety and complex information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making
  • 8. Big Data Platforms • Data Integration o Informatica, Infosphere o talenD, Pentaho, Karmasphere, Apache Sqoop, Apache Flume • Database Framework o Hadoop (Distributions: Cloudera, Hortonworks, MapR) o Hbase o Hive • NoSQL Databases o MongoDB, CouchDB • Machine Data Processing o Splunk, Mahout • Text Analytics o Clarabridge, Lexanalytics
  • 9. Big Data in an Enterprise Big Data Big Data ETL Sources Platform Datamarts ETL Analytical Datamarts Applications Datamarts Data ETL Data warehouse Sources
  • 11. Big Data : Adoption Drivers Cluster Distributed Platform Storage Scalable Process Availability Performance Data Augmented Integration Data Possibilities Processing TCO Ecosystem Actionable ROI Insights
  • 12. Big Data – Adoption Scenarios  Replatforming to Big Data (Hadoop, MapR)  Archival Solution (Hadoop)  Offloading Data warehouse, EDW (Hadoop, Hive)  Social Media Integration  Machine Data Analysis (Splunk, Mahout)  Complex Analytical Requirements (Hbase)
  • 13. Q&A