SlideShare une entreprise Scribd logo
1  sur  12
Big Data
Shankar Radhakrishnan
Topics

• Data Management Today


• New Interests, Expectations, Problems


• Big Data


• New Approach


• Big Data Ecosystem


• Q&A
Data Management Today

• Relational Databases


  • Oracle, MySQL, MS-SQL Server


• Data warehouse Appliances


  • Teradata, IBM-Netezza


• Legacy Systems


  • Mainframes
New Interests, Expectations

• Collect More, Data-Mine More   • Actionable Insights


• Complex Data Integration       • Extension of Investments


• Advanced Analytics             • Talent Management


• Social Data Analysis           • ROI


• Machine Data Analysis          • TCO


• Realtime Data Analysis         • Business Continuity
How Big is Data?




? BIG                            90
                                                                                          is the average


                                                              $214
                                                                                       amount companies
                                                                                        have to spend per
                                 of the world’s data                                      compromised

             Facts               was created in the
                                    last two years
                                                                                        customer when a
                                                                                       data breach occurs
(as of Oct 2012)




                                2.7bn
                                                  Average number
                                                    of “likes” and
                                                    “comments”
                                                      posted on
                                                                             247bn
                                                                                e-mail messages are sent each
                                                   Facebook daily                day… about 80% of them are
                                                                                           spam
    It would take 2,000 hours
     to watch all the YouTube

                                500,000+
      videos uploaded while                                     data centers across the world are large
       we’re talking on this                                      enough to fill 5,955 football fields
              panel*



                                 *this is 3x more than just 2 short years ago

5
New Problems

• Unpredictable Volume          • Computing Limitations


• Data Processing Issues        • Information vs. Insights


• Data Integration Issues       • Business Requirements


• Identifying Source-of-Truth   • Regulatory Requirements


• Store vs. Analyze             • True Value-of-Data


• Data Retrieval Requirements   • Price to Performance Dilemma
What is Big Data?


                •     Very large data sets                                •   Real-time data streaming
                •     Sizes from 100 TB to 50 PB                              data
                •     Larger than “one machine”                           •   High volume / Low latency
                •     Whole data set analysis                                   • Write heavy
                      replaces “sampling”                                       • Read heavy
                                                                                • Both is common
                                                   Volume     Velocity




                •     Structured data
                        • OLTP                     Variety   Complexity

                        • DW
                        • ODS
                        • Data marts
                •     Unstructured data                                   •   Complexity
                        • Text                                                 • Data acquisition
                        • Audio                                                • Analysis
                        • Video                                                • Deriving insights
                        • Click streams
                        • Log files



Source: Ventana Research
New Approach

• Commodity Hardware


  • Open Computing Project


• Open Source Solutions, Frameworks


  • Value Added Products – Cloudera, Datastax, 10gen


• Research Oriented Product Development


• Augmented Ecosystem
Big Data : Ecosystem


                                                    Advanced
                                                     Analytics
                                            Predictive & Optimization
                                               Modeling, Business
Data Analytics                                 Processes Analysis,
                                                                                                 R            Splunk
                                               Functional Analysis                                                                SAS Big Data
                                                                                                Madlib           Mahout
                                                                                                                                 Visual Analytics

                                                                                                           Tableau
                                            Advanced Visualizations
Data Delivery                        Data Delivery - Dashboards , Scorecard                                               SpotFire
                                      (Strategy Maps), Spatial & Temporal                            Datameer
Data Visualization                                   Analysis

                                                                                                  Pig            Hive       Other BI Tools with
Data Engineering                                  BI / Reporting                                                            Hadoop connectors
                             Data Engineering - Performance Reporting, Enterprise                       Lucene                     Karmasphere
Data Agility                 Metrics, Data Agility - Data Mining, OLAP Modeling etc
                                                                                                         Cassandra         Crunch          Pangool
Data Consolidation                        Data Storage and Processing
                                                                                                           HDFS           HBase        Mapreduce
                                          Data Storage, Data processing
Data Economics
                                                                                                                 Flume        Scribe          Avro
                                                                                                                         Sqoop              Chukwa
                                          Data Integration & Management                                                 Zookeeper            Oozie
                     Data Filtering, Data Consolidation & Warehousing, Data Quality, Metadata
Integration                         Management, Job Scheduling, Data Economics                                             Native Hadoop ETL

                                                                                                                           Traditional ETL with
                                                                                                                           Hadoop connectors
                                           Distributed Infrastructure

                                  Hadoop components                        Open source Hadoop platforms
                                   3rd party Hadoop supporting platforms
What Big Data can do that traditional data warehousing and analytics cannot?

               Traditional DW                                                 Big Data

Complete records from known transactional            Data from many different internal & external sources
systems.                                             with unknown quality and/or utility.
                                                 u

Data is structured, and data fields have known       Loosely structured data. Flat schemas with few
(and often complex) interrelationships.              complex interrelationships, connections between data
                                                 u   elements have to be probabilistically inferred.



Multi Terabytes of Data                              Multi Peta Bytes of Data
                                                 u

Mostly Scale Up Architecture                         Scale Out Architecture
                                                 u

                                                     The analytic models are larger and require very large
Analytics run on a stable data model.            u   amounts of hardware resources to process them in a
                                                     timely manner


Low Performance/Cost ratio as most of the            High Performance/Cost ratio as most of the software/
software/hardware platforms are proprietary      u   hardware platforms are commodity, free, open source
and license based


10
What Big Data can do that traditional data warehousing and analytics cannot?

                 Traditional DW                                           Big Data


 Aggregate data (structured)                    u    Raw Data (structured and unstructured)




                                                     Individual level analytics, Micro segmentation,
 Aggregate / Segment analytics                  u    individualized offers to customers


 Mainstream analytics
                                                     Outlier analytics, Pattern discovery, Simulation and
 – Structured analysis                          u    modeling, Machine learning
 - OLAP cubes


                                                     Entire population of granular data can be
 Sample data is used for identifying patterns   u    leveraged



Reports & Dashboards are done on a production       Real-time operational analytics and reporting. Intra-
basis                                           u   day decision making.


 Traditional models good for small amount of         Big Models: Computationally intensive analyses,
 data due to time constraints                   u    simulations, models with many parameters




11
Q&A




      Thank You !

Contenu connexe

Tendances (20)

Maximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data PlatformMaximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data Platform
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Big Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesBig Data’s Big Impact on Businesses
Big Data’s Big Impact on Businesses
 
BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
 
Big data 101
Big data 101Big data 101
Big data 101
 
BIG DATA and USE CASES
BIG DATA and USE CASESBIG DATA and USE CASES
BIG DATA and USE CASES
 
Big data
Big dataBig data
Big data
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practices
 
Big data analysis
Big data analysisBig data analysis
Big data analysis
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentation
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use Cases
 
Unit i big data introduction
Unit  i big data introductionUnit  i big data introduction
Unit i big data introduction
 
Big data 101
Big data 101Big data 101
Big data 101
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

En vedette

Graphic Design-LISAA Paris
Graphic Design-LISAA ParisGraphic Design-LISAA Paris
Graphic Design-LISAA ParisREM Limited
 
FashionDesign-LISAA Paris
FashionDesign-LISAA ParisFashionDesign-LISAA Paris
FashionDesign-LISAA ParisREM Limited
 
LISAA-programmes
LISAA-programmesLISAA-programmes
LISAA-programmesREM Limited
 
Mahabharata And Gita Synopsis
Mahabharata And Gita SynopsisMahabharata And Gita Synopsis
Mahabharata And Gita SynopsisAvinash Patwadhan
 
Metro and Windows Phone 7
Metro and Windows Phone 7Metro and Windows Phone 7
Metro and Windows Phone 7Johan Lindfors
 
Lesser known monuments of delhi
Lesser known monuments of delhiLesser known monuments of delhi
Lesser known monuments of delhiArun Kumar
 
Values, ethics & the mahabharat
Values, ethics & the mahabharatValues, ethics & the mahabharat
Values, ethics & the mahabharatChandresh Suprit
 
ArchiSurance Case Study
ArchiSurance Case StudyArchiSurance Case Study
ArchiSurance Case StudyIver Band
 
Concept study of mahindra united world college,pune and pearl academy of fash...
Concept study of mahindra united world college,pune and pearl academy of fash...Concept study of mahindra united world college,pune and pearl academy of fash...
Concept study of mahindra united world college,pune and pearl academy of fash...harshita batra
 
CASE STUDY NEHRU PLACE,DELHI
CASE STUDY NEHRU PLACE,DELHICASE STUDY NEHRU PLACE,DELHI
CASE STUDY NEHRU PLACE,DELHIAashna Arora
 
case study on National institute of design, Ahmedabad.
case study on National institute of design, Ahmedabad.case study on National institute of design, Ahmedabad.
case study on National institute of design, Ahmedabad.Milan Jain
 

En vedette (20)

LISAAanim
LISAAanimLISAAanim
LISAAanim
 
Graphic Design-LISAA Paris
Graphic Design-LISAA ParisGraphic Design-LISAA Paris
Graphic Design-LISAA Paris
 
FashionDesign-LISAA Paris
FashionDesign-LISAA ParisFashionDesign-LISAA Paris
FashionDesign-LISAA Paris
 
LISAA-programmes
LISAA-programmesLISAA-programmes
LISAA-programmes
 
LISAAfashion
LISAAfashionLISAAfashion
LISAAfashion
 
Mahabharata And Gita Synopsis
Mahabharata And Gita SynopsisMahabharata And Gita Synopsis
Mahabharata And Gita Synopsis
 
Metro and Windows Phone 7
Metro and Windows Phone 7Metro and Windows Phone 7
Metro and Windows Phone 7
 
Kurukshetra
KurukshetraKurukshetra
Kurukshetra
 
Ajay mishra mahabharat maps with diagrams events peoples domain range chains ...
Ajay mishra mahabharat maps with diagrams events peoples domain range chains ...Ajay mishra mahabharat maps with diagrams events peoples domain range chains ...
Ajay mishra mahabharat maps with diagrams events peoples domain range chains ...
 
Indian history
Indian historyIndian history
Indian history
 
Lesser known monuments of delhi
Lesser known monuments of delhiLesser known monuments of delhi
Lesser known monuments of delhi
 
Pearl academy prospectus 2015
Pearl academy prospectus 2015Pearl academy prospectus 2015
Pearl academy prospectus 2015
 
NID UPGRADED
NID UPGRADEDNID UPGRADED
NID UPGRADED
 
Nid
NidNid
Nid
 
Values, ethics & the mahabharat
Values, ethics & the mahabharatValues, ethics & the mahabharat
Values, ethics & the mahabharat
 
ArchiSurance Case Study
ArchiSurance Case StudyArchiSurance Case Study
ArchiSurance Case Study
 
National Institute Of Fashion Technology(NIFT)-Delhi,Mumbai,Chennai
National Institute Of Fashion Technology(NIFT)-Delhi,Mumbai,ChennaiNational Institute Of Fashion Technology(NIFT)-Delhi,Mumbai,Chennai
National Institute Of Fashion Technology(NIFT)-Delhi,Mumbai,Chennai
 
Concept study of mahindra united world college,pune and pearl academy of fash...
Concept study of mahindra united world college,pune and pearl academy of fash...Concept study of mahindra united world college,pune and pearl academy of fash...
Concept study of mahindra united world college,pune and pearl academy of fash...
 
CASE STUDY NEHRU PLACE,DELHI
CASE STUDY NEHRU PLACE,DELHICASE STUDY NEHRU PLACE,DELHI
CASE STUDY NEHRU PLACE,DELHI
 
case study on National institute of design, Ahmedabad.
case study on National institute of design, Ahmedabad.case study on National institute of design, Ahmedabad.
case study on National institute of design, Ahmedabad.
 

Similaire à Kurukshetra - Big Data

Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Etu Solution
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data SolutionsMark Kromer
 
Software architecture & design patterns for MS CRM Developers
Software architecture & design patterns for MS CRM  Developers Software architecture & design patterns for MS CRM  Developers
Software architecture & design patterns for MS CRM Developers sebedatalabs
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureOdinot Stanislas
 
Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionDATAVERSITY
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data ApplicationsRichard McDougall
 
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageWebinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageCloudera, Inc.
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing DataWorks Summit
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)Ajay Ohri
 
Cutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellCutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellAMD
 
Big Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyBig Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyHitachi Vantara
 
Mind Blowing Business Intelligence Dashboards
Mind Blowing Business Intelligence DashboardsMind Blowing Business Intelligence Dashboards
Mind Blowing Business Intelligence DashboardsUnilytics
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsGDi Techno Solutions
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012Gigaom
 
DataPortal Presentation
DataPortal Presentation DataPortal Presentation
DataPortal Presentation DataPortal
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaleBase
 

Similaire à Kurukshetra - Big Data (20)

Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Software architecture & design patterns for MS CRM Developers
Software architecture & design patterns for MS CRM  Developers Software architecture & design patterns for MS CRM  Developers
Software architecture & design patterns for MS CRM Developers
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
 
Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise Adoption
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Secure Big Data Analytics - Hadoop & Intel
Secure Big Data Analytics - Hadoop & IntelSecure Big Data Analytics - Hadoop & Intel
Secure Big Data Analytics - Hadoop & Intel
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data Applications
 
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageWebinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
 
Cutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellCutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and Dell
 
Big Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage StrategyBig Data, Big Content, and Aligning Your Storage Strategy
Big Data, Big Content, and Aligning Your Storage Strategy
 
Mind Blowing Business Intelligence Dashboards
Mind Blowing Business Intelligence DashboardsMind Blowing Business Intelligence Dashboards
Mind Blowing Business Intelligence Dashboards
 
Data mining - GDi Techno Solutions
Data mining - GDi Techno SolutionsData mining - GDi Techno Solutions
Data mining - GDi Techno Solutions
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
 
DataPortal Presentation
DataPortal Presentation DataPortal Presentation
DataPortal Presentation
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write Splitting
 

Dernier

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17Celine George
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxPooja Bhuva
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 

Dernier (20)

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 

Kurukshetra - Big Data

  • 2. Topics • Data Management Today • New Interests, Expectations, Problems • Big Data • New Approach • Big Data Ecosystem • Q&A
  • 3. Data Management Today • Relational Databases • Oracle, MySQL, MS-SQL Server • Data warehouse Appliances • Teradata, IBM-Netezza • Legacy Systems • Mainframes
  • 4. New Interests, Expectations • Collect More, Data-Mine More • Actionable Insights • Complex Data Integration • Extension of Investments • Advanced Analytics • Talent Management • Social Data Analysis • ROI • Machine Data Analysis • TCO • Realtime Data Analysis • Business Continuity
  • 5. How Big is Data? ? BIG 90 is the average $214 amount companies have to spend per of the world’s data compromised Facts was created in the last two years customer when a data breach occurs (as of Oct 2012) 2.7bn Average number of “likes” and “comments” posted on 247bn e-mail messages are sent each Facebook daily day… about 80% of them are spam It would take 2,000 hours to watch all the YouTube 500,000+ videos uploaded while data centers across the world are large we’re talking on this enough to fill 5,955 football fields panel* *this is 3x more than just 2 short years ago 5
  • 6. New Problems • Unpredictable Volume • Computing Limitations • Data Processing Issues • Information vs. Insights • Data Integration Issues • Business Requirements • Identifying Source-of-Truth • Regulatory Requirements • Store vs. Analyze • True Value-of-Data • Data Retrieval Requirements • Price to Performance Dilemma
  • 7. What is Big Data? • Very large data sets • Real-time data streaming • Sizes from 100 TB to 50 PB data • Larger than “one machine” • High volume / Low latency • Whole data set analysis • Write heavy replaces “sampling” • Read heavy • Both is common Volume Velocity • Structured data • OLTP Variety Complexity • DW • ODS • Data marts • Unstructured data • Complexity • Text • Data acquisition • Audio • Analysis • Video • Deriving insights • Click streams • Log files Source: Ventana Research
  • 8. New Approach • Commodity Hardware • Open Computing Project • Open Source Solutions, Frameworks • Value Added Products – Cloudera, Datastax, 10gen • Research Oriented Product Development • Augmented Ecosystem
  • 9. Big Data : Ecosystem Advanced Analytics Predictive & Optimization Modeling, Business Data Analytics Processes Analysis, R Splunk Functional Analysis SAS Big Data Madlib Mahout Visual Analytics Tableau Advanced Visualizations Data Delivery Data Delivery - Dashboards , Scorecard SpotFire (Strategy Maps), Spatial & Temporal Datameer Data Visualization Analysis Pig Hive Other BI Tools with Data Engineering BI / Reporting Hadoop connectors Data Engineering - Performance Reporting, Enterprise Lucene Karmasphere Data Agility Metrics, Data Agility - Data Mining, OLAP Modeling etc Cassandra Crunch Pangool Data Consolidation Data Storage and Processing HDFS HBase Mapreduce Data Storage, Data processing Data Economics Flume Scribe Avro Sqoop Chukwa Data Integration & Management Zookeeper Oozie Data Filtering, Data Consolidation & Warehousing, Data Quality, Metadata Integration Management, Job Scheduling, Data Economics Native Hadoop ETL Traditional ETL with Hadoop connectors Distributed Infrastructure Hadoop components Open source Hadoop platforms 3rd party Hadoop supporting platforms
  • 10. What Big Data can do that traditional data warehousing and analytics cannot? Traditional DW Big Data Complete records from known transactional Data from many different internal & external sources systems. with unknown quality and/or utility. u Data is structured, and data fields have known Loosely structured data. Flat schemas with few (and often complex) interrelationships. complex interrelationships, connections between data u elements have to be probabilistically inferred. Multi Terabytes of Data Multi Peta Bytes of Data u Mostly Scale Up Architecture Scale Out Architecture u The analytic models are larger and require very large Analytics run on a stable data model. u amounts of hardware resources to process them in a timely manner Low Performance/Cost ratio as most of the High Performance/Cost ratio as most of the software/ software/hardware platforms are proprietary u hardware platforms are commodity, free, open source and license based 10
  • 11. What Big Data can do that traditional data warehousing and analytics cannot? Traditional DW Big Data Aggregate data (structured) u Raw Data (structured and unstructured) Individual level analytics, Micro segmentation, Aggregate / Segment analytics u individualized offers to customers Mainstream analytics Outlier analytics, Pattern discovery, Simulation and – Structured analysis u modeling, Machine learning - OLAP cubes Entire population of granular data can be Sample data is used for identifying patterns u leveraged Reports & Dashboards are done on a production Real-time operational analytics and reporting. Intra- basis u day decision making. Traditional models good for small amount of Big Models: Computationally intensive analyses, data due to time constraints u simulations, models with many parameters 11
  • 12. Q&A Thank You !