SlideShare une entreprise Scribd logo
1  sur  63
BIG Data
Desai Karan A
https://in.linkedin.com/in/karan28
SYNOPSIS:
1. Handy Hands-on
2. Introduction to big data
3. Big Data Niceties
4. Specifics of Big Data
5. Big Data Management Tools
6. Practical use-cases
7. Conclusions
8. References
1 Handy Hands-On
2. Introduction to big data
-2.1 What is big data?
-2.2 Etymology.
-2.3 Hype and Facts.
2.1 What is big data?
• “Big data” refers to datasets whose size is
beyond the ability of typical database software
tools to capture, store, manage, and analyze.
• Big Data is the extremely large data sets that
may be analyzed computationally to reveal
patterns, trends, and associations, especially
relating to human behavior and interactions.
• Big data is the data of range more than 1000
gigabytes or 100 zettabytes.
2.2 Etymology: Word Origination
Big data is the simplest,
shortest phrase to convey that
the boundaries of computing
keep advancing, growing,
diversifying and intensifying
rapidly..
John R Mashey, chief
scientist at Silicon Graphics
coined the term “Big Data”.
2.3 Hype and Facts
2.3 Hype and Facts
GLOBALLY, EVERY 60 SECONDS…
• 204 Million emails are
sent.
• 300k logins to .
• 1.3 Million views on
YouTube.
• 2 Million Google searches.
• 100k tweets.
• 62,000 hours of Music
Downloads
• WE GENERATE 2.5 QUINTILION BYTES
EVERYDAY
• IN 2012, WORLD’S INFORMATION
CROSSED 2 ZETTA BYTES =2
TRILLION GIGABYTES!!
2.3 Hype and Facts (contd.)
3. Big Data Niceties.
-3.1 Evolution of Big Data
-3.2 Why traditional tools fail?
-3.3 Utilities of Big Data
3.1 Evolution Story:
• E-TSUNAMI and Heavy RAINS of DATA…
3.2 Why traditional tools fail? (contd.)
3.2 Why traditional tools fail?
• The present data is highly BIG for the
traditional data managers.
-Can work only with small samples of
data
-It is same as looking through keyhole
and finding size of room…
• High Turnaround time for meaningful
results
– Means Deciding to cross road based on
picture taken 5 minutes earlier!!
3.2 Why traditional tools fail? (contd.)
3.3 Big data utilities:
• Dealing with real time data.
• A new level of insight and
opportunity.
• More effective, fact based
decision making.
• A new source of business
values.
• A competitive advantage.
4. Specifics of Big Data
-4.1 Characteristics
-4.2 Life cycle
4.1 Characteristics
Big
data
Volume
Variety
Velocity
Veracity
4.2 Big Data Life Cycle
Insight
Enrich
Manage
• Manage and secure data of any size.
• Enrich by connecting world’s data.
• Insights on any data irrespective of
location
3.2 Big Data Life Cycle
5. Big Data Management tools.
-5.1 Cow story
-5.2 Introduction to Hadoop
-5.3 Basic Working of Hadoop.
5.1 Cow story: Case 1
It is easy for me
to handle my
resources.(Data)
.
Data
Storage device
MB/GB
Case 2 I am strong…I
can handle my
resources
Data Data
Data Data
Data Data
Storage device
TB
Case 3
Oof…There are so
many resources!!!
I am not strong!
Storage device
PB
Case 4
I call my
friends
for help
Big Data Management tools
5.2 Introduction to Hadoop
Apache Hadoop is an open-source software
framework for storage and large-scale
processing of data-sets on clusters of
commodity hardware.
Introduction to Hadoop
• Doug Cutting created the Apache Hadoop.
• Logo of Hadoop is a tiny yellow elephant.
5.3 Basic working of Hadoop
Read 1 TB of Data
1 Machine 10 Machine
• 4 I/O Channels
• Each channel: 100
MB/s
• ~ 45 minutes
• 4 I/O Channels
• Each channel: 100
MB/s
• ~4.5 Minutes
Present Hadoop basic
architecture.
Schematic Working.
Schematic Working.
• Application written in java for Big Data Processing
• Uses the “Map-Reduce” Processing Paradigm
• Optimized for distributed storage and computing
of data
• Open Source
• Very low cost for acquisition and storage
Hadoop .
HadoopData Analytics
Other big data management
tools: Overview…
6. Practical Use-Cases
-6.1 Big apps of Big Data tools
-6.2 How big data affects small business
-6.3 Relevance of big data in market
6.1 Big apps of big data tools.
Who is using big data?
Who is using big data?
6.2 How big data affects
small businesses?
• Every organization has a tipping point, and
most organizations – regardless of size –
will eventually reach a point where the
volume, variety and velocity of their data
will be something that they have to
address.
• This new big data world is not only about
running problems faster, but about solving
problems that were not solvable before.
6.3 Relevance of big data in
market.
7. Conclusions
Conclusions: Through pics..
Conclusions: Through pics..
Conclusions: Through pics..
8. References:
• www.microsoft.com
• http://en.wikipedia.org/wiki/Hadoop
• http://en.wikipedia.org/wiki/Big_data
• www.google.com
• www.slideshare.net
• Pdf: Mgkinskey Global Institute
• Pdf: 101 Big data by Pradeep Vardan
• Workshop in college by ‘Ecsttasys’ on big
data
Introduction to Big Data

Contenu connexe

Tendances

Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling TechniquesDATAVERSITY
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachDATAVERSITY
 
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...Precisely
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...DATAVERSITY
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityDATAVERSITY
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big datahktripathy
 
Data Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business ApproachesData Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business ApproachesDATAVERSITY
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?DATAVERSITY
 
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...DATAVERSITY
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Hortonworks
 
The importance of data
The importance of dataThe importance of data
The importance of dataAPNIC
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop IntroductionJayant Mukherjee
 

Tendances (20)

Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
 
Big Data
Big DataBig Data
Big Data
 
Big data
Big dataBig data
Big data
 
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
Data Quality in the Banking Industry: Turning Regulatory Compliance into Busi...
 
Big data
Big dataBig data
Big data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Ibm db2 big sql
Ibm db2 big sqlIbm db2 big sql
Ibm db2 big sql
 
Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 
Data stewardship
Data stewardshipData stewardship
Data stewardship
 
Big data
Big dataBig data
Big data
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Data Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business ApproachesData Governance — Aligning Technical and Business Approaches
Data Governance — Aligning Technical and Business Approaches
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
 
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
 
The importance of data
The importance of dataThe importance of data
The importance of data
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
 

En vedette

Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for BeginnersMichael Perez
 
Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiEdzo Botjes
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big dataRichard Vidgen
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data Srinath Perera
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Big Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia ExperienceBig Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia Experiencerotated8
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Impetus Technologies
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
Big data
Big dataBig data
Big datahsn99
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataMohammed Guller
 

En vedette (20)

Big Data: an introduction
Big Data: an introductionBig Data: an introduction
Big Data: an introduction
 
Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for Beginners
 
Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - Sogeti
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Big data Introduction by Mohan
Big data Introduction by MohanBig data Introduction by Mohan
Big data Introduction by Mohan
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
What is big data?
What is big data?What is big data?
What is big data?
 
Hadoop basics
Hadoop basicsHadoop basics
Hadoop basics
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia ExperienceBig Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia Experience
 
Big data experiments
Big data experimentsBig data experiments
Big data experiments
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data introduction - Café Numérique Bruxelles
Big Data introduction - Café Numérique BruxellesBig Data introduction - Café Numérique Bruxelles
Big Data introduction - Café Numérique Bruxelles
 
Big data
Big dataBig data
Big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 

Similaire à Introduction to Big Data

WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye TechnologiesAshish Jha
 
WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye Technologieswisdomeye
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusersBob Hardaway
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataRoi Blanco
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadhMithlesh Sadh
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationDoug Denton
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - IntroductionTomy Rhymond
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big dataVedanand Singh
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01nayanbhatia2
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and HadoopGreyCampus
 

Similaire à Introduction to Big Data (20)

WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye Technologies
 
WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye Technologies
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Intro big data analytics
Intro big data analyticsIntro big data analytics
Intro big data analytics
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Big data
Big dataBig data
Big data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
 

Dernier

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numberssuginr1
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...HyderabadDolls
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdfkhraisr
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 

Dernier (20)

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 

Introduction to Big Data

  • 1. BIG Data Desai Karan A https://in.linkedin.com/in/karan28
  • 2. SYNOPSIS: 1. Handy Hands-on 2. Introduction to big data 3. Big Data Niceties 4. Specifics of Big Data 5. Big Data Management Tools 6. Practical use-cases 7. Conclusions 8. References
  • 4.
  • 5.
  • 6.
  • 7. 2. Introduction to big data -2.1 What is big data? -2.2 Etymology. -2.3 Hype and Facts.
  • 8. 2.1 What is big data? • “Big data” refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze. • Big Data is the extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions. • Big data is the data of range more than 1000 gigabytes or 100 zettabytes.
  • 9. 2.2 Etymology: Word Origination Big data is the simplest, shortest phrase to convey that the boundaries of computing keep advancing, growing, diversifying and intensifying rapidly.. John R Mashey, chief scientist at Silicon Graphics coined the term “Big Data”.
  • 10. 2.3 Hype and Facts
  • 11. 2.3 Hype and Facts
  • 12.
  • 13. GLOBALLY, EVERY 60 SECONDS… • 204 Million emails are sent. • 300k logins to . • 1.3 Million views on YouTube. • 2 Million Google searches. • 100k tweets. • 62,000 hours of Music Downloads
  • 14. • WE GENERATE 2.5 QUINTILION BYTES EVERYDAY • IN 2012, WORLD’S INFORMATION CROSSED 2 ZETTA BYTES =2 TRILLION GIGABYTES!! 2.3 Hype and Facts (contd.)
  • 15. 3. Big Data Niceties. -3.1 Evolution of Big Data -3.2 Why traditional tools fail? -3.3 Utilities of Big Data
  • 17.
  • 18. • E-TSUNAMI and Heavy RAINS of DATA… 3.2 Why traditional tools fail? (contd.)
  • 19. 3.2 Why traditional tools fail? • The present data is highly BIG for the traditional data managers. -Can work only with small samples of data -It is same as looking through keyhole and finding size of room…
  • 20. • High Turnaround time for meaningful results – Means Deciding to cross road based on picture taken 5 minutes earlier!! 3.2 Why traditional tools fail? (contd.)
  • 21. 3.3 Big data utilities: • Dealing with real time data. • A new level of insight and opportunity. • More effective, fact based decision making. • A new source of business values. • A competitive advantage.
  • 22. 4. Specifics of Big Data -4.1 Characteristics -4.2 Life cycle
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29. 4.2 Big Data Life Cycle Insight Enrich Manage
  • 30. • Manage and secure data of any size. • Enrich by connecting world’s data. • Insights on any data irrespective of location 3.2 Big Data Life Cycle
  • 31.
  • 32. 5. Big Data Management tools. -5.1 Cow story -5.2 Introduction to Hadoop -5.3 Basic Working of Hadoop.
  • 33. 5.1 Cow story: Case 1 It is easy for me to handle my resources.(Data) . Data Storage device MB/GB
  • 34. Case 2 I am strong…I can handle my resources Data Data Data Data Data Data Storage device TB
  • 35. Case 3 Oof…There are so many resources!!! I am not strong! Storage device PB
  • 36. Case 4 I call my friends for help Big Data Management tools
  • 37. 5.2 Introduction to Hadoop Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware.
  • 38. Introduction to Hadoop • Doug Cutting created the Apache Hadoop. • Logo of Hadoop is a tiny yellow elephant.
  • 39. 5.3 Basic working of Hadoop
  • 40. Read 1 TB of Data 1 Machine 10 Machine • 4 I/O Channels • Each channel: 100 MB/s • ~ 45 minutes • 4 I/O Channels • Each channel: 100 MB/s • ~4.5 Minutes
  • 42.
  • 43.
  • 46. • Application written in java for Big Data Processing • Uses the “Map-Reduce” Processing Paradigm • Optimized for distributed storage and computing of data • Open Source • Very low cost for acquisition and storage Hadoop . HadoopData Analytics
  • 47. Other big data management tools: Overview…
  • 48.
  • 49. 6. Practical Use-Cases -6.1 Big apps of Big Data tools -6.2 How big data affects small business -6.3 Relevance of big data in market
  • 50. 6.1 Big apps of big data tools.
  • 51.
  • 52. Who is using big data?
  • 53. Who is using big data?
  • 54. 6.2 How big data affects small businesses? • Every organization has a tipping point, and most organizations – regardless of size – will eventually reach a point where the volume, variety and velocity of their data will be something that they have to address. • This new big data world is not only about running problems faster, but about solving problems that were not solvable before.
  • 55. 6.3 Relevance of big data in market.
  • 56.
  • 61.
  • 62. 8. References: • www.microsoft.com • http://en.wikipedia.org/wiki/Hadoop • http://en.wikipedia.org/wiki/Big_data • www.google.com • www.slideshare.net • Pdf: Mgkinskey Global Institute • Pdf: 101 Big data by Pradeep Vardan • Workshop in college by ‘Ecsttasys’ on big data

Notes de l'éditeur

  1. ©Karan Desai(Follow me on twitter/@karlmit or https://in.linkedin.com/in/karan28) DISCLAIMER: The images or diagrams or content presented in the presentations are meant for educational purpose only. The author don’t guarantee the originality of any media of the presentation. The author has only combined and summed up the details regarding the topic from varied sources. The author is not subjected to any violation or copyrights.
  2. SSAS: SQL Server Analysis Services, SSAS, is an online analytical processing (OLAP), data mining and reporting tool in Microsoft SQL Server. Essbase is a multidimensional database management system (MDBMS) that provides a multidimensional database platform upon which to build analytic applications.  BM Cognos TM1 (formerly Applix TM1) is enterprise planning software used to implement collaborative planning, budgeting and forecasting solutions, as well as analytical and reporting applications. Power Pivot is a free add-in to the 2010 version of the spreadsheet application Microsoft Excel. PowerPivot workbooks are self contained web applications, merely requiring a 'Save as' to make them accessible in the browser as interactive solutions.”. K is a proprietary array processing language developed by Arthur Whitney and commercialized by Kx Systems. Since then, an open-source implementation known as Kona has also been developed. ... kdb is both a database (kdb) and a vector language (q). It's used by almost every major financial institution Vertica Systems is an analytic database management software company. QlikView is the most flexible Business Intelligence platform for turning data into knowledge. TIBCO Spotfire® designs, develops and distributes in-memory analytics software for next generation business intelligence. Tableau Software is an American computer software company headquartered in Seattle, Washington. It produces a family of interactive data visualization products focused on business intelligence Omniscope is single, in-memory, file-based application that enables agile, 'best practise' data sharing solutions An in-memory database (IMDB; also main memory database system or MMDB or memory resident database) is a database management system that primarily relies on main memory for computer data storage. It is contrasted with database management systems that employ a disk storage mechanism. Relational databases are row oriented, as the data in each row of a table is stored together. In a columnar, or column-oriented database, the data is stored across rows.