SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
BIGDATA
The Technological Renaissance
Dr. Ritu Bhargava
Department Of Computer Science
Sophia Girls’ College,
Ajmer(Autonomous)
What is Big
Data?
According to Gartner,
information becomes big data
when the volume can no longer
be managed with normal
database tools.
DEFINATION
Big data is high-volume, high-
velocity, and high-variety
information assets that demand
cost-effective, innovative forms of
information processing for
enhanced insight and decision-
making.
5 V’s Of Big Data
» Volume: Raw Data
» Velocity: Change over time
» Variety: Data types
» Veracity:Data Quality
» Value: Information for Decision
Making.
DATA IS EVERYWHERE
» The digital universe will grow from 3.2 zettabytes to 40 zettabytes in only six
years.
» Every day, we create 2.5 quintillion bytes of data — so much that 90% of the
data in the world today has been created in the last two years alone.
» This data comes from everywhere: sensors used to gather climate
information, posts to social media sites, digital pictures and videos, purchase
transaction records, and cell phone GPS signals .
Origins of Big Data Infrastructure
● The value generated by a social network is proportional to the number of
contacts between users of the social network, rather than the number of
users. According to Metcalfe’s Law[3], and its variants, the number of
contacts for N users is proportional to N*logN. Thus, the growth of contacts,
and therefore the interactions within a social network, which results in data
generation, is nonlinear with respect to number of users. As the world gets
more connected, one can expect the number of interactions to grow, resulting
in even more accelerated data growth.
RECENT STUDY
Google’s search index exploded from 26 Million pages in 1998, to more than 1
Trillion in less than a decade, this content was “multi-structured”, consisting of
natural language text, images, video, geo-spatial, and even renderings of
structured data.
Google had to develop , the Google File System (GFS), and MapReduce
programming framework.
These two publications became the blueprint for Apache Hadoop, an open
source framework that has become a de facto standard for big data platforms
deployed today.
Apache Hadoop
Yahoo adopted Apache Hadoop in January 2006, and made significant
contributions to make it a scalable and stable platform.
Today, Yahoo has the largest footprint of Apache Hadoop, running more than
45,000 servers managing more than 370 Petabytes of data with Hadoop.
Being an open source system, licensed under the liberal Apache Software
License, governed by the Apache Software Foundation.
The scalability and flexibility of Apache Hadoop prompted growing Internet
companies such as Facebook, Twitter, and LinkedIn to adopt it for their data
infrastructure.
Industrial Internet: The Next Frontier
The Big Data use-cases today are analysing customer behaviour, their buying
patterns, their likes and dislikes as expressed in social media,their clickstreams
and location information from mobile devices, machine-generated data could be
the next frontier for Big Data systems.
For example, in an automobile , thousands of signals being captured by 70+
sensors that generate more than 25 gigabytes of data every hour, and processed
by 70 on-board computers .
While most of this data is transient, and needs to be acted upon in real-time,
recognizing patterns within the data to improve safety and usability of the
automobile implies aggregating and analysing it offline.
Facts!
CEO
» Zuckerberg noted that 1 billion pieces of content are shared via Facebook’s
Open Graph daily .
» Facebook puts up over 10 million photographs every hour and around 3
billion ‘like’ buttons are pushed everyday .
» Google process more than 24 petabytes of data every day .
» 48 hours of video are uploaded to YouTube every minute, resulting in nearly
8 years of content every day .
» 70% of data is created by individuals – but enterprises are responsible for
storing and managing 80% of it .
» Every day, we create 2.5 quintillion bytes of data — so much that 90% of the
data in the world today has been created in the last two years alone.
Drivers and Opportunities
» Real-time prediction.
» Increase operational and supply chain efficiencies
» Deep insights into customer behaviour based on pattern and purchase
analysis
» Information aggregation
» Better and more scientific customer segmentation for targeted marketing and
product offering.
» Improve productivity and innovation
» McKinsey predicts an increase in job opportunities ranging from 140K to
190K
» Uncover hidden patterns and rapidly respond to changing scenarios.
» Multi-channel and multi-dimensional information aggregation
» Data convergence
CUSTOMER SEGMENTATION
Market Opportunity
Big Data offer bigger opportunities. Here is a snapshot of some of the
predictions done by market research firms
IDC predicts the Big Market to grow to $16.9 Billion by 2019
Digital reasoning estimates that Big Data market would be worth $48.3 billion in
2019
Applications
» Better financial data management
» Investment banking using aggregated information from various sources likes
financial forecasting, asset pricing and portfolio management.
» More accurate pricing adjustments based on vast amount of real-time data
» Stock advises based on huge amount of stock data analysis, unstructured
data like social media content etc.
» Customer segmentation based on previous transactions and profile
information
» Analysis of purchase patterns and tailor made product offerings
» Unstructured data analysis from social media, multi-media to understand the
tastes, preferences, and customer patterns and do sentiment analysis
» Targeted marketing based on user segmentation
Thank you

Contenu connexe

Tendances

Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
Koltiva
 

Tendances (18)

The Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient WorldThe Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient World
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
 
Fun Facts about Big Data
Fun Facts about Big DataFun Facts about Big Data
Fun Facts about Big Data
 
7 Big Facts About Data-Driven Innovation
7 Big Facts About Data-Driven Innovation7 Big Facts About Data-Driven Innovation
7 Big Facts About Data-Driven Innovation
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
A Big Data Timeline
A Big Data TimelineA Big Data Timeline
A Big Data Timeline
 
Big data
Big dataBig data
Big data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
big data
big databig data
big data
 
Towards a big data roadmap for europe
Towards a big data roadmap for europeTowards a big data roadmap for europe
Towards a big data roadmap for europe
 
Big data overview external
Big data overview externalBig data overview external
Big data overview external
 
Data Mining With Big Data
Data Mining With Big DataData Mining With Big Data
Data Mining With Big Data
 
Big data characteristics, value chain and challenges
Big data characteristics, value chain and challengesBig data characteristics, value chain and challenges
Big data characteristics, value chain and challenges
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
BIG DATA(PPT)
BIG DATA(PPT)BIG DATA(PPT)
BIG DATA(PPT)
 
P02 | Big Data | Anurag Gupta | BCA
P02 | Big Data | Anurag Gupta | BCAP02 | Big Data | Anurag Gupta | BCA
P02 | Big Data | Anurag Gupta | BCA
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 

Similaire à Bigdata the technological renaissance

Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
Vamshikrishna Goud
 
Big Data Analytics Orientation. .pdf
Big Data Analytics Orientation.        .pdfBig Data Analytics Orientation.        .pdf
Big Data Analytics Orientation. .pdf
080msdsa024yatru
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
saranya270513
 

Similaire à Bigdata the technological renaissance (20)

Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Big data
Big dataBig data
Big data
 
Data mining with big data implementation
Data mining with big data implementationData mining with big data implementation
Data mining with big data implementation
 
Big data and Internet
Big data and InternetBig data and Internet
Big data and Internet
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
Understanding big data
Understanding big dataUnderstanding big data
Understanding big data
 
Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 
130214 copy
130214   copy130214   copy
130214 copy
 
Big Data World
Big Data WorldBig Data World
Big Data World
 
Big Data Analytics Orientation. .pdf
Big Data Analytics Orientation.        .pdfBig Data Analytics Orientation.        .pdf
Big Data Analytics Orientation. .pdf
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
big data.pptx
big data.pptxbig data.pptx
big data.pptx
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
 
Big data
Big dataBig data
Big data
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
Big data
Big dataBig data
Big data
 

Plus de RituBhargava7 (9)

Client server architecture
Client server architectureClient server architecture
Client server architecture
 
File organization
File organizationFile organization
File organization
 
Data Models
Data ModelsData Models
Data Models
 
Data models
Data modelsData models
Data models
 
Database abstraction
Database abstractionDatabase abstraction
Database abstraction
 
Open Source Concepts
Open Source ConceptsOpen Source Concepts
Open Source Concepts
 
Role of a DBA
Role of a DBARole of a DBA
Role of a DBA
 
File systems versus a dbms
File systems versus a dbmsFile systems versus a dbms
File systems versus a dbms
 
Database tachnologies
Database tachnologiesDatabase tachnologies
Database tachnologies
 

Dernier

Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
dharasingh5698
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
MsecMca
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 

Dernier (20)

Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Intro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdfIntro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 

Bigdata the technological renaissance

  • 1. BIGDATA The Technological Renaissance Dr. Ritu Bhargava Department Of Computer Science Sophia Girls’ College, Ajmer(Autonomous)
  • 2. What is Big Data? According to Gartner, information becomes big data when the volume can no longer be managed with normal database tools.
  • 3. DEFINATION Big data is high-volume, high- velocity, and high-variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision- making.
  • 4.
  • 5. 5 V’s Of Big Data » Volume: Raw Data » Velocity: Change over time » Variety: Data types » Veracity:Data Quality » Value: Information for Decision Making.
  • 6. DATA IS EVERYWHERE » The digital universe will grow from 3.2 zettabytes to 40 zettabytes in only six years. » Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. » This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals .
  • 7. Origins of Big Data Infrastructure ● The value generated by a social network is proportional to the number of contacts between users of the social network, rather than the number of users. According to Metcalfe’s Law[3], and its variants, the number of contacts for N users is proportional to N*logN. Thus, the growth of contacts, and therefore the interactions within a social network, which results in data generation, is nonlinear with respect to number of users. As the world gets more connected, one can expect the number of interactions to grow, resulting in even more accelerated data growth.
  • 8. RECENT STUDY Google’s search index exploded from 26 Million pages in 1998, to more than 1 Trillion in less than a decade, this content was “multi-structured”, consisting of natural language text, images, video, geo-spatial, and even renderings of structured data. Google had to develop , the Google File System (GFS), and MapReduce programming framework. These two publications became the blueprint for Apache Hadoop, an open source framework that has become a de facto standard for big data platforms deployed today.
  • 9. Apache Hadoop Yahoo adopted Apache Hadoop in January 2006, and made significant contributions to make it a scalable and stable platform. Today, Yahoo has the largest footprint of Apache Hadoop, running more than 45,000 servers managing more than 370 Petabytes of data with Hadoop. Being an open source system, licensed under the liberal Apache Software License, governed by the Apache Software Foundation. The scalability and flexibility of Apache Hadoop prompted growing Internet companies such as Facebook, Twitter, and LinkedIn to adopt it for their data infrastructure.
  • 10. Industrial Internet: The Next Frontier The Big Data use-cases today are analysing customer behaviour, their buying patterns, their likes and dislikes as expressed in social media,their clickstreams and location information from mobile devices, machine-generated data could be the next frontier for Big Data systems. For example, in an automobile , thousands of signals being captured by 70+ sensors that generate more than 25 gigabytes of data every hour, and processed by 70 on-board computers . While most of this data is transient, and needs to be acted upon in real-time, recognizing patterns within the data to improve safety and usability of the automobile implies aggregating and analysing it offline.
  • 11. Facts! CEO » Zuckerberg noted that 1 billion pieces of content are shared via Facebook’s Open Graph daily . » Facebook puts up over 10 million photographs every hour and around 3 billion ‘like’ buttons are pushed everyday . » Google process more than 24 petabytes of data every day . » 48 hours of video are uploaded to YouTube every minute, resulting in nearly 8 years of content every day . » 70% of data is created by individuals – but enterprises are responsible for storing and managing 80% of it . » Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone.
  • 12. Drivers and Opportunities » Real-time prediction. » Increase operational and supply chain efficiencies » Deep insights into customer behaviour based on pattern and purchase analysis » Information aggregation » Better and more scientific customer segmentation for targeted marketing and product offering. » Improve productivity and innovation » McKinsey predicts an increase in job opportunities ranging from 140K to 190K » Uncover hidden patterns and rapidly respond to changing scenarios. » Multi-channel and multi-dimensional information aggregation » Data convergence
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19. Market Opportunity Big Data offer bigger opportunities. Here is a snapshot of some of the predictions done by market research firms IDC predicts the Big Market to grow to $16.9 Billion by 2019 Digital reasoning estimates that Big Data market would be worth $48.3 billion in 2019
  • 20. Applications » Better financial data management » Investment banking using aggregated information from various sources likes financial forecasting, asset pricing and portfolio management. » More accurate pricing adjustments based on vast amount of real-time data » Stock advises based on huge amount of stock data analysis, unstructured data like social media content etc. » Customer segmentation based on previous transactions and profile information » Analysis of purchase patterns and tailor made product offerings » Unstructured data analysis from social media, multi-media to understand the tastes, preferences, and customer patterns and do sentiment analysis » Targeted marketing based on user segmentation