2. Big Data is the ocean of information we swim in every day vast
sources of data flowing from our computers, mobile devices, and
machine sensors. Big Data is being generated by everything around
us at all times. Every digital process and social media exchange
produces it, while systems, sensors, and mobile devices transmit it.
New sources of data come from a variety of machines, such as
website interactions, search engine optimizations, and social
business sites by using click-stream data. These changing business
requirements demand that the right information be available at the
right time.[1]
What is Big Data?
3. Big Data Versus Small Data
Small Data
• Usually designed to answer a
specific question or serve a
particular goal.
• Typically, small data is
contained within one
institution, often on one
computer, sometimes in one
file.
• In many cases, the data user
prepares her own data, for her
own purposes.
Big Data
• Usually designed with a goal in
mind, but the goal is flexible
and the questions posed are
protean.
• Typically spread throughout
electronic space, typically
spread through multiple
Internet servers, located
anywhere on earth.
• The data comes from many
differ sources, and it is
prepared by many people.
4. Small Data
• Ordinarily contains highly
structured data. The data
domain is restricted to a
single discipline or sub
discipline.
• Typically, the data is
measured using one
experimental protocol, and
the data can be represented
using one set of standard
units.
Big Data
• Must be capable of
absorbing unstructured
data (e.g., such as free-text
documents, images, motion
pictures, sound recordings,
physical objects).
• Many different types of
data are delivered in many
different electronic formats
by different people.[2]
8. Megabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
9. Gigabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
10. Terabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
11. Petabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
12. One ByteExabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
13. Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Zettabyte
14. Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL! Yottabyte
15. HobbyistByte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
16. Desktop
HobbyistByte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
17. Desktop
Hobbyist
Internet
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
18. Desktop
Hobbyist
Internet
Big Data
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
19. Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
20. Desktop
Hobbyist
The Future?[3]
Internet
Big Data
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
21. Volume...
100 terabytes of data are uploaded daily to Facebook ; Akamai
analyses 75 million events a day to target online ads; Walmart
handles 1 million customer transactions every single hour. 90%
of all data ever created was generated in the past 2 years.
Scale is certainly a part of what makes Big Data big. The
internet-mobile revolution, bringing with it a torrent of social
media updates, sensor data from devices and an explosion of e-
commerce, means that every industry is swamped with data-
which can be incredibly valuable, if you know how to use it.
3vs of Big Data
22. Velocity…
In 1999, Wal-Mart’s data warehouse stored 1,000 terabytes (1,000,000
gigabytes) of data. In 2012, it had access to over 2.5 petabytes (2,500,000
gigabytes) of data.
Every minute of every day, we upload 100 hours of video on Youtube, send
over 200 million emails and send 300,000 tweets. ‘Velocity’ refers to the
increasing speed at which this data is created, and the increasing speed at
which the data can be processed, stored and analysed by relational
databases. The possibilities of processing data in real-time is an area of
particular interest, which allows companies to do things like display
personalised ads on the web pages you visit, based on your recent search,
viewing and purchase history.
23. Variety…
Gone are the days when a company’s data could be neatly
slotted into a table and analysed. 90% of data generated is
‘unstructured’, coming in all shapes and forms- from geo-spatial
data, to tweets which can be analysed for content and
sentiment, to visual data such as photos and videos.
The ‘3 V’s’ certainly give us an insight into the almost
unimaginable scale of data, and the break-neck speeds at which
these vast datasets grow and multiply. But only ‘Variety’ really
begins to scratch the surface of the depth- and crucially, the
challenges- of Big Data.[4]
29. [1].Book: "Big Data for Beginners" by Alonzo Williams,Stepanie Foor.
[2].Book: "Principles of Big Data: Preparing, Sharing, and Analyzing Complex
Information" by Jules J. Berman.
[4].Book: "Understanding Big Data: A Beginners Guide to Data Science & the Business
Applications" by Eileen McNulty-Holmes.
[5].http://www.oii.ox.ac.uk/research/project/?id=98.
[6].https://www.google.co.in/#q=applications+of+big+data+wikipedia.
[3]. http://www.slideshare.net/dwellman/what-is-big-data-24401517?qid=6e8e2726-
6681-486c-880b-f973f6b61e2c&v=&b=&from_search=5