2. Refences
[1]. DAMA-DMBOK (2017) Data Management Body of Knowledge (Second Edition)-DAMA
International
[2]. Data Strategy (2017) How to profit from a world of big data, analytics and the internet of things – By
Bernard Marr - Kogan Page
[3]. Big Data Analytics for Entrepreneurial Success (2019) – By Soraya Sedkaoui - IGI Global
[4]. https://www.eckerson.com/
[5]. https://www.lightsondata.com/
[6]. https://www.dataedo.com/
[7]. https://www.linkedin.com/in/denise-harders-4908a967/
[8]. http://www.fabak.ir/
CREDITS: This presentation template was created by
Slidesgo, including icons by Flaticon, and infographics &
images by Freepik and illustrations by Storiesplease inform me if some references was missing.
3. Data
Long-standing definitions of data emphasize its role in
representing facts about the world. In relation to
information technology, data is also understood as
information that has been stored in digital form (though
data is not limited to information that has been digitized
and data management principles apply to data captured
on paper as well as in databases). Still, because today we
can capture so much information electronically, we call
many things ‘data’ that would not have been called
‘data’ in earlier times – things like names, addresses,
birthdates, what one ate for dinner on Saturday, the most
recent book one purchased.
4. Data and Information
Much ink has been spilled over the relationship between
data and information. Data has been called the “raw
material of information” and information has been called
“data in context”. Often a layered pyramid is used to
describe the relationship between data (at the base),
information, knowledge, and wisdom (at the very top).
While the pyramid can be helpful in describing why data
needs to be well-managed, this representation presents
several challenges for data management.
5. BIGDATA
Early efforts to define the meaning of Big Data
characterized it in terms of the Three V’s: Volume,
Velocity, Variety (Laney, 2001). As more organizations
start to leverage the potential of Big Data, the list of V’s
has expanded
6. Volume
Refers to the amount of data. Big Data often has
thousands of entities or elements in billions of records.
7. Data storages challenges
This picture presents a visual
summary of the range of data
that has become available
through Big Data technologies
and the implications on data
storage options.
8. Velocity
Refers to the speed at which data is captured, generated,
or shared. Big Data is often generated and can also be
distributed and even analyzed in real-time.
9. Variety /Variability
Refers to the forms in which data is captured or
delivered. Big Data requires storage of multiple formats;
data structure is often inconsistent within or across data
sets.
10. Structured Data
The term structured data refers to the data that is
identifiable because it is organized or structured.
Structured data concerns all data which can be stored in
database SQL in a table with rows and columns. They
have the relational key and can be easily mapped into
pre-designed fields. It refers to the data that are stored in
relational databases or corporate data warehouses. In
another word, structured data are data which we
established the functional sense and life cycle (creation
of rules, values and possible technical means of
representation). Also, this type of data is relatively
simple to enter, store, query, and analyze.
11. Semi-Structured Data
This comes from the speed of arrival of the data, their
volume, making structuring impossible. The messages in
your mailboxes are a good example of semi-structured
data. Semi-structured data is information that doesn’t
reside in a relational database but that does have some
organizational properties that make it easier to analyze.
So, this kind of data contains a structured part and an
unstructured part, so it combines the two types
(structured and unstructured). And due to unorganized
information, the semi-structured is difficult to retrieve,
analyze and store as compared to structured data. It
requires software framework like Apache Hadoop to
perform all this.
12. Unstructured Data
This is the simplest abstract form, a series of bytes.
Unstructured data is also a description of a reality but
whose codification, meaning, is not directly exploitable
by the machine, for example, audio or video file, a text
contained in a document or an email. Belong to this
category, the data from social networks (Twitter and
Facebook). Such type of data becomes difficult and
requires advanced tools and software to generate value.
Today, there is a predominance of unstructured data
(they represent, according to the big data survey
conducted by New Vantage Partner, more than 85% of
digital data), which must be tried to organize a minimum
(via metadata for example) to give them meaning.
13. Metadata
Describe and enrich unstructured data. In other words,
the absence of structure is mitigated by the metadata.
For example, the title and the tags to describe the videos
on Dailymotion.
14. BIGDATA another v’s
Viscosity: Refers to how difficult the
data is to use or integrate.
Volatility: Refers to how often data
changes occur and therefore how long
the data is useful.
Veracity: Refers to how trustworthy
the data is.
15. WHEREDOBIG DATA COMEFROM?
The answer is clear, they come from everywhere!
Because so much of human activity is executed
electronically, massive amounts of data are
accumulating every day as we move through the
world, interact with each other, and transact
business. Big Data is produced through email,
social media, online orders, and even online
video games. Data is generated not only by
phones and point-of-sale devices, but also by
surveillance systems, sensors in transportation
systems, medical monitoring systems, industrial
and utility monitoring systems, satellites, and
military equipment.
16. Bigdata is itjusta simplebuzzword?
Not at all! Our lives are already concerned in all their
aspects by the uses of big data. Big data is already in
several fields that the author lists as examples.
Companies want to understand the behaviors and
expectations of their customers to better target their
proposals. They create predictive models to anticipate
the departure of a client or sales of a product.
17. BIGDATA APPLICATIONS
▪ Industry
▪ Telecom
▪ Manufacturing
▪ Retail
▪ Sport and Fitness
▪ Insurance and Bank
▪ Tourism
▪ Government
▪ Health
▪ Transport
▪ Energy
▪ Agriculture
▪ Science and Research
18. Data asan Organizational Asset
An asset is an economic resource, that can be
owned or controlled, and that holds or produces
value. Assets can be converted to money. Data
is widely recognized as an enterprise asset,
though understanding of what it means to
manage data as an asset is still evolving.
Today’s organizations rely on their data assets
to make more effective decisions and to operate
more efficiently. Businesses use data to
understand their customers, create new
products and services, and improve operational
efficiency by cutting costs and controlling
risks.