SlideShare une entreprise Scribd logo
1  sur  18
BIG DATA
Presented By : Rameshwari S.
Patil
PRN No : 2020033800117147
WHAT IS DATA?
• Data is represented with the help of characters such as
alphabets (A-Z, a-z), digits (0-9) or special characters
(+,-,/,*,<,>,= etc.).
• This information may be in the form of text documents,
images, audio clips, software programs, or other types of
data.
WHAT IS BIG DATA?
• Big data is also a data but with huge size.
• The term “big data” refers to data that is so large, fast or complex that it’s difficult
or impossible to process using traditional methods.
• The concept of big data gained momentum in the early 2000s when industry analyst
Doug Laney articulated the now-mainstream definition of big data as the five V’s:
• With the development and increase of apps and social media and people and businesses
moving online, there’s been a huge increase in data. If we look at only social media platforms,
they interest and attract over a million users daily, scaling up data more than ever before.
The next question is how exactly is this huge amount of data handled and how is it processed
and stored. This is where Big Data comes into play.
• The term not only refers to the data, but also to the various frameworks, tools, and
techniques involved.
• As you build your big data solution, consider open source software such as Apache
Hadoop, Apache Spark and the entire Hadoop ecosystem as cost-effective, flexible data
processing and storage tools designed to handle the volume of data being generated today.
• Big data is most often stored in computer databases and is analyzed using software
specifically designed to handle large, complex data sets. Many software-as-a-service (SaaS)
companies specialize in managing this type of complex data.
TOP BIG DATA PLATFORMS
SOURCES OF BIG DATA
• Social media : such as Twitter, Facebook, Instagram, Pinterest, and Google+.
• Cloud : public, private, or third party cloud platforms.
• Web : data publically available on the web.
• IOT : data generated from the interconnection of web.(medical devices , video games,
cameras, meters)
• Databases : traditional and modern databases.
HOW DOES IT WORKS?
• Integrate : Big data brings together data from many disparate sources and
applications. During integration, you need to bring in the data, process it, and make
sure it’s formatted and available in a form that your business analysts can get
started with.
• Manage : Big data requires storage. Your storage solution can be in the cloud, on
premises, or both.
• Analyze : Get new clarity with a visual analysis of your varied data sets. Explore
the data further to make new discoveries. Build data models with machine learning
and Artificial Intelligence.
TYPES OF BIG DATA……
1. Structured data
2. Unstructured data
3. Semi-structured data
STRUCTURED DATA
• Any data that can be stored, accessed and processed in the form of fixed format is termed as a
'structured' data.
• Structured data is the easiest to work with.
• Sources : machine generated data(satellite images) & human generated (data entered by user as an
input).
• It’s all quantitative data.
• The ETL process for structured data stores the finished product in what is called a data warehouse.
These databases are highly structured and filtered for specific analytics purpose the initial data was
harvested for.
• Structured data is the easiest type of data to analyze because it requires little to no preparation before
processing.
• A table in a database is an example of structured data.
UN-STRUCTURED DATA
• Any data with unknown form or the structure is classified as unstructured data.
• About 80% of the total account for unstructured big data.
• The Unstructured data is further divided into –
I. Captured data (GPS)
II. User-Generated data (likes, shares, comments, tweets).
• The hardest part of analyzing unstructured data is teaching an application to understand the
information it is extracting. More often than not, this means translating it into some form of
structured data.
• Examples of Un-structured Data is text, video, audio, social media activity, satellite imagery,
email
SEMI-STRUCTURED DATA
• Semi-structured data can contain both the forms of data.
• Most of the time, this translates to unstructured data with metadata attached to it.
• If you send an email, the time sent, email addresses to and from, the IP address
from the device sent from, and other pieces of information are linked to the actual
content of the email.
• Semi-structured data has no set schema.
• Example of semi-structured data is a data represented in an XML file. For example,
NoSQL documents are considered to be semi-structured, since they contain
keywords that can be used to process the document easily.
ADVANTAGES DISADVANTAGES
• Cost savings
• Time reductions
• Better sales insights
• Controls online reputation
• Understand the market
conditions
• Increased productivity
• Fraud detection
• Need for talent
• Incompatible tools
• Cost
• Security & privacy concerns
CONCLUSION
• As more and more data is generated and collected, data analysis requires scalable,
flexible, and high performing tools to provide insights in a timely fashion. However,
organizations are facing a growing big data ecosystem where new tools emerge and
“die” very quickly. Therefore, it can be very difficult to keep pace and choose the
right tools.
• They have also been able to more accurately predict daily weather as well as natural
disasters.
• By using these big data-related systems, engineers and scientists have been able to
more easily design cars, airplanes, and other vehicles.
FUTURE ENHANCEMENT
• Increasing demand for data analytics
• Increasing enterprise adoption of big data
• Big Data finds application across various parallels of the industry. (Professional,
Scientific and Technical Services (25%), Information Technology (17%), Manufacturing
(15%), Finance and Insurance (9%), and Retail Trade (8%)).
• Flexible career options.
• Promises exponential salary growth.
REFERENCES
• www.wikipedia.com
• www.computerhope.com
• www.knowledgdehut.com
• www.javapoint.com
• www.tutorialsponit.com

Contenu connexe

Tendances

Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewSivashankar Ganapathy
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research reportJULIO GONZALEZ SANZ
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Simplilearn
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Edureka!
 
Big data Presentation
Big data PresentationBig data Presentation
Big data PresentationAswadmehar
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadhMithlesh Sadh
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)SiamAhmed16
 

Tendances (20)

What is big data?
What is big data?What is big data?
What is big data?
 
Big Data
Big DataBig Data
Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Big data
Big dataBig data
Big data
 
Big_data_ppt
Big_data_ppt Big_data_ppt
Big_data_ppt
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
Big data
Big dataBig data
Big data
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
 
Big data
Big dataBig data
Big data
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Big Data
Big DataBig Data
Big Data
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)Presentation About Big Data (DBMS)
Presentation About Big Data (DBMS)
 

Similaire à Big data

Similaire à Big data (20)

What is big data
What is big dataWhat is big data
What is big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Overview of Big Data
Overview of Big DataOverview of Big Data
Overview of Big Data
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
 
TOPIC.pptx
TOPIC.pptxTOPIC.pptx
TOPIC.pptx
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
 
M.Florence Dayana
M.Florence DayanaM.Florence Dayana
M.Florence Dayana
 
Unit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfUnit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdf
 
big-data-notes1.ppt
big-data-notes1.pptbig-data-notes1.ppt
big-data-notes1.ppt
 
Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1
 
What is Data?
What is Data?What is Data?
What is Data?
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
bigdatappt.pptx
bigdatappt.pptxbigdatappt.pptx
bigdatappt.pptx
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdf
 
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptxUnit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
 
Big data
Big dataBig data
Big data
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 

Dernier

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 

Dernier (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 

Big data

  • 1. BIG DATA Presented By : Rameshwari S. Patil PRN No : 2020033800117147
  • 2. WHAT IS DATA? • Data is represented with the help of characters such as alphabets (A-Z, a-z), digits (0-9) or special characters (+,-,/,*,<,>,= etc.). • This information may be in the form of text documents, images, audio clips, software programs, or other types of data.
  • 3. WHAT IS BIG DATA? • Big data is also a data but with huge size. • The term “big data” refers to data that is so large, fast or complex that it’s difficult or impossible to process using traditional methods. • The concept of big data gained momentum in the early 2000s when industry analyst Doug Laney articulated the now-mainstream definition of big data as the five V’s:
  • 4.
  • 5. • With the development and increase of apps and social media and people and businesses moving online, there’s been a huge increase in data. If we look at only social media platforms, they interest and attract over a million users daily, scaling up data more than ever before. The next question is how exactly is this huge amount of data handled and how is it processed and stored. This is where Big Data comes into play. • The term not only refers to the data, but also to the various frameworks, tools, and techniques involved. • As you build your big data solution, consider open source software such as Apache Hadoop, Apache Spark and the entire Hadoop ecosystem as cost-effective, flexible data processing and storage tools designed to handle the volume of data being generated today. • Big data is most often stored in computer databases and is analyzed using software specifically designed to handle large, complex data sets. Many software-as-a-service (SaaS) companies specialize in managing this type of complex data.
  • 6. TOP BIG DATA PLATFORMS
  • 7. SOURCES OF BIG DATA • Social media : such as Twitter, Facebook, Instagram, Pinterest, and Google+. • Cloud : public, private, or third party cloud platforms. • Web : data publically available on the web. • IOT : data generated from the interconnection of web.(medical devices , video games, cameras, meters) • Databases : traditional and modern databases.
  • 8. HOW DOES IT WORKS? • Integrate : Big data brings together data from many disparate sources and applications. During integration, you need to bring in the data, process it, and make sure it’s formatted and available in a form that your business analysts can get started with. • Manage : Big data requires storage. Your storage solution can be in the cloud, on premises, or both. • Analyze : Get new clarity with a visual analysis of your varied data sets. Explore the data further to make new discoveries. Build data models with machine learning and Artificial Intelligence.
  • 9.
  • 10. TYPES OF BIG DATA…… 1. Structured data 2. Unstructured data 3. Semi-structured data
  • 11. STRUCTURED DATA • Any data that can be stored, accessed and processed in the form of fixed format is termed as a 'structured' data. • Structured data is the easiest to work with. • Sources : machine generated data(satellite images) & human generated (data entered by user as an input). • It’s all quantitative data. • The ETL process for structured data stores the finished product in what is called a data warehouse. These databases are highly structured and filtered for specific analytics purpose the initial data was harvested for. • Structured data is the easiest type of data to analyze because it requires little to no preparation before processing. • A table in a database is an example of structured data.
  • 12. UN-STRUCTURED DATA • Any data with unknown form or the structure is classified as unstructured data. • About 80% of the total account for unstructured big data. • The Unstructured data is further divided into – I. Captured data (GPS) II. User-Generated data (likes, shares, comments, tweets). • The hardest part of analyzing unstructured data is teaching an application to understand the information it is extracting. More often than not, this means translating it into some form of structured data. • Examples of Un-structured Data is text, video, audio, social media activity, satellite imagery, email
  • 13. SEMI-STRUCTURED DATA • Semi-structured data can contain both the forms of data. • Most of the time, this translates to unstructured data with metadata attached to it. • If you send an email, the time sent, email addresses to and from, the IP address from the device sent from, and other pieces of information are linked to the actual content of the email. • Semi-structured data has no set schema. • Example of semi-structured data is a data represented in an XML file. For example, NoSQL documents are considered to be semi-structured, since they contain keywords that can be used to process the document easily.
  • 14. ADVANTAGES DISADVANTAGES • Cost savings • Time reductions • Better sales insights • Controls online reputation • Understand the market conditions • Increased productivity • Fraud detection • Need for talent • Incompatible tools • Cost • Security & privacy concerns
  • 15.
  • 16. CONCLUSION • As more and more data is generated and collected, data analysis requires scalable, flexible, and high performing tools to provide insights in a timely fashion. However, organizations are facing a growing big data ecosystem where new tools emerge and “die” very quickly. Therefore, it can be very difficult to keep pace and choose the right tools. • They have also been able to more accurately predict daily weather as well as natural disasters. • By using these big data-related systems, engineers and scientists have been able to more easily design cars, airplanes, and other vehicles.
  • 17. FUTURE ENHANCEMENT • Increasing demand for data analytics • Increasing enterprise adoption of big data • Big Data finds application across various parallels of the industry. (Professional, Scientific and Technical Services (25%), Information Technology (17%), Manufacturing (15%), Finance and Insurance (9%), and Retail Trade (8%)). • Flexible career options. • Promises exponential salary growth.
  • 18. REFERENCES • www.wikipedia.com • www.computerhope.com • www.knowledgdehut.com • www.javapoint.com • www.tutorialsponit.com