SlideShare une entreprise Scribd logo
1  sur  42
Social Media with Big Data Analytics
Mohammed Zuhair Al-Taie
Big Data Centre - Universiti Teknologi Malaysia - 2016
AGENDA
Web 2.0
Social Media
Big Data
Social Media with Big Data Analytics
Social Network Analysis
Sentiment Analysis
Web 2.0 is
A Complex,
Organic Online
Conversation
WHAT IS WEB 2.0?
Web 2.0 is powered by:
• Social Networks
•News and
Bookmarking
•Blogs
•Microblogging
•Video/Photo-sharing
•Message Boards
•Wikis
•Virtual reality
•Social gaming
•Podcasts
•Real Simple
syndication (RSS)
•Social Media Press
Release
TECHNOLOGY OVERVIEW
Search: The ease of finding information through keyword search
Links: Ad-hoc guides to other relevant information
Authoring: The ability to create constantly updating content over a platform
that is shifted from being the creation of a few to being constantly updated,
interlinked work.
Tags: Categorization of content by creating tags: simple,one-word user-
determined descriptions to facilitate searching and avoid rigid, pre-made
categories
Extensions: Powerful algorithms that leverage the Web as an application
platform as well as a documentserver
Signals: The use of RSS technology to rapidly notify users of content changes
Web 2.0 websites typically include some of the following features/techniques-
SLATES
Social media:
is an umbrella
term that
defines the
various activities
that integrate
technology,
social
interaction, and
the construction
of words,
pictures, videos
and audio.
WEB 2.0 TECHNOLOGIES:
SOCIAL MEDIA
“Creation of web content, by the
people, for the people”
In Simple Language…
SOCIAL MEDIA PLATFORMS
WHAT HAPPENS EVERY 1 MIN?
 Variety of sources from where data is being
generated has also undergone a shift
 The types of data being created has changed
from structured to semi-structured to
unstructured data
Structured
Data
Semi-
Structured
Data
Unstructured
Data Need to manage broad range of data types
 Process analytic queries across numerous data
types
 Need to extract meaningful analysis from this
data has led to several technologies to gain
traction
 Examples include NoSQL databases to store
unstructured data as well as innovative
processing methods like Hadoop and massive
parallel processing (MPP)
Today 80% Of Data Existing In
Any Enterprise Is Unstructured
Data
Unstructured data from social
media has to be approached in a
non traditional manner.
UNSTRUCTURED DATA
Facebook
- User Likes and
Favorites
- Article/Video/Link
Shares
- Views
- Comments
- Location / Geospatial
Twitter
Tweet Characteristics
- Length
- Language Model
- Semantics
- Emoticons
- Location / Geospatial
Google / You Tube
- Blogs
- Comments
- Search Statistics
- Likes vs Dislikes
- Shares / Views /
Comments
IDENTIFYING UNSTRUCTURED DATA
SOURCES
“Big Data”
is data whose
scale, diversity,
and complexity
require new
architecture,
techniques,
algorithms, and
analytics to
manage it and
extract value and
hidden knowledge
from it…
BIG DATA IS…
BIG DATA =
BIG DATA VS USUAL DATA
Implication for an organization
2009 2011 2015 2020
0.8
1.9
7.9
35.0
CAGR
(2009-2020)
41.0%
Zetabytes
THE GLOBAL DATA GROWTH
>3,500
>40
>2,000
>200
>400
 Key verticals: Healthcare,
Manufacturing, Retail, Digital
Marketing
 Demand trend: High demand
of Big Data analytics
>250
 Key verticals: Telecom, Retail, Banking
 Demand trend: Still embryonic; most
organizations have wait and watch approach
 Demand trend: Current demand
appears to be limited, however,
lack of skills may drive
outsourcing of Big Data analytics
 Low awareness levels
 Key verticals: Technology, Financial services,
Oil & Gas, Utilities, Manufacturing
 Demand trend: European MNC’s are still in
the early stages of the adoption cycle
North
America
South America
Europe
Middle East
India
China
Japan
 Key verticals: Manufacturing,
Telecom, Health & Life Sciences
 Demand trend: Demand for BI
to derive operational efficiency
 Key verticals: Telecom, Bioinformatics,
Retail
 Demand trend: Industry is in nascent stage
with demand catching up, particularly in retail
>50
16
NORTH AMERICA & EUROPE DRIVES THE BIG DATA
OPPORTUNITY WITH OVER 85%
OF THE WORLD’S DATA
Tools Description
The Hadoop
Distributed
File System
(HDFS)
HDFS divides the data into smaller parts and distributes
it across the various servers/nodes
SQL Server
Integration
Service
These tools allow posts can be downloaded and loaded
into Hadoop
Apache
Flume
MapReduce
MapReduce is a process that transforms data loaded
into Hadoop into a format that can be used for analysis.
Hive
a runtime Hadoop support architecture that leverages
Structure Query Language (SQL) with the Hadoop
platform.
Jaql Jaql converts high-level queries into low-level queries
and
Zookeeper Zookeeper coordinate parallel processing across big
clusters
HBase HBase is a column-oriented database management
system that sits on top of HDFS by using a non-SQL
approach.
BIG DATA TOOLS
Variety
Veracity
Value
BIG DATA IS OFTEN DESCRIBED USING
FIVE Vs
Volume
refers to the vast amounts of
data generated every second.
We are not talking Terabytes
but Zettabytes or Brontobytes.
If we take all the data
generated in the world
between the beginning of time
and 2008, the same amount of
data will soon be generated
every minute.
This makes most data sets too
large to store and analyse
using traditional database
technology.
Variety
Veracity
Value
BIG DATA: VOLUME
BIG DATA: VELOCITY
Variety
Veracity
Value
Velocity
refers to the speed at which
new data is generated and
the speed at which data
moves around. Just think of
social media messages
going viral in seconds.
Technology allows us now to
analyse the data while it is
being generated
(sometimes referred to as
in-memory analytics),
without ever putting it into
databases.
Variety
Veracity
Value
Variety
refers to the different types
of data we can now use. In
the past we only focused on
structured data that neatly
fitted into tables or
relational databases, such
as financial data. In fact,
80% of the world’s data is
unstructured (text, images,
video, voice, etc.)
BIG DATA: VARIETY
Variety
Veracity
Value
Veracity
refers to the messiness or
trustworthiness of the data.
With many forms of big
data quality and accuracy
are less controllable (just
think of Twitter posts with
hash tags, abbreviations,
typos and colloquial speech
as well as the reliability and
accuracy of content) but
technology now allows us to
work with this type of data.
BIG DATA: VERACITY
Variety
Veracity
Value
VALUE
Then there is another V to
take into account when
looking at Big Data: Value!
Having access to big data is
no good unless we can turn
it into value.
Companies are starting to
generate amazing value
from their big data.
BIG DATA: VALUE
THE INTERSECTION OF SOCIAL MEDIA
AND BIG DATA
 Big Data is also characterized by
velocity or speed i.e. frequency of
data generation or the frequency of
data delivery
 New age communication channels
such as mobile phones, emails, social
networking has increased the rate of
information flows
Examples:
 Telcos adopting location based
marketing based on user location
sensed by mobile towers
 Satellite images can help monitor
and analyze troop movements, a
flood plane, cloud patterns, or forest
fires
 Video analysis systems could monitor
a sensitive or valuable facility,
watching for possible intruders and
alert authorities in real time
Big Data velocity enabling real
time use of data
Data
velocity
per
minute
600+
videos on
YouTube
200
million+
emails sent
2
million+
Google
search
queries
400,000+
minutes of
Skype
calling
400,000+
tweets on
Twitter
US$
300,000+
are spent
on online
shopping
700,000+
Facebook
updates
7,000+
photos on
flickr
1,500+
blog posts
3500+
ticks per
minute in
securities
trading
BIG DATA & REAL TIME USE
BIG DATA FOR SOCIAL MEDIA ANALYTICS
PROCESS MODEL
CONCEPTUAL VIEW OF FRAMEWORK FOR BIG DATA
EXTRACTION, MESSAGING AND STORE
This phase has a composite pattern that is
based on the store-and-explore and focuses on
obtaining and storing the relevant data from
sources outside our establishment.
CONCEPTUAL VIEW OF DISCUSSION TOPIC AND
OPINION ANALYSIS COMPONENT
This phase has a composite pattern that is based on
purposeful-and-predictive analytics to gain advanced
insight.
WHAT IS HADOOP?
*Hadoop is an open source
framework which is used for
storing and processing the
large scale of data sets on
large clusters of hardware.
*The specialty of Hadoop
involves in HDFS which is used
for storing data on large
commodity machines and
provides very huge bandwidth
for the cluster.
CONCEPTUAL VIEW OF APACHE HADOOP
ARCHITECTURE
CONCEPTUAL VIEW OF DATA VISUALIZATION AND
DECISION-MAKING COMPONENT
This project has a composite pattern based on
actionable-analysis with the aim of taking the next best
actions that leads to take appropriate actions by
related customers.
SOCIAL NETWORK ANALYSIS
A GLOBAL SOCIAL NETWORK
NETWORK PERSPECTIVE
WHY SOCIAL NETWORK ANALYSIS
MATTERS?
SOCIAL NETWORK ANALYSIS: THE NEW
SCIENCE OF NETWORKS
Sentiment analysis…
• Analyzes people’s sentiments,
opinions, appraisals, attitudes,
evaluations, and emotions
• Towards entities such as
organizations, products,
services, individuals, topics,
issues, events, and their
attributes
• As presented online via text,
video and other means of
communication.
• These communications can fall
into three broad categories:
positive, neutral or negative.
SENTIMENT ANALYSIS
We can inquire about sentiment at
various linguistic levels:
O Words – objective, positive,
negative, neutral
O Clauses – “going out of my
mind”
O Sentences – possibly multiple
sentiments
O Documents
LEVEL OF ANALYSIS
Elections 2012 Dashboard
FILTER BY:
Facebook
Twitter
Google
Mitt Romney
RepublicanPrimary
Democratic Vote
Republican Vote
Democratic Sentiment
Republican Sentiment
TRUTHY: A SOCIAL MEDIA RESEARCH
PROJECT
Truthy is a research project to study how memes spread on social
media. A meme is a transmissible unit of information, such as a hashtag,
phrase, or link. This website highlights some of the research coming from
this effort and showcases some visualizations, tools, and data resources
demonstrating broader impacts of the project.
Social media with big data analytics

Contenu connexe

Tendances

Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
Big data visualization
Big data visualizationBig data visualization
Big data visualizationAnurag Gupta
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notesMohit Saini
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceMahir Haque
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Yaman Hajja, Ph.D.
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big DataJames Serra
 
Social Media, Big Data
Social Media, Big Data Social Media, Big Data
Social Media, Big Data robin fay
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureGuido Schmutz
 
Slides: Knowledge Graphs vs. Property Graphs
Slides: Knowledge Graphs vs. Property GraphsSlides: Knowledge Graphs vs. Property Graphs
Slides: Knowledge Graphs vs. Property GraphsDATAVERSITY
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big datahktripathy
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesT.S. Lim
 

Tendances (20)

Big Data
Big DataBig Data
Big Data
 
Big data
Big dataBig data
Big data
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big data visualization
Big data visualizationBig data visualization
Big data visualization
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
What is big data?
What is big data?What is big data?
What is big data?
 
Big_data_ppt
Big_data_ppt Big_data_ppt
Big_data_ppt
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
 
Social Media, Big Data
Social Media, Big Data Social Media, Big Data
Social Media, Big Data
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
Data analytics
Data analyticsData analytics
Data analytics
 
Slides: Knowledge Graphs vs. Property Graphs
Slides: Knowledge Graphs vs. Property GraphsSlides: Knowledge Graphs vs. Property Graphs
Slides: Knowledge Graphs vs. Property Graphs
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Data analytics
Data analyticsData analytics
Data analytics
 

Similaire à Social media with big data analytics

Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache HadoopSuman Saurabh
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessAjay Ohri
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond Rajesh Kumar
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overviewNitesh Ghosh
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfrajsharma159890
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupScott Mitchell
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunitiesBigdata Meetup Kochi
 
Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Memoori
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataPrakalp Agarwal
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
TDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DWTDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DWJeannette Browning
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformIRJET Journal
 
Introduction to BIG DATA
Introduction to BIG DATA Introduction to BIG DATA
Introduction to BIG DATA Zeeshan Khan
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfkalai75
 
Real-time Analytics in Big data
Real-time Analytics in Big dataReal-time Analytics in Big data
Real-time Analytics in Big dataPratiksha Manan
 

Similaire à Social media with big data analytics (20)

Big data
Big dataBig data
Big data
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
Big Data
Big DataBig Data
Big Data
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdf
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Bigdata overview
Bigdata overviewBigdata overview
Bigdata overview
 
BigData
BigDataBigData
BigData
 
Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!Making Internet Of Things Device Data Just Work!
Making Internet Of Things Device Data Just Work!
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
TDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DWTDWI checklist - Evolving to Modern DW
TDWI checklist - Evolving to Modern DW
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 
Introduction to BIG DATA
Introduction to BIG DATA Introduction to BIG DATA
Introduction to BIG DATA
 
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Real-time Analytics in Big data
Real-time Analytics in Big dataReal-time Analytics in Big data
Real-time Analytics in Big data
 

Plus de Universiti Technologi Malaysia (UTM)

A self organizing communication model for disaster risk management
A self organizing communication model for disaster risk managementA self organizing communication model for disaster risk management
A self organizing communication model for disaster risk managementUniversiti Technologi Malaysia (UTM)
 
Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...Universiti Technologi Malaysia (UTM)
 
Explanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research ApproachesExplanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research ApproachesUniversiti Technologi Malaysia (UTM)
 
Factors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraqFactors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraqUniversiti Technologi Malaysia (UTM)
 

Plus de Universiti Technologi Malaysia (UTM) (11)

A self organizing communication model for disaster risk management
A self organizing communication model for disaster risk managementA self organizing communication model for disaster risk management
A self organizing communication model for disaster risk management
 
Spark Working Environment in Windows OS
Spark Working Environment in Windows OSSpark Working Environment in Windows OS
Spark Working Environment in Windows OS
 
Python networkx library quick start guide
Python networkx library quick start guidePython networkx library quick start guide
Python networkx library quick start guide
 
Python 3.x quick syntax guide
Python 3.x quick syntax guidePython 3.x quick syntax guide
Python 3.x quick syntax guide
 
Predicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systemsPredicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systems
 
Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...Scientific theory of state and society parities and disparities between the p...
Scientific theory of state and society parities and disparities between the p...
 
Nation building current trends of technology use in da’wah
Nation building current trends of technology use in da’wahNation building current trends of technology use in da’wah
Nation building current trends of technology use in da’wah
 
Flight MH370 community structure
Flight MH370 community structureFlight MH370 community structure
Flight MH370 community structure
 
Visualization of explanations in recommender systems
Visualization of explanations in recommender systemsVisualization of explanations in recommender systems
Visualization of explanations in recommender systems
 
Explanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research ApproachesExplanations in Recommender Systems: Overview and Research Approaches
Explanations in Recommender Systems: Overview and Research Approaches
 
Factors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraqFactors disrupting a successful implementation of e-commerce in iraq
Factors disrupting a successful implementation of e-commerce in iraq
 

Dernier

Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Dernier (20)

Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 

Social media with big data analytics

  • 1. Social Media with Big Data Analytics Mohammed Zuhair Al-Taie Big Data Centre - Universiti Teknologi Malaysia - 2016
  • 2. AGENDA Web 2.0 Social Media Big Data Social Media with Big Data Analytics Social Network Analysis Sentiment Analysis
  • 3. Web 2.0 is A Complex, Organic Online Conversation WHAT IS WEB 2.0? Web 2.0 is powered by: • Social Networks •News and Bookmarking •Blogs •Microblogging •Video/Photo-sharing •Message Boards •Wikis •Virtual reality •Social gaming •Podcasts •Real Simple syndication (RSS) •Social Media Press Release
  • 4. TECHNOLOGY OVERVIEW Search: The ease of finding information through keyword search Links: Ad-hoc guides to other relevant information Authoring: The ability to create constantly updating content over a platform that is shifted from being the creation of a few to being constantly updated, interlinked work. Tags: Categorization of content by creating tags: simple,one-word user- determined descriptions to facilitate searching and avoid rigid, pre-made categories Extensions: Powerful algorithms that leverage the Web as an application platform as well as a documentserver Signals: The use of RSS technology to rapidly notify users of content changes Web 2.0 websites typically include some of the following features/techniques- SLATES
  • 5. Social media: is an umbrella term that defines the various activities that integrate technology, social interaction, and the construction of words, pictures, videos and audio. WEB 2.0 TECHNOLOGIES: SOCIAL MEDIA
  • 6. “Creation of web content, by the people, for the people” In Simple Language…
  • 9.  Variety of sources from where data is being generated has also undergone a shift  The types of data being created has changed from structured to semi-structured to unstructured data Structured Data Semi- Structured Data Unstructured Data Need to manage broad range of data types  Process analytic queries across numerous data types  Need to extract meaningful analysis from this data has led to several technologies to gain traction  Examples include NoSQL databases to store unstructured data as well as innovative processing methods like Hadoop and massive parallel processing (MPP) Today 80% Of Data Existing In Any Enterprise Is Unstructured Data Unstructured data from social media has to be approached in a non traditional manner. UNSTRUCTURED DATA
  • 10. Facebook - User Likes and Favorites - Article/Video/Link Shares - Views - Comments - Location / Geospatial Twitter Tweet Characteristics - Length - Language Model - Semantics - Emoticons - Location / Geospatial Google / You Tube - Blogs - Comments - Search Statistics - Likes vs Dislikes - Shares / Views / Comments IDENTIFYING UNSTRUCTURED DATA SOURCES
  • 11.
  • 12.
  • 13. “Big Data” is data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it… BIG DATA IS… BIG DATA =
  • 14. BIG DATA VS USUAL DATA
  • 15. Implication for an organization 2009 2011 2015 2020 0.8 1.9 7.9 35.0 CAGR (2009-2020) 41.0% Zetabytes THE GLOBAL DATA GROWTH
  • 16. >3,500 >40 >2,000 >200 >400  Key verticals: Healthcare, Manufacturing, Retail, Digital Marketing  Demand trend: High demand of Big Data analytics >250  Key verticals: Telecom, Retail, Banking  Demand trend: Still embryonic; most organizations have wait and watch approach  Demand trend: Current demand appears to be limited, however, lack of skills may drive outsourcing of Big Data analytics  Low awareness levels  Key verticals: Technology, Financial services, Oil & Gas, Utilities, Manufacturing  Demand trend: European MNC’s are still in the early stages of the adoption cycle North America South America Europe Middle East India China Japan  Key verticals: Manufacturing, Telecom, Health & Life Sciences  Demand trend: Demand for BI to derive operational efficiency  Key verticals: Telecom, Bioinformatics, Retail  Demand trend: Industry is in nascent stage with demand catching up, particularly in retail >50 16 NORTH AMERICA & EUROPE DRIVES THE BIG DATA OPPORTUNITY WITH OVER 85% OF THE WORLD’S DATA
  • 17. Tools Description The Hadoop Distributed File System (HDFS) HDFS divides the data into smaller parts and distributes it across the various servers/nodes SQL Server Integration Service These tools allow posts can be downloaded and loaded into Hadoop Apache Flume MapReduce MapReduce is a process that transforms data loaded into Hadoop into a format that can be used for analysis. Hive a runtime Hadoop support architecture that leverages Structure Query Language (SQL) with the Hadoop platform. Jaql Jaql converts high-level queries into low-level queries and Zookeeper Zookeeper coordinate parallel processing across big clusters HBase HBase is a column-oriented database management system that sits on top of HDFS by using a non-SQL approach. BIG DATA TOOLS
  • 18. Variety Veracity Value BIG DATA IS OFTEN DESCRIBED USING FIVE Vs
  • 19. Volume refers to the vast amounts of data generated every second. We are not talking Terabytes but Zettabytes or Brontobytes. If we take all the data generated in the world between the beginning of time and 2008, the same amount of data will soon be generated every minute. This makes most data sets too large to store and analyse using traditional database technology. Variety Veracity Value BIG DATA: VOLUME
  • 20. BIG DATA: VELOCITY Variety Veracity Value Velocity refers to the speed at which new data is generated and the speed at which data moves around. Just think of social media messages going viral in seconds. Technology allows us now to analyse the data while it is being generated (sometimes referred to as in-memory analytics), without ever putting it into databases.
  • 21. Variety Veracity Value Variety refers to the different types of data we can now use. In the past we only focused on structured data that neatly fitted into tables or relational databases, such as financial data. In fact, 80% of the world’s data is unstructured (text, images, video, voice, etc.) BIG DATA: VARIETY
  • 22. Variety Veracity Value Veracity refers to the messiness or trustworthiness of the data. With many forms of big data quality and accuracy are less controllable (just think of Twitter posts with hash tags, abbreviations, typos and colloquial speech as well as the reliability and accuracy of content) but technology now allows us to work with this type of data. BIG DATA: VERACITY
  • 23. Variety Veracity Value VALUE Then there is another V to take into account when looking at Big Data: Value! Having access to big data is no good unless we can turn it into value. Companies are starting to generate amazing value from their big data. BIG DATA: VALUE
  • 24.
  • 25. THE INTERSECTION OF SOCIAL MEDIA AND BIG DATA
  • 26.  Big Data is also characterized by velocity or speed i.e. frequency of data generation or the frequency of data delivery  New age communication channels such as mobile phones, emails, social networking has increased the rate of information flows Examples:  Telcos adopting location based marketing based on user location sensed by mobile towers  Satellite images can help monitor and analyze troop movements, a flood plane, cloud patterns, or forest fires  Video analysis systems could monitor a sensitive or valuable facility, watching for possible intruders and alert authorities in real time Big Data velocity enabling real time use of data Data velocity per minute 600+ videos on YouTube 200 million+ emails sent 2 million+ Google search queries 400,000+ minutes of Skype calling 400,000+ tweets on Twitter US$ 300,000+ are spent on online shopping 700,000+ Facebook updates 7,000+ photos on flickr 1,500+ blog posts 3500+ ticks per minute in securities trading BIG DATA & REAL TIME USE
  • 27. BIG DATA FOR SOCIAL MEDIA ANALYTICS PROCESS MODEL
  • 28. CONCEPTUAL VIEW OF FRAMEWORK FOR BIG DATA EXTRACTION, MESSAGING AND STORE This phase has a composite pattern that is based on the store-and-explore and focuses on obtaining and storing the relevant data from sources outside our establishment.
  • 29. CONCEPTUAL VIEW OF DISCUSSION TOPIC AND OPINION ANALYSIS COMPONENT This phase has a composite pattern that is based on purposeful-and-predictive analytics to gain advanced insight.
  • 30. WHAT IS HADOOP? *Hadoop is an open source framework which is used for storing and processing the large scale of data sets on large clusters of hardware. *The specialty of Hadoop involves in HDFS which is used for storing data on large commodity machines and provides very huge bandwidth for the cluster.
  • 31. CONCEPTUAL VIEW OF APACHE HADOOP ARCHITECTURE
  • 32. CONCEPTUAL VIEW OF DATA VISUALIZATION AND DECISION-MAKING COMPONENT This project has a composite pattern based on actionable-analysis with the aim of taking the next best actions that leads to take appropriate actions by related customers.
  • 34. A GLOBAL SOCIAL NETWORK
  • 36. WHY SOCIAL NETWORK ANALYSIS MATTERS?
  • 37. SOCIAL NETWORK ANALYSIS: THE NEW SCIENCE OF NETWORKS
  • 38. Sentiment analysis… • Analyzes people’s sentiments, opinions, appraisals, attitudes, evaluations, and emotions • Towards entities such as organizations, products, services, individuals, topics, issues, events, and their attributes • As presented online via text, video and other means of communication. • These communications can fall into three broad categories: positive, neutral or negative. SENTIMENT ANALYSIS
  • 39. We can inquire about sentiment at various linguistic levels: O Words – objective, positive, negative, neutral O Clauses – “going out of my mind” O Sentences – possibly multiple sentiments O Documents LEVEL OF ANALYSIS
  • 40. Elections 2012 Dashboard FILTER BY: Facebook Twitter Google Mitt Romney RepublicanPrimary Democratic Vote Republican Vote Democratic Sentiment Republican Sentiment
  • 41. TRUTHY: A SOCIAL MEDIA RESEARCH PROJECT Truthy is a research project to study how memes spread on social media. A meme is a transmissible unit of information, such as a hashtag, phrase, or link. This website highlights some of the research coming from this effort and showcases some visualizations, tools, and data resources demonstrating broader impacts of the project.