SlideShare a Scribd company logo
1 of 20
Social Web: Where are the Semantics?
ESWC 2014
Miriam Fernández, Victor Rodríguez,
Andrés García-Silva, Oscar Corcho
Ontology Engineering Group, UPM, Spain
Knowledge Media Institute, The Open University
Outline
2
•  Part 1: Understanding Social Media
–  Theory: background & applications described in this tutorial
–  Hands on: data extraction from Twitter and Facebook
•  Part 2: Using semantics to represent data from SNS
–  Theory: Using SW to represent content, users and relations
–  Hands on: applying and extending SIOC
•  Part 3: Using semantics to understand social media conversations
–  Theory: Using semantics to understand topics in social media
–  Hands on: using LDA to extract topics from social media
•  Part 4: Using semantics to understand user behaviour
Implicit vs. Explicit Semantics
•  Implicit Semantics
–  Implicit, also called statistical semantics, focus on extracting word
sense by studying the patterns of human word usage in massive
collections of text or other human generated data
–  It does not rely on an explicit formalisation/conceptualisation of
knowledge
•  Explicit Semantics
–  Explicit semantics focus on the analysis of content by using the
support of explicit conceptualisations in the form of ontologies and
knowledge bases
ESWC 2014 Social Web: Where are the Semantics? 3
Explicit Semantics
Structured
Unstructured
From the Web of human generated content
The Web of unstructured text (Posts / Documents)
and Links
To the Web of machine understandable content
The Web of Objects and Relations
•  The annotators extract entities (classes / individuals) and relations
from the text and link them to object URIs
Obtaining explicit semantics from social media content
Using Semantics to Analyse Topic Evolution
•  LDA topics are identified by a set of keywords
–  Difficult to assess their meaning and evolution
•  Use explicit semantics to characterise topics as concrete entities
6
!
!
Using Semantics to Analyse Topic Evolution
ESWC 2014 Social Web: Where are the Semantics? 7
•  Analyse concepts appearance
–  Within a group
–  Across groups
–  Over time
•  Type filtering
•  Interlinking with other datasets
(data.open.ac.uk)
Using Semantics To Analyse Sentiment
•  Sentiment analysis on social media
–  Offers a fast and cheap access to publics’ feelings towards brands,
business, people, etc.
–  Comes with additional challenges
–  Current approaches
•  Lexical-based
•  Machine Learning
–  Explicit semantics are often neglected
ESWC 2014 Social Web: Where are the Semantics? 8
Using Semantics to Analyse Sentiment
•  Add semantics as additional features into the training set
•  Results
–  Incorporating semantics increases accuracy by 6.5% for negative
sentiment and by 4.8% for positive sentiment
–  The use of explicit semantics is more appropriate when the datasets
being analysed are large and cover a wide range of topics
Saif, Hassan, He, Yulan, Alani, Harith (2012). Semantic sentiment analysis of twitter. In: 11th
International Semantic Web Conference (ISWC 2012)
“Words that occur in
similar context tend
to have similar
meaning”
Wittgenstein (1953)
Using Semantics To Analyse Sentiment
•  SentiCircles
–  Integrates implicit and explicit semantics to analyse sentiment
–  Outperforms other lexicon labeling methods and overtakes the state-of-the-
art SentiStrength approach in accuracy, with a marginal drop in F-measure
ESWC 2014 Social Web: Where are the Semantics? 10
Saif, Hassan, Fernandez, Miriam, He, Yulan, Alani, Harith (2014). SentiCircles for Tweet-level
Sentiment Analysis (ESWC 2014) -> conference presentation on the 27, 14:00!!
Using Semantics To Analyse Sentiment
ESWC 2014 Social Web: Where are the Semantics? 11
Using Semantics to Analyse User Behaviour
•  Goal
–  Monitor and capture member activities
–  Analyse emerging behaviour over time
–  Understand the correlation of behaviour with community evolution
•  Approach
–  Identify behavioural features and behaviour roles
–  Create an ontology to model behavioural roles and behaviour
features
–  Use semantic rules to infer user roles in online communities
–  Study role composition patterns
ESWC 2014 Social Web: Where are the Semantics? 12
Angeletou, S., Rowe, M. and Alani, H. (2011) Modelling and Analysis of User Behaviour in Online
Communities, 10th International Semantic Web Conference (ISWC 2011), Bonn, Germany
Rowe, Matthew; Fernandez, Miriam; Angeletou, Sofia and Alani, Harith (2013). Community analysis through
semantic rules and role composition derivation. Journal of Web Semantics: Science, Services and Agents on
the World Wide Web, 18(1) pp. 31–47
Behavioural roles and features
ESWC 2014 Social Web: Where are the Semantics? 13
Table 1. Roles and the feature-to-level mappings
Role Feature Level
Elitist In-Degree Ratio low
Bi-directional Threads Ratio high
Bi-directional Neighbours Ratio low
Grunt Bi-directional Threads Ratio med
Bi-directional Neighbours Ratio med
Average Posts per Thread low
STD of Posts per Thread low
Joining Conversationalist Thread Initiation Ratio low
Average Posts per Thread high
STD of Posts per Thread high
Popular Initiator In-Degree Ratio high
Thread Initiation Ratio high
Popular Participants In-Degree Ratio high
Thread Initiation Ratio low
Average Posts per Thread med
STD of Posts per Thread med
Supporter In-Degree Ratio med
Bi-directional Threads Ratio med
Bi-directional Neighbours Ratio med
Taciturn Bi-directional Threads Ratio low
Bi-directional Neighbours Ratio low
Average Posts per Thread low
STD of Posts per Thread low
Ignored Posts Replied Ratio low
Jeffrey Chan, Conor Hayes, and Elizabeth Daly. Decomposing discussion forums using
common user roles. In Proc. Web Science Conf. (WebSci10), Raleigh, NC: US, 2010.
Modelling user features and interactions
ESWC 2014 Social Web: Where are the Semantics? 14
http://purl.org/net/oubo/0.3•  OUBO: The OU Behaviour Ontology
Encoding Rules in Ontologies with SPIN
ESWC 2014 Social Web: Where are the Semantics? 15
Apply rules to infer user roles over time
ESWC 2014 Social Web: Where are the Semantics? 16
1.- Construct features for community
users at a given time step
2.- Derive bings using equal
frequency binning
Popularity-low cutoff = 0.5
Initiation-high cutoff = 0.4
3.- Use skeleton rule base to construct
rules using bin levels
Popularity=low, Initiation=high ->roleA
Popularity<0.5, Initiation > 0.4 -> roleA
4.- Apply rules to infer user roles and
community composition
5.- Repeat 1-4 following time steps
Analyse the role composition of the community
ESWC 2014 Social Web: Where are the Semantics? 17
•  Investigate the correlation between the role composition and the
students’ performance
Analyse the role composition of the community
•  Allow Policy Makers to focus on a smaller set of users, with whom
they may want to engage more closely
ESWC 2014 Social Web: Where are the Semantics? 18
Analyse the role composition of the community
•  Development of models to predict community health based on role
compositions and evolution of user behaviour
–  Health Indicators
•  Churn Rate: proportion of users who leave the network in a given time segment
•  User Count: number of users who posted at least once
•  Seeds / Non seeds: proportion of posts that get responses vs. those that don’t
•  Clustering coefficient: measures the cohesion within the network
–  Results
•  Accurate detection of community health is possible using role composition
information
•  There is no “one size fits all” model
ESWC 2014 Social Web: Where are the Semantics? 19
Rowe, M. and Alani, H. (2012) What Makes Communities Tick? Community Health Analysis using Role
compositions. International Conference on Social Computing, 2012
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
Churn Rate
FPR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
User Count
FPR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
Seeds / Non−seeds Prop
FPR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
Clustering Coefficient
FPR
TPR
Challenges: How would you address them?
•  Scalability
–  Communities exceed millions of users
–  Infrastructures must support hundreds of millions discussion threads
•  Growth (real-time analysis)
–  Speed of new incoming data / stream processing
•  Concept vs. keyword based data acquisition/pre-processing
–  How to filter certain tags?
–  Which new topics emerge?
–  How topics evolve over time?
–  Authorship in social media, who copies who?
•  Multilingualism
–  We all speak different languages
•  Understanding the user and acting accordingly
–  We all have different personalities, behaviours and preferences
ESWC 2014 Social Web: Where are the Semantics? 20

More Related Content

What's hot

Social Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic LibrariansSocial Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic LibrariansCARLsurvey2010
 
04 Diffusion and Peer Influence
04 Diffusion and Peer Influence04 Diffusion and Peer Influence
04 Diffusion and Peer Influencednac
 
11 Network Experiments and Interventions
11 Network Experiments and Interventions11 Network Experiments and Interventions
11 Network Experiments and Interventionsdnac
 
Categorize balanced dataset for troll detection
Categorize balanced dataset for troll detectionCategorize balanced dataset for troll detection
Categorize balanced dataset for troll detectionvivatechijri
 
05 Communities in Network
05 Communities in Network05 Communities in Network
05 Communities in Networkdnac
 
Predicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learningPredicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learningMirXahid1
 
WSDM'16 Relational Learning with Social Status Analysis
WSDM'16 Relational Learning with Social Status AnalysisWSDM'16 Relational Learning with Social Status Analysis
WSDM'16 Relational Learning with Social Status AnalysisArizona State University
 
Published Paper
Published PaperPublished Paper
Published PaperFaeza Noor
 
CARL ABRC social media environmental scan 2011
CARL ABRC social media environmental scan 2011CARL ABRC social media environmental scan 2011
CARL ABRC social media environmental scan 2011CARLsurvey2010
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018Arsalan Khan
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education researchChristian Bokhove
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studiesdnac
 
Learning Analytics - CET Seminar 2012
Learning Analytics - CET Seminar 2012Learning Analytics - CET Seminar 2012
Learning Analytics - CET Seminar 2012Andrew Deacon
 
CARL ABRC Survey Results april 2011
CARL ABRC Survey Results april 2011CARL ABRC Survey Results april 2011
CARL ABRC Survey Results april 2011CARLsurvey2010
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Vala Ali Rohani
 
Political prediction analysis using text mining and deep learning
Political prediction analysis using text mining and deep learningPolitical prediction analysis using text mining and deep learning
Political prediction analysis using text mining and deep learningVishwambhar Deshpande
 
Social media for libraries
Social media for librariesSocial media for libraries
Social media for librariesNaomi Bates
 
Taylor & Francis: Use of Social Media by the Library
Taylor & Francis: Use of Social Media by the LibraryTaylor & Francis: Use of Social Media by the Library
Taylor & Francis: Use of Social Media by the LibrarySIBiUSP
 
Who creates trends in online social media
Who creates trends in online social mediaWho creates trends in online social media
Who creates trends in online social mediaAmir Razmjou
 
Social computing meet & greet
Social computing meet & greetSocial computing meet & greet
Social computing meet & greetAngela Brandt
 

What's hot (20)

Social Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic LibrariansSocial Media Use by Canadian Academic Librarians
Social Media Use by Canadian Academic Librarians
 
04 Diffusion and Peer Influence
04 Diffusion and Peer Influence04 Diffusion and Peer Influence
04 Diffusion and Peer Influence
 
11 Network Experiments and Interventions
11 Network Experiments and Interventions11 Network Experiments and Interventions
11 Network Experiments and Interventions
 
Categorize balanced dataset for troll detection
Categorize balanced dataset for troll detectionCategorize balanced dataset for troll detection
Categorize balanced dataset for troll detection
 
05 Communities in Network
05 Communities in Network05 Communities in Network
05 Communities in Network
 
Predicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learningPredicting cyber bullying on t witter using machine learning
Predicting cyber bullying on t witter using machine learning
 
WSDM'16 Relational Learning with Social Status Analysis
WSDM'16 Relational Learning with Social Status AnalysisWSDM'16 Relational Learning with Social Status Analysis
WSDM'16 Relational Learning with Social Status Analysis
 
Published Paper
Published PaperPublished Paper
Published Paper
 
CARL ABRC social media environmental scan 2011
CARL ABRC social media environmental scan 2011CARL ABRC social media environmental scan 2011
CARL ABRC social media environmental scan 2011
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018
 
Social Network Analysis: applications for education research
Social Network Analysis: applications for education researchSocial Network Analysis: applications for education research
Social Network Analysis: applications for education research
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies
 
Learning Analytics - CET Seminar 2012
Learning Analytics - CET Seminar 2012Learning Analytics - CET Seminar 2012
Learning Analytics - CET Seminar 2012
 
CARL ABRC Survey Results april 2011
CARL ABRC Survey Results april 2011CARL ABRC Survey Results april 2011
CARL ABRC Survey Results april 2011
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)
 
Political prediction analysis using text mining and deep learning
Political prediction analysis using text mining and deep learningPolitical prediction analysis using text mining and deep learning
Political prediction analysis using text mining and deep learning
 
Social media for libraries
Social media for librariesSocial media for libraries
Social media for libraries
 
Taylor & Francis: Use of Social Media by the Library
Taylor & Francis: Use of Social Media by the LibraryTaylor & Francis: Use of Social Media by the Library
Taylor & Francis: Use of Social Media by the Library
 
Who creates trends in online social media
Who creates trends in online social mediaWho creates trends in online social media
Who creates trends in online social media
 
Social computing meet & greet
Social computing meet & greetSocial computing meet & greet
Social computing meet & greet
 

Viewers also liked

Viewers also liked (8)

SocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs WorkshopSocInfo2014 CityLabs Workshop
SocInfo2014 CityLabs Workshop
 
ESWC 2014 Tutorial part 1
ESWC 2014 Tutorial part 1ESWC 2014 Tutorial part 1
ESWC 2014 Tutorial part 1
 
Dealers Program
Dealers ProgramDealers Program
Dealers Program
 
DealersProgram
DealersProgramDealersProgram
DealersProgram
 
ESWC 2014 Tutorial part 2
ESWC 2014 Tutorial part 2ESWC 2014 Tutorial part 2
ESWC 2014 Tutorial part 2
 
CAEPIA 2011
CAEPIA 2011CAEPIA 2011
CAEPIA 2011
 
Toyota slides candi's revision
Toyota slides  candi's revisionToyota slides  candi's revision
Toyota slides candi's revision
 
ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3
 

Similar to ESWC 2014 Tutorial Part 4

2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...eMadrid network
 
Click and Connect: Social Media Affordances & their influence on user partici...
Click and Connect: Social Media Affordances & their influence on user partici...Click and Connect: Social Media Affordances & their influence on user partici...
Click and Connect: Social Media Affordances & their influence on user partici...Global OER Graduate Network
 
Social Network Analysis based on MOOC's (Massive Open Online Classes)
Social Network Analysis based on MOOC's (Massive Open Online Classes)Social Network Analysis based on MOOC's (Massive Open Online Classes)
Social Network Analysis based on MOOC's (Massive Open Online Classes)ShankarPrasaadRajama
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Nicola Osborne
 
Making More Sense Out of Social Data
Making More Sense Out of Social DataMaking More Sense Out of Social Data
Making More Sense Out of Social DataThe Open University
 
Social information Access2012
Social information Access2012Social information Access2012
Social information Access2012Peter Brusilovsky
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx20211a05p7
 
Empirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contextsEmpirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contextsCatia Pesquita
 
Influencing the MOOC agenda - analysis of #MOOC Twitter Data
Influencing the MOOC agenda - analysis of #MOOC Twitter Data  Influencing the MOOC agenda - analysis of #MOOC Twitter Data
Influencing the MOOC agenda - analysis of #MOOC Twitter Data Mairéad Nic Giolla Mhichíl
 
Social media and researchers: Josipa Crnic Deakin University
Social media and researchers: Josipa Crnic Deakin University Social media and researchers: Josipa Crnic Deakin University
Social media and researchers: Josipa Crnic Deakin University therese nolan-brown
 
Global Redirective Practices: an online workshop for a client
Global Redirective Practices: an online workshop for a clientGlobal Redirective Practices: an online workshop for a client
Global Redirective Practices: an online workshop for a clientSean Connolly
 
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...IRJET Journal
 
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on TwitterAn Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on TwitterSymeon Papadopoulos
 
Social Media Analytics with a pinch of semantics
Social Media Analytics with a pinch of semanticsSocial Media Analytics with a pinch of semantics
Social Media Analytics with a pinch of semanticsThe Open University
 

Similar to ESWC 2014 Tutorial Part 4 (20)

Ifip wg-galway-
Ifip wg-galway-Ifip wg-galway-
Ifip wg-galway-
 
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
 
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
 
Click and Connect: Social Media Affordances & their influence on user partici...
Click and Connect: Social Media Affordances & their influence on user partici...Click and Connect: Social Media Affordances & their influence on user partici...
Click and Connect: Social Media Affordances & their influence on user partici...
 
Social Network Analysis based on MOOC's (Massive Open Online Classes)
Social Network Analysis based on MOOC's (Massive Open Online Classes)Social Network Analysis based on MOOC's (Massive Open Online Classes)
Social Network Analysis based on MOOC's (Massive Open Online Classes)
 
Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...Working with Social Media Data: Ethics & good practice around collecting, usi...
Working with Social Media Data: Ethics & good practice around collecting, usi...
 
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
CIC Networked Learning Practices Workshop - Caroline HaythornthwaiteCIC Networked Learning Practices Workshop - Caroline Haythornthwaite
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
 
Making More Sense Out of Social Data
Making More Sense Out of Social DataMaking More Sense Out of Social Data
Making More Sense Out of Social Data
 
Qs1 group a
Qs1 group a Qs1 group a
Qs1 group a
 
The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?
 
Social information Access2012
Social information Access2012Social information Access2012
Social information Access2012
 
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptxSampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
 
Empirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contextsEmpirical user studies in Semantic Web contexts
Empirical user studies in Semantic Web contexts
 
Influencing the MOOC agenda - analysis of #MOOC Twitter Data
Influencing the MOOC agenda - analysis of #MOOC Twitter Data  Influencing the MOOC agenda - analysis of #MOOC Twitter Data
Influencing the MOOC agenda - analysis of #MOOC Twitter Data
 
Social Multimedia as Sensors
Social Multimedia as SensorsSocial Multimedia as Sensors
Social Multimedia as Sensors
 
Social media and researchers: Josipa Crnic Deakin University
Social media and researchers: Josipa Crnic Deakin University Social media and researchers: Josipa Crnic Deakin University
Social media and researchers: Josipa Crnic Deakin University
 
Global Redirective Practices: an online workshop for a client
Global Redirective Practices: an online workshop for a clientGlobal Redirective Practices: an online workshop for a client
Global Redirective Practices: an online workshop for a client
 
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...
 
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on TwitterAn Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
 
Social Media Analytics with a pinch of semantics
Social Media Analytics with a pinch of semanticsSocial Media Analytics with a pinch of semantics
Social Media Analytics with a pinch of semantics
 

More from Miriam Fernandez

Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)Miriam Fernandez
 
Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)Miriam Fernandez
 
Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5Miriam Fernandez
 
On the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal ChallengesOn the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal ChallengesMiriam Fernandez
 
Online radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directionsOnline radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directionsMiriam Fernandez
 
Mining Social Media Data For Policing
Mining Social Media Data For PolicingMining Social Media Data For Policing
Mining Social Media Data For PolicingMiriam Fernandez
 
Introduction to Mining Social Media Data
Introduction to Mining Social Media DataIntroduction to Mining Social Media Data
Introduction to Mining Social Media DataMiriam Fernandez
 
Online Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsOnline Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsMiriam Fernandez
 
Slides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxSlides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxMiriam Fernandez
 
Artificial Intelligence for Policing
Artificial Intelligence for PolicingArtificial Intelligence for Policing
Artificial Intelligence for PolicingMiriam Fernandez
 
OUSocial OUSocMed conference
OUSocial OUSocMed conference OUSocial OUSocMed conference
OUSocial OUSocMed conference Miriam Fernandez
 
On the use of social media for evidence-based policing
On the use of social media for evidence-based policingOn the use of social media for evidence-based policing
On the use of social media for evidence-based policingMiriam Fernandez
 
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...Miriam Fernandez
 
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookMiriam Fernandez
 
Wm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalWm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalMiriam Fernandez
 
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Miriam Fernandez
 

More from Miriam Fernandez (16)

Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)Biases in Social Media Research (NoBias EU project)
Biases in Social Media Research (NoBias EU project)
 
Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)Research seminar Queen Mary University of London (CogSci)
Research seminar Queen Mary University of London (CogSci)
 
Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5Vision track october_2020_fernandez_v5
Vision track october_2020_fernandez_v5
 
On the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal ChallengesOn the Application of Social Data Science to Address Societal Challenges
On the Application of Social Data Science to Address Societal Challenges
 
Online radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directionsOnline radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directions
 
Mining Social Media Data For Policing
Mining Social Media Data For PolicingMining Social Media Data For Policing
Mining Social Media Data For Policing
 
Introduction to Mining Social Media Data
Introduction to Mining Social Media DataIntroduction to Mining Social Media Data
Introduction to Mining Social Media Data
 
Online Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future DirectionsOnline Misinformation: Challenges and Future Directions
Online Misinformation: Challenges and Future Directions
 
Slides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptxSlides 28-feb-2018-v2.pptx
Slides 28-feb-2018-v2.pptx
 
Artificial Intelligence for Policing
Artificial Intelligence for PolicingArtificial Intelligence for Policing
Artificial Intelligence for Policing
 
OUSocial OUSocMed conference
OUSocial OUSocMed conference OUSocial OUSocMed conference
OUSocial OUSocMed conference
 
On the use of social media for evidence-based policing
On the use of social media for evidence-based policingOn the use of social media for evidence-based policing
On the use of social media for evidence-based policing
 
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
 
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from FacebookESWC 2014 Tutorial Handson 1: Collect Data from Facebook
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
 
Wm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-finalWm unit1.6-slides-semantic web-final
Wm unit1.6-slides-semantic web-final
 
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
 

Recently uploaded

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Recently uploaded (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

ESWC 2014 Tutorial Part 4

  • 1. Social Web: Where are the Semantics? ESWC 2014 Miriam Fernández, Victor Rodríguez, Andrés García-Silva, Oscar Corcho Ontology Engineering Group, UPM, Spain Knowledge Media Institute, The Open University
  • 2. Outline 2 •  Part 1: Understanding Social Media –  Theory: background & applications described in this tutorial –  Hands on: data extraction from Twitter and Facebook •  Part 2: Using semantics to represent data from SNS –  Theory: Using SW to represent content, users and relations –  Hands on: applying and extending SIOC •  Part 3: Using semantics to understand social media conversations –  Theory: Using semantics to understand topics in social media –  Hands on: using LDA to extract topics from social media •  Part 4: Using semantics to understand user behaviour
  • 3. Implicit vs. Explicit Semantics •  Implicit Semantics –  Implicit, also called statistical semantics, focus on extracting word sense by studying the patterns of human word usage in massive collections of text or other human generated data –  It does not rely on an explicit formalisation/conceptualisation of knowledge •  Explicit Semantics –  Explicit semantics focus on the analysis of content by using the support of explicit conceptualisations in the form of ontologies and knowledge bases ESWC 2014 Social Web: Where are the Semantics? 3
  • 4. Explicit Semantics Structured Unstructured From the Web of human generated content The Web of unstructured text (Posts / Documents) and Links To the Web of machine understandable content The Web of Objects and Relations
  • 5. •  The annotators extract entities (classes / individuals) and relations from the text and link them to object URIs Obtaining explicit semantics from social media content
  • 6. Using Semantics to Analyse Topic Evolution •  LDA topics are identified by a set of keywords –  Difficult to assess their meaning and evolution •  Use explicit semantics to characterise topics as concrete entities 6
  • 7. ! ! Using Semantics to Analyse Topic Evolution ESWC 2014 Social Web: Where are the Semantics? 7 •  Analyse concepts appearance –  Within a group –  Across groups –  Over time •  Type filtering •  Interlinking with other datasets (data.open.ac.uk)
  • 8. Using Semantics To Analyse Sentiment •  Sentiment analysis on social media –  Offers a fast and cheap access to publics’ feelings towards brands, business, people, etc. –  Comes with additional challenges –  Current approaches •  Lexical-based •  Machine Learning –  Explicit semantics are often neglected ESWC 2014 Social Web: Where are the Semantics? 8
  • 9. Using Semantics to Analyse Sentiment •  Add semantics as additional features into the training set •  Results –  Incorporating semantics increases accuracy by 6.5% for negative sentiment and by 4.8% for positive sentiment –  The use of explicit semantics is more appropriate when the datasets being analysed are large and cover a wide range of topics Saif, Hassan, He, Yulan, Alani, Harith (2012). Semantic sentiment analysis of twitter. In: 11th International Semantic Web Conference (ISWC 2012)
  • 10. “Words that occur in similar context tend to have similar meaning” Wittgenstein (1953) Using Semantics To Analyse Sentiment •  SentiCircles –  Integrates implicit and explicit semantics to analyse sentiment –  Outperforms other lexicon labeling methods and overtakes the state-of-the- art SentiStrength approach in accuracy, with a marginal drop in F-measure ESWC 2014 Social Web: Where are the Semantics? 10 Saif, Hassan, Fernandez, Miriam, He, Yulan, Alani, Harith (2014). SentiCircles for Tweet-level Sentiment Analysis (ESWC 2014) -> conference presentation on the 27, 14:00!!
  • 11. Using Semantics To Analyse Sentiment ESWC 2014 Social Web: Where are the Semantics? 11
  • 12. Using Semantics to Analyse User Behaviour •  Goal –  Monitor and capture member activities –  Analyse emerging behaviour over time –  Understand the correlation of behaviour with community evolution •  Approach –  Identify behavioural features and behaviour roles –  Create an ontology to model behavioural roles and behaviour features –  Use semantic rules to infer user roles in online communities –  Study role composition patterns ESWC 2014 Social Web: Where are the Semantics? 12 Angeletou, S., Rowe, M. and Alani, H. (2011) Modelling and Analysis of User Behaviour in Online Communities, 10th International Semantic Web Conference (ISWC 2011), Bonn, Germany Rowe, Matthew; Fernandez, Miriam; Angeletou, Sofia and Alani, Harith (2013). Community analysis through semantic rules and role composition derivation. Journal of Web Semantics: Science, Services and Agents on the World Wide Web, 18(1) pp. 31–47
  • 13. Behavioural roles and features ESWC 2014 Social Web: Where are the Semantics? 13 Table 1. Roles and the feature-to-level mappings Role Feature Level Elitist In-Degree Ratio low Bi-directional Threads Ratio high Bi-directional Neighbours Ratio low Grunt Bi-directional Threads Ratio med Bi-directional Neighbours Ratio med Average Posts per Thread low STD of Posts per Thread low Joining Conversationalist Thread Initiation Ratio low Average Posts per Thread high STD of Posts per Thread high Popular Initiator In-Degree Ratio high Thread Initiation Ratio high Popular Participants In-Degree Ratio high Thread Initiation Ratio low Average Posts per Thread med STD of Posts per Thread med Supporter In-Degree Ratio med Bi-directional Threads Ratio med Bi-directional Neighbours Ratio med Taciturn Bi-directional Threads Ratio low Bi-directional Neighbours Ratio low Average Posts per Thread low STD of Posts per Thread low Ignored Posts Replied Ratio low Jeffrey Chan, Conor Hayes, and Elizabeth Daly. Decomposing discussion forums using common user roles. In Proc. Web Science Conf. (WebSci10), Raleigh, NC: US, 2010.
  • 14. Modelling user features and interactions ESWC 2014 Social Web: Where are the Semantics? 14 http://purl.org/net/oubo/0.3•  OUBO: The OU Behaviour Ontology
  • 15. Encoding Rules in Ontologies with SPIN ESWC 2014 Social Web: Where are the Semantics? 15
  • 16. Apply rules to infer user roles over time ESWC 2014 Social Web: Where are the Semantics? 16 1.- Construct features for community users at a given time step 2.- Derive bings using equal frequency binning Popularity-low cutoff = 0.5 Initiation-high cutoff = 0.4 3.- Use skeleton rule base to construct rules using bin levels Popularity=low, Initiation=high ->roleA Popularity<0.5, Initiation > 0.4 -> roleA 4.- Apply rules to infer user roles and community composition 5.- Repeat 1-4 following time steps
  • 17. Analyse the role composition of the community ESWC 2014 Social Web: Where are the Semantics? 17 •  Investigate the correlation between the role composition and the students’ performance
  • 18. Analyse the role composition of the community •  Allow Policy Makers to focus on a smaller set of users, with whom they may want to engage more closely ESWC 2014 Social Web: Where are the Semantics? 18
  • 19. Analyse the role composition of the community •  Development of models to predict community health based on role compositions and evolution of user behaviour –  Health Indicators •  Churn Rate: proportion of users who leave the network in a given time segment •  User Count: number of users who posted at least once •  Seeds / Non seeds: proportion of posts that get responses vs. those that don’t •  Clustering coefficient: measures the cohesion within the network –  Results •  Accurate detection of community health is possible using role composition information •  There is no “one size fits all” model ESWC 2014 Social Web: Where are the Semantics? 19 Rowe, M. and Alani, H. (2012) What Makes Communities Tick? Community Health Analysis using Role compositions. International Conference on Social Computing, 2012 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 Churn Rate FPR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 User Count FPR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 Seeds / Non−seeds Prop FPR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 Clustering Coefficient FPR TPR
  • 20. Challenges: How would you address them? •  Scalability –  Communities exceed millions of users –  Infrastructures must support hundreds of millions discussion threads •  Growth (real-time analysis) –  Speed of new incoming data / stream processing •  Concept vs. keyword based data acquisition/pre-processing –  How to filter certain tags? –  Which new topics emerge? –  How topics evolve over time? –  Authorship in social media, who copies who? •  Multilingualism –  We all speak different languages •  Understanding the user and acting accordingly –  We all have different personalities, behaviours and preferences ESWC 2014 Social Web: Where are the Semantics? 20