SlideShare une entreprise Scribd logo
1  sur  67
CS6010 - Social Network Analysis
Unit III – Extraction and Mining
Communities in Web Social Networks
Kaviya.P
AP/IT
Kamaraj College of Engineering & Technology
1
Unit III – Extraction and Mining
Communities in Web Social Networks
Extracting evolution of Web Community from a Series of Web Archive
Detecting communities in social networks - Definition of community -
Evaluating communities - Methods for community detection and mining -
Applications of community mining algorithms - Tools for detecting
communities social network infrastructures and communities -
Decentralized online social networks - Multi-Relational characterization of
dynamic social network communities.
2
Extracting Evolution of Web Community
from a Series of Web Archive
3
Extracting Evolution of Web Community from a
Series of Web Archive
• The extraction of Web community utilizes Web community chart
– A graph of communities, in which related communities are connected by
weighted edges.
• The main advantage of the Web community chart is existence of relevance between
communities.
• Notations Used
– t1, t2, ..., tn: Time when each archive crawled. Currently, a month is used as the
unit time.
– W(tk): The Web archive at time tk.
– C(tk): The Web community chart at time tk.
– c(tk), d(tk), e(tk), ...: Communities in C(tk).
4
Extracting evolution of Web Community from a
Series of Web Archive
Types of Changes
• Emerge
– A community c(tk) emerges in C(tk), when c(tk) shares no URLs with any
community in C(tk−1).
• Dissolve
– A community c(tk−1) in C(tk) has dissolved, when c(tk−1) shares no URLs
with any community in C(tk)
• Growth and Shrink
– The community grows when new URLs are appeared in c(tk), and shrinks
when URLs disappeared from c(tk−1).
• Split
– c(tk−1) shares URLs with multiple communities in C(tk)
• Merge
– When multiple communities (c(tk−1)), d(tk−1), ...) share URLs with a single
community e(tk), these communities are merged into e(tk)
5
Extracting evolution of Web Community from a
Series of Web Archive
Evolution Metrics
• Evolution metrics measure how a particular community c(tk) has evolved.
• The metrics are defined by differences between c(tk) and its corresponding
community c(tk−1).
• Attributes Used:
– N(c(tk)): Number of URLs in the c(tk).
– Nsh(c(tk−1), c(tk)): Number of URLs shared by c(tk−1) and c(tk).
– Ndis(c(tk−1)): Number of disappeared URLs from c(tk−1) that exist in c(tk−1)
but do not exist in any community in C(tk)).
– Nsp(c(tk1), c(tk)): Number of URLs split from c(tk−1) to communities at tk
other than c(tk).
– Nap(c(tk)): Number of newly appeared URLs in c(tk)) that exist in c(tk) but do
not exist in any community C(tk−1).
– Nmg(c(tk−1), c(tk)): Number of URLs merged into c(tk)) from communities at
tk−1 other than c(tk−1).
6
Extracting evolution of Web Community from a
Series of Web Archive
Evolution Metrics
• Evolution metrics measure how a particular community c(tk) has evolved.
• Growth Rate
– The growth rate, Rgrow(c(tk−1), c(tk)), represents the increase of URLs per unit time.
– It allows us to find most growing or shrinking communities.
• Stability
– Represents the amount of disappeared, appeared, merged and split URLs per unit time.
– A stable community on a topic is the best starting point for finding interesting changes
around the topic.
7
Extracting evolution of Web Community from a
Series of Web Archive
Evolution Metrics
• Disappearance Rate
– The number of disappeared URLs from c(tk−1) per unit time.
– Higher disappear rate means that the community has lost URLs mainly by disappearance.
• Merge Rate
– The number of absorbed URLs from other communities by merging per unit time.
– Higher merge rate means that the community has obtained URLs mainly by merging.
8
Extracting evolution of Web Community from a
Series of Web Archive
Evolution Metrics
• Split Rate
– The split rate, Rsplit (c(tk−1), c(tk)), is the number of split URLs from c(tk−1) per unit
time.
– When the split rate is low, c(tk) is larger than other split communities. Otherwise, c(tk) is
smaller than other split communities.
• Other Metrics
– The novelty metrics of a main line (c(ti), c(ti+1), ..., c(t j)) is calculated as follows.
9
Extracting evolution of Web Community from a
Series of Web Archive
Evolution Metrics
• By combining these metrics, some complex evolution patterns can be represented.
– A community has stably grown when its growth rate is positive, and its
disappearance and split rates are low.
– Similar evolution patterns can be defined for shrinkage.
10
Extracting evolution of Web Community from a
Series of Web Archive
Web Archives and Graphs
• Web archiving is the process of collecting portions of the Web to ensure the
information is preserved in an archive
• Web crawlers are used for automated capture due to the massive size and amount of
information on the Web.
• From each archive, a Web graph is built with URLs and links by extracting anchors
from all pages in the archive.
• The graph included not only URLs inside the archive, but also URLs outside pointed
to by inside URLs.
• By comparing these graphs, the Web was extremely dynamic.
11
Extracting evolution of Web Community from a
Series of Web Archive
Evolution of Web Community Charts
• The size distribution of communities also follows the power law and its
exponent did not change so much over time.
• Although the size distribution of communities is stable, the structure of
communities changes dynamically.
• The structure of the chart changes mainly by split and merge, in which more
than half of communities are involved.
• Split and Merged Communities
– Both distributions roughly follow the power law, and show that split or
merge rate is small in most cases.
– Their shapes and scales are also similar.
– This symmetry is part of the reason why the size distribution of
communities does not change so much.
12
Extracting evolution of Web Community from a
Series of Web Archive
Evolution of Web Community Charts
• Emerged and Dissolved Communities
– The size distributions of emerged and dissolved communities also follow the
power law
– Contribute to preserve the size distribution of communities.
– Small communities are easy to emerge and dissolve.
• Growth Rate
– The growth rate is small for most of communities, and the graph has clear y-axis
symmetry.
– Size distribution of communities is preserved over time.
13
Detecting communities in social networks
Definition of community - Evaluating
Communities
14
Detecting communities in social networks
Detecting communities from given social networks are practically important for the
following reasons:
• Communities can be used for information recommendation because members of
the communities often have similar tastes and preferences. Membership of detected
communities will be the basis of collaborative filtering.
• Communities will help us understand the structures of given social networks.
Communities are regarded as components of given social networks, and they will
clarify the functions and properties of the networks.
• Communities will play important roles when we visualize large-scale social
networks. Relations of the communities clarify the processes of information sharing
and information diffusions, and they may give us some insights for the growth the
networks in the future.
15
Detecting communities in social networks
Definition of community
• “Community” means a subnetwork whose edges connecting inside of it are
denser than the edges connecting outside of it .
• Three categories:
– Local definitions
– Global definitions
– Definitions based on vertex similarity.
16
Detecting communities in social networks
Definition of community
Local definitions
• The attention is focused on the vertices of the subnetwork under investigation and on
its immediate neighborhood.
• Local definitions of community can be further divided into self-referring ones and
comparative ones.
• The former considers the subnetwork alone, and the latter compares mutual
connections of the vertices of the subnetwork with the connections with external
neighbors.
• Self referring definitions are clique (a maximal subnetworks where each vertex is
adjacent to all the others), n-clique (a maximal subnetwork such that the distance of
each pair of vertices is not larger than n), and k-plex (a maximal subnetwork such that
each vertex is adjacent to all the others except at most k of them).
17
Detecting communities in social networks
Definition of community
Local definitions
• Comparative definitions are LS set (a subnetwork where each vertex has more neighbors inside
than outside of the subnetwork), and weak community (the total degrees of the vertices inside
the community exceeds the number of edges lying between the community and the rest of the
network).
Global Definitions
• Global definitions of community characterize a subnetwork with respect to the network as a
whole.
• These definitions usually starts from a null model, in another words, a network which matches
the original network in some of its topological features, but which does not display community
structure.
• The most popular null model consists of a randomized version of the original network, where
edges are rewired at random under the constraint that each vertex keeps its degree. This null
model is the basic concept behind the definition of modularity.
18
Detecting communities in social networks
Definition of community
Definitions based on vertex similarity
• Communities are groups of vertices which are similar to each other.
• Some quantitative criterion is employed to evaluate the similarity between each pair
of vertices.
• Hierarchical clustering is a way to find several layers of communities that are
composed of vertices similar to each other.
• Repetitive merges of similar vertices based on some quantitative similarity measures
will generate a structure shown in Fig.
• This structure is called dendrogram, and highly similar vertices are connected in the
lower part of the dendrogram.
• Subtrees obtained by cutting the dendrogram with horizontal line correspond to
communities.
• Communities of different granurality will be obtained by changing the position of the
horizontal line
19
Detecting communities in social networks
Evaluating Communities
• It is necessary to establish which partition exhibit a real community structure.
Therefore, a quality function for evaluating how good a partition is needed.
• The most popular quality function is the modularity of Newman and Girivan:
Where A is the adjacency matrix, ki is the degree of vertex i , m is the total number
of edges of the network.
• Modularity can be rewritten as follows:
Where nm is the number of communities, ls is the total number of edges joining
vertices of community s, ds is the sum of the degrees of the vertices of s
20
Detecting communities in social networks
Evaluating Communities
Meaning of modularity
• Subnetwork is a community if the number of edges inside it is larger than the expected
number in modularity’s null model
21
Methods for Community Detection
and Mining
22
Methods for Community Detection and Mining
• Native methods for dividing given networks into subnetworks, such as graph
partitioning, hierarchical clustering, and k-means clustering.
• However, these methods needs to provide the numbers of clusters or their
size.
• It is desirable to devise methods that have abilities of extracting a complete
information about the community structure of networks.
• The methods for detecting communities are roughly classified into the
following categories: (1) divisive algorithms, (2) modularity optimization, (3)
spectral algorithms, and (4) other algorithms.
23
Methods for Community Detection and Mining
Divisive algorithms
• A simple way to identify communities in a network is to detect the edges that
connect vertices of different communities and remove them, so that the
communities get disconnected from each other.
• The most popular algorithm is that proposed by Girvan and Newman.
• In this algorithm, edges are selected according to the values of measures of edge
centrality, estimating the importance of edges according to some property on the
network.
• The steps of the algorithm are as follows:
– (1) Computation of the centrality of all edges,
– (2) Removal of edge with largest centrality,
– (3) Recalculation of centralities on the running network
– (4) Iteration of the cycle from step (2).
24
Methods for Community Detection and Mining
Divisive algorithms
• Girvan and Newman focuses on the concept of edge betweenness.
• Edge betweenness is the number of shortest paths between all vertex pairs
that run along the edge.
25
Methods for Community Detection and Mining
Modularity Optimization
• Modularity is a quality function for evaluating partitions.
• Therefore, the partition corresponding to its maximum value on a given network
should be the best one. This is the main idea for modularity optimization.
• It has been proved that modularity optimization is an NP-hard problem.
• However, there are currently several algorithms that are able to find fairly good
approximations of the modularity maximum in a reasonable time.
• One of the famous algorithms for modularity optimization is CNM algorithm.
• Another examples of the algorithms are greedy algorithms and simulated
annealing.
26
Methods for Community Detection and Mining
Spectral Algorithms
• Spectral algorithms are to cut given network into pieces so that the number of
edges to be cut will be minimized.
• One of the basic algorithm is spectral graph bipartitioning.
• The Laplacian matrix L of a network is an n * n symmetric matrix, with one
row and column for each vertex.
• Laplacian matrix is defined as L =D - A , where A is the adjacency matrix and
D is the diagonal degree matrix with
27
Methods for Community Detection and Mining
Spectral Algorithms
• All eigenvalues of L are real and non-negative, and L has a full set of n real
and orthogonal eigenvectors.
• In order to minimize the above cut, vertices are partitioned based on the signs
of the eigenvector that corresponds to the second smallest eigenvalue of L.
• In general, community detection based on repetative bipartitioning is
relatively fast.
28
Methods for Community Detection and Mining
Other Algorithms
• There are many other algorithms for detecting communities, such as the
methods focusing on random walk, and the ones searching for overlapping
cliques.
• Danon compares the computational costs and their accuracies of major
community detection methods.
29
Applications of Community Mining
Algorithms
30
Applications of Community Mining Algorithms
• Network Reduction
• Discovering Scientific Collaboration Groups from Social
Networks
• Mining Communities from Distributed and Dynamic Networks
31
Applications of Community Mining Algorithms
1. Network Reduction
• Network reduction is an important step in analyzing social networks.
• Example
– The bibliography contains 360 papers written by 314 authors.
• Community structure is detected using a community mining
algorithm, called ICS.
• Most of the detected communities are self-connected components.
32
Applications of Community Mining Algorithms
1. Network Reduction
33
The bibliography network for the book entitled
“graph products: structure and recognition”
Applications of Community Mining Algorithms
1. Network Reduction
34
The community structure of the bibliography
network as detected using ICS
Applications of Community Mining Algorithms
1. Network Reduction
35
The coauthor network corresponding to the
biggest component
Applications of Community Mining Algorithms
1. Network Reduction - The reduction of a coauthor network
• The clustered coauthor network can be reduced into a much smaller one by
condensing each community as one node.
• The top-level condensed network corresponding to a 3-community structure
is constructed by using ICS from the condensed network.
• In this way, a dendrogram corresponding to the original coauthor network can
be built.
• The dendrogram of the coauthor network
36
Applications of Community Mining Algorithms
1. Network Reduction
37
The reduction of a coauthor – The community structure of the network,
The condensed network, The top-level condensed network
Applications of Community Mining Algorithms
2. Discovering Scientific Collaboration Groups from Social Networks
• Flink is a social network that describes the scientific collaborations among
681 semantic Web researchers.
• From the perspective of social network analysis, one may be especially
interested in such questions as:
1. Among all researchers, which ones would more likely to collaborate
with each other?
2. What are the main reasons that bind them together?
• Apply the community mining techniques.
38
Applications of Community Mining Algorithms
2. Discovering Scientific Collaboration Groups from Social Networks -
Mining a scientific collaboration network
39
Applications of Community Mining Algorithms
2. Discovering Scientific Collaboration Groups from Social Networks –
Results Observed
• The self-organized communities would provide the answer to the first
question.
• By referring to them, one can know the specific collaboration activities
among these researchers.
• After manually checking the profiles of members within different
communities, an interesting fact has been confirmed.
– Most of communities are organized according to the locations or the
research interests of their respective members.
• Answers the second question, i.e., researchers in adjacent locations and with
common interests prefer to intensively collaborate with each other. 40
Applications of Community Mining Algorithms
3.Mining Communities from Distributed and Dynamic Networks
• Applications involve distributed and dynamically-evolving networks.
– Resources and controls are not only decentralized but also updated
frequently.
• We need find a way to solve a more challenging NCMP.
– To adaptively mine hidden communities from distributed and dynamic
networks.
• Solution is based on an Autonomy-Oriented Computing (AOC) approach.
41
Applications of Community Mining Algorithms
3.Mining Communities from Distributed and Dynamic Networks
• In AOC approach, a group of self-organizing agents are utilized.
• The agents will rely only on their locally acquired information about
networks.
• Example: In Intelligent Portable Digital Assistants
– Selecting individuals who have frequently contacted or been contacted
with the user during a certain period of time
– Taking the selected individuals as the input to an AOC-based algorithm
– Ranking and recommending new persons
42
Tools for Detecting Communities in
Social Network
43
Tools for Detecting Communities in Social Network
• Several tools have been developed for detecting communities.
• These are roughly classified into the following two categories:
– Detecting communities from large-scale networks
– Interactively analyzing communities from small networks
44
Tools for Detecting Communities in Social Network
Interactively analyzing communities from small networks
• JUNG - http://jung.sourceforge.net/
• Netminer - http://www.netminer.com/NetMiner/overview 01.jsp
• Pajek - http://vlado.fmf.unilj.si/pub/networks/pajek/
• igraph - http://igraph.sourceforge.net/ Demo - igraph
• SONIVIS - http://www.sonivis.org/
• Commetrix - http://www.commetrix.de/
• NetworkWorkbench - http://nwb.slis.indiana.edu/
• visone - http://visone.info/
• Cfinder - http://www.cfinder.org/
45
Social Network Infrastructures and Communities
Decentralized Online Social Networks
46
Decentralized Online Social Networks
Online social network (OSN) is an online platform that
• Provides services for a user to build a public profile and to explicitly
declare the connection between his or her profile with those of the other
users;
• Enables a user to share information and content with the chosen users or
public; and
• Supports the development and usage of social applications with which the
user can interact and collaborate with both friends and strangers.
47
Decentralized Online Social Networks
• Current online social networks are extended in two main directions towards
the capabilities of the provided services and the decentralization of the
supporting infrastructures,
48
Decentralized Online Social Networks
• Disadvantages of Centralized Management of a Social Network
– Performance scalability issues,
– Frequent down-time, slowness and unresponsiveness
– Increasing cost of management and maintaining the infrastructures
– Lack of proper privacy preserving schemes
• A decentralized online social network is an online social network
implemented on a distributed information management platform, such as a
network of trusted servers or a peer-to-peer systems.
• In contrast to centralized OSNs where the vendor bears all the cost in
providing the services, a distributed or peer-to-peer OSN offers a cost-
effective alternative.
49
Decentralized Online Social Networks
Challenges for DOSNs
• Storage - Where should content be stored? Should they be stored exclusively at nodes run
by friends, or be encrypted and stored at random nodes?. The requirement for redundancy
to provide availability of data depends to a large extent on the duration and distribution of
time peers are online.
• Updates - How can we deal with updates, e.g., status updates of friends?. In peer
collaboration systems, updates, e.g., of a workplace, are sent to a small group of peers via a
decentralized synchronization mechanism.
• Topology - In pure file-sharing networks, the topology does not depend on whether the
peers know each other and nodes exchange content with any other nodes in the network. At
the other end of the spectrum, for collaboration or media sharing, they tend to consist of
collaborative groups that are relatively closed circles, e.g., using a “ring of trust” or
darknets. In contrast, online social networking services have overlapping circles.
50
Decentralized Online Social Networks
Challenges for DOSNs
• Search, Addressing - One just needs to find some peer with the content it is looking
for. Traditionally, peer identity is tied with an IP address which clearly is not
sufficient.
• Openness to new applications - One of the most alluring features of current online
social networks is that they are open to third-party applications, which enables a
constant change of what a social networking service provides to the users. In a
decentralized environment, if some users choose to enable a third-party application,
their choice should not affect other users or even users connected directly to them.
• Security - For distributed storage with other peers that the user not necessarily wants
to access data, the content has to be encrypted. To manage access to encrypted data,
key distribution and maintenance have to be handled such that the social network
group can access data.
51
Decentralized Online Social Networks
Challenges for DOSNs
• Robustness - In a centralized system, one can turn to the provider in case of user misbehavior,
there is usually a process defined for dealing with such complaints. In a decentralized system,
there is no authority that can ban users for misbehavior or remove content. Robustness against
free-riding. Once access to content is granted, it is difficult to revoke that right. When a user
allows a friend to see a message, the friend can store the message and keep access to it even after
a change of key. Trust has to be at least equal to assigned access rights, due to this difficulty.
• Limited Peers - Two-tier system is that users can then participate in the social network with
resource constrained (e.g., mobile) devices, which they may use as an auxiliary, even when they
contribute resources to the core of the system with their primary device.
• Locality - A distributed architecture also enables us to take advantage of geographic proximity
and its correlation with local interests. The routers provide availability of data of local interest.
This local interest can arise from the locality of events but also from the locality of typical real-
life social networks of friends and neighbors.
52
Decentralized Online Social Networks
General Purpose DOSNs
53
Decentralized Online Social Networks
General Purpose DOSNs
• The reference architecture consists of six layers and provides an architectural
abstraction of variety of current related approaches to decentralized social
networking in the research literature.
• Physical communication network, which can be the Internet or a (mobile)
ad hoc network (in case we consider a mobile online social network).
• Distributed or P2P overlay management provides core functionalities to
manage resources in the supporting infrastructure of the system, which can be
a distributed network of trusted servers or a P2P overlay. Specifically, this
layer provides higher layers the capabilities of looking up resources, routing
messages, and retrieving information reliably and effectively among nodes in
the overlay. 54
Decentralized Online Social Networks
General Purpose DOSNs
• Decentralized data management layer, which implements functionalities of
a distributed or peer-to-peer information system to query, insert, and update
various persistent objects to the systems.
• The capability to search the system (Distributed search) for relevant
information, the management of users and shared space (User account and
share space management), the management of security and access control
issues (Trust management, Access control and security), the coordination
and management of social applications developed by third parties
(Application management).
55
Decentralized Online Social Networks
General Purpose DOSNs
• It is expected that the social networking layer exposes and implements an
application programming interface (API) to support the development of
new applications as well as to enable the customization of the social network
service to suit various preferences of the user.
• The top layer of the architecture includes the user interface to the system and
various applications built on top of the development platform provided by the
DOSN.
• Applications can be either implemented by the DOSN provider or developed
by third-parties, and can be installed or removed from the system according
to user’s preferences.
56
Decentralized Online Social Networks
Proposed DOSN Approaches
• Safebook, an approach to provide a decentralized general purpose OSN,
follows the main objective of protecting its users’ privacy. It considers adverse or
erroneous behavior of a centralized service provider, possible adversaries which
are misusing the functions of the social networking service, as well as external
adversaries that could eavesdrop or modify data on the networking layer.
• FOAF presents another practical approach to decentralize management of
current social networks. The framework enables users to export their FOAF4
profiles, store them on dedicated trusted servers. Users query and manage these
profiles through open Web-based protocols such as WebDAV5 or
SPARQL/Update.
57
Decentralized Online Social Networks
Proposed DOSN Approaches
• NEPOMUK7 is an on-going EU project with close relation to DOSN. The goal of
NEPOMUK is to develop a middleware for sharing users’ desktops with friends for
online collaborations and sharing of knowledge by exploiting Semantic Web
technologies.
• LifeSocial primarily aims at keeping social networking services scalable by
distributing the load to their users’ resources. The main functional components of
OSNs are data storage and interaction.
• PeerSoN aims at keeping the features of OSNs but overcoming two limitations:
privacy issues and the requirement of Internet connectivity for all transactions. It also
enables direct exchange of data between devices. The main properties of PeerSoN are
encryption, decentralization, and direct data exchange.
58
Multi-Relational characterization of dynamic
social network communities
59
Multi-Relational characterization of dynamic
social network communities
• The characterization of communities in online social media is presented using
computational approaches grounded on the observations from social science.
Motivation: human community as meaning-making eco-system
• The semantics is an emergent artifact of human activity that evolves over time.
• Human activity is mostly social, and the social networks of human are
conceivable loci for the construction of meaning.
• Hence, it is crucial to identify real human networks as communities of people
interacting with each other through meaningful social activities, and producing
stable associations between concepts and artifacts in a coherent manner.
Motivating applications
• As new concepts emerge and evolve around real human networks, community
discovery can result in new knowledge and provoke advancements in information
search and decision-making. Example applications include:
• Context-sensitive information search and recommendation: The discovered
community around an information seeker can provide context (including objects,
activities, time) that help identify most relevant information.
• Content organization, tracking and monitoring: Community structure may be
used to reflect the social sharing practice and facilitate the organization, tracking
and monitoring of user-generated social media content.
• Behavioral prediction: Community structure that accounts for inherent
dependencies between individuals embedded in a social network can help
understand and predict the behavioral dynamics of individuals.
Multi-Relational characterization of dynamic
social network communities
• Data characteristics and challenges – Large volumes of social media data are
being generated from various social media platforms including blogs,
FaceBook, Twitter, Digg, Flickr. The key characteristics of online social media
data include:
• Voluminous: The amount of data and the rate of data production can be
enormous.
• Dynamic: Users’ online actions are constantly archived with timestamps.
These online activity records enable a fine-grained observation on the dynamics
of human interactions and interests.
• Context-rich: Most social media platforms allow a wide array of actions for
managing and sharing media objects
Multi-Relational characterization of dynamic
social network communities
1. Mutual awareness
• It is a bi-directional relationship indicating how well a pair of bloggers is
aware of each other, as fundamental property of a community.
• The amount of mutual awareness is captured expanding on the entire
network using a random walk based distance measure, commute time,
which estimates the probability that two bloggers are aware of each other
on the network.
• The experimental results for community extraction in terms of standard
evaluation metrics are promising.
Multi-Relational characterization of dynamic social
network communities - Approaches to three problems
2. FacetNet:
• The community structure at a given timestep is determined both by the
observed networked data and by the prior distribution given by historic
community structures.
• The experimental results suggest that this technique is scalable and is able
to extract meaningful communities based on social media context.
Multi-Relational characterization of dynamic social
network communities - Approaches to three problems
3. MetaFac:
• MetaFac is the first graph-based tensor factorization framework for
analyzing the dynamics of heterogeneous social networks.
• In this framework, metagraph, is a novel relational hypergraph
representation for modeling multi-relational and multi-dimensional social
data.
• Extensive experiments on large-scale real-world social media data and
from the enterprise data suggest that this technique is able to extract
meaningful communities that are adaptive to social media context.
Multi-Relational characterization of dynamic social
network communities - Approaches to three problems
(a) Mutual awareness – a bi-directional relationship indicating how well a
pair of bloggers is aware of each other, as fundamental property of a
community.
(b) Mutual awareness expansion – a random walk based distance measure
which estimates the probability that two bloggers are aware of each other
on the network.
(c) FacetNet – for analyzing communities and their evolutions in a unified
process.
(d) MetaFac – the first graph-based multi-tensor factorization framework for
analyzing the dynamics of heterogeneous social networks.
Multi-Relational characterization of dynamic social
network communities - Approaches to three problems
Multi-Relational characterization of dynamic social
network communities - Approaches to three problems

Contenu connexe

Tendances

Community detection algorithms
Community detection algorithmsCommunity detection algorithms
Community detection algorithmsAlireza Andalib
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networksFrancisco Restivo
 
Social Media Mining - Chapter 9 (Recommendation in Social Media)
Social Media Mining - Chapter 9 (Recommendation in Social Media)Social Media Mining - Chapter 9 (Recommendation in Social Media)
Social Media Mining - Chapter 9 (Recommendation in Social Media)SocialMediaMining
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphsNicola Barbieri
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018Arsalan Khan
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network AnalysisPremsankar Chakkingal
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
Community Detection
Community Detection Community Detection
Community Detection Kanika Kanwal
 
Social Network Analysis power point presentation
Social Network Analysis power point presentation Social Network Analysis power point presentation
Social Network Analysis power point presentation Ratnesh Shah
 
Social Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online CommunitiesSocial Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online Communitiessubby88
 
Probabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction ProblemProbabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction ProblemSina Sajadmanesh
 
Information retrieval dynamic indexing
Information retrieval dynamic indexingInformation retrieval dynamic indexing
Information retrieval dynamic indexingNadia Nahar
 
Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewSatyaki Sikdar
 
Congestion avoidance in TCP
Congestion avoidance in TCPCongestion avoidance in TCP
Congestion avoidance in TCPselvakumar_b1985
 

Tendances (20)

Community detection algorithms
Community detection algorithmsCommunity detection algorithms
Community detection algorithms
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networks
 
Social Media Mining - Chapter 9 (Recommendation in Social Media)
Social Media Mining - Chapter 9 (Recommendation in Social Media)Social Media Mining - Chapter 9 (Recommendation in Social Media)
Social Media Mining - Chapter 9 (Recommendation in Social Media)
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphs
 
Cluster Analysis
Cluster AnalysisCluster Analysis
Cluster Analysis
 
06 Community Detection
06 Community Detection06 Community Detection
06 Community Detection
 
Social Network Analysis (SNA) 2018
Social Network Analysis  (SNA) 2018Social Network Analysis  (SNA) 2018
Social Network Analysis (SNA) 2018
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Distributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data ControlDistributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data Control
 
Community Detection
Community Detection Community Detection
Community Detection
 
Social Network Analysis power point presentation
Social Network Analysis power point presentation Social Network Analysis power point presentation
Social Network Analysis power point presentation
 
Social Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online CommunitiesSocial Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online Communities
 
CS8080 IRT UNIT I NOTES.pdf
CS8080 IRT UNIT I  NOTES.pdfCS8080 IRT UNIT I  NOTES.pdf
CS8080 IRT UNIT I NOTES.pdf
 
Probabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction ProblemProbabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction Problem
 
Information retrieval dynamic indexing
Information retrieval dynamic indexingInformation retrieval dynamic indexing
Information retrieval dynamic indexing
 
Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief Overview
 
Congestion avoidance in TCP
Congestion avoidance in TCPCongestion avoidance in TCP
Congestion avoidance in TCP
 
Trends in distributed systems
Trends in distributed systemsTrends in distributed systems
Trends in distributed systems
 

Similaire à CS6010 Social Network Analysis Unit III

2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 TutorialAlexander Pico
 
Data Mining In Social Networks Using K-Means Clustering Algorithm
Data Mining In Social Networks Using K-Means Clustering AlgorithmData Mining In Social Networks Using K-Means Clustering Algorithm
Data Mining In Social Networks Using K-Means Clustering Algorithmnishant24894
 
Network sampling, community detection
Network sampling, community detectionNetwork sampling, community detection
Network sampling, community detectionroberval mariano
 
Lessons Learned with Distributed Systems at Bitly
Lessons Learned with Distributed Systems at BitlyLessons Learned with Distributed Systems at Bitly
Lessons Learned with Distributed Systems at BitlyAll Things Open
 
Architectural design1
Architectural design1Architectural design1
Architectural design1Zahid Hussain
 
Architectural design1
Architectural design1Architectural design1
Architectural design1Zahid Hussain
 
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Symeon Papadopoulos
 
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKSSCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKSIJDKP
 
Scalable Local Community Detection with Mapreduce for Large Networks
Scalable Local Community Detection with Mapreduce for Large NetworksScalable Local Community Detection with Mapreduce for Large Networks
Scalable Local Community Detection with Mapreduce for Large NetworksIJDKP
 
Community analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networksCommunity analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networksMarco Brambilla
 
network mining and representation learning
network mining and representation learningnetwork mining and representation learning
network mining and representation learningsun peiyuan
 
Community Detection with Networkx
Community Detection with NetworkxCommunity Detection with Networkx
Community Detection with NetworkxErika Fille Legara
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Mediarezahk
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Michael Mathioudakis
 
community Detection.pptx
community Detection.pptxcommunity Detection.pptx
community Detection.pptxBhuvana97
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMatthew Rowe
 
2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-BoxMarc Smith
 
Taxonomy and survey of community
Taxonomy and survey of communityTaxonomy and survey of community
Taxonomy and survey of communityIJCSES Journal
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012CameliaN
 

Similaire à CS6010 Social Network Analysis Unit III (20)

2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial
 
Data Mining In Social Networks Using K-Means Clustering Algorithm
Data Mining In Social Networks Using K-Means Clustering AlgorithmData Mining In Social Networks Using K-Means Clustering Algorithm
Data Mining In Social Networks Using K-Means Clustering Algorithm
 
Chapter 3.pdf
Chapter 3.pdfChapter 3.pdf
Chapter 3.pdf
 
Network sampling, community detection
Network sampling, community detectionNetwork sampling, community detection
Network sampling, community detection
 
Lessons Learned with Distributed Systems at Bitly
Lessons Learned with Distributed Systems at BitlyLessons Learned with Distributed Systems at Bitly
Lessons Learned with Distributed Systems at Bitly
 
Architectural design1
Architectural design1Architectural design1
Architectural design1
 
Architectural design1
Architectural design1Architectural design1
Architectural design1
 
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
 
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKSSCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
 
Scalable Local Community Detection with Mapreduce for Large Networks
Scalable Local Community Detection with Mapreduce for Large NetworksScalable Local Community Detection with Mapreduce for Large Networks
Scalable Local Community Detection with Mapreduce for Large Networks
 
Community analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networksCommunity analysis using graph representation learning on social networks
Community analysis using graph representation learning on social networks
 
network mining and representation learning
network mining and representation learningnetwork mining and representation learning
network mining and representation learning
 
Community Detection with Networkx
Community Detection with NetworkxCommunity Detection with Networkx
Community Detection with Networkx
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020
 
community Detection.pptx
community Detection.pptxcommunity Detection.pptx
community Detection.pptx
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online Communities
 
2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box2011 IEEE Social Computing Nodexl: Group-In-A-Box
2011 IEEE Social Computing Nodexl: Group-In-A-Box
 
Taxonomy and survey of community
Taxonomy and survey of communityTaxonomy and survey of community
Taxonomy and survey of community
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012
 

Plus de pkaviya

IT2255 Web Essentials - Unit V Servlets and Database Connectivity
IT2255 Web Essentials - Unit V Servlets and Database ConnectivityIT2255 Web Essentials - Unit V Servlets and Database Connectivity
IT2255 Web Essentials - Unit V Servlets and Database Connectivitypkaviya
 
IT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdf
IT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdfIT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdf
IT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdfpkaviya
 
IT2255 Web Essentials - Unit III Client-Side Processing and Scripting
IT2255 Web Essentials - Unit III Client-Side Processing and ScriptingIT2255 Web Essentials - Unit III Client-Side Processing and Scripting
IT2255 Web Essentials - Unit III Client-Side Processing and Scriptingpkaviya
 
IT2255 Web Essentials - Unit II Web Designing
IT2255 Web Essentials - Unit II  Web DesigningIT2255 Web Essentials - Unit II  Web Designing
IT2255 Web Essentials - Unit II Web Designingpkaviya
 
IT2255 Web Essentials - Unit I Website Basics
IT2255 Web Essentials - Unit I  Website BasicsIT2255 Web Essentials - Unit I  Website Basics
IT2255 Web Essentials - Unit I Website Basicspkaviya
 
BT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdf
BT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdfBT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdf
BT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdfpkaviya
 
OIT552 Cloud Computing Material
OIT552 Cloud Computing MaterialOIT552 Cloud Computing Material
OIT552 Cloud Computing Materialpkaviya
 
OIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question BankOIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question Bankpkaviya
 
CS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question BankCS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question Bankpkaviya
 
CS8592 Object Oriented Analysis & Design - UNIT V
CS8592 Object Oriented Analysis & Design - UNIT V CS8592 Object Oriented Analysis & Design - UNIT V
CS8592 Object Oriented Analysis & Design - UNIT V pkaviya
 
CS8592 Object Oriented Analysis & Design - UNIT IV
CS8592 Object Oriented Analysis & Design - UNIT IV CS8592 Object Oriented Analysis & Design - UNIT IV
CS8592 Object Oriented Analysis & Design - UNIT IV pkaviya
 
CS8592 Object Oriented Analysis & Design - UNIT III
CS8592 Object Oriented Analysis & Design - UNIT III CS8592 Object Oriented Analysis & Design - UNIT III
CS8592 Object Oriented Analysis & Design - UNIT III pkaviya
 
CS8592 Object Oriented Analysis & Design - UNIT II
CS8592 Object Oriented Analysis & Design - UNIT IICS8592 Object Oriented Analysis & Design - UNIT II
CS8592 Object Oriented Analysis & Design - UNIT IIpkaviya
 
CS8592 Object Oriented Analysis & Design - UNIT I
CS8592 Object Oriented Analysis & Design - UNIT ICS8592 Object Oriented Analysis & Design - UNIT I
CS8592 Object Oriented Analysis & Design - UNIT Ipkaviya
 
Cs8591 Computer Networks - UNIT V
Cs8591 Computer Networks - UNIT VCs8591 Computer Networks - UNIT V
Cs8591 Computer Networks - UNIT Vpkaviya
 
CS8591 Computer Networks - Unit IV
CS8591 Computer Networks - Unit IVCS8591 Computer Networks - Unit IV
CS8591 Computer Networks - Unit IVpkaviya
 
CS8591 Computer Networks - Unit III
CS8591 Computer Networks - Unit IIICS8591 Computer Networks - Unit III
CS8591 Computer Networks - Unit IIIpkaviya
 
CS8591 Computer Networks - Unit II
CS8591 Computer Networks - Unit II CS8591 Computer Networks - Unit II
CS8591 Computer Networks - Unit II pkaviya
 
CS8591 Computer Networks - Unit I
CS8591 Computer Networks - Unit ICS8591 Computer Networks - Unit I
CS8591 Computer Networks - Unit Ipkaviya
 
IT8602 Mobile Communication - Unit V
IT8602 Mobile Communication - Unit V IT8602 Mobile Communication - Unit V
IT8602 Mobile Communication - Unit V pkaviya
 

Plus de pkaviya (20)

IT2255 Web Essentials - Unit V Servlets and Database Connectivity
IT2255 Web Essentials - Unit V Servlets and Database ConnectivityIT2255 Web Essentials - Unit V Servlets and Database Connectivity
IT2255 Web Essentials - Unit V Servlets and Database Connectivity
 
IT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdf
IT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdfIT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdf
IT2255 Web Essentials - Unit IV Server-Side Processing and Scripting - PHP.pdf
 
IT2255 Web Essentials - Unit III Client-Side Processing and Scripting
IT2255 Web Essentials - Unit III Client-Side Processing and ScriptingIT2255 Web Essentials - Unit III Client-Side Processing and Scripting
IT2255 Web Essentials - Unit III Client-Side Processing and Scripting
 
IT2255 Web Essentials - Unit II Web Designing
IT2255 Web Essentials - Unit II  Web DesigningIT2255 Web Essentials - Unit II  Web Designing
IT2255 Web Essentials - Unit II Web Designing
 
IT2255 Web Essentials - Unit I Website Basics
IT2255 Web Essentials - Unit I  Website BasicsIT2255 Web Essentials - Unit I  Website Basics
IT2255 Web Essentials - Unit I Website Basics
 
BT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdf
BT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdfBT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdf
BT2252 - ETBT - UNIT 3 - Enzyme Immobilization.pdf
 
OIT552 Cloud Computing Material
OIT552 Cloud Computing MaterialOIT552 Cloud Computing Material
OIT552 Cloud Computing Material
 
OIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question BankOIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question Bank
 
CS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question BankCS8791 Cloud Computing - Question Bank
CS8791 Cloud Computing - Question Bank
 
CS8592 Object Oriented Analysis & Design - UNIT V
CS8592 Object Oriented Analysis & Design - UNIT V CS8592 Object Oriented Analysis & Design - UNIT V
CS8592 Object Oriented Analysis & Design - UNIT V
 
CS8592 Object Oriented Analysis & Design - UNIT IV
CS8592 Object Oriented Analysis & Design - UNIT IV CS8592 Object Oriented Analysis & Design - UNIT IV
CS8592 Object Oriented Analysis & Design - UNIT IV
 
CS8592 Object Oriented Analysis & Design - UNIT III
CS8592 Object Oriented Analysis & Design - UNIT III CS8592 Object Oriented Analysis & Design - UNIT III
CS8592 Object Oriented Analysis & Design - UNIT III
 
CS8592 Object Oriented Analysis & Design - UNIT II
CS8592 Object Oriented Analysis & Design - UNIT IICS8592 Object Oriented Analysis & Design - UNIT II
CS8592 Object Oriented Analysis & Design - UNIT II
 
CS8592 Object Oriented Analysis & Design - UNIT I
CS8592 Object Oriented Analysis & Design - UNIT ICS8592 Object Oriented Analysis & Design - UNIT I
CS8592 Object Oriented Analysis & Design - UNIT I
 
Cs8591 Computer Networks - UNIT V
Cs8591 Computer Networks - UNIT VCs8591 Computer Networks - UNIT V
Cs8591 Computer Networks - UNIT V
 
CS8591 Computer Networks - Unit IV
CS8591 Computer Networks - Unit IVCS8591 Computer Networks - Unit IV
CS8591 Computer Networks - Unit IV
 
CS8591 Computer Networks - Unit III
CS8591 Computer Networks - Unit IIICS8591 Computer Networks - Unit III
CS8591 Computer Networks - Unit III
 
CS8591 Computer Networks - Unit II
CS8591 Computer Networks - Unit II CS8591 Computer Networks - Unit II
CS8591 Computer Networks - Unit II
 
CS8591 Computer Networks - Unit I
CS8591 Computer Networks - Unit ICS8591 Computer Networks - Unit I
CS8591 Computer Networks - Unit I
 
IT8602 Mobile Communication - Unit V
IT8602 Mobile Communication - Unit V IT8602 Mobile Communication - Unit V
IT8602 Mobile Communication - Unit V
 

Dernier

DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilVinayVitekari
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxMuhammadAsimMuhammad6
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxmaisarahman1
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Call Girls Mumbai
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersMairaAshraf6
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Arindam Chakraborty, Ph.D., P.E. (CA, TX)
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadhamedmustafa094
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 

Dernier (20)

DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech Civil
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 

CS6010 Social Network Analysis Unit III

  • 1. CS6010 - Social Network Analysis Unit III – Extraction and Mining Communities in Web Social Networks Kaviya.P AP/IT Kamaraj College of Engineering & Technology 1
  • 2. Unit III – Extraction and Mining Communities in Web Social Networks Extracting evolution of Web Community from a Series of Web Archive Detecting communities in social networks - Definition of community - Evaluating communities - Methods for community detection and mining - Applications of community mining algorithms - Tools for detecting communities social network infrastructures and communities - Decentralized online social networks - Multi-Relational characterization of dynamic social network communities. 2
  • 3. Extracting Evolution of Web Community from a Series of Web Archive 3
  • 4. Extracting Evolution of Web Community from a Series of Web Archive • The extraction of Web community utilizes Web community chart – A graph of communities, in which related communities are connected by weighted edges. • The main advantage of the Web community chart is existence of relevance between communities. • Notations Used – t1, t2, ..., tn: Time when each archive crawled. Currently, a month is used as the unit time. – W(tk): The Web archive at time tk. – C(tk): The Web community chart at time tk. – c(tk), d(tk), e(tk), ...: Communities in C(tk). 4
  • 5. Extracting evolution of Web Community from a Series of Web Archive Types of Changes • Emerge – A community c(tk) emerges in C(tk), when c(tk) shares no URLs with any community in C(tk−1). • Dissolve – A community c(tk−1) in C(tk) has dissolved, when c(tk−1) shares no URLs with any community in C(tk) • Growth and Shrink – The community grows when new URLs are appeared in c(tk), and shrinks when URLs disappeared from c(tk−1). • Split – c(tk−1) shares URLs with multiple communities in C(tk) • Merge – When multiple communities (c(tk−1)), d(tk−1), ...) share URLs with a single community e(tk), these communities are merged into e(tk) 5
  • 6. Extracting evolution of Web Community from a Series of Web Archive Evolution Metrics • Evolution metrics measure how a particular community c(tk) has evolved. • The metrics are defined by differences between c(tk) and its corresponding community c(tk−1). • Attributes Used: – N(c(tk)): Number of URLs in the c(tk). – Nsh(c(tk−1), c(tk)): Number of URLs shared by c(tk−1) and c(tk). – Ndis(c(tk−1)): Number of disappeared URLs from c(tk−1) that exist in c(tk−1) but do not exist in any community in C(tk)). – Nsp(c(tk1), c(tk)): Number of URLs split from c(tk−1) to communities at tk other than c(tk). – Nap(c(tk)): Number of newly appeared URLs in c(tk)) that exist in c(tk) but do not exist in any community C(tk−1). – Nmg(c(tk−1), c(tk)): Number of URLs merged into c(tk)) from communities at tk−1 other than c(tk−1). 6
  • 7. Extracting evolution of Web Community from a Series of Web Archive Evolution Metrics • Evolution metrics measure how a particular community c(tk) has evolved. • Growth Rate – The growth rate, Rgrow(c(tk−1), c(tk)), represents the increase of URLs per unit time. – It allows us to find most growing or shrinking communities. • Stability – Represents the amount of disappeared, appeared, merged and split URLs per unit time. – A stable community on a topic is the best starting point for finding interesting changes around the topic. 7
  • 8. Extracting evolution of Web Community from a Series of Web Archive Evolution Metrics • Disappearance Rate – The number of disappeared URLs from c(tk−1) per unit time. – Higher disappear rate means that the community has lost URLs mainly by disappearance. • Merge Rate – The number of absorbed URLs from other communities by merging per unit time. – Higher merge rate means that the community has obtained URLs mainly by merging. 8
  • 9. Extracting evolution of Web Community from a Series of Web Archive Evolution Metrics • Split Rate – The split rate, Rsplit (c(tk−1), c(tk)), is the number of split URLs from c(tk−1) per unit time. – When the split rate is low, c(tk) is larger than other split communities. Otherwise, c(tk) is smaller than other split communities. • Other Metrics – The novelty metrics of a main line (c(ti), c(ti+1), ..., c(t j)) is calculated as follows. 9
  • 10. Extracting evolution of Web Community from a Series of Web Archive Evolution Metrics • By combining these metrics, some complex evolution patterns can be represented. – A community has stably grown when its growth rate is positive, and its disappearance and split rates are low. – Similar evolution patterns can be defined for shrinkage. 10
  • 11. Extracting evolution of Web Community from a Series of Web Archive Web Archives and Graphs • Web archiving is the process of collecting portions of the Web to ensure the information is preserved in an archive • Web crawlers are used for automated capture due to the massive size and amount of information on the Web. • From each archive, a Web graph is built with URLs and links by extracting anchors from all pages in the archive. • The graph included not only URLs inside the archive, but also URLs outside pointed to by inside URLs. • By comparing these graphs, the Web was extremely dynamic. 11
  • 12. Extracting evolution of Web Community from a Series of Web Archive Evolution of Web Community Charts • The size distribution of communities also follows the power law and its exponent did not change so much over time. • Although the size distribution of communities is stable, the structure of communities changes dynamically. • The structure of the chart changes mainly by split and merge, in which more than half of communities are involved. • Split and Merged Communities – Both distributions roughly follow the power law, and show that split or merge rate is small in most cases. – Their shapes and scales are also similar. – This symmetry is part of the reason why the size distribution of communities does not change so much. 12
  • 13. Extracting evolution of Web Community from a Series of Web Archive Evolution of Web Community Charts • Emerged and Dissolved Communities – The size distributions of emerged and dissolved communities also follow the power law – Contribute to preserve the size distribution of communities. – Small communities are easy to emerge and dissolve. • Growth Rate – The growth rate is small for most of communities, and the graph has clear y-axis symmetry. – Size distribution of communities is preserved over time. 13
  • 14. Detecting communities in social networks Definition of community - Evaluating Communities 14
  • 15. Detecting communities in social networks Detecting communities from given social networks are practically important for the following reasons: • Communities can be used for information recommendation because members of the communities often have similar tastes and preferences. Membership of detected communities will be the basis of collaborative filtering. • Communities will help us understand the structures of given social networks. Communities are regarded as components of given social networks, and they will clarify the functions and properties of the networks. • Communities will play important roles when we visualize large-scale social networks. Relations of the communities clarify the processes of information sharing and information diffusions, and they may give us some insights for the growth the networks in the future. 15
  • 16. Detecting communities in social networks Definition of community • “Community” means a subnetwork whose edges connecting inside of it are denser than the edges connecting outside of it . • Three categories: – Local definitions – Global definitions – Definitions based on vertex similarity. 16
  • 17. Detecting communities in social networks Definition of community Local definitions • The attention is focused on the vertices of the subnetwork under investigation and on its immediate neighborhood. • Local definitions of community can be further divided into self-referring ones and comparative ones. • The former considers the subnetwork alone, and the latter compares mutual connections of the vertices of the subnetwork with the connections with external neighbors. • Self referring definitions are clique (a maximal subnetworks where each vertex is adjacent to all the others), n-clique (a maximal subnetwork such that the distance of each pair of vertices is not larger than n), and k-plex (a maximal subnetwork such that each vertex is adjacent to all the others except at most k of them). 17
  • 18. Detecting communities in social networks Definition of community Local definitions • Comparative definitions are LS set (a subnetwork where each vertex has more neighbors inside than outside of the subnetwork), and weak community (the total degrees of the vertices inside the community exceeds the number of edges lying between the community and the rest of the network). Global Definitions • Global definitions of community characterize a subnetwork with respect to the network as a whole. • These definitions usually starts from a null model, in another words, a network which matches the original network in some of its topological features, but which does not display community structure. • The most popular null model consists of a randomized version of the original network, where edges are rewired at random under the constraint that each vertex keeps its degree. This null model is the basic concept behind the definition of modularity. 18
  • 19. Detecting communities in social networks Definition of community Definitions based on vertex similarity • Communities are groups of vertices which are similar to each other. • Some quantitative criterion is employed to evaluate the similarity between each pair of vertices. • Hierarchical clustering is a way to find several layers of communities that are composed of vertices similar to each other. • Repetitive merges of similar vertices based on some quantitative similarity measures will generate a structure shown in Fig. • This structure is called dendrogram, and highly similar vertices are connected in the lower part of the dendrogram. • Subtrees obtained by cutting the dendrogram with horizontal line correspond to communities. • Communities of different granurality will be obtained by changing the position of the horizontal line 19
  • 20. Detecting communities in social networks Evaluating Communities • It is necessary to establish which partition exhibit a real community structure. Therefore, a quality function for evaluating how good a partition is needed. • The most popular quality function is the modularity of Newman and Girivan: Where A is the adjacency matrix, ki is the degree of vertex i , m is the total number of edges of the network. • Modularity can be rewritten as follows: Where nm is the number of communities, ls is the total number of edges joining vertices of community s, ds is the sum of the degrees of the vertices of s 20
  • 21. Detecting communities in social networks Evaluating Communities Meaning of modularity • Subnetwork is a community if the number of edges inside it is larger than the expected number in modularity’s null model 21
  • 22. Methods for Community Detection and Mining 22
  • 23. Methods for Community Detection and Mining • Native methods for dividing given networks into subnetworks, such as graph partitioning, hierarchical clustering, and k-means clustering. • However, these methods needs to provide the numbers of clusters or their size. • It is desirable to devise methods that have abilities of extracting a complete information about the community structure of networks. • The methods for detecting communities are roughly classified into the following categories: (1) divisive algorithms, (2) modularity optimization, (3) spectral algorithms, and (4) other algorithms. 23
  • 24. Methods for Community Detection and Mining Divisive algorithms • A simple way to identify communities in a network is to detect the edges that connect vertices of different communities and remove them, so that the communities get disconnected from each other. • The most popular algorithm is that proposed by Girvan and Newman. • In this algorithm, edges are selected according to the values of measures of edge centrality, estimating the importance of edges according to some property on the network. • The steps of the algorithm are as follows: – (1) Computation of the centrality of all edges, – (2) Removal of edge with largest centrality, – (3) Recalculation of centralities on the running network – (4) Iteration of the cycle from step (2). 24
  • 25. Methods for Community Detection and Mining Divisive algorithms • Girvan and Newman focuses on the concept of edge betweenness. • Edge betweenness is the number of shortest paths between all vertex pairs that run along the edge. 25
  • 26. Methods for Community Detection and Mining Modularity Optimization • Modularity is a quality function for evaluating partitions. • Therefore, the partition corresponding to its maximum value on a given network should be the best one. This is the main idea for modularity optimization. • It has been proved that modularity optimization is an NP-hard problem. • However, there are currently several algorithms that are able to find fairly good approximations of the modularity maximum in a reasonable time. • One of the famous algorithms for modularity optimization is CNM algorithm. • Another examples of the algorithms are greedy algorithms and simulated annealing. 26
  • 27. Methods for Community Detection and Mining Spectral Algorithms • Spectral algorithms are to cut given network into pieces so that the number of edges to be cut will be minimized. • One of the basic algorithm is spectral graph bipartitioning. • The Laplacian matrix L of a network is an n * n symmetric matrix, with one row and column for each vertex. • Laplacian matrix is defined as L =D - A , where A is the adjacency matrix and D is the diagonal degree matrix with 27
  • 28. Methods for Community Detection and Mining Spectral Algorithms • All eigenvalues of L are real and non-negative, and L has a full set of n real and orthogonal eigenvectors. • In order to minimize the above cut, vertices are partitioned based on the signs of the eigenvector that corresponds to the second smallest eigenvalue of L. • In general, community detection based on repetative bipartitioning is relatively fast. 28
  • 29. Methods for Community Detection and Mining Other Algorithms • There are many other algorithms for detecting communities, such as the methods focusing on random walk, and the ones searching for overlapping cliques. • Danon compares the computational costs and their accuracies of major community detection methods. 29
  • 30. Applications of Community Mining Algorithms 30
  • 31. Applications of Community Mining Algorithms • Network Reduction • Discovering Scientific Collaboration Groups from Social Networks • Mining Communities from Distributed and Dynamic Networks 31
  • 32. Applications of Community Mining Algorithms 1. Network Reduction • Network reduction is an important step in analyzing social networks. • Example – The bibliography contains 360 papers written by 314 authors. • Community structure is detected using a community mining algorithm, called ICS. • Most of the detected communities are self-connected components. 32
  • 33. Applications of Community Mining Algorithms 1. Network Reduction 33 The bibliography network for the book entitled “graph products: structure and recognition”
  • 34. Applications of Community Mining Algorithms 1. Network Reduction 34 The community structure of the bibliography network as detected using ICS
  • 35. Applications of Community Mining Algorithms 1. Network Reduction 35 The coauthor network corresponding to the biggest component
  • 36. Applications of Community Mining Algorithms 1. Network Reduction - The reduction of a coauthor network • The clustered coauthor network can be reduced into a much smaller one by condensing each community as one node. • The top-level condensed network corresponding to a 3-community structure is constructed by using ICS from the condensed network. • In this way, a dendrogram corresponding to the original coauthor network can be built. • The dendrogram of the coauthor network 36
  • 37. Applications of Community Mining Algorithms 1. Network Reduction 37 The reduction of a coauthor – The community structure of the network, The condensed network, The top-level condensed network
  • 38. Applications of Community Mining Algorithms 2. Discovering Scientific Collaboration Groups from Social Networks • Flink is a social network that describes the scientific collaborations among 681 semantic Web researchers. • From the perspective of social network analysis, one may be especially interested in such questions as: 1. Among all researchers, which ones would more likely to collaborate with each other? 2. What are the main reasons that bind them together? • Apply the community mining techniques. 38
  • 39. Applications of Community Mining Algorithms 2. Discovering Scientific Collaboration Groups from Social Networks - Mining a scientific collaboration network 39
  • 40. Applications of Community Mining Algorithms 2. Discovering Scientific Collaboration Groups from Social Networks – Results Observed • The self-organized communities would provide the answer to the first question. • By referring to them, one can know the specific collaboration activities among these researchers. • After manually checking the profiles of members within different communities, an interesting fact has been confirmed. – Most of communities are organized according to the locations or the research interests of their respective members. • Answers the second question, i.e., researchers in adjacent locations and with common interests prefer to intensively collaborate with each other. 40
  • 41. Applications of Community Mining Algorithms 3.Mining Communities from Distributed and Dynamic Networks • Applications involve distributed and dynamically-evolving networks. – Resources and controls are not only decentralized but also updated frequently. • We need find a way to solve a more challenging NCMP. – To adaptively mine hidden communities from distributed and dynamic networks. • Solution is based on an Autonomy-Oriented Computing (AOC) approach. 41
  • 42. Applications of Community Mining Algorithms 3.Mining Communities from Distributed and Dynamic Networks • In AOC approach, a group of self-organizing agents are utilized. • The agents will rely only on their locally acquired information about networks. • Example: In Intelligent Portable Digital Assistants – Selecting individuals who have frequently contacted or been contacted with the user during a certain period of time – Taking the selected individuals as the input to an AOC-based algorithm – Ranking and recommending new persons 42
  • 43. Tools for Detecting Communities in Social Network 43
  • 44. Tools for Detecting Communities in Social Network • Several tools have been developed for detecting communities. • These are roughly classified into the following two categories: – Detecting communities from large-scale networks – Interactively analyzing communities from small networks 44
  • 45. Tools for Detecting Communities in Social Network Interactively analyzing communities from small networks • JUNG - http://jung.sourceforge.net/ • Netminer - http://www.netminer.com/NetMiner/overview 01.jsp • Pajek - http://vlado.fmf.unilj.si/pub/networks/pajek/ • igraph - http://igraph.sourceforge.net/ Demo - igraph • SONIVIS - http://www.sonivis.org/ • Commetrix - http://www.commetrix.de/ • NetworkWorkbench - http://nwb.slis.indiana.edu/ • visone - http://visone.info/ • Cfinder - http://www.cfinder.org/ 45
  • 46. Social Network Infrastructures and Communities Decentralized Online Social Networks 46
  • 47. Decentralized Online Social Networks Online social network (OSN) is an online platform that • Provides services for a user to build a public profile and to explicitly declare the connection between his or her profile with those of the other users; • Enables a user to share information and content with the chosen users or public; and • Supports the development and usage of social applications with which the user can interact and collaborate with both friends and strangers. 47
  • 48. Decentralized Online Social Networks • Current online social networks are extended in two main directions towards the capabilities of the provided services and the decentralization of the supporting infrastructures, 48
  • 49. Decentralized Online Social Networks • Disadvantages of Centralized Management of a Social Network – Performance scalability issues, – Frequent down-time, slowness and unresponsiveness – Increasing cost of management and maintaining the infrastructures – Lack of proper privacy preserving schemes • A decentralized online social network is an online social network implemented on a distributed information management platform, such as a network of trusted servers or a peer-to-peer systems. • In contrast to centralized OSNs where the vendor bears all the cost in providing the services, a distributed or peer-to-peer OSN offers a cost- effective alternative. 49
  • 50. Decentralized Online Social Networks Challenges for DOSNs • Storage - Where should content be stored? Should they be stored exclusively at nodes run by friends, or be encrypted and stored at random nodes?. The requirement for redundancy to provide availability of data depends to a large extent on the duration and distribution of time peers are online. • Updates - How can we deal with updates, e.g., status updates of friends?. In peer collaboration systems, updates, e.g., of a workplace, are sent to a small group of peers via a decentralized synchronization mechanism. • Topology - In pure file-sharing networks, the topology does not depend on whether the peers know each other and nodes exchange content with any other nodes in the network. At the other end of the spectrum, for collaboration or media sharing, they tend to consist of collaborative groups that are relatively closed circles, e.g., using a “ring of trust” or darknets. In contrast, online social networking services have overlapping circles. 50
  • 51. Decentralized Online Social Networks Challenges for DOSNs • Search, Addressing - One just needs to find some peer with the content it is looking for. Traditionally, peer identity is tied with an IP address which clearly is not sufficient. • Openness to new applications - One of the most alluring features of current online social networks is that they are open to third-party applications, which enables a constant change of what a social networking service provides to the users. In a decentralized environment, if some users choose to enable a third-party application, their choice should not affect other users or even users connected directly to them. • Security - For distributed storage with other peers that the user not necessarily wants to access data, the content has to be encrypted. To manage access to encrypted data, key distribution and maintenance have to be handled such that the social network group can access data. 51
  • 52. Decentralized Online Social Networks Challenges for DOSNs • Robustness - In a centralized system, one can turn to the provider in case of user misbehavior, there is usually a process defined for dealing with such complaints. In a decentralized system, there is no authority that can ban users for misbehavior or remove content. Robustness against free-riding. Once access to content is granted, it is difficult to revoke that right. When a user allows a friend to see a message, the friend can store the message and keep access to it even after a change of key. Trust has to be at least equal to assigned access rights, due to this difficulty. • Limited Peers - Two-tier system is that users can then participate in the social network with resource constrained (e.g., mobile) devices, which they may use as an auxiliary, even when they contribute resources to the core of the system with their primary device. • Locality - A distributed architecture also enables us to take advantage of geographic proximity and its correlation with local interests. The routers provide availability of data of local interest. This local interest can arise from the locality of events but also from the locality of typical real- life social networks of friends and neighbors. 52
  • 53. Decentralized Online Social Networks General Purpose DOSNs 53
  • 54. Decentralized Online Social Networks General Purpose DOSNs • The reference architecture consists of six layers and provides an architectural abstraction of variety of current related approaches to decentralized social networking in the research literature. • Physical communication network, which can be the Internet or a (mobile) ad hoc network (in case we consider a mobile online social network). • Distributed or P2P overlay management provides core functionalities to manage resources in the supporting infrastructure of the system, which can be a distributed network of trusted servers or a P2P overlay. Specifically, this layer provides higher layers the capabilities of looking up resources, routing messages, and retrieving information reliably and effectively among nodes in the overlay. 54
  • 55. Decentralized Online Social Networks General Purpose DOSNs • Decentralized data management layer, which implements functionalities of a distributed or peer-to-peer information system to query, insert, and update various persistent objects to the systems. • The capability to search the system (Distributed search) for relevant information, the management of users and shared space (User account and share space management), the management of security and access control issues (Trust management, Access control and security), the coordination and management of social applications developed by third parties (Application management). 55
  • 56. Decentralized Online Social Networks General Purpose DOSNs • It is expected that the social networking layer exposes and implements an application programming interface (API) to support the development of new applications as well as to enable the customization of the social network service to suit various preferences of the user. • The top layer of the architecture includes the user interface to the system and various applications built on top of the development platform provided by the DOSN. • Applications can be either implemented by the DOSN provider or developed by third-parties, and can be installed or removed from the system according to user’s preferences. 56
  • 57. Decentralized Online Social Networks Proposed DOSN Approaches • Safebook, an approach to provide a decentralized general purpose OSN, follows the main objective of protecting its users’ privacy. It considers adverse or erroneous behavior of a centralized service provider, possible adversaries which are misusing the functions of the social networking service, as well as external adversaries that could eavesdrop or modify data on the networking layer. • FOAF presents another practical approach to decentralize management of current social networks. The framework enables users to export their FOAF4 profiles, store them on dedicated trusted servers. Users query and manage these profiles through open Web-based protocols such as WebDAV5 or SPARQL/Update. 57
  • 58. Decentralized Online Social Networks Proposed DOSN Approaches • NEPOMUK7 is an on-going EU project with close relation to DOSN. The goal of NEPOMUK is to develop a middleware for sharing users’ desktops with friends for online collaborations and sharing of knowledge by exploiting Semantic Web technologies. • LifeSocial primarily aims at keeping social networking services scalable by distributing the load to their users’ resources. The main functional components of OSNs are data storage and interaction. • PeerSoN aims at keeping the features of OSNs but overcoming two limitations: privacy issues and the requirement of Internet connectivity for all transactions. It also enables direct exchange of data between devices. The main properties of PeerSoN are encryption, decentralization, and direct data exchange. 58
  • 59. Multi-Relational characterization of dynamic social network communities 59
  • 60. Multi-Relational characterization of dynamic social network communities • The characterization of communities in online social media is presented using computational approaches grounded on the observations from social science. Motivation: human community as meaning-making eco-system • The semantics is an emergent artifact of human activity that evolves over time. • Human activity is mostly social, and the social networks of human are conceivable loci for the construction of meaning. • Hence, it is crucial to identify real human networks as communities of people interacting with each other through meaningful social activities, and producing stable associations between concepts and artifacts in a coherent manner.
  • 61. Motivating applications • As new concepts emerge and evolve around real human networks, community discovery can result in new knowledge and provoke advancements in information search and decision-making. Example applications include: • Context-sensitive information search and recommendation: The discovered community around an information seeker can provide context (including objects, activities, time) that help identify most relevant information. • Content organization, tracking and monitoring: Community structure may be used to reflect the social sharing practice and facilitate the organization, tracking and monitoring of user-generated social media content. • Behavioral prediction: Community structure that accounts for inherent dependencies between individuals embedded in a social network can help understand and predict the behavioral dynamics of individuals. Multi-Relational characterization of dynamic social network communities
  • 62. • Data characteristics and challenges – Large volumes of social media data are being generated from various social media platforms including blogs, FaceBook, Twitter, Digg, Flickr. The key characteristics of online social media data include: • Voluminous: The amount of data and the rate of data production can be enormous. • Dynamic: Users’ online actions are constantly archived with timestamps. These online activity records enable a fine-grained observation on the dynamics of human interactions and interests. • Context-rich: Most social media platforms allow a wide array of actions for managing and sharing media objects Multi-Relational characterization of dynamic social network communities
  • 63. 1. Mutual awareness • It is a bi-directional relationship indicating how well a pair of bloggers is aware of each other, as fundamental property of a community. • The amount of mutual awareness is captured expanding on the entire network using a random walk based distance measure, commute time, which estimates the probability that two bloggers are aware of each other on the network. • The experimental results for community extraction in terms of standard evaluation metrics are promising. Multi-Relational characterization of dynamic social network communities - Approaches to three problems
  • 64. 2. FacetNet: • The community structure at a given timestep is determined both by the observed networked data and by the prior distribution given by historic community structures. • The experimental results suggest that this technique is scalable and is able to extract meaningful communities based on social media context. Multi-Relational characterization of dynamic social network communities - Approaches to three problems
  • 65. 3. MetaFac: • MetaFac is the first graph-based tensor factorization framework for analyzing the dynamics of heterogeneous social networks. • In this framework, metagraph, is a novel relational hypergraph representation for modeling multi-relational and multi-dimensional social data. • Extensive experiments on large-scale real-world social media data and from the enterprise data suggest that this technique is able to extract meaningful communities that are adaptive to social media context. Multi-Relational characterization of dynamic social network communities - Approaches to three problems
  • 66. (a) Mutual awareness – a bi-directional relationship indicating how well a pair of bloggers is aware of each other, as fundamental property of a community. (b) Mutual awareness expansion – a random walk based distance measure which estimates the probability that two bloggers are aware of each other on the network. (c) FacetNet – for analyzing communities and their evolutions in a unified process. (d) MetaFac – the first graph-based multi-tensor factorization framework for analyzing the dynamics of heterogeneous social networks. Multi-Relational characterization of dynamic social network communities - Approaches to three problems
  • 67. Multi-Relational characterization of dynamic social network communities - Approaches to three problems