Researcher Profiling based on Semantic Analysis in Social Networks

Researcher Proﬁling based on
Semantic Analysis in Social Networks
Laurens De Vocht

Supervisors Promotors
Gonzalo Parra Erik Duval
Selver Softic Martin Ebner
July 1, 2011

Agenda
‣Background
‣Framework
‣Client Application
‣Deployment
‣Evaluation
‣Conclusion
2

Background

‣Deﬁnitions
‣Problem Statement
‣The Social Semantic Web
‣Scope & Value of Study

3

Deﬁnitions
Proﬁling
“Inferring unobser vable information about users from
observable information about them, that is their actions or their
utterances.” (Zukerman and Albrecht, 2001)

Semantic Analysis
“A technique using semantic-based tools and ontologies in
order to gain a deeper understanding of the information being
stored and manipulated in an existing system” (McComb, 2004)

4

Problem Statement
Web users generate a massive
unstructured information ﬂow

?

Who has scientiﬁc information
relevant for me?
5

Problem Statement
Connecting researchers based on shared scientific events
(conferences)
Scientific Profiling

Scientific
User Model Event Model Conferences
Resource

Researchers

Profiler/
Analyzer

Researcher
(User)

6

The Social Semantic Web

Community of (micro)blogging,
researchers with sharing,
conference tagging,
experience discussion
semi-structured
information
Larger population of system
people interested in
(faceted) search
scientiﬁc conferences
engine

recommendation clustered and
engine analyzed data

(Gruber, 2007)
7


conference tagging,
semi-structured
information
(faceted) search
engine


Human process
(Gruber, 2007)
7


conference tagging,
semi-structured
information
(faceted) search
engine


Human process Machine process
(Gruber, 2007)
7

Social Web
conference tagging,
semi-structured
information
(faceted) search
engine


(Gruber, 2007)
7

Social Web Semantic Web
conference tagging,
semi-structured
information
(faceted) search
engine


(Gruber, 2007)
7

The Social semantic Web
‣Hashtags as Identifiers
‣not always strong or consistent enough
‣properties of good hashtags formalized
‣helpful in assessment of valuable identifiers
(Laniado and Mika, 2007)

‣Expert Search/Profiling with Linked Data
‣aggregate and analyze certain types of data
‣need to surpass limits of closed data sets
‣LOD delivers multi-purpose data
(Stankovic et al., 2010)

8

Scope & Value of the Study
‣Bridging research areas
Human Computer-Interaction & Semantic Analysis
‣Mining usable data
out of social networks (microblogs)
‣Integration
Social network data and linked open data
‣Framework driven methodology
based upon current state-of-the-art semantic tools
‣Evaluation
proof-of-concept Research 2.0 application

9

Solution
Annotate Data from Social Networks

Community approved
ontologies: FOAF, SIOC

Linked Open Data Applications

Scientific Profiling Framework

Connect People and Resources
that share Scientific Affinities
10

Framework

‣Overview
‣Grabeeter
‣Architecture
‣Web Service

11

Framework: Overview
Social Linked Open
Output Format
Networks Data Cloud

Framework Aggregate Interlink Publish

Archived/Cached Scientiﬁc
Linked Data Information
Data Annotate Analyse

12

Framework: Overview
Social Linked Open
Output Format
Networks Data Cloud



DBPedia JSON
Twitter Colinda RDF (XML)
GeoNames

Aggregate Interlink Publish

Semantic Scientific
Grabeeter Profiling API
Annotate Profiling Network Analyse

12

Framework: Overview
Social Linked Open
Output Format
Networks Data Cloud



DBPedia JSON
GeoNames



13

Framework: Grabeeter
= Twitter aggregation & archiving tool
(developed at TUGraz)

14

Framework: Overview
Social Linked Open
Output Format
Networks Data Cloud



DBPedia JSON
GeoNames



15

Framework: Architecture
Applications

Analysis

Extraction Interlinking

Grabeeter RDF Store

16

Framework: Web Service
‣ User Profile
http://api.semanticprofiling.net/profile.php?user=screen_name

‣ Discover people, events...
http:///discovery.php?

find=persons|events|popular_friends|popular_mentions|popular_events
user=screen_name

‣ Register new Twitter user
http://api.semanticprofiling.net/register.php?user=twitter_user

‣ Event Details
http://api.semanticprofiling.net/event.php?name=event_name

17


18


19


20


21

Deployment
<device> <device>
Grabeeter Server Researcher Affinity Browser Server

<execution environment> <execution environment>
Application Server RDMS <execution environment>
Application Server

crawling MySQL Database
scripts
AffinityBrowser.war

<device>
Semantic Profiling Network Server

<execution environment>
PHP Server

provider.php

interlink.php

profile.php
<execution environment>
RDMS
discovery.php

event.php
MySQL Database
allusers.php

22

Evaluation

‣Approach
‣Usability
‣Usefulness
‣Discussion

23

Evaluation: Approach

‣Test usability & usefulness
‣Web application: “Researcher Afﬁnity Browser”
‣Using explicit evaluation questionnaire

24

Evaluation: Usability

‣Definitely useful application
‣Use of the map view makes sense
‣People - Event split confusing
‣View of own profile
‣not a suitable starting point
‣only useful in comparison
‣shouldn’t be always visible
‣Person-specific affinities
‣too much hidden

25


26


27

Evaluation: Usefulness

‣Relevance
Test users rate their search results
‣Satisfaction questionnaire
Targeted questions about usefulness
Allow comments on user interface

28

Evaluation: Usefulness
Relevant user percentage
Number of users
0% (None)

1-20% (A few)

21-40% (Less than one half)

41-60% (About one half)

61-80% (More than one half)

81-99% (Almost all)

100% (All)

0 1 2 3 4

29

Evaluation: Usefulness Usefulness Questionnaire Results
Concept Affinity

Clear view of affinities between people

Map & Plot combination understood

Deactivating filer fast enough

Activating filer fast enough

Never usability glitches

Convention between views understood

Information display not overwhelming (confusing)

Relevant detailed person info

Shown details correspond with ‘real life’ activities

Enough relevant (new) persons

Daily updating of information obvious

Twitter data made more useful for researchers
1 2 3 4 5

30

Evaluation: Discussion
‣ Afﬁnities exposed in an engaging way
‣ Relevant users rating
OR Many common entities trigger positive rating
OR Common entities start deeper investigation
‣ Reliability of person details hard to verify
‣ UI satisfaction user dependent
‣ What does the user expect from “Afﬁnity Browser”?
‣ Test different scenarios to identify usage types?
31

Future work

‣ Rank tags
by importance, not just frequency of use

‣ Visualization
improve viewing of links between users and entities

‣ Multiple Resources
better reliability and more veriﬁcation of data

32

Conclusion

‣ Framework could support many social semantic-based applications
‣ Realized with current state-of-the-art technologies
‣ Interlinking with Linked Open Data Cloud enriches social network
data
‣ Researcher Affinity Browser
‣ Exposes affinities between users
‣ User feedback affirms positively new view on social data
‣ Hash tags identified as conferences provide consistent links

33

Researcher Profiling based on Semantic Analysis in Social Networks

Recommandé

Recommandé

Contenu connexe

En vedette

En vedette (20)

Similaire à Researcher Profiling based on Semantic Analysis in Social Networks

Similaire à Researcher Profiling based on Semantic Analysis in Social Networks (20)

Dernier

Dernier (20)

Researcher Profiling based on Semantic Analysis in Social Networks

Notes de l'éditeur