Vector Databases 101 - An introduction to the world of Vector Databases
Design and Prototyping of a Social Media Observatory
1. Design and Prototyping of
a Social Media Observatory
Karissa McKelvey and Filippo Menczer
Center for Complex Networks and Systems Research
School of Informatics and Computing
Indiana University, Bloomington
1
2. Can we use social media as
laboratories for social
science?
truthy.indiana.edu 2
3. Political Polarization on Twitter Michael Conover, Jacob Ratkiewicz, Bruno Gonçalves, Alessandro Flammini
& Filippo Menczer International Conference on Weblogs and Social Media 2011
truthy.indiana.edu 3
15. Reliability
• Spam and misinformation
• Cleansing and tagging by social, algorithmic,
or other means
• Sampling bias
truthy.indiana.edu 15
16. Data Collection
• Twitter Streaming API, random sample
• August, 2010 – present
• 5TB Compressed
• Real-time access to data from last 9 months
related to 3 themes: US Politics, Social
Movements, News
truthy.indiana.edu 16
19. Model
• Events
– Post on a social media site
• Users
– Actors in the post
– Sender, receiver, forwarder, etc
• Meme
– Discernable unit of information transfer
– Eg, hashtag, URL, user, phrase…
truthy.indiana.edu 19
26. Open Access
• A public and free -- or low-cost -- social
observatory enables access to large-scale
social media data analytics for non-profit
endeavors.
truthy.indiana.edu 26
27. API & Website
• Filterable and
searchable interface
to find memes of
interest
• Endpoints for users
to access data
programmatically
• Visualizations
truthy.indiana.edu/apidoc
truthy.indiana.edu 27
31. Thanks!
Papers at cnets.indiana.edu/groups/nan/truthy
S
andro Flammini
Bruno Conçalves
J
acob Ratkiewicz
LilianWeng
Mike Conover
J
ohan Bollen
KarissaMcKelvey
Przemek Grabowicz
Mark Meiss
AlexVespignani
Alex Rudnick
LucaAiello
Fil Menczer
Mohsen J
afari-Asbagh
Onur Varol
Emilio Ferrara
Wednesday, September 26, 12
Notes de l'éditeur
Are there well-defined communication behaviors that characterize
the activities of influential actors?
What is the role of bridging users in facilitating information
transfer between ideologically opposed communities?
Who are the opinion leaders, and how do
they engage in frame-making and agenda-setting?