SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
Leveraging Microblogs
                                               for Resource Ranking

Tomáš Majer, Marián Šimko
tomasmajer@gmail.com, simko@fiit.stuba.sk


23.01.2012, SOFSEM 2012
Institute of Informatics and Software Engineering
Faculty of Informatics and Information Technologies
Slovak University of Technology in Bratislava
Microblog
• a brief form of a blog with limited size of a post,
  typically 140 characters of length
• sharing experiences, opinions, comments, links
• Twitter, Identi.ca, Jaiku,   (Google+, Facebook)


• Twitter:
  ▫ over 300 millions of users   (100 mil. at the time of writing paper)
  ▫ over 300 millions of tweets/day     (100 mil. --||--)
Microblog – why important?
• „Read-Write Web“ vision
• user-generated data
 ▫ unbiased
 ▫ not moderated


• huge amount of data – text, links
• valuable source of information
 ▫ data mining
Microblog – why important?
• „Read-Write Web“ vision
• user-generated data
 ▫ unbiased
 ▫ not moderated
                                      22 % of posts


• huge amount of data – text, links
• valuable source of information
 ▫ data mining
State-of-the-Art
• topic identification/keyword extraction             (Ramage et al., 2010)

• opinion mining    (Pendey, Iyer, 2009)

• twitter search, tweets ranking           (Teevan et al., 2011)

• user ranking   (Gayo-Avello, 2010)

• user modeling    (Abel et al., 2011)

•…
• challenge: resource ranking
Twitter Graph
                              T1
                                          R1
       U1
                         T2
                                    R2

               T3
       U2                                  R3
                               T4

                    T5
                                     R4

       U3
                          T6
                                          R5
                    T7


       Users         Tweets         Resources
Twitter Graph
                                  T1
                                              R1
 posts     U1
                             T2                     contains
                                        R2

                   T3
           U2                                  R3
                                   T4

                        T5
                                         R4

           U3                                       re-posts
 follows                      T6
                                              R5
                        T7


           Users         Tweets         Resources
Resource Ranking: Overview
• TweetRank
 ▫ ranking of a resource based on Twitter graph analysis


• Computation
 1. UserRank
 2. TweetRelevance
 3. TweetRank
Resource Ranking: Overview
• TweetRank
 ▫ ranking of a resource based on Twitter graph analysis


• Computation                                   T1
                                                           R1


 1. UserRank
                            U1
                                           T2
                                                      R2

                                 T3


 2. TweetRelevance
                            U2                              R3
                                                 T4

                                      T5
                                                      R4



 3. TweetRank
                            U3
                                            T6
                                                           R5
                                      T7
UserRank



                                1+γ (u)UserRank ( f )
 UserRank (u) = ∑
               f ∈ followers(u)   ∣ followers( f )∣
UserRank
                                        ∣ followers(u)∣
                                   γ(u)=
                                           ∣tweets(u)∣


                                1+γ (u)UserRank ( f )
 UserRank (u) = ∑
               f ∈ followers(u)   ∣ followers( f )∣
TweetRelevance, TweetRank


                      UserRank ( Author (t))
 TweetRelevance (t) =
                       ∣tweets( Author (t ))∣
TweetRelevance, TweetRank


                      UserRank ( Author (t))
 TweetRelevance (t) =                         = TR(t)
                       ∣tweets( Author (t ))∣


 TweetRank (r) =      ∑
                   t ∈tweets(r )   (   TR(t)+        ∑
                                                rt ∈retweets(t)
                                                                  TR(t)TR(rt)
                                                                                )
Evaluation
1. TweetRank ranking vs. explicit user ranking (YouTube )
2. Search results ranking study (Search)


• Data:
 ▫ 1,997,466 tweets from 367,824 users
    85 % in English
 ▫ 1,468,365,182 connections between 40,103,281 users
 ▫ 1,150,168 unique web links
    3 % of them: YouTube
Computing TweetRank
• TweetRank computed for each resource
 ▫ power-law distribution
   # of resources




                            TweetRank intervals
Experiment YouTube
• YouTube – explicit user rating (Y1Rank)
 ▫ positive/negative vote, normalized


• TweetRank vs. Y1Rank
                             TweetRank, Y1Rank
 ▫ correlation coefficient
   r = 0.02




                                                 YouTube videos
Experiment YouTube 2
• YouTube – application-collected user rating (Y2Rank)
 ▫ „how do you like the video“?
 ▫ 5 degree scale (1-best, 5-worst), 70 participants


• TweetRank vs. Y2Rank
 ▫ Kendall rank correlation coefficient τ= 0.125


• Relative video ranking
Experiment YouTube 2
 ▫ 5-tuplets
Experiment YouTube 2
 ▫ pairs
Experiment Search
• 20,000 resources
• indexing: SOLR (Apache Lucene)
• searching: resource ranking extended with TweetRank


• search results manual comparison
 ▫ 20 randomly selected queries (e.g.: „apple“)
 ▫ analyzed top-k results
Experiment Search
• findings:
  ▫ in general, „newer“ resources rank better
  ▫ ranking does not reflect chronological ordering of
    resources


• suitable for sorting search results within a predefined
  time window
Conclusions
• microblog
 ▫ perspective source of data, information, knowledge
• we proposed novel method for resource ranking
  leveraging microblog network
• an important additional knowledge from the crowd
 ▫ a form of indirect explicit user rating
• great potential for search improvement
 ▫ reflects temporal characteristics (not linearly)
 ▫ sorting results within a predefined time window

Contenu connexe

Dernier

Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 

Dernier (20)

Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 

En vedette

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

En vedette (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 

Leveraging microblogs for resource ranking

  • 1. Leveraging Microblogs for Resource Ranking Tomáš Majer, Marián Šimko tomasmajer@gmail.com, simko@fiit.stuba.sk 23.01.2012, SOFSEM 2012 Institute of Informatics and Software Engineering Faculty of Informatics and Information Technologies Slovak University of Technology in Bratislava
  • 2. Microblog • a brief form of a blog with limited size of a post, typically 140 characters of length • sharing experiences, opinions, comments, links • Twitter, Identi.ca, Jaiku, (Google+, Facebook) • Twitter: ▫ over 300 millions of users (100 mil. at the time of writing paper) ▫ over 300 millions of tweets/day (100 mil. --||--)
  • 3. Microblog – why important? • „Read-Write Web“ vision • user-generated data ▫ unbiased ▫ not moderated • huge amount of data – text, links • valuable source of information ▫ data mining
  • 4. Microblog – why important? • „Read-Write Web“ vision • user-generated data ▫ unbiased ▫ not moderated 22 % of posts • huge amount of data – text, links • valuable source of information ▫ data mining
  • 5. State-of-the-Art • topic identification/keyword extraction (Ramage et al., 2010) • opinion mining (Pendey, Iyer, 2009) • twitter search, tweets ranking (Teevan et al., 2011) • user ranking (Gayo-Avello, 2010) • user modeling (Abel et al., 2011) •… • challenge: resource ranking
  • 6. Twitter Graph T1 R1 U1 T2 R2 T3 U2 R3 T4 T5 R4 U3 T6 R5 T7 Users Tweets Resources
  • 7. Twitter Graph T1 R1 posts U1 T2 contains R2 T3 U2 R3 T4 T5 R4 U3 re-posts follows T6 R5 T7 Users Tweets Resources
  • 8. Resource Ranking: Overview • TweetRank ▫ ranking of a resource based on Twitter graph analysis • Computation 1. UserRank 2. TweetRelevance 3. TweetRank
  • 9. Resource Ranking: Overview • TweetRank ▫ ranking of a resource based on Twitter graph analysis • Computation T1 R1 1. UserRank U1 T2 R2 T3 2. TweetRelevance U2 R3 T4 T5 R4 3. TweetRank U3 T6 R5 T7
  • 10. UserRank 1+γ (u)UserRank ( f ) UserRank (u) = ∑ f ∈ followers(u) ∣ followers( f )∣
  • 11. UserRank ∣ followers(u)∣ γ(u)= ∣tweets(u)∣ 1+γ (u)UserRank ( f ) UserRank (u) = ∑ f ∈ followers(u) ∣ followers( f )∣
  • 12. TweetRelevance, TweetRank UserRank ( Author (t)) TweetRelevance (t) = ∣tweets( Author (t ))∣
  • 13. TweetRelevance, TweetRank UserRank ( Author (t)) TweetRelevance (t) = = TR(t) ∣tweets( Author (t ))∣ TweetRank (r) = ∑ t ∈tweets(r ) ( TR(t)+ ∑ rt ∈retweets(t) TR(t)TR(rt) )
  • 14. Evaluation 1. TweetRank ranking vs. explicit user ranking (YouTube ) 2. Search results ranking study (Search) • Data: ▫ 1,997,466 tweets from 367,824 users  85 % in English ▫ 1,468,365,182 connections between 40,103,281 users ▫ 1,150,168 unique web links  3 % of them: YouTube
  • 15. Computing TweetRank • TweetRank computed for each resource ▫ power-law distribution # of resources TweetRank intervals
  • 16. Experiment YouTube • YouTube – explicit user rating (Y1Rank) ▫ positive/negative vote, normalized • TweetRank vs. Y1Rank TweetRank, Y1Rank ▫ correlation coefficient r = 0.02 YouTube videos
  • 17. Experiment YouTube 2 • YouTube – application-collected user rating (Y2Rank) ▫ „how do you like the video“? ▫ 5 degree scale (1-best, 5-worst), 70 participants • TweetRank vs. Y2Rank ▫ Kendall rank correlation coefficient τ= 0.125 • Relative video ranking
  • 18. Experiment YouTube 2 ▫ 5-tuplets
  • 19. Experiment YouTube 2 ▫ pairs
  • 20. Experiment Search • 20,000 resources • indexing: SOLR (Apache Lucene) • searching: resource ranking extended with TweetRank • search results manual comparison ▫ 20 randomly selected queries (e.g.: „apple“) ▫ analyzed top-k results
  • 21. Experiment Search • findings: ▫ in general, „newer“ resources rank better ▫ ranking does not reflect chronological ordering of resources • suitable for sorting search results within a predefined time window
  • 22. Conclusions • microblog ▫ perspective source of data, information, knowledge • we proposed novel method for resource ranking leveraging microblog network • an important additional knowledge from the crowd ▫ a form of indirect explicit user rating • great potential for search improvement ▫ reflects temporal characteristics (not linearly) ▫ sorting results within a predefined time window