SlideShare a Scribd company logo
1 of 7
Download to read offline
Twitter is noisy
but you can find some diamonds
Pew Internet report:


“75% of online news consumers say they get
  news forwarded through email or posts on
  social networking sites
and 52% say they share links to news with
  others via those means.”
Twitter Lists
• Filtering main friends timeline is a bad idea
• Twitter Lists: manually created set of users who
  often post on a certain topic
• For example:
   – @huffingtonpost/apple-news
   – @IndieFlix/film-people-to-follow
   – @alisohani/bigdata-analytics
• A Twitter user can be included into different lists.
• Me for example:
  http://twitter.com/mariagrineva/lists/memberships
What kind of noise?
• People tweet on other topics too, including
  personal stuff



• Global news widely spread, often really
  annoying: IPad launch, ash clouds, Christmas,
  Michael Jackson
Our Approach
• Identifying niche topic of Twitter list
  automatically, at real-time
• Improve the niche topic with respect to the
  Global Twitter Stream
  – If there is a burst related to Apple, IPad => check
    maybe all Twitter is talking about that
Filtering = Classification
• Traditional approaches to filter news use only
  textual features
• We use both textual and social features for
  classification
  – Twitter lists is a community of interconnected
    users => see who is the center and who is an
    outsider
What is done
• Method for identification list’s topic
  signature with respect to Global Twitter
  Stream
• Social features identification
• Evaluation framework

More Related Content

Viewers also liked

Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
maria.grineva
 
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge BasesSemantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
maria.grineva
 
XQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database SednaXQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database Sedna
maria.grineva
 
Architecture of Native XML Database Sedna
Architecture of Native XML Database SednaArchitecture of Native XML Database Sedna
Architecture of Native XML Database Sedna
maria.grineva
 
Tqr 2013 probes proxies
Tqr 2013 probes proxies Tqr 2013 probes proxies
Tqr 2013 probes proxies
An Jacobs
 
Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
maria.grineva
 

Viewers also liked (12)

Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
 
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge BasesSemantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
 
XQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database SednaXQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database Sedna
 
Architecture of Native XML Database Sedna
Architecture of Native XML Database SednaArchitecture of Native XML Database Sedna
Architecture of Native XML Database Sedna
 
Tqr 2013 probes proxies
Tqr 2013 probes proxies Tqr 2013 probes proxies
Tqr 2013 probes proxies
 
Строителство Градът
Строителство ГрадътСтроителство Градът
Строителство Градът
 
Using Social Media to Build Your Business
Using Social Media to Build Your BusinessUsing Social Media to Build Your Business
Using Social Media to Build Your Business
 
Marketing Success in 7 Steps Guaranteed
Marketing Success in 7 Steps GuaranteedMarketing Success in 7 Steps Guaranteed
Marketing Success in 7 Steps Guaranteed
 
Development and Testing of a Conceptual Framework for Asking about Intoxication
Development and Testing of a Conceptual Framework for Asking about IntoxicationDevelopment and Testing of a Conceptual Framework for Asking about Intoxication
Development and Testing of a Conceptual Framework for Asking about Intoxication
 
The Social Research Group
The Social Research GroupThe Social Research Group
The Social Research Group
 
Defining the User Experience of Emotion & Content
Defining the User Experience of Emotion & ContentDefining the User Experience of Emotion & Content
Defining the User Experience of Emotion & Content
 
Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Filtering Twitter

  • 1. Twitter is noisy but you can find some diamonds
  • 2. Pew Internet report: “75% of online news consumers say they get news forwarded through email or posts on social networking sites and 52% say they share links to news with others via those means.”
  • 3. Twitter Lists • Filtering main friends timeline is a bad idea • Twitter Lists: manually created set of users who often post on a certain topic • For example: – @huffingtonpost/apple-news – @IndieFlix/film-people-to-follow – @alisohani/bigdata-analytics • A Twitter user can be included into different lists. • Me for example: http://twitter.com/mariagrineva/lists/memberships
  • 4. What kind of noise? • People tweet on other topics too, including personal stuff • Global news widely spread, often really annoying: IPad launch, ash clouds, Christmas, Michael Jackson
  • 5. Our Approach • Identifying niche topic of Twitter list automatically, at real-time • Improve the niche topic with respect to the Global Twitter Stream – If there is a burst related to Apple, IPad => check maybe all Twitter is talking about that
  • 6. Filtering = Classification • Traditional approaches to filter news use only textual features • We use both textual and social features for classification – Twitter lists is a community of interconnected users => see who is the center and who is an outsider
  • 7. What is done • Method for identification list’s topic signature with respect to Global Twitter Stream • Social features identification • Evaluation framework