SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
Detection of Spam
Tipping Behaviour on
Foursquare
Anupama Aggarwal¶, Prof P. Kumaraguru “PK”¶ ,
Prof J. Almeida*
¶

Indraprastha Institute of Information Technology (IIIT-Delhi, India)
* Universidade Federal de Minas Gerais (UFMG, Brazil)

1
Foursquare 101
‣ Location Based Social Network
‣ 33 Million Users *
‣ 3.5 Billion checkins *
‣ 31% of mobile social media users use Foursquare *

* As of January 2013

2
Location
Sharing
OSN

Foursquare 101

Friends Activity

Your Last
Checkin

Venue

Friends
Suggestions

Venue
Suggestions
Tip : Suggested Activity
for a Venue
Tip can be Liked or Saved
Spam Tips
Advertising / Marketing

‣ Tips unrelated to Venue

Scam / Phishing

4
Spam
according to

Foursquare ToS
‣ Tips with links to websites selling software, realtor contact
info, a listing for your business, or other promotion
‣ Tips with inappropriate language or negativity directed at
another person
‣ Unauthorized or unsolicited advertising, junk
5
Contributions
‣ Characterizing irregular user behaviour
‣

We observed different categories of spam users

‣

We characterize features distinguishing these spam users

‣ Automatic detection of spammers
‣

Distinguish between spam and legitimate Foursquare users

‣

Cluster spam users into different categories according to their behaviour

6
Data Crawling

2,400,594 tips
613,298 users

7
Observed Categories of
Spam Users
‣ Marketing : These users post tips to promote and
advertise a specific product/ brand / venue / external URL
‣ Malicious : Such Foursquare users post external  URLs in
Tips which direct to spam / phishing / malware websites
‣ Abusive / Derogatory: These users try to deface or
bad-mouth another person
‣ Self Promotion: These users try to draw attention to
themselves
8
Ground Truth Data
Annotation Portal

2,000 Legitimate users
1,900 Spammers
9
Features used to
detect Spammers
‣ User Attributes
‣

Properties of the Foursquare user profile and his checkins

‣ Social Attributes
‣

Friends network of the Foursquare user under inspection

‣ Content Attributes
‣

Details about Tips posted by the Foursquare user
10
Features used
Category

χ2 rank

Feature

User
Attributes

1
3
4
5
11
12
15

Number of Tips
Ratio of Check-ins and Tips
Number of Check-ins
Number of Badges
Number of Mayorships
Ratio of Check-ins and Badges
Number of Photos posted

Social
Attributes

6

Number of Friends

2

Similarity score of Tips

7
8
9
10
13
14

Number of URLs posted
Average number of words in Tips
Average number of characters in Tips
Ratio of number of likes and number of Tips
Average number of spam words in Tips
Average number of phone-numbers posted in Tips

Content
Attributes

11
Few Observations
‣ Spammers post same/similar Tips on multiple venues
‣ A large fraction of spam Tips contain URLs
‣ Spam Tips may also have phone numbers
‣ Legitimate users have more Friends
‣ Spammers have very few Friends but large number of Tips

12
Relation b/w Tips and
Checkins
Tips

Irregular User Behaviour

Check-ins
Tips Distribution

Legitimate users

Spammers

14
Classification Results
Classification
Algorithm

Precision
(Spam)

Precision
(Safe)

Recall
(Spam)

Recall
(Safe)

Accuracy

KNN

83.2%

86.6%

86.3%

83.5%

84.89%

Decision
Tree

88.1%

89.2%

88.3%

85.8%

89.53%

Random
Forest

89.3%

90.2%

88.3%

90.3%

89.76%

15
Detection of Spam Classes
‣ Expectation-Maximization (EM) clustering
‣ Spammers Categories ‣

Advertising / Marketing

‣

Self Promotion

‣

Abusive

‣

Malicious

16
Detection of Spam Classes
‣ Clustering Accuracy for spammer categories -

Advertising

88.23%

Self-Promotion

87.23%

Abusive

78.88%

Malicious

0%

17
Conclusion
‣ Analyzed spammers behaviour on Foursquare
‣ We obtained an accuracy of 89.76% with Random Forest
classifier to distinguish spammers from legitimate users
‣ We classified the spammers into four broad categories
‣ We were able to to detect users belonging to Advertising,
Self-promotion and Abusive categories with an accuracy of
88.23%, 87.23% and 78.88%
18
Future Work
‣ Refine our methodology by use of other classification
algorithms
‣ Use multiclass classification to detect users in any of the
spam categories
‣ Correlation of content and the URLs posted by different
users can help us in identifying several spam campaigns on
Foursquare

19
Thank You!
Questions ?

20
For any further information, please write to
pk@iiitd.ac.in
precog.iiitd.edu.in

21

Contenu connexe

Similaire à Detection of Spam Tipping Behaviour on Foursquare

Learn how personas can shape your optimization program
Learn how personas can shape your optimization programLearn how personas can shape your optimization program
Learn how personas can shape your optimization programVWO
 
Data Science and Online Dating.pptx
Data Science and Online Dating.pptxData Science and Online Dating.pptx
Data Science and Online Dating.pptxPiyush Prashant
 
Introduction to Online Dating 2flirts.pptx
Introduction to Online Dating  2flirts.pptxIntroduction to Online Dating  2flirts.pptx
Introduction to Online Dating 2flirts.pptxjeffwallter1
 
Collective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and MetadataCollective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and MetadataShebuti Rayana
 
How to Catch Someone Who Is Cheating Online
How to Catch Someone Who Is Cheating OnlineHow to Catch Someone Who Is Cheating Online
How to Catch Someone Who Is Cheating OnlinePaul Bossky
 
When should we ask, when should be measure?
When should we ask, when should be measure?When should we ask, when should be measure?
When should we ask, when should be measure?Ray Poynter
 
Tinder clone app tinder business model & how it works!
Tinder clone app tinder business model  & how it works!Tinder clone app tinder business model  & how it works!
Tinder clone app tinder business model & how it works!Moon Technolabs Pvt. Ltd.
 
Dating App Guide 2022: Increasing The Dating App Downloads In Just 2 Months
Dating App Guide 2022: Increasing The Dating App Downloads In Just 2 MonthsDating App Guide 2022: Increasing The Dating App Downloads In Just 2 Months
Dating App Guide 2022: Increasing The Dating App Downloads In Just 2 MonthsMoon Technolabs Pvt. Ltd.
 
You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...
You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...
You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...Priyanka Aash
 
#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...
#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...
#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...Emma Mirrington
 
MKTG521 Circle Tie Marketing Research
MKTG521 Circle Tie   Marketing ResearchMKTG521 Circle Tie   Marketing Research
MKTG521 Circle Tie Marketing ResearchRehab Wahsh
 
Deb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing Summit
Deb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing SummitDeb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing Summit
Deb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing SummitRuperta Daher
 
Converting Free Users to Paid Users - Fowa Miami 2009
Converting Free Users to Paid Users - Fowa Miami 2009Converting Free Users to Paid Users - Fowa Miami 2009
Converting Free Users to Paid Users - Fowa Miami 2009Carsonified Team
 
best dating apps chicago
best dating apps chicagobest dating apps chicago
best dating apps chicagojeniferevans1
 
Connecting with Microsoft Travellers
Connecting with Microsoft Travellers Connecting with Microsoft Travellers
Connecting with Microsoft Travellers Deb O'Sullivan
 
E marketer traveler_reviews-online_communities_reshape_marketing_strategies
E marketer traveler_reviews-online_communities_reshape_marketing_strategiesE marketer traveler_reviews-online_communities_reshape_marketing_strategies
E marketer traveler_reviews-online_communities_reshape_marketing_strategiesAdCMO
 
The Fact Checking Project from the American Press Institute
The Fact Checking Project from the American Press InstituteThe Fact Checking Project from the American Press Institute
The Fact Checking Project from the American Press InstituteJoe Amditis
 
InstaFind Power Point Presentation
InstaFind Power Point PresentationInstaFind Power Point Presentation
InstaFind Power Point Presentationlhsu0903
 

Similaire à Detection of Spam Tipping Behaviour on Foursquare (20)

Learn how personas can shape your optimization program
Learn how personas can shape your optimization programLearn how personas can shape your optimization program
Learn how personas can shape your optimization program
 
Data Science and Online Dating.pptx
Data Science and Online Dating.pptxData Science and Online Dating.pptx
Data Science and Online Dating.pptx
 
Introduction to Online Dating 2flirts.pptx
Introduction to Online Dating  2flirts.pptxIntroduction to Online Dating  2flirts.pptx
Introduction to Online Dating 2flirts.pptx
 
Collective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and MetadataCollective Opinion Spam Detection Bridging Review Networks and Metadata
Collective Opinion Spam Detection Bridging Review Networks and Metadata
 
How to Catch Someone Who Is Cheating Online
How to Catch Someone Who Is Cheating OnlineHow to Catch Someone Who Is Cheating Online
How to Catch Someone Who Is Cheating Online
 
When should we ask, when should be measure?
When should we ask, when should be measure?When should we ask, when should be measure?
When should we ask, when should be measure?
 
Tinder clone app tinder business model & how it works!
Tinder clone app tinder business model  & how it works!Tinder clone app tinder business model  & how it works!
Tinder clone app tinder business model & how it works!
 
Dating App Guide 2022: Increasing The Dating App Downloads In Just 2 Months
Dating App Guide 2022: Increasing The Dating App Downloads In Just 2 MonthsDating App Guide 2022: Increasing The Dating App Downloads In Just 2 Months
Dating App Guide 2022: Increasing The Dating App Downloads In Just 2 Months
 
You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...
You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...
You’re Just Complaining Because You’re Guilty: A DEF CON Guide to Adversarial...
 
#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...
#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...
#FIRMday London 27 Nov 2014 James Bywater Talent Q " Making the right impact ...
 
MKTG521 Circle Tie Marketing Research
MKTG521 Circle Tie   Marketing ResearchMKTG521 Circle Tie   Marketing Research
MKTG521 Circle Tie Marketing Research
 
Deb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing Summit
Deb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing SummitDeb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing Summit
Deb O'Sullivan's (Microsoft) presentation at Mumbrella's Travel Marketing Summit
 
Quality In Action #7: Going Beyond Background Check
Quality In Action #7: Going Beyond Background CheckQuality In Action #7: Going Beyond Background Check
Quality In Action #7: Going Beyond Background Check
 
Converting Free Users to Paid Users - Fowa Miami 2009
Converting Free Users to Paid Users - Fowa Miami 2009Converting Free Users to Paid Users - Fowa Miami 2009
Converting Free Users to Paid Users - Fowa Miami 2009
 
best dating apps chicago
best dating apps chicagobest dating apps chicago
best dating apps chicago
 
Connecting with Microsoft Travellers
Connecting with Microsoft Travellers Connecting with Microsoft Travellers
Connecting with Microsoft Travellers
 
E marketer traveler_reviews-online_communities_reshape_marketing_strategies
E marketer traveler_reviews-online_communities_reshape_marketing_strategiesE marketer traveler_reviews-online_communities_reshape_marketing_strategies
E marketer traveler_reviews-online_communities_reshape_marketing_strategies
 
Fact-Checking Workshop by API & PolitiFact
Fact-Checking Workshop by API & PolitiFactFact-Checking Workshop by API & PolitiFact
Fact-Checking Workshop by API & PolitiFact
 
The Fact Checking Project from the American Press Institute
The Fact Checking Project from the American Press InstituteThe Fact Checking Project from the American Press Institute
The Fact Checking Project from the American Press Institute
 
InstaFind Power Point Presentation
InstaFind Power Point PresentationInstaFind Power Point Presentation
InstaFind Power Point Presentation
 

Plus de IIIT Hyderabad

Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay
Responsible & Safe AI Systems at ACM India ROCS at IIT BombayResponsible & Safe AI Systems at ACM India ROCS at IIT Bombay
Responsible & Safe AI Systems at ACM India ROCS at IIT BombayIIIT Hyderabad
 
International Collaboration: Experiences, Challenges, Success stories
International Collaboration: Experiences, Challenges, Success storiesInternational Collaboration: Experiences, Challenges, Success stories
International Collaboration: Experiences, Challenges, Success storiesIIIT Hyderabad
 
Responsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBias
Responsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBiasResponsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBias
Responsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBiasIIIT Hyderabad
 
Identify, Inspect and Intervene Multimodal Fake News
Identify, Inspect and Intervene Multimodal Fake NewsIdentify, Inspect and Intervene Multimodal Fake News
Identify, Inspect and Intervene Multimodal Fake NewsIIIT Hyderabad
 
Data Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafety
Data Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafetyData Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafety
Data Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafetyIIIT Hyderabad
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...IIIT Hyderabad
 
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityIIIT Hyderabad
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...
Data Science for Social Good: #LegalNLP #AlgorithmicBias...IIIT Hyderabad
 
How to Write a (Good) Research Paper
How to Write a (Good) Research Paper How to Write a (Good) Research Paper
How to Write a (Good) Research Paper IIIT Hyderabad
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasData Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasIIIT Hyderabad
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in IndiaIIIT Hyderabad
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in IndiaIIIT Hyderabad
 
Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...IIIT Hyderabad
 
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayPrivacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayIIIT Hyderabad
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...IIIT Hyderabad
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...IIIT Hyderabad
 
Leveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceLeveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceIIIT Hyderabad
 
Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...IIIT Hyderabad
 
A Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesA Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesIIIT Hyderabad
 

Plus de IIIT Hyderabad (20)

Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay
Responsible & Safe AI Systems at ACM India ROCS at IIT BombayResponsible & Safe AI Systems at ACM India ROCS at IIT Bombay
Responsible & Safe AI Systems at ACM India ROCS at IIT Bombay
 
International Collaboration: Experiences, Challenges, Success stories
International Collaboration: Experiences, Challenges, Success storiesInternational Collaboration: Experiences, Challenges, Success stories
International Collaboration: Experiences, Challenges, Success stories
 
Responsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBias
Responsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBiasResponsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBias
Responsible & Safe AI: #LegalBias #Inconsistency #BiasinLLMs #MultiModalBias
 
Identify, Inspect and Intervene Multimodal Fake News
Identify, Inspect and Intervene Multimodal Fake NewsIdentify, Inspect and Intervene Multimodal Fake News
Identify, Inspect and Intervene Multimodal Fake News
 
#ChatGPT #ResponsibleAI
#ChatGPT #ResponsibleAI#ChatGPT #ResponsibleAI
#ChatGPT #ResponsibleAI
 
Data Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafety
Data Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafetyData Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafety
Data Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafety
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...
 
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
 
How to Write a (Good) Research Paper
How to Write a (Good) Research Paper How to Write a (Good) Research Paper
How to Write a (Good) Research Paper
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasData Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBias
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in India
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in India
 
Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...
 
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayPrivacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...
 
Leveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceLeveraging Social Media for Financial Advice
Leveraging Social Media for Financial Advice
 
Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...
 
A Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesA Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian Languages
 

Dernier

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 

Dernier (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

Detection of Spam Tipping Behaviour on Foursquare

  • 1. Detection of Spam Tipping Behaviour on Foursquare Anupama Aggarwal¶, Prof P. Kumaraguru “PK”¶ , Prof J. Almeida* ¶ Indraprastha Institute of Information Technology (IIIT-Delhi, India) * Universidade Federal de Minas Gerais (UFMG, Brazil) 1
  • 2. Foursquare 101 ‣ Location Based Social Network ‣ 33 Million Users * ‣ 3.5 Billion checkins * ‣ 31% of mobile social media users use Foursquare * * As of January 2013 2
  • 3. Location Sharing OSN Foursquare 101 Friends Activity Your Last Checkin Venue Friends Suggestions Venue Suggestions Tip : Suggested Activity for a Venue Tip can be Liked or Saved
  • 4. Spam Tips Advertising / Marketing ‣ Tips unrelated to Venue Scam / Phishing 4
  • 5. Spam according to Foursquare ToS ‣ Tips with links to websites selling software, realtor contact info, a listing for your business, or other promotion ‣ Tips with inappropriate language or negativity directed at another person ‣ Unauthorized or unsolicited advertising, junk 5
  • 6. Contributions ‣ Characterizing irregular user behaviour ‣ We observed different categories of spam users ‣ We characterize features distinguishing these spam users ‣ Automatic detection of spammers ‣ Distinguish between spam and legitimate Foursquare users ‣ Cluster spam users into different categories according to their behaviour 6
  • 8. Observed Categories of Spam Users ‣ Marketing : These users post tips to promote and advertise a specific product/ brand / venue / external URL ‣ Malicious : Such Foursquare users post external  URLs in Tips which direct to spam / phishing / malware websites ‣ Abusive / Derogatory: These users try to deface or bad-mouth another person ‣ Self Promotion: These users try to draw attention to themselves 8
  • 9. Ground Truth Data Annotation Portal 2,000 Legitimate users 1,900 Spammers 9
  • 10. Features used to detect Spammers ‣ User Attributes ‣ Properties of the Foursquare user profile and his checkins ‣ Social Attributes ‣ Friends network of the Foursquare user under inspection ‣ Content Attributes ‣ Details about Tips posted by the Foursquare user 10
  • 11. Features used Category χ2 rank Feature User Attributes 1 3 4 5 11 12 15 Number of Tips Ratio of Check-ins and Tips Number of Check-ins Number of Badges Number of Mayorships Ratio of Check-ins and Badges Number of Photos posted Social Attributes 6 Number of Friends 2 Similarity score of Tips 7 8 9 10 13 14 Number of URLs posted Average number of words in Tips Average number of characters in Tips Ratio of number of likes and number of Tips Average number of spam words in Tips Average number of phone-numbers posted in Tips Content Attributes 11
  • 12. Few Observations ‣ Spammers post same/similar Tips on multiple venues ‣ A large fraction of spam Tips contain URLs ‣ Spam Tips may also have phone numbers ‣ Legitimate users have more Friends ‣ Spammers have very few Friends but large number of Tips 12
  • 13. Relation b/w Tips and Checkins Tips Irregular User Behaviour Check-ins
  • 16. Detection of Spam Classes ‣ Expectation-Maximization (EM) clustering ‣ Spammers Categories ‣ Advertising / Marketing ‣ Self Promotion ‣ Abusive ‣ Malicious 16
  • 17. Detection of Spam Classes ‣ Clustering Accuracy for spammer categories - Advertising 88.23% Self-Promotion 87.23% Abusive 78.88% Malicious 0% 17
  • 18. Conclusion ‣ Analyzed spammers behaviour on Foursquare ‣ We obtained an accuracy of 89.76% with Random Forest classifier to distinguish spammers from legitimate users ‣ We classified the spammers into four broad categories ‣ We were able to to detect users belonging to Advertising, Self-promotion and Abusive categories with an accuracy of 88.23%, 87.23% and 78.88% 18
  • 19. Future Work ‣ Refine our methodology by use of other classification algorithms ‣ Use multiclass classification to detect users in any of the spam categories ‣ Correlation of content and the URLs posted by different users can help us in identifying several spam campaigns on Foursquare 19
  • 21. For any further information, please write to pk@iiitd.ac.in precog.iiitd.edu.in 21