SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
Diving into Twitter dataon consumer electronic brands
Which brands get tweeted about most? Is it mainly positive or negative?
15.3 gbof JSON data downloaded from Twitter’s Streaming API 
between 13 –25 May using Python
Before processing, tweets were in raw JSON format 
Time Created 
Tweet text/status 
Username 
Tweet location (if available) 
No. of followers 
No. of people followed 
No. of statuses 
Language 
Data should be optimized as only a fraction of the data used for analysis— optimization improves performance in models and saves cost and time
The same tweet we saw previously 
By optimizing the data, 
15.3 gbof jsonwas converted to 757 mbof csv (5% of original size) 
After processing, only some fields retained and converted to CSV format
Brand 
Positive Sentiment 
Brand 
Negative Sentiment 
Brand 
Mixed Sentiment 
The list of words for sentiment analysis is adapted from 
the Harvard General Inquirer dictionaries 
Source: http://www.wjh.harvard.edu/~inquirer/homecat.htm, downloaded on 28 May 2014 
Tweets are then tagged for brand and sentiment in R
Initially, collected tweets based on 17 keywords 
Samsung 
S4 
Xperia 
HTC 
Huawei 
BlackBerry 
Apple 
S5 
Sony 
Nokia 
Note 3 
Lumia 
q5 
iPhone 
q10 
z10 
Motorala
“Apple” and “iPhone” accounted for 87% of tweet volume 
Removed from keywords during actual data collection to focus on other brands (, save space, and reduce bandwidth usage) 
A trial was conducted with 16 keywords on 11 May, 8 –9am 
1 gbof JSON data was collected in a hour 
During a one hour trial, “Apple” and “iPhone” had 87% share of tweets
Samsung 
Sony 
Nokia 
HTC 
Huawei 
BlackBerry 
Motorola 
Tweets containing seven keywords were collected from 13 –25 May
4% of tweets mentioned > 2 brands; they were excluded from analysis 
8% of tweets had mixed sentiment (i.e., positive and negative sentiment); they were excluded from analysis 
92% of tweets remained, each only mentioning 1 brand with either “positive”, “negative”, or “neutral” sentiment 
3,681,942 tweets were collected 
After processing, 3,234,678 tweets remained for analysis
Samsung leads in twitter buzz, followed by Sony and Nokia 
Together, they make up 75% of twitter buzz 
Samsung is the clear leader in twitter buzz, followed by Sony and Nokia 
However, Samsung and Sony have wider product offerings relative to the rest that mainly focus on phones 
Also, Huawei’s users may mainly be on Weibo, Renren, etc
Most brands have roughly 1:1 ratio of positive to negative tweets 
Samsung is the exception with ratio of roughly 3:2 
Brands have equal ratio of positive to negative tweets
Dip due to connectivity issues 
Brands’ share of tweets is roughly consistent over time
Spikes in tweet volume coincide with product launches
Spikes in tweet volume coincide with product launches
Users who tweet about BlackBerry tend to be better connected (i.e., higher median of followers and people followed)* 
* Excluding outliers 
Across brands, there is not much difference in user connectedness 
The median user has around 250 followers and also follows 250 people
50th–75thpercentile of users who tweet about Sony, HTC, and Motorola have very high numbers of all time tweets (spam bots perhaps?)* 
While Nokia is 3rdin twitter buzz share (14%), users who tweet about Nokia have least numbers of all time tweets 
Suggests that tweets likely to come from real users and not bots (or maybe less active bots) 
* Excluding outliers 
However, there is a large difference between users’ all time tweets
12833979 
followers 
11796709 
followers 
CNN’s tweet on Obama’s BlackBerry was “seen” by most followers
1753696 tweets 
1730006 
tweets 
A bot that retweets on farts has the highest all time tweets
1753696 tweets 
1730006 
tweets 
A bot that retweets on farts has the highest all time tweets
Initially, BlackBerry tweets showed 100% negative sentiment 
Culprit was the word “lack”—it was removed 
However, removing it reduced negative sentiment for other brands by 2 –3 % 
An interesting error led to BlackBerry having 100% negative sentiment
Track brands’ managed twitter accounts and conversations to measure engagement 
Which brands have better engagement with users and why? 
Track general message of tweets 
Are tweets of a brand mainly about sales, reviews, complaints, or news? 
Network analysis to identify users with high centrality and influence 
Which users have high influence and what are they tweeting about my brand? 
Geospatial analysis of tweets 
Are there differences in brand buzz, sentiment, and engagement across regions? 
Where do we go from here?
Code available on GitHub: https://github.com/eugeneyan/Twitter-SMA 
Python script to download tweets in JSON format 
Python scripts to convert tweets from JSON to CSV (with & without regular expressions filtering) 
R script and sentiment analysis list of words 
R script and sentiment analysis list of words to reproduce BlackBerry error

Contenu connexe

En vedette

Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)Eugene Yan Ziyou
 
Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...Eugene Yan Ziyou
 
AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)Eugene Yan Ziyou
 
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learntKaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learntEugene Yan Ziyou
 
Statistical inference: Probability and Distribution
Statistical inference: Probability and DistributionStatistical inference: Probability and Distribution
Statistical inference: Probability and DistributionEugene Yan Ziyou
 
Statistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-testsStatistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-testsEugene Yan Ziyou
 
Sharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at LazadaSharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at LazadaEugene Yan Ziyou
 
How Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionHow Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionEugene Yan Ziyou
 

En vedette (8)

Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)Nielsen x DataScience SG Meetup (Apr 2015)
Nielsen x DataScience SG Meetup (Apr 2015)
 
Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...Competition Improves Performance: Only when Competition Form matches Goal Ori...
Competition Improves Performance: Only when Competition Form matches Goal Ori...
 
AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)AXA x DSSG Meetup Sharing (Feb 2016)
AXA x DSSG Meetup Sharing (Feb 2016)
 
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learntKaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
 
Statistical inference: Probability and Distribution
Statistical inference: Probability and DistributionStatistical inference: Probability and Distribution
Statistical inference: Probability and Distribution
 
Statistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-testsStatistical inference: Hypothesis Testing and t-tests
Statistical inference: Hypothesis Testing and t-tests
 
Sharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at LazadaSharing about my data science journey and what I do at Lazada
Sharing about my data science journey and what I do at Lazada
 
How Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversionHow Lazada ranks products to improve customer experience and conversion
How Lazada ranks products to improve customer experience and conversion
 

Similaire à Diving into Twitter data on consumer electronic brands

Making sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 LondonMaking sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 LondonNicolasSaintagne
 
Geek Retreat 2011
Geek Retreat 2011Geek Retreat 2011
Geek Retreat 2011BrandsEye
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter FeedsEu Jin Lok
 
Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10Murilo Cappucci
 
How to get started with social media
How to get started with social mediaHow to get started with social media
How to get started with social mediaEric Ritter
 
Social Media Strategy PRSA NCC
Social Media Strategy PRSA NCCSocial Media Strategy PRSA NCC
Social Media Strategy PRSA NCCSally Falkow
 
Chase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condensedChase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condenseddborstein
 
PeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's HotPeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's HotPeopleBrowsr
 
Making Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas SaintagneMaking Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas SaintagneInfluence People
 
Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009Lynne d Johnson
 
Twitter overview
Twitter overviewTwitter overview
Twitter overviewJason Cross
 
2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored Social2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored SocialTed Murphy
 
Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study Arjoon Mehra
 
PeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl DeckPeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl DeckPeopleBrowsr
 
Tips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter StrategyTips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter StrategyBrickfish
 

Similaire à Diving into Twitter data on consumer electronic brands (20)

Making sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 LondonMaking sense of Twitter - MSM 2010 London
Making sense of Twitter - MSM 2010 London
 
Geek Retreat 2011
Geek Retreat 2011Geek Retreat 2011
Geek Retreat 2011
 
Data Analytics on Twitter Feeds
Data Analytics on Twitter FeedsData Analytics on Twitter Feeds
Data Analytics on Twitter Feeds
 
Social 360
Social 360Social 360
Social 360
 
Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10Digital immersion for elt 3 29-10
Digital immersion for elt 3 29-10
 
How to get started with social media
How to get started with social mediaHow to get started with social media
How to get started with social media
 
Web 2.0 sf 2011 metrics
Web 2.0 sf 2011 metricsWeb 2.0 sf 2011 metrics
Web 2.0 sf 2011 metrics
 
Social Media Strategy PRSA NCC
Social Media Strategy PRSA NCCSocial Media Strategy PRSA NCC
Social Media Strategy PRSA NCC
 
Chase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condensedChase oppfor brands_11.1.11_condensed
Chase oppfor brands_11.1.11_condensed
 
PeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's HotPeopleBrowsr Platform - Monitoring What's Hot
PeopleBrowsr Platform - Monitoring What's Hot
 
Making Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas SaintagneMaking Sense of Twitter - Spotter - Nicolas Saintagne
Making Sense of Twitter - Spotter - Nicolas Saintagne
 
Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009Social Media Convergence - The ARF/Adweek 2009
Social Media Convergence - The ARF/Adweek 2009
 
Coffee Twalk Three: Measurement and Analytics for Social Media
Coffee Twalk Three: Measurement and Analytics for Social MediaCoffee Twalk Three: Measurement and Analytics for Social Media
Coffee Twalk Three: Measurement and Analytics for Social Media
 
Gravity Summit 2010 PeopleBrowsr
Gravity Summit 2010 PeopleBrowsrGravity Summit 2010 PeopleBrowsr
Gravity Summit 2010 PeopleBrowsr
 
Twitter overview
Twitter overviewTwitter overview
Twitter overview
 
2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored Social2015 Consumer State of Sponsored Social
2015 Consumer State of Sponsored Social
 
Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study Indian Mobile Phone Brand Social Media Analysis & Competitors Study
Indian Mobile Phone Brand Social Media Analysis & Competitors Study
 
PeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl DeckPeopleBrowsr Super Bowl Deck
PeopleBrowsr Super Bowl Deck
 
Tips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter StrategyTips to Optimize Your Twitter Strategy
Tips to Optimize Your Twitter Strategy
 
Coffee Twalk Two: Communication Strategies
Coffee Twalk Two: Communication StrategiesCoffee Twalk Two: Communication Strategies
Coffee Twalk Two: Communication Strategies
 

Plus de Eugene Yan Ziyou

System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and searchEugene Yan Ziyou
 
Recommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrixRecommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrixEugene Yan Ziyou
 
Predicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admissionPredicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admissionEugene Yan Ziyou
 
OLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech GiantsOLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech GiantsEugene Yan Ziyou
 
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...Eugene Yan Ziyou
 
INSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my JourneyINSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my JourneyEugene Yan Ziyou
 
SMU BIA Sharing on Data Science
SMU BIA Sharing on Data ScienceSMU BIA Sharing on Data Science
SMU BIA Sharing on Data ScienceEugene Yan Ziyou
 
Culture at Lazada Data Science
Culture at Lazada Data ScienceCulture at Lazada Data Science
Culture at Lazada Data ScienceEugene Yan Ziyou
 
Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Eugene Yan Ziyou
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveEugene Yan Ziyou
 
Social network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG communitySocial network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG communityEugene Yan Ziyou
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsEugene Yan Ziyou
 
A Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the USA Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the USEugene Yan Ziyou
 

Plus de Eugene Yan Ziyou (13)

System design for recommendations and search
System design for recommendations and searchSystem design for recommendations and search
System design for recommendations and search
 
Recommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrixRecommender Systems: Beyond the user-item matrix
Recommender Systems: Beyond the user-item matrix
 
Predicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admissionPredicting Hospital Bills at Pre-admission
Predicting Hospital Bills at Pre-admission
 
OLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech GiantsOLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
OLX Group Prod Tech 2019 Keynote: Asia's Tech Giants
 
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
Data Science Challenges and Impact at Lazada (Big Data and Analytics Innovati...
 
INSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my JourneyINSEAD Sharing on Lazada Data Science and my Journey
INSEAD Sharing on Lazada Data Science and my Journey
 
SMU BIA Sharing on Data Science
SMU BIA Sharing on Data ScienceSMU BIA Sharing on Data Science
SMU BIA Sharing on Data Science
 
Culture at Lazada Data Science
Culture at Lazada Data ScienceCulture at Lazada Data Science
Culture at Lazada Data Science
 
Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)Garuda Robotics x DataScience SG Meetup (Sep 2015)
Garuda Robotics x DataScience SG Meetup (Sep 2015)
 
DataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDiveDataKind SG sharing of our first DataDive
DataKind SG sharing of our first DataDive
 
Social network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG communitySocial network analysis and growth recommendations for DataScience SG community
Social network analysis and growth recommendations for DataScience SG community
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
 
A Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the USA Study on the Relationship between Education and Income in the US
A Study on the Relationship between Education and Income in the US
 

Dernier

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 

Dernier (20)

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 

Diving into Twitter data on consumer electronic brands

  • 1. Diving into Twitter dataon consumer electronic brands
  • 2. Which brands get tweeted about most? Is it mainly positive or negative?
  • 3. 15.3 gbof JSON data downloaded from Twitter’s Streaming API between 13 –25 May using Python
  • 4. Before processing, tweets were in raw JSON format Time Created Tweet text/status Username Tweet location (if available) No. of followers No. of people followed No. of statuses Language Data should be optimized as only a fraction of the data used for analysis— optimization improves performance in models and saves cost and time
  • 5. The same tweet we saw previously By optimizing the data, 15.3 gbof jsonwas converted to 757 mbof csv (5% of original size) After processing, only some fields retained and converted to CSV format
  • 6. Brand Positive Sentiment Brand Negative Sentiment Brand Mixed Sentiment The list of words for sentiment analysis is adapted from the Harvard General Inquirer dictionaries Source: http://www.wjh.harvard.edu/~inquirer/homecat.htm, downloaded on 28 May 2014 Tweets are then tagged for brand and sentiment in R
  • 7. Initially, collected tweets based on 17 keywords Samsung S4 Xperia HTC Huawei BlackBerry Apple S5 Sony Nokia Note 3 Lumia q5 iPhone q10 z10 Motorala
  • 8. “Apple” and “iPhone” accounted for 87% of tweet volume Removed from keywords during actual data collection to focus on other brands (, save space, and reduce bandwidth usage) A trial was conducted with 16 keywords on 11 May, 8 –9am 1 gbof JSON data was collected in a hour During a one hour trial, “Apple” and “iPhone” had 87% share of tweets
  • 9. Samsung Sony Nokia HTC Huawei BlackBerry Motorola Tweets containing seven keywords were collected from 13 –25 May
  • 10. 4% of tweets mentioned > 2 brands; they were excluded from analysis 8% of tweets had mixed sentiment (i.e., positive and negative sentiment); they were excluded from analysis 92% of tweets remained, each only mentioning 1 brand with either “positive”, “negative”, or “neutral” sentiment 3,681,942 tweets were collected After processing, 3,234,678 tweets remained for analysis
  • 11. Samsung leads in twitter buzz, followed by Sony and Nokia Together, they make up 75% of twitter buzz Samsung is the clear leader in twitter buzz, followed by Sony and Nokia However, Samsung and Sony have wider product offerings relative to the rest that mainly focus on phones Also, Huawei’s users may mainly be on Weibo, Renren, etc
  • 12. Most brands have roughly 1:1 ratio of positive to negative tweets Samsung is the exception with ratio of roughly 3:2 Brands have equal ratio of positive to negative tweets
  • 13. Dip due to connectivity issues Brands’ share of tweets is roughly consistent over time
  • 14. Spikes in tweet volume coincide with product launches
  • 15. Spikes in tweet volume coincide with product launches
  • 16. Users who tweet about BlackBerry tend to be better connected (i.e., higher median of followers and people followed)* * Excluding outliers Across brands, there is not much difference in user connectedness The median user has around 250 followers and also follows 250 people
  • 17. 50th–75thpercentile of users who tweet about Sony, HTC, and Motorola have very high numbers of all time tweets (spam bots perhaps?)* While Nokia is 3rdin twitter buzz share (14%), users who tweet about Nokia have least numbers of all time tweets Suggests that tweets likely to come from real users and not bots (or maybe less active bots) * Excluding outliers However, there is a large difference between users’ all time tweets
  • 18. 12833979 followers 11796709 followers CNN’s tweet on Obama’s BlackBerry was “seen” by most followers
  • 19. 1753696 tweets 1730006 tweets A bot that retweets on farts has the highest all time tweets
  • 20. 1753696 tweets 1730006 tweets A bot that retweets on farts has the highest all time tweets
  • 21. Initially, BlackBerry tweets showed 100% negative sentiment Culprit was the word “lack”—it was removed However, removing it reduced negative sentiment for other brands by 2 –3 % An interesting error led to BlackBerry having 100% negative sentiment
  • 22. Track brands’ managed twitter accounts and conversations to measure engagement Which brands have better engagement with users and why? Track general message of tweets Are tweets of a brand mainly about sales, reviews, complaints, or news? Network analysis to identify users with high centrality and influence Which users have high influence and what are they tweeting about my brand? Geospatial analysis of tweets Are there differences in brand buzz, sentiment, and engagement across regions? Where do we go from here?
  • 23. Code available on GitHub: https://github.com/eugeneyan/Twitter-SMA Python script to download tweets in JSON format Python scripts to convert tweets from JSON to CSV (with & without regular expressions filtering) R script and sentiment analysis list of words R script and sentiment analysis list of words to reproduce BlackBerry error