SlideShare une entreprise Scribd logo
Collective Sensing 
Opinion Mining 
Group members : 
Mahdi Kianirad , Maryam Daneshfar , Éva Balázs, Fabian Berndt 
1
• Introduction 
• History 
• Application 
• Methods and Approaches 
• Case Study 
2
Introduction 
• Sentiment analysis (also known as opinion mining) refers to the use of 
natural language processing, text analysis and computational linguistics to 
identify and extract subjective information in source materials. 
• Methods to extract, identify, or otherwise characterize the sentiment 
content of a text unit, Sometimes referred to as opinion mining, although 
the emphasis in this case is on extraction. 
• Aims to determine the attitude of a speaker or a writer with respect to 
some topic or the overall contextual polarity of a document. 
3
History 
• Early work in this area includes different methods for detecting the 
polarity of product reviews and movie reviews respectively (document 
level) 
• For example : Rotten Tomatoes movie review dataset 
Label the reviews : 
0 – negative 
1 – somewhat negative 
2 – neutral 
3 – somewhat positive 
4 – positive 
4
Application 
• Business 
• Politics/political science 
• Law/policy making 
• Sociology 
• Psychology 
5
Methods and Approaches 
• keyword spotting 
• lexical affinity 
• statistical methods (Machine learning) 
– latent semantic analysis 
assumes that words that are close in meaning will occur in similar pieces of text 
– support vector machines 
builds a model that assigns new examples into one category or the other (Positive or Negative) 
– bag of words 
(frequency of) occurrence of each word is used as a feature for training a classifier. Example usage: 
spam filtering 
• concept-level techniques 
6
Case Study 
7
• Introduction 
• Sentiment Analysis 
• Method 
• Using Bag of words 
– Disadvantages 
• Using keyword spotting 
– Advantages and Disadvantages 
• Validation 
• Conclusion 
8
Introduction 
• Twitter is a social networking and micro blogging service that allows users 
to post real time messages, called twits. Twits are restricted to 140 
characters in length. 
• We introduce two resources for pre-processing twitter data to determine 
the polarity of sentiment 
– Bag of Words 
– Keyword Spotting (Using Sad and happy emoticons) 
• We delineate our data to London bounding box 
– Most twitter users in Europe 
– The language is English 
For each of them we will show the results and compare these two methods. 
9
Sentiment Analysis 
• In order to text mining there are many solutions by many platforms 
– “Tm” Package for R 
– NLTK package for Python 
– LingPipe library for java 
– … 
• NLTK (Natural Language Toolkit) 
– a leading platform for building Python programs to work with human language data 
– easy-to-use 
– over 50 corpora and lexical resources 
– suite of text processing libraries for classification, tokenization, stemming, tagging, 
parsing, and semantic reasoning 
10
Method 
• NLTK 
Very strong to slicing sentences : 
Detect contractions , punctuation a and emoticons 
11
Using Bag of words 
• Defining to wordlist 
– Positive, consist of 2029 words 
– Negative , consist of 4783 words 
• Approach 
– For each Tokenized part of a twit check whether it is positive or negative 
– Rate the whole twit based of ratio of positive and negative words frequency 
– Each twit will get a rating between 0 and 1(Float number) 
12
13 
Sentiment (sa)
Disadvantages 
– Tend to generate false positive 
Near 70 % of records (from 10 million records) have got positive score 
(between 0.75 to 1) 
– Very dependent on definition of word bag 
Results will be deferent with another word bag 
– Can not detect implicit attitudes 
sarcasm or wit 
14
Complete Positive twits 
Low density areas were eliminated in order to have more readable map 
15
Complete Negative twits 
Low density areas were eliminated in order to have more readable map 
16
Using keyword spotting 
• Defining the keyword 
Olympic 
Low density areas were eliminated in order to have more readable map 
17
Using keyword spotting in opinion mining 
• Defining the key words 
Happy : :-) :) :o) :] :3 :c) … 
Sad : :-( :( :-< :-/ :/ … 
• Approach 
– For each list of Tokenized twits check whether it contains happy or sad emoticon 
– Rate the whole twit based of appearance of sad or happy emoticons 
18
Advantages 
– Less ambiguity of results in comparison with “bag of words” 
method 
Work only with twits that contain emoticon (explicit emption) 
Disadvantage 
– Smaller data to analyze 
750,000 records out of 10,000,000 records 
19
Happy emoticon twits  
Low density areas were eliminated in order to have more readable map 
20
Sad emoticon twits  
Low density areas were eliminated in order to have more readable map 
21
Validation 
• Validation is performed by user 
We examined 4000 twitts to determine whether the algorithm works correctly or not. 
It reveals that for bag of words method the algorithm work properly in 60% of cases 
No validation performed for emoticon spotting 
Conclusion 
• In opinion mining when different keywords are matter of concern the distribution 
of twitts will be different respectively but in term of mood analysis in an area the 
distribution and density of different moods (different moods in twitts) will depend 
on distribution of the whole population (in this case the concentration of positive 
and negative twits do not differ from each other ) 
22

Contenu connexe

Tendances

The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionEditorIJAERD
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis worksCJ Jenkins
 
Sarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisSarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisAnuj Gupta
 
Data Acquisition for Sentiment Analysis
Data Acquisition for Sentiment AnalysisData Acquisition for Sentiment Analysis
Data Acquisition for Sentiment AnalysisAli BELCAID
 
Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...
Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...
Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...Welocalize
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysisharit66
 
Approaches to Sentiment Analysis
Approaches to Sentiment AnalysisApproaches to Sentiment Analysis
Approaches to Sentiment AnalysisNihar Suryawanshi
 
Big Data & Sentiment Analysis
Big Data & Sentiment AnalysisBig Data & Sentiment Analysis
Big Data & Sentiment AnalysisMichel Bruley
 
Sarcasm Detection and User Behaviour Analysis
Sarcasm Detection and User Behaviour AnalysisSarcasm Detection and User Behaviour Analysis
Sarcasm Detection and User Behaviour Analysisrahulmonikasharma
 
Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Mechanical Turk
 
IRJET- Sentiment Analysis of Customer Reviews on Laptop Products for Flip...
IRJET-  	  Sentiment Analysis of Customer Reviews on Laptop Products for Flip...IRJET-  	  Sentiment Analysis of Customer Reviews on Laptop Products for Flip...
IRJET- Sentiment Analysis of Customer Reviews on Laptop Products for Flip...IRJET Journal
 
Sentiment analyzer and opinion mining
Sentiment analyzer and opinion miningSentiment analyzer and opinion mining
Sentiment analyzer and opinion miningAnkush Mehta
 
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...Khan Mostafa
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment AnalysisMakrand Patil
 
RCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMinerRCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMinerbohanairl
 

Tendances (18)

The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regression
 
sentiment analysis
sentiment analysis sentiment analysis
sentiment analysis
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis works
 
Sarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysisSarcasm Detection: Achilles Heel of sentiment analysis
Sarcasm Detection: Achilles Heel of sentiment analysis
 
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
 
Amazon seniment
Amazon senimentAmazon seniment
Amazon seniment
 
Data Acquisition for Sentiment Analysis
Data Acquisition for Sentiment AnalysisData Acquisition for Sentiment Analysis
Data Acquisition for Sentiment Analysis
 
Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...
Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...
Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Approaches to Sentiment Analysis
Approaches to Sentiment AnalysisApproaches to Sentiment Analysis
Approaches to Sentiment Analysis
 
Big Data & Sentiment Analysis
Big Data & Sentiment AnalysisBig Data & Sentiment Analysis
Big Data & Sentiment Analysis
 
Sarcasm Detection and User Behaviour Analysis
Sarcasm Detection and User Behaviour AnalysisSarcasm Detection and User Behaviour Analysis
Sarcasm Detection and User Behaviour Analysis
 
Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar
 
IRJET- Sentiment Analysis of Customer Reviews on Laptop Products for Flip...
IRJET-  	  Sentiment Analysis of Customer Reviews on Laptop Products for Flip...IRJET-  	  Sentiment Analysis of Customer Reviews on Laptop Products for Flip...
IRJET- Sentiment Analysis of Customer Reviews on Laptop Products for Flip...
 
Sentiment analyzer and opinion mining
Sentiment analyzer and opinion miningSentiment analyzer and opinion mining
Sentiment analyzer and opinion mining
 
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 
RCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMinerRCOMM 2011 - Sentiment Classification with RapidMiner
RCOMM 2011 - Sentiment Classification with RapidMiner
 

Similaire à Collective sensing

Sentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusSentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusShalin Hai-Jew
 
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...hajinouha0
 
Glide - Extracting Meaning from Social Media - Keith Woods-Holder
Glide - Extracting Meaning from Social Media - Keith Woods-HolderGlide - Extracting Meaning from Social Media - Keith Woods-Holder
Glide - Extracting Meaning from Social Media - Keith Woods-HolderInfluence People
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning AnalyticsXavier Ochoa
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPAnuj Gupta
 
02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysis02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysisSubhas Kumar Ghosh
 
Data Analysis in Research for Social Study
Data Analysis in Research for Social StudyData Analysis in Research for Social Study
Data Analysis in Research for Social StudyLisaneworkSileshi
 
Statistics for linguistics
Statistics for linguisticsStatistics for linguistics
Statistics for linguisticsaiaioo
 
Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.Sunil Kumar Kopparapu
 
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...Jekaterina Novikova, PhD
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET Journal
 
A Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis IA Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis IUNCResearchHub
 
Sentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfSentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfOmSatpathy
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment AnalysisRebecca Williams
 

Similaire à Collective sensing (20)

Fypca4
Fypca4Fypca4
Fypca4
 
Fypca4
Fypca4Fypca4
Fypca4
 
Fypca4
Fypca4Fypca4
Fypca4
 
Sentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusSentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 Plus
 
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
 
Glide - Extracting Meaning from Social Media - Keith Woods-Holder
Glide - Extracting Meaning from Social Media - Keith Woods-HolderGlide - Extracting Meaning from Social Media - Keith Woods-Holder
Glide - Extracting Meaning from Social Media - Keith Woods-Holder
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
 
02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysis02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysis
 
Data Analysis in Research for Social Study
Data Analysis in Research for Social StudyData Analysis in Research for Social Study
Data Analysis in Research for Social Study
 
Statistics for linguistics
Statistics for linguisticsStatistics for linguistics
Statistics for linguistics
 
NLP Bootcamp
NLP BootcampNLP Bootcamp
NLP Bootcamp
 
Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.
 
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
 
Analyzing Movie Reviews : Machine learning project
Analyzing Movie Reviews : Machine learning projectAnalyzing Movie Reviews : Machine learning project
Analyzing Movie Reviews : Machine learning project
 
Sentiment Analysis.pptx
Sentiment Analysis.pptxSentiment Analysis.pptx
Sentiment Analysis.pptx
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
 
A Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis IA Gentle Introduction to Text Analysis I
A Gentle Introduction to Text Analysis I
 
Sentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdfSentiment Analysis using Machine Learning.pdf
Sentiment Analysis using Machine Learning.pdf
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment Analysis
 

Dernier

2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Calllward7
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictJack Cole
 
how can i exchange pi coins for others currency like Bitcoin
how can i exchange pi coins for others currency like Bitcoinhow can i exchange pi coins for others currency like Bitcoin
how can i exchange pi coins for others currency like BitcoinDOT TECH
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理cyebo
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfscitechtalktv
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Jon Hansen
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfMichaelSenkow
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...elinavihriala
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonPayment Village
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理pyhepag
 
Machine Learning For Career Growth..pptx
Machine Learning For Career Growth..pptxMachine Learning For Career Growth..pptx
Machine Learning For Career Growth..pptxbenishzehra469
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理pyhepag
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理pyhepag
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptxMALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptxNidaFaviankaNawawi
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group MeetingAlison Pitt
 

Dernier (20)

2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
how can i exchange pi coins for others currency like Bitcoin
how can i exchange pi coins for others currency like Bitcoinhow can i exchange pi coins for others currency like Bitcoin
how can i exchange pi coins for others currency like Bitcoin
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Machine Learning For Career Growth..pptx
Machine Learning For Career Growth..pptxMachine Learning For Career Growth..pptx
Machine Learning For Career Growth..pptx
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptxMALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
MALL CUSTOMER SEGMENTATION USING K-MEANS CLUSTERING.pptx
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 

Collective sensing

  • 1. Collective Sensing Opinion Mining Group members : Mahdi Kianirad , Maryam Daneshfar , Éva Balázs, Fabian Berndt 1
  • 2. • Introduction • History • Application • Methods and Approaches • Case Study 2
  • 3. Introduction • Sentiment analysis (also known as opinion mining) refers to the use of natural language processing, text analysis and computational linguistics to identify and extract subjective information in source materials. • Methods to extract, identify, or otherwise characterize the sentiment content of a text unit, Sometimes referred to as opinion mining, although the emphasis in this case is on extraction. • Aims to determine the attitude of a speaker or a writer with respect to some topic or the overall contextual polarity of a document. 3
  • 4. History • Early work in this area includes different methods for detecting the polarity of product reviews and movie reviews respectively (document level) • For example : Rotten Tomatoes movie review dataset Label the reviews : 0 – negative 1 – somewhat negative 2 – neutral 3 – somewhat positive 4 – positive 4
  • 5. Application • Business • Politics/political science • Law/policy making • Sociology • Psychology 5
  • 6. Methods and Approaches • keyword spotting • lexical affinity • statistical methods (Machine learning) – latent semantic analysis assumes that words that are close in meaning will occur in similar pieces of text – support vector machines builds a model that assigns new examples into one category or the other (Positive or Negative) – bag of words (frequency of) occurrence of each word is used as a feature for training a classifier. Example usage: spam filtering • concept-level techniques 6
  • 8. • Introduction • Sentiment Analysis • Method • Using Bag of words – Disadvantages • Using keyword spotting – Advantages and Disadvantages • Validation • Conclusion 8
  • 9. Introduction • Twitter is a social networking and micro blogging service that allows users to post real time messages, called twits. Twits are restricted to 140 characters in length. • We introduce two resources for pre-processing twitter data to determine the polarity of sentiment – Bag of Words – Keyword Spotting (Using Sad and happy emoticons) • We delineate our data to London bounding box – Most twitter users in Europe – The language is English For each of them we will show the results and compare these two methods. 9
  • 10. Sentiment Analysis • In order to text mining there are many solutions by many platforms – “Tm” Package for R – NLTK package for Python – LingPipe library for java – … • NLTK (Natural Language Toolkit) – a leading platform for building Python programs to work with human language data – easy-to-use – over 50 corpora and lexical resources – suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning 10
  • 11. Method • NLTK Very strong to slicing sentences : Detect contractions , punctuation a and emoticons 11
  • 12. Using Bag of words • Defining to wordlist – Positive, consist of 2029 words – Negative , consist of 4783 words • Approach – For each Tokenized part of a twit check whether it is positive or negative – Rate the whole twit based of ratio of positive and negative words frequency – Each twit will get a rating between 0 and 1(Float number) 12
  • 14. Disadvantages – Tend to generate false positive Near 70 % of records (from 10 million records) have got positive score (between 0.75 to 1) – Very dependent on definition of word bag Results will be deferent with another word bag – Can not detect implicit attitudes sarcasm or wit 14
  • 15. Complete Positive twits Low density areas were eliminated in order to have more readable map 15
  • 16. Complete Negative twits Low density areas were eliminated in order to have more readable map 16
  • 17. Using keyword spotting • Defining the keyword Olympic Low density areas were eliminated in order to have more readable map 17
  • 18. Using keyword spotting in opinion mining • Defining the key words Happy : :-) :) :o) :] :3 :c) … Sad : :-( :( :-< :-/ :/ … • Approach – For each list of Tokenized twits check whether it contains happy or sad emoticon – Rate the whole twit based of appearance of sad or happy emoticons 18
  • 19. Advantages – Less ambiguity of results in comparison with “bag of words” method Work only with twits that contain emoticon (explicit emption) Disadvantage – Smaller data to analyze 750,000 records out of 10,000,000 records 19
  • 20. Happy emoticon twits  Low density areas were eliminated in order to have more readable map 20
  • 21. Sad emoticon twits  Low density areas were eliminated in order to have more readable map 21
  • 22. Validation • Validation is performed by user We examined 4000 twitts to determine whether the algorithm works correctly or not. It reveals that for bag of words method the algorithm work properly in 60% of cases No validation performed for emoticon spotting Conclusion • In opinion mining when different keywords are matter of concern the distribution of twitts will be different respectively but in term of mood analysis in an area the distribution and density of different moods (different moods in twitts) will depend on distribution of the whole population (in this case the concentration of positive and negative twits do not differ from each other ) 22