SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Taras Zagibalov
             T.Zagibalov@sussex.ac.uk


    PhD candidate at University of Sussex
                Brighton, UK
Ford Foundation International Fellowship fellow
Natural languages: Russian, English, Mandarin
          Programming: Java, Prolog


                  Taras Zagibalov© 2009
Unsupervised Sentiment Analysis


     Listening to the Word of Mouth



                                          What is it?
                                  How does it work?
                                 How can it be used?


               Taras Zagibalov© 2009
Outline

   What is Sentiment Analysis
   Application of Sentiment Analysis
   Who's in the business?
   Unsolved Problems
   Why unsupervised?
   Is it effective?



                       Taras Zagibalov© 2009
Sentiment Analysis



  Sentiment Analysis (or Opinion Mining) is a
  relatively new research area in Information
 Retrieval and Natural Language Processing,
which is concerned not with a document's topic,
      but with what opinion it expresses



                  Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     Feature-Based Opinion Mining




                 Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


               A car has four wheels.
                                 vs
                    It's a good car.

                Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


                     It's a good car.
                                  vs
                       It's a bad car.

                 Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


               Ian says it's a good car.




                 Taras Zagibalov© 2009
What is Sentiment Analysis

     Subjectivity Classification
     Orientation Detection
     Opinion Holder and Target Extraction
     quot;Feature-Based Opinion Miningquot;


      The wheels are good, but all the rest is just
       unusable.


                 Taras Zagibalov© 2009
Application of Sentiment
         Analysis

     Where opinions can be found?

    News feeds (Google, Yahoo, Reuters etc)
    Blogs (LJ, Technorati etc)
    Social Networks (Twitter, Facebook...)
    Customer review sites (Amazon, eBay...)




               Taras Zagibalov© 2009
Application of Sentiment
         Analysis

    Marketing Research
            Product Reviews Analysis
            Brand Tracking
            Influence Analysis
    Public Opinion Tracking
    Customer correspondence analysis




                 Taras Zagibalov© 2009
Application of Sentiment
         Analysis

     What questions can be answered by
      Sentiment analysis system?
    What do customers think about our product?
    Which of our customers are unsatisfied?
    What features of our product are the worst?
    Who and how influences our image?
    What is public reaction to (some event or
      some person)?
    and so on...
               Taras Zagibalov© 2009
Example 1

On-line (blogs, mass-media) monitoring of a product
promotion campaigns
   10

    9

    8

    7

    6

    5

    4

    3

    2

    1

    0
                    A                           B


Promotional campaign A is successful as most of on-line
reviews are positive.
Promotional campaign B needs immediate actions as most of
on-line reviews are negative.
                        Taras Zagibalov© 2009
Example 2

New product release as it mirrored in customer on-line
reviews
   8

   7

   6

   5

   4

   3

   2

   1

   0
                    A                           B




(A) Product release and add campaign is quite effective as
public opinion is mostly positive. But the sentiment changes as
sales grow (B), more people are unsatisfied and it needs to be
analysed (probably some quality-related issues)
                        Taras Zagibalov© 2009
Example 3

Influence analysis by tracking blogs
   9

   8

   7

   6

   5

   4

   3

   2

   1

   0
                    A                           B



(A) Negative review in a newspaper does not affect a generally
positive sentiment towards a product, although a positive
review in a magazine (B) is quite effective.


                        Taras Zagibalov© 2009
Who's in the business?

    BrandWatch
    Istrategy Labs
    Cataphora
    Scoutlabs
    Lexalytics
    Infonic
    Attensity
    Open Dover
    ...          Taras Zagibalov© 2009
What's the technology?

   Machine Learning
       Manually tagged training data sets
       User-tagged training data sets (“thumbs up” and the
        “ five stars”)
   Knowledge-based Approaches
       Manually created word-lists
       Generic word-lists (like SentiWordNet or sentiment
        vocabularies)
   Manual Processing
                         Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency




                     Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency


      quot;The plot was unpredictablequot;
      vs
      quot;the steering was unpredictablequot;


                       Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency


    “good” == “bad” in eBay
    “3G” (technology for mobile phones) == “good”



                        Taras Zagibalov© 2009
Unsolved Problems

   Domain-dependency
   Unpredictable evaluation language
   Language-dependency


    Culture-related issues (“good” <> “ 好” )
    Language-related issues (SVO vs SOV)



                      Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start




               Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start

     No expensive human annotation needed:
     all information is found in the documents
     which needed to be processed.
     All extracted information is domain-
     specific and free from noise produced by
     “generic” word lists and wordnets.
               Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start

     Unsupervised systems, being data-
     independent, can be easily ported to
     almost any language.




              Taras Zagibalov© 2009
Why unsupervised?

   Cross-Domain applicability
   Multi-Lingual applicability
   Cheap Start

     Once an unsupervised system is
     developed it can be applied to new data
     almost immediately saving costs of data
     labelling and/or rules (word-lists) writing
     up.

               Taras Zagibalov© 2009
Is it effective?

   The unsupervised approach was tested on
    different language corpora (English, Simplified
    Chinese, Traditional Chinese, Japanese) and in
    many cases compared reasonably well with
    supervised methods.
   Results were presented on some major
    international scientific conferences (ACL,
    IJCNLP, COLING, NTCIR).


                      Taras Zagibalov© 2009
Is it effective?

    The approach can be easily combined with
    supervised techniques:
   Unsupervised system can provide initial data
    for in-depth research of the data (building up
    word-lists and rule-sets)
   Automatically extracted information can be
    used for training machine learning systems.



                      Taras Zagibalov© 2009
Conclusion

   Unsupervised Sentiment Analysis is an efficient
    instument of keeping track of public opinion in
    different domains and languages.
   It can be used as an entry point to a new
    domain or language.
   It can be combined with supervised methods to
    increase accuracy.



                      Taras Zagibalov© 2009

Contenu connexe

En vedette

SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets🧑‍💻 Manuel Coppotelli
 
Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Mechanical Turk
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSangeeth Nagarajan
 
Sentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusSentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusShalin Hai-Jew
 
Practical Sentiment Analysis
Practical Sentiment AnalysisPractical Sentiment Analysis
Practical Sentiment AnalysisPeople Pattern
 
Can Deep Learning solve the Sentiment Analysis Problem
Can Deep Learning solve the Sentiment Analysis ProblemCan Deep Learning solve the Sentiment Analysis Problem
Can Deep Learning solve the Sentiment Analysis ProblemMark Cieliebak
 
Sentiment Analysis via R Programming
Sentiment Analysis via R ProgrammingSentiment Analysis via R Programming
Sentiment Analysis via R ProgrammingSkillspeed
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Dev Sahu
 
Sentiment analysis-by-nltk
Sentiment analysis-by-nltkSentiment analysis-by-nltk
Sentiment analysis-by-nltkWei-Ting Kuo
 
MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]Sagar Ahire
 
Blockchain for Business
Blockchain for BusinessBlockchain for Business
Blockchain for BusinessFloyd DCosta
 
Blockchain: the solution for transparency in product supply chains
Blockchain: the solution for transparency in product supply chainsBlockchain: the solution for transparency in product supply chains
Blockchain: the solution for transparency in product supply chainsJamie Sandhu
 
Blockchain in Supply Chain Management: Fad or Potential?
Blockchain in Supply Chain Management: Fad or Potential?Blockchain in Supply Chain Management: Fad or Potential?
Blockchain in Supply Chain Management: Fad or Potential?Xeneta
 
Smart Cities are the Internet of Things
Smart Cities are the Internet of ThingsSmart Cities are the Internet of Things
Smart Cities are the Internet of Thingszdshelby
 
Internet of Things for Smart Cities
Internet of Things for Smart CitiesInternet of Things for Smart Cities
Internet of Things for Smart CitiesMphasis
 
Social media analysis in R using twitter API
Social media analysis in R using twitter API Social media analysis in R using twitter API
Social media analysis in R using twitter API Mohd Shadab Alam
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweetsVasu Jain
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDevashish Shanker
 

En vedette (19)

SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
 
Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar Best Practices for Sentiment Analysis Webinar
Best Practices for Sentiment Analysis Webinar
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
 
Sentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusSentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 Plus
 
Practical Sentiment Analysis
Practical Sentiment AnalysisPractical Sentiment Analysis
Practical Sentiment Analysis
 
Can Deep Learning solve the Sentiment Analysis Problem
Can Deep Learning solve the Sentiment Analysis ProblemCan Deep Learning solve the Sentiment Analysis Problem
Can Deep Learning solve the Sentiment Analysis Problem
 
Sentiment Analysis via R Programming
Sentiment Analysis via R ProgrammingSentiment Analysis via R Programming
Sentiment Analysis via R Programming
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Sentiment analysis-by-nltk
Sentiment analysis-by-nltkSentiment analysis-by-nltk
Sentiment analysis-by-nltk
 
MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]
 
Blockchain for Business
Blockchain for BusinessBlockchain for Business
Blockchain for Business
 
Blockchain: the solution for transparency in product supply chains
Blockchain: the solution for transparency in product supply chainsBlockchain: the solution for transparency in product supply chains
Blockchain: the solution for transparency in product supply chains
 
Blockchain in Supply Chain Management: Fad or Potential?
Blockchain in Supply Chain Management: Fad or Potential?Blockchain in Supply Chain Management: Fad or Potential?
Blockchain in Supply Chain Management: Fad or Potential?
 
Smart Cities are the Internet of Things
Smart Cities are the Internet of ThingsSmart Cities are the Internet of Things
Smart Cities are the Internet of Things
 
Internet of Things for Smart Cities
Internet of Things for Smart CitiesInternet of Things for Smart Cities
Internet of Things for Smart Cities
 
Green Computing
Green ComputingGreen Computing
Green Computing
 
Social media analysis in R using twitter API
Social media analysis in R using twitter API Social media analysis in R using twitter API
Social media analysis in R using twitter API
 
Sentiment analysis of tweets
Sentiment analysis of tweetsSentiment analysis of tweets
Sentiment analysis of tweets
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 

Similaire à Unsupervised Sentiment Analysis

How to manage test automation complexity
How to manage test automation complexityHow to manage test automation complexity
How to manage test automation complexityDana Aonofriesei
 
Competitive Intelligence
Competitive  IntelligenceCompetitive  Intelligence
Competitive IntelligencePallav L
 
State of Automatic Speech Recognition
State of Automatic Speech RecognitionState of Automatic Speech Recognition
State of Automatic Speech Recognition3Play Media
 
User Testing For The Rest Of Us
User Testing For The Rest Of UsUser Testing For The Rest Of Us
User Testing For The Rest Of UsLisa Herrod
 
Mind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer VessenmeyerMind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer VessenmeyerOnline Marketing Summit
 
Site Search Analytics
Site Search Analytics Site Search Analytics
Site Search Analytics Stefan Thies
 
Sprinkle on Just Enough Process
Sprinkle on Just Enough ProcessSprinkle on Just Enough Process
Sprinkle on Just Enough ProcessTechWell
 
What’s Your Leadership IQ?
What’s Your Leadership IQ?What’s Your Leadership IQ?
What’s Your Leadership IQ?TechWell
 
Laptop brand awareness
Laptop brand awarenessLaptop brand awareness
Laptop brand awarenessProlay Ray
 
IRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion MiningIRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion MiningIRJET Journal
 
Localization of iPhone Apps
Localization of iPhone AppsLocalization of iPhone Apps
Localization of iPhone Appskeywords
 
Online Community Best Practices Final
Online Community Best Practices FinalOnline Community Best Practices Final
Online Community Best Practices FinalRoger Pilney
 
Weak Supervision.pdf
Weak Supervision.pdfWeak Supervision.pdf
Weak Supervision.pdfStephenLeo7
 
Why the h# should I use Appium with React Native
Why the h# should I use Appium with React NativeWhy the h# should I use Appium with React Native
Why the h# should I use Appium with React NativeWim Selles
 
Crawford Upa09 Consumer Health Information Seeking
Crawford Upa09 Consumer Health Information SeekingCrawford Upa09 Consumer Health Information Seeking
Crawford Upa09 Consumer Health Information SeekingKath Straub
 
Software Development Quantified
Software Development QuantifiedSoftware Development Quantified
Software Development QuantifiedRiaz Virani
 
7 Factors to Consider When Building Your Mobile Strategy
7 Factors to Consider When Building Your Mobile Strategy7 Factors to Consider When Building Your Mobile Strategy
7 Factors to Consider When Building Your Mobile StrategyiTexico
 

Similaire à Unsupervised Sentiment Analysis (20)

How to manage test automation complexity
How to manage test automation complexityHow to manage test automation complexity
How to manage test automation complexity
 
Competitive Intelligence
Competitive  IntelligenceCompetitive  Intelligence
Competitive Intelligence
 
Usability
UsabilityUsability
Usability
 
Michael Kowalski
Michael KowalskiMichael Kowalski
Michael Kowalski
 
State of Automatic Speech Recognition
State of Automatic Speech RecognitionState of Automatic Speech Recognition
State of Automatic Speech Recognition
 
User Testing For The Rest Of Us
User Testing For The Rest Of UsUser Testing For The Rest Of Us
User Testing For The Rest Of Us
 
Mind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer VessenmeyerMind Of An Analyst- Jennifer Vessenmeyer
Mind Of An Analyst- Jennifer Vessenmeyer
 
Site Search Analytics
Site Search Analytics Site Search Analytics
Site Search Analytics
 
Sprinkle on Just Enough Process
Sprinkle on Just Enough ProcessSprinkle on Just Enough Process
Sprinkle on Just Enough Process
 
What’s Your Leadership IQ?
What’s Your Leadership IQ?What’s Your Leadership IQ?
What’s Your Leadership IQ?
 
Programming trends
Programming trendsProgramming trends
Programming trends
 
Laptop brand awareness
Laptop brand awarenessLaptop brand awareness
Laptop brand awareness
 
IRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion MiningIRJET- Fake Review Detection using Opinion Mining
IRJET- Fake Review Detection using Opinion Mining
 
Localization of iPhone Apps
Localization of iPhone AppsLocalization of iPhone Apps
Localization of iPhone Apps
 
Online Community Best Practices Final
Online Community Best Practices FinalOnline Community Best Practices Final
Online Community Best Practices Final
 
Weak Supervision.pdf
Weak Supervision.pdfWeak Supervision.pdf
Weak Supervision.pdf
 
Why the h# should I use Appium with React Native
Why the h# should I use Appium with React NativeWhy the h# should I use Appium with React Native
Why the h# should I use Appium with React Native
 
Crawford Upa09 Consumer Health Information Seeking
Crawford Upa09 Consumer Health Information SeekingCrawford Upa09 Consumer Health Information Seeking
Crawford Upa09 Consumer Health Information Seeking
 
Software Development Quantified
Software Development QuantifiedSoftware Development Quantified
Software Development Quantified
 
7 Factors to Consider When Building Your Mobile Strategy
7 Factors to Consider When Building Your Mobile Strategy7 Factors to Consider When Building Your Mobile Strategy
7 Factors to Consider When Building Your Mobile Strategy
 

Dernier

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 

Dernier (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Unsupervised Sentiment Analysis

  • 1. Taras Zagibalov T.Zagibalov@sussex.ac.uk PhD candidate at University of Sussex Brighton, UK Ford Foundation International Fellowship fellow Natural languages: Russian, English, Mandarin Programming: Java, Prolog Taras Zagibalov© 2009
  • 2. Unsupervised Sentiment Analysis Listening to the Word of Mouth What is it? How does it work? How can it be used? Taras Zagibalov© 2009
  • 3. Outline  What is Sentiment Analysis  Application of Sentiment Analysis  Who's in the business?  Unsolved Problems  Why unsupervised?  Is it effective? Taras Zagibalov© 2009
  • 4. Sentiment Analysis Sentiment Analysis (or Opinion Mining) is a relatively new research area in Information Retrieval and Natural Language Processing, which is concerned not with a document's topic, but with what opinion it expresses Taras Zagibalov© 2009
  • 5. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  Feature-Based Opinion Mining Taras Zagibalov© 2009
  • 6. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; A car has four wheels. vs It's a good car. Taras Zagibalov© 2009
  • 7. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; It's a good car. vs It's a bad car. Taras Zagibalov© 2009
  • 8. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; Ian says it's a good car. Taras Zagibalov© 2009
  • 9. What is Sentiment Analysis  Subjectivity Classification  Orientation Detection  Opinion Holder and Target Extraction  quot;Feature-Based Opinion Miningquot; The wheels are good, but all the rest is just unusable. Taras Zagibalov© 2009
  • 10. Application of Sentiment Analysis Where opinions can be found?  News feeds (Google, Yahoo, Reuters etc)  Blogs (LJ, Technorati etc)  Social Networks (Twitter, Facebook...)  Customer review sites (Amazon, eBay...) Taras Zagibalov© 2009
  • 11. Application of Sentiment Analysis  Marketing Research  Product Reviews Analysis  Brand Tracking  Influence Analysis  Public Opinion Tracking  Customer correspondence analysis Taras Zagibalov© 2009
  • 12. Application of Sentiment Analysis What questions can be answered by Sentiment analysis system?  What do customers think about our product?  Which of our customers are unsatisfied?  What features of our product are the worst?  Who and how influences our image?  What is public reaction to (some event or some person)?  and so on... Taras Zagibalov© 2009
  • 13. Example 1 On-line (blogs, mass-media) monitoring of a product promotion campaigns 10 9 8 7 6 5 4 3 2 1 0 A B Promotional campaign A is successful as most of on-line reviews are positive. Promotional campaign B needs immediate actions as most of on-line reviews are negative. Taras Zagibalov© 2009
  • 14. Example 2 New product release as it mirrored in customer on-line reviews 8 7 6 5 4 3 2 1 0 A B (A) Product release and add campaign is quite effective as public opinion is mostly positive. But the sentiment changes as sales grow (B), more people are unsatisfied and it needs to be analysed (probably some quality-related issues) Taras Zagibalov© 2009
  • 15. Example 3 Influence analysis by tracking blogs 9 8 7 6 5 4 3 2 1 0 A B (A) Negative review in a newspaper does not affect a generally positive sentiment towards a product, although a positive review in a magazine (B) is quite effective. Taras Zagibalov© 2009
  • 16. Who's in the business?  BrandWatch  Istrategy Labs  Cataphora  Scoutlabs  Lexalytics  Infonic  Attensity  Open Dover  ... Taras Zagibalov© 2009
  • 17. What's the technology?  Machine Learning  Manually tagged training data sets  User-tagged training data sets (“thumbs up” and the “ five stars”)  Knowledge-based Approaches  Manually created word-lists  Generic word-lists (like SentiWordNet or sentiment vocabularies)  Manual Processing Taras Zagibalov© 2009
  • 18. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency Taras Zagibalov© 2009
  • 19. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency quot;The plot was unpredictablequot; vs quot;the steering was unpredictablequot; Taras Zagibalov© 2009
  • 20. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency “good” == “bad” in eBay “3G” (technology for mobile phones) == “good” Taras Zagibalov© 2009
  • 21. Unsolved Problems  Domain-dependency  Unpredictable evaluation language  Language-dependency Culture-related issues (“good” <> “ 好” ) Language-related issues (SVO vs SOV) Taras Zagibalov© 2009
  • 22. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start Taras Zagibalov© 2009
  • 23. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start No expensive human annotation needed: all information is found in the documents which needed to be processed. All extracted information is domain- specific and free from noise produced by “generic” word lists and wordnets. Taras Zagibalov© 2009
  • 24. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start Unsupervised systems, being data- independent, can be easily ported to almost any language. Taras Zagibalov© 2009
  • 25. Why unsupervised?  Cross-Domain applicability  Multi-Lingual applicability  Cheap Start Once an unsupervised system is developed it can be applied to new data almost immediately saving costs of data labelling and/or rules (word-lists) writing up. Taras Zagibalov© 2009
  • 26. Is it effective?  The unsupervised approach was tested on different language corpora (English, Simplified Chinese, Traditional Chinese, Japanese) and in many cases compared reasonably well with supervised methods.  Results were presented on some major international scientific conferences (ACL, IJCNLP, COLING, NTCIR). Taras Zagibalov© 2009
  • 27. Is it effective? The approach can be easily combined with supervised techniques:  Unsupervised system can provide initial data for in-depth research of the data (building up word-lists and rule-sets)  Automatically extracted information can be used for training machine learning systems. Taras Zagibalov© 2009
  • 28. Conclusion  Unsupervised Sentiment Analysis is an efficient instument of keeping track of public opinion in different domains and languages.  It can be used as an entry point to a new domain or language.  It can be combined with supervised methods to increase accuracy. Taras Zagibalov© 2009