SlideShare une entreprise Scribd logo
1  sur  15
Télécharger pour lire hors ligne
Expressing Opinion Diversity

Andreea Bizău – Babeș Bolyai University,
Cluj-Napoca, Romania
Delia Rusu, Dunja Mladenić - Jožef Stefan
Institute, Ljubljana, Slovenia


                                  ailab.ijs.si
Motivation: Opinion Diversity




                       ailab.ijs.si
Overview

Terminology
Related work
Domain Driven Opinion Vocabulary
Use Case:
  Twitter Movie Comments
Conclusions and Future Work




                                   ailab.ijs.si
Terminology
opinion - a subjective expression of sentiments,
appraisals or feelings
opinion words - a set of keywords/phrases used
in expressing an opinion
orientation of an opinion word indicates whether
the opinion expressed is positive, negative or
neutral
the totality of opinion words forms an opinion
vocabulary


                                     ailab.ijs.si
Related Work – Opinion
                  Vocabulary
Dictionary based
  Esuli and Sebastiani, 2006 - SentiWordNet: three
  numerical scores (objective, positive, negative)
Corpus based
  Kanayama and Nasukawa, 2006 - context coherency
  (same polarity tend to appear successively)
  Jialin Pan et al, 2010 - feature bipartite graph modeling the
  relationship between domain-specific words and domain-
  independent words.
Dictionary and Corpus based
  Jijkoun et al, 2010 - dependency parsing on a set of
  relevant documents, extracting patterns of the form (clue
  word, syntactic context, target of sentiment)

                                                 ailab.ijs.si
Related Work – Opinion Analysis
Approaches:
  Hatzivassiloglou et al, 1997 – relevance of using
  connectives (conjunctions: and, or, but, etc)
  Kim and Hovy, 2004 – use word seed lists and WordNet
  synsets to determine the strength of the opinion orientation
  for the identified opinion words
  Gamon and Aue, 2005 – assign opinion orientation to
  candidate words, assuming that opinion terms with similar
  orientation tend to co-occur
Twitter Applications:
  Pang and Lee, 2002 – classifying movie reviews by overall
  document sentiment
  Asur and Huberman, 2010 – sentiments extracted from
  Twitter can be used to build a prediction model for box-
  office revenue

                                                ailab.ijs.si
Our Approach
                                                Domain-specific
         IMDb Movie reviews                     opinion
             (sample)                           vocabulary

                                                  Weird, odd,
              2 Clusters                             bad
                Weird,
                 odd       IMDb Movie reviews
                                                   amazing,
               amazing,      (Training data)       awesome,
               awesome                               perfect,
                                                    fantastic

Vocabulary                                                Twitter comments
                                                               analysis
             applied to
                           Movie tweets
                            (Test data)




                                                                ailab.ijs.si
Domain Driven Opinion
                      Vocabulary
Given a positive word seed list and
a negative word seed list, expand
the initial seed lists using synonymy                          Domain-specific
/ antonymy relations (WordNet)                                 opinion
                                                               vocabulary
   the initial words will be assigned a
   score of 1 for positive words and -1 for
   negative words                                                Weird, odd,
Given a corpus, parse and extract                                   bad
                                              2 Clusters
all adjectives and conjunctions –              Weird,
obtain a graph with two types of                odd               amazing,
relationships between nodes:                                      awesome,
                                              amazing,
   same context orientation (words            awesome               perfect,
   connected by and, or, nor) or                                   fantastic
   opposite context orientation (words
   connected by but, yet)
Clean the resulting set of words and
relationship graph by removing stop
words and self-reference relations
                                                           ailab.ijs.si
Domain Driven Opinion
                      Vocabulary
 Some of the characters are fictitious, but not grotesque
 synonym

  fictional
                                          fictitious
    real                                score(“fictitious”) = max(s_syn, s_ant)
                                        score(“fictitious”) = s_syn
 antonym                                score(“fictitious”) = - 0.3874 if f = 0.9

fictitious but grotesque – ContextOpposite relationship

fictitious but not grotesque – ContextSame relationship


                                                              ailab.ijs.si
Domain Driven Opinion
                   Vocabulary
4. Propagate scores through the graph, by
   determining for each word w
     Positivity score sPos
     Negativity score sNeg

      if relij is a ContextSame relation
          sPos(wi) += weigth(relij) * prevSPos(wj)
          sNeg (wi) += weigth(relij) * prevSNeg(wj)
      else if relij is a ContextOpposite relation
          sPos(wi) += weigth(relij) * prevSNeg(wj)
          sNeg (wi) += weigth(relij) * prevSPos(wj)



                                                      ailab.ijs.si
Use Case: Twitter Movie Reviews

domain specific document corpus of 27,886 IMDb
movie reviews
domain specific opinion vocabulary:
   9,318 words: 4,925 have a negative orientation and 4393
   have a positive orientation

            Inception (2010)                  Meet the Spartans (2008)
Positive words: good, great,           Positive words: funny, awesome,
awesome, amazing, favorite, fantastic, great
incredible, thrilling, different,
speechless                             Negative words: bad, stupid, dumb,
                                       weird, silly, common, ridiculous,
Negative words: bad, confusing,        terrible
weird, stupid, dumb, boring,
predictable, horrible, disappointing

                                                              ailab.ijs.si
Use Case: Twitter Movie Reviews

220,387 tweets crawled over a two month
interval, keyed on 84 movies
     Movie               Genre           Tweets   IMDb       Our
                                                  score     score
Inception (2010)      mystery, sci-fi,   19,256    8.9      66.52
                         thriller
  Megamind         animation, comedy,    8,109     7.3      67.71
    (2010)               family
 Unstoppable          drama, thriller    15,349    7        63.67
    (2010)
  Burlesque          drama, music,       1,244     6.2      70.78
    (2010)              romance
   Meet the           comedy, war          44      2.5      40.67
Spartans (2008)

                                                         ailab.ijs.si
Twitter comments
          analysis

•   Sentiment words
    distribution for a movie
•   Sentiment orientation
    evolution per week, day,
    hour
•   Movie comparison




       ailab.ijs.si
Conclusions and Future Work
identifying opinion diversity expressed within text,
with the aid of a domain-specific vocabulary
processing a corpus of IMDb movie reviews,
generated an opinion lexicon and analyzed a
different opinion source corpus, i.e. a tweet collection

further extend our algorithm to include opinion words
expressed by verbs and adverbs, as well as more
complex expressions
conduct experiments in order to
  determine the correlation between positive opinion words
  for a given movie and the IMDb movie rating
  evaluate the opinion lexicon directly

                                              ailab.ijs.si
Thank You for Your Attention!




                       ailab.ijs.si

Contenu connexe

Plus de RENDER project

Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...RENDER project
 
Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja TrampusDiversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja TrampusRENDER project
 
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...RENDER project
 
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...RENDER project
 
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny VrandecicDiversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny VrandecicRENDER project
 
Diversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlDiversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlRENDER project
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementRENDER project
 
Render Project introduction and overview
Render Project introduction and overviewRender Project introduction and overview
Render Project introduction and overviewRENDER project
 

Plus de RENDER project (11)

Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
Diversiweb2011 08 Mining Diverse Views from Related Articles - Ravali Pochamp...
 
Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja TrampusDiversiweb2011 07 Approximate subgraph matching - Mitja Trampus
Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus
 
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
Diversiweb2011 06 Faceted Approach To Diverse Query Processing - Devika P. Ma...
 
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
 
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny VrandecicDiversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
Diversiweb2011 03 Towards a Knowledge Diversity Model - Denny Vrandecic
 
Diversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlDiversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena Simperl
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data Management
 
Diversity toolkit
Diversity toolkitDiversity toolkit
Diversity toolkit
 
RENDER Telefonica
RENDER TelefonicaRENDER Telefonica
RENDER Telefonica
 
Defining Diversity
Defining DiversityDefining Diversity
Defining Diversity
 
Render Project introduction and overview
Render Project introduction and overviewRender Project introduction and overview
Render Project introduction and overview
 

Dernier

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 

Dernier (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 

Diversiweb2011 04 Expressing Opinion Diversity - Delia Rusu

  • 1. Expressing Opinion Diversity Andreea Bizău – Babeș Bolyai University, Cluj-Napoca, Romania Delia Rusu, Dunja Mladenić - Jožef Stefan Institute, Ljubljana, Slovenia ailab.ijs.si
  • 3. Overview Terminology Related work Domain Driven Opinion Vocabulary Use Case: Twitter Movie Comments Conclusions and Future Work ailab.ijs.si
  • 4. Terminology opinion - a subjective expression of sentiments, appraisals or feelings opinion words - a set of keywords/phrases used in expressing an opinion orientation of an opinion word indicates whether the opinion expressed is positive, negative or neutral the totality of opinion words forms an opinion vocabulary ailab.ijs.si
  • 5. Related Work – Opinion Vocabulary Dictionary based Esuli and Sebastiani, 2006 - SentiWordNet: three numerical scores (objective, positive, negative) Corpus based Kanayama and Nasukawa, 2006 - context coherency (same polarity tend to appear successively) Jialin Pan et al, 2010 - feature bipartite graph modeling the relationship between domain-specific words and domain- independent words. Dictionary and Corpus based Jijkoun et al, 2010 - dependency parsing on a set of relevant documents, extracting patterns of the form (clue word, syntactic context, target of sentiment) ailab.ijs.si
  • 6. Related Work – Opinion Analysis Approaches: Hatzivassiloglou et al, 1997 – relevance of using connectives (conjunctions: and, or, but, etc) Kim and Hovy, 2004 – use word seed lists and WordNet synsets to determine the strength of the opinion orientation for the identified opinion words Gamon and Aue, 2005 – assign opinion orientation to candidate words, assuming that opinion terms with similar orientation tend to co-occur Twitter Applications: Pang and Lee, 2002 – classifying movie reviews by overall document sentiment Asur and Huberman, 2010 – sentiments extracted from Twitter can be used to build a prediction model for box- office revenue ailab.ijs.si
  • 7. Our Approach Domain-specific IMDb Movie reviews opinion (sample) vocabulary Weird, odd, 2 Clusters bad Weird, odd IMDb Movie reviews amazing, amazing, (Training data) awesome, awesome perfect, fantastic Vocabulary Twitter comments analysis applied to Movie tweets (Test data) ailab.ijs.si
  • 8. Domain Driven Opinion Vocabulary Given a positive word seed list and a negative word seed list, expand the initial seed lists using synonymy Domain-specific / antonymy relations (WordNet) opinion vocabulary the initial words will be assigned a score of 1 for positive words and -1 for negative words Weird, odd, Given a corpus, parse and extract bad 2 Clusters all adjectives and conjunctions – Weird, obtain a graph with two types of odd amazing, relationships between nodes: awesome, amazing, same context orientation (words awesome perfect, connected by and, or, nor) or fantastic opposite context orientation (words connected by but, yet) Clean the resulting set of words and relationship graph by removing stop words and self-reference relations ailab.ijs.si
  • 9. Domain Driven Opinion Vocabulary Some of the characters are fictitious, but not grotesque synonym fictional fictitious real score(“fictitious”) = max(s_syn, s_ant) score(“fictitious”) = s_syn antonym score(“fictitious”) = - 0.3874 if f = 0.9 fictitious but grotesque – ContextOpposite relationship fictitious but not grotesque – ContextSame relationship ailab.ijs.si
  • 10. Domain Driven Opinion Vocabulary 4. Propagate scores through the graph, by determining for each word w Positivity score sPos Negativity score sNeg if relij is a ContextSame relation sPos(wi) += weigth(relij) * prevSPos(wj) sNeg (wi) += weigth(relij) * prevSNeg(wj) else if relij is a ContextOpposite relation sPos(wi) += weigth(relij) * prevSNeg(wj) sNeg (wi) += weigth(relij) * prevSPos(wj) ailab.ijs.si
  • 11. Use Case: Twitter Movie Reviews domain specific document corpus of 27,886 IMDb movie reviews domain specific opinion vocabulary: 9,318 words: 4,925 have a negative orientation and 4393 have a positive orientation Inception (2010) Meet the Spartans (2008) Positive words: good, great, Positive words: funny, awesome, awesome, amazing, favorite, fantastic, great incredible, thrilling, different, speechless Negative words: bad, stupid, dumb, weird, silly, common, ridiculous, Negative words: bad, confusing, terrible weird, stupid, dumb, boring, predictable, horrible, disappointing ailab.ijs.si
  • 12. Use Case: Twitter Movie Reviews 220,387 tweets crawled over a two month interval, keyed on 84 movies Movie Genre Tweets IMDb Our score score Inception (2010) mystery, sci-fi, 19,256 8.9 66.52 thriller Megamind animation, comedy, 8,109 7.3 67.71 (2010) family Unstoppable drama, thriller 15,349 7 63.67 (2010) Burlesque drama, music, 1,244 6.2 70.78 (2010) romance Meet the comedy, war 44 2.5 40.67 Spartans (2008) ailab.ijs.si
  • 13. Twitter comments analysis • Sentiment words distribution for a movie • Sentiment orientation evolution per week, day, hour • Movie comparison ailab.ijs.si
  • 14. Conclusions and Future Work identifying opinion diversity expressed within text, with the aid of a domain-specific vocabulary processing a corpus of IMDb movie reviews, generated an opinion lexicon and analyzed a different opinion source corpus, i.e. a tweet collection further extend our algorithm to include opinion words expressed by verbs and adverbs, as well as more complex expressions conduct experiments in order to determine the correlation between positive opinion words for a given movie and the IMDb movie rating evaluate the opinion lexicon directly ailab.ijs.si
  • 15. Thank You for Your Attention! ailab.ijs.si