SlideShare a Scribd company logo
1 of 2
Download to read offline
eCare: Online Sentiment Monitoring
    Thijs Westerveld, Stefan de Bruijn, Arthur van Bunningen, Rolf Schellenberger, and Sylvain
                                              Perenes

                                                                      Teezir
                                                                 Kanaalweg 17 L-E
                                                                  3526 KL Utrecht
                                                                 +31 30 267 9648
                                                                 info@teezir.com

ABSTRACT                                                                     2. eCare Technology
This paper presents eCare, a web-based dashboard for online                  Two important aspects of eCare as an online reputation
reputation monitoring. eCare allows companies to monitor what is             monitoring tool are the flexibility of the tool and the quality of the
said online about their brands and products, to gauge the online             results. Since eCare’s data collection is not centred on pre-defined
sentiment, to identify the opinion leaders in the sector, and to find        search terms, users are free to query for their own topics. This
the sources where their brands and products are actively                     way, even new users have instant access to a wealth of historic
discussed.                                                                   information that has been collected over the years. Moreover,
                                                                             users can construct their own dashboards choosing from a variety
Categories and Subject Descriptors                                           of result presentations.
H.3.4 [Information Storage and Retrieval]: Systems and
Software                                                                     2.1 Crawling
                                                                             The content in eCare is collected by continuously monitoring a
General Terms                                                                fixed set of the most important blogs, forums, news sites and
Algorithms, Experimentation, Verification.                                   social media platforms in the Dutch language domain. Specific
                                                                             crawlers have been trained to recognize the patterns of the various
Keywords                                                                     source types (news, blogs, and forums). Based on a small set of
Online reputation management, Sentiment analysis, Web crawling               training examples, the crawlers have learned the characteristics of
                                                                             the links to follow and the content elements to store. This way
1. INTRODUCTION                                                              content can be extracted cleanly from both known and new
People are influenced by their peers. Recommendations from                   sources: individual posts and their metadata are collected while
friends, family and colleagues are an important factor in deciding           menu structures, advertisements and other distractors that may be
where to eat, what places to visit, which movies to see and what to          present on a webpage are ignored. Additional content is collected
buy. We get spontaneous, unsolicited advice from people around               from RSS feeds and social media APIs.
us all the time; often we trust the information and act on it. In fact,
                                                                             While the set of sources monitored for eCare is not the complete
word of mouth is one of the most important factors in consumers’
                                                                             Dutch internet, the carefully selected set provides a good picture
behavior. When consumers are asked which forms of
                                                                             of the topics that are discussed online and of the corresponding
advertisement they trust most, recommendations from other users
                                                                             sentiments. Still, in some cases it is useful to know whether more
and opinions posted online are amongst the top answers [1].
                                                                             discussion takes place outside these sources or whether important
Traditionally, word of mouth has been limited to face-to-face,               sources are missing. To identify these blind spots, eCare makes
spoken communication, but today, online forms of                             use of external search engine APIs. A change in the volume of
communication have become equally important. Blogs, forums                   relevant content for a specific site can lead to further study of the
and social media (Twitter, Hyves, Facebook, etc.) support word of            site by the eCare user, or eventually to adding this site to the set of
mouth product recommendation and become more and more                        monitored sources from which clean content is collected.
important to determine consumer’s behaviour. This demo shows
eCare, Teezir’s online sentiment monitoring dashboard [2]. eCare             2.2 Sentiment Analysis
is a flexible tool to monitor the online communication around                All collected content is automatically analysed to determine the
brands, products and topics. This paper describes the underlying             predominant sentiment in the document. The main sentiment
technology.                                                                  expressed in a document is formed by the words and sentences of
                                                                             the document. Based on lists of terms that are known to have a
                                                                             clear positive or negative connotation, the overall sentiment of
                                                                             phrases, sentences and documents is determined. We use part of
 Permission to make digital or hard copies of all or part of this work for   speech tagging to take the context of a term into account. For
 personal or classroom use is granted without fee provided that copies are
                                                                             example, the term sound can be opinionating or neutral depending
 not made or distributed for profit or commercial advantage and that
 copies bear this notice and the full citation on the first page. To copy    on its context. Compare for example the following phrases: “their
 otherwise, or republish, to post on servers or to redistribute to lists,    judgment is always sound”, “…with sound foundations”, “I have
 requires prior specific permission and/or a fee.                            a 16bit sound card…” The first two are clear positive statements,
 DIR’11, February 2, 2011, Amsterdam, The Netherlands.                       the last is neutral.
 Copyright 2010 ACM 1-58113-000-0/00/0010…$10.00.
Context is also analysed for modifiers that strengthen, weaken or     with the authors of blog or forum posts and administer these
reverse the sentiment of a phrase. This way we can deal with          actions in the eCare dashboards.
negations and subtleties like “…the mouse itself is not exactly
ergonomically shaped…”, “…simple, but somewhat awkward…”
and “…extremely solid and easy to use…”. The final sentiment
expressed in a document is a function of the weights of the
document’s opinionating expressions.
The base list of positive and negative sentiment terms originates
from the Instituut voor de Nederlandse Lexicografie. This list is
constantly adapted to the language encountered in our data
sources. In regular tuning sessions we manually adapt and extend
the lists to keep up with the evolving language use in social media
and other online sources.
2.3 Index
To efficiently combine relevance and sentiment scores, we
developed our own inverted-file based indexing structures. These
indexes contain the collected content, their metadata and the
computed sentiment scores.
Adding newly collected content and searching previously indexed
content happens on the same files. This way there is no need to
swap indexes to be able to access new data. Newly added data is
initially kept in memory and only committed to the on-disk
indexing structures later. A carefully designed recovery process is
in-place to be able to reconstruct the non-committed indexing data    Figure 1 - Timeline comparing the volume of results for Ziggo
when something goes wrong during the indexing process.                and KPN between January and November 2010
The eCare indexing structures are designed for efficient
computation of both relevance and sentiment scores. We need to
be able to efficiently rank documents on either of these scores. At
the same time computing aggregate volume, sentiment and
relevance scores for sets of documents should be efficient.
2.4 Dashboard
eCare allows its users to slice and dice the collected content, and
learn what people say, either at the very aggregated level: “What
is the share of positive versus negative views about our new
product?”, or at the very detailed level: “Which sources reflect
this negative sentiment, and what exactly are people saying?”.
Choosing from a range of available widgets, users can compose
their own dashboards and share them with team members.
A query can be formulated using terms, phrases and Boolean
operators. On top of that, filters can be applied to zoom in on
specific timeframes, sources or authors. The available views of
the data include the following:
     •    Document results: a relevance ranking of the matching
          documents
     •    Sentiment overview: an indication of the overall
          sentiment in the matching documents                         Figure 2- Term cloud showing the most distinguishing terms
     •    Timeline: showing the development of volume and             in the context of Ziggo, including the context in which they
          sentiment over time (Figure 1)                              appear (green for positive, red for negative)
     •    Related terms: showing the most distinguishing terms in
          the matching documents as well as whether they appear
          mostly in positive or negative documents (Figure 2)         3. REFERENCES
     •    A breakdown of volume and sentiment by source type          [1] The Nielsen Company. 2009. Nielsen Online Consumer
          (news/blogs/forums/etc.), by individual source or by            Survey.
          author                                                          http://blog.nielsen.com/nielsenwire/consumer/global-
eCare sends email alerts when the sentiment drops below a user-           advertising-consumers-trust-real-friends-and-virtual-
defined threshold. Users can export results to further analyse or         strangers-the-most
combine with external data. Finally, users can directly engage        [2] http://www.ecarewebcare.nl/

More Related Content

Viewers also liked

Icv cc2014 ausstellerbroschuere
Icv cc2014 ausstellerbroschuereIcv cc2014 ausstellerbroschuere
Icv cc2014 ausstellerbroschuereICV_eV
 
Der Spiegel 2012 23 mit grosser Hells Angel Reportage
Der Spiegel 2012 23 mit grosser Hells Angel ReportageDer Spiegel 2012 23 mit grosser Hells Angel Reportage
Der Spiegel 2012 23 mit grosser Hells Angel ReportageThe Skunks
 
Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...
Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...
Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...Aquarius Digital GmbH
 
Similarities of Email and Social Marketing
Similarities of Email and Social MarketingSimilarities of Email and Social Marketing
Similarities of Email and Social Marketingadverteaze.com
 
Carrier Neutral Data Center Services and Best Practices
Carrier Neutral Data Center Services and Best PracticesCarrier Neutral Data Center Services and Best Practices
Carrier Neutral Data Center Services and Best PracticesInternet Society
 
Ocean view condotel noi ly tuong don anh binh minh
Ocean view condotel noi ly tuong don anh binh minh Ocean view condotel noi ly tuong don anh binh minh
Ocean view condotel noi ly tuong don anh binh minh Nguyễn Trang
 
Waiting Line Management
Waiting Line ManagementWaiting Line Management
Waiting Line ManagementRaman Kumar
 

Viewers also liked (14)

Icv cc2014 ausstellerbroschuere
Icv cc2014 ausstellerbroschuereIcv cc2014 ausstellerbroschuere
Icv cc2014 ausstellerbroschuere
 
Ashutosh rubber-pvt-ltd
Ashutosh rubber-pvt-ltdAshutosh rubber-pvt-ltd
Ashutosh rubber-pvt-ltd
 
Vaio catalogue-2011
Vaio catalogue-2011Vaio catalogue-2011
Vaio catalogue-2011
 
Growth roadmap Q3 2014
Growth roadmap   Q3 2014Growth roadmap   Q3 2014
Growth roadmap Q3 2014
 
Der Spiegel 2012 23 mit grosser Hells Angel Reportage
Der Spiegel 2012 23 mit grosser Hells Angel ReportageDer Spiegel 2012 23 mit grosser Hells Angel Reportage
Der Spiegel 2012 23 mit grosser Hells Angel Reportage
 
Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...
Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...
Mobile Marketing im Einzelhandel (DACH) - Eine Studie von aquarius consulting...
 
Erp und bi
Erp und biErp und bi
Erp und bi
 
PAIDEIA
PAIDEIAPAIDEIA
PAIDEIA
 
Presentation CRM In The Contact Center
Presentation   CRM In The Contact CenterPresentation   CRM In The Contact Center
Presentation CRM In The Contact Center
 
Similarities of Email and Social Marketing
Similarities of Email and Social MarketingSimilarities of Email and Social Marketing
Similarities of Email and Social Marketing
 
Carrier Neutral Data Center Services and Best Practices
Carrier Neutral Data Center Services and Best PracticesCarrier Neutral Data Center Services and Best Practices
Carrier Neutral Data Center Services and Best Practices
 
Ocean view condotel noi ly tuong don anh binh minh
Ocean view condotel noi ly tuong don anh binh minh Ocean view condotel noi ly tuong don anh binh minh
Ocean view condotel noi ly tuong don anh binh minh
 
Waiting Line Management
Waiting Line ManagementWaiting Line Management
Waiting Line Management
 
CV writing
CV writingCV writing
CV writing
 

Similar to DIR 2011 teezir paper

NOW! Get the internet to work for you!
NOW! Get the internet to work for you!NOW! Get the internet to work for you!
NOW! Get the internet to work for you!Philip Hannah
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWJournal For Research
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSijistjournal
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSijistjournal
 
OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...
OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...
OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...Content Savvy
 
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...IRJET Journal
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsEditor IJCATR
 
Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewINFOGAIN PUBLICATION
 
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...IRJET Journal
 
Sentiment Analysis in Social Media and Its Operations
Sentiment Analysis in Social Media and Its OperationsSentiment Analysis in Social Media and Its Operations
Sentiment Analysis in Social Media and Its OperationsIRJET Journal
 
Elqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds PeopleElqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds PeopleGuillermo Garcia
 
OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...
OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...
OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...OpenAthens
 
Social media analytics tool new v
Social media analytics tool new vSocial media analytics tool new v
Social media analytics tool new vMahesh Panchal
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1ALATechSource
 
THE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNING
THE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNINGTHE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNING
THE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNINGIRJET Journal
 

Similar to DIR 2011 teezir paper (20)

NOW! Get the internet to work for you!
NOW! Get the internet to work for you!NOW! Get the internet to work for you!
NOW! Get the internet to work for you!
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
 
DB_Assgn 3
DB_Assgn 3DB_Assgn 3
DB_Assgn 3
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
 
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
 
Sentiment analysis on unstructured review
Sentiment analysis on unstructured reviewSentiment analysis on unstructured review
Sentiment analysis on unstructured review
 
OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...
OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...
OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extr...
 
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
Live Twitter Sentiment Analysis and Interactive Visualizations with PyLDAvis ...
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
 
Dictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A ReviewDictionary Based Approach to Sentiment Analysis - A Review
Dictionary Based Approach to Sentiment Analysis - A Review
 
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
 
Sentiment Analysis in Social Media and Its Operations
Sentiment Analysis in Social Media and Its OperationsSentiment Analysis in Social Media and Its Operations
Sentiment Analysis in Social Media and Its Operations
 
Ieee format 5th nccci_a study on factors influencing as a best practice for...
Ieee format 5th nccci_a study on factors influencing as  a  best practice for...Ieee format 5th nccci_a study on factors influencing as  a  best practice for...
Ieee format 5th nccci_a study on factors influencing as a best practice for...
 
Elqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds PeopleElqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds People
 
OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...
OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...
OpenAthens Conference 2018 - Tim Lull and Chad Smith - Cultivating your onlin...
 
Social media analytics tool new v
Social media analytics tool new vSocial media analytics tool new v
Social media analytics tool new v
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1Managing Electronic Resources for Public Libraries, Part 1
Managing Electronic Resources for Public Libraries, Part 1
 
THE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNING
THE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNINGTHE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNING
THE ANALYSIS FOR CUSTOMER REVIEWS THROUGH TWEETS, BASED ON DEEP LEARNING
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Recently uploaded (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

DIR 2011 teezir paper

  • 1. eCare: Online Sentiment Monitoring Thijs Westerveld, Stefan de Bruijn, Arthur van Bunningen, Rolf Schellenberger, and Sylvain Perenes Teezir Kanaalweg 17 L-E 3526 KL Utrecht +31 30 267 9648 info@teezir.com ABSTRACT 2. eCare Technology This paper presents eCare, a web-based dashboard for online Two important aspects of eCare as an online reputation reputation monitoring. eCare allows companies to monitor what is monitoring tool are the flexibility of the tool and the quality of the said online about their brands and products, to gauge the online results. Since eCare’s data collection is not centred on pre-defined sentiment, to identify the opinion leaders in the sector, and to find search terms, users are free to query for their own topics. This the sources where their brands and products are actively way, even new users have instant access to a wealth of historic discussed. information that has been collected over the years. Moreover, users can construct their own dashboards choosing from a variety Categories and Subject Descriptors of result presentations. H.3.4 [Information Storage and Retrieval]: Systems and Software 2.1 Crawling The content in eCare is collected by continuously monitoring a General Terms fixed set of the most important blogs, forums, news sites and Algorithms, Experimentation, Verification. social media platforms in the Dutch language domain. Specific crawlers have been trained to recognize the patterns of the various Keywords source types (news, blogs, and forums). Based on a small set of Online reputation management, Sentiment analysis, Web crawling training examples, the crawlers have learned the characteristics of the links to follow and the content elements to store. This way 1. INTRODUCTION content can be extracted cleanly from both known and new People are influenced by their peers. Recommendations from sources: individual posts and their metadata are collected while friends, family and colleagues are an important factor in deciding menu structures, advertisements and other distractors that may be where to eat, what places to visit, which movies to see and what to present on a webpage are ignored. Additional content is collected buy. We get spontaneous, unsolicited advice from people around from RSS feeds and social media APIs. us all the time; often we trust the information and act on it. In fact, While the set of sources monitored for eCare is not the complete word of mouth is one of the most important factors in consumers’ Dutch internet, the carefully selected set provides a good picture behavior. When consumers are asked which forms of of the topics that are discussed online and of the corresponding advertisement they trust most, recommendations from other users sentiments. Still, in some cases it is useful to know whether more and opinions posted online are amongst the top answers [1]. discussion takes place outside these sources or whether important Traditionally, word of mouth has been limited to face-to-face, sources are missing. To identify these blind spots, eCare makes spoken communication, but today, online forms of use of external search engine APIs. A change in the volume of communication have become equally important. Blogs, forums relevant content for a specific site can lead to further study of the and social media (Twitter, Hyves, Facebook, etc.) support word of site by the eCare user, or eventually to adding this site to the set of mouth product recommendation and become more and more monitored sources from which clean content is collected. important to determine consumer’s behaviour. This demo shows eCare, Teezir’s online sentiment monitoring dashboard [2]. eCare 2.2 Sentiment Analysis is a flexible tool to monitor the online communication around All collected content is automatically analysed to determine the brands, products and topics. This paper describes the underlying predominant sentiment in the document. The main sentiment technology. expressed in a document is formed by the words and sentences of the document. Based on lists of terms that are known to have a clear positive or negative connotation, the overall sentiment of phrases, sentences and documents is determined. We use part of Permission to make digital or hard copies of all or part of this work for speech tagging to take the context of a term into account. For personal or classroom use is granted without fee provided that copies are example, the term sound can be opinionating or neutral depending not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy on its context. Compare for example the following phrases: “their otherwise, or republish, to post on servers or to redistribute to lists, judgment is always sound”, “…with sound foundations”, “I have requires prior specific permission and/or a fee. a 16bit sound card…” The first two are clear positive statements, DIR’11, February 2, 2011, Amsterdam, The Netherlands. the last is neutral. Copyright 2010 ACM 1-58113-000-0/00/0010…$10.00.
  • 2. Context is also analysed for modifiers that strengthen, weaken or with the authors of blog or forum posts and administer these reverse the sentiment of a phrase. This way we can deal with actions in the eCare dashboards. negations and subtleties like “…the mouse itself is not exactly ergonomically shaped…”, “…simple, but somewhat awkward…” and “…extremely solid and easy to use…”. The final sentiment expressed in a document is a function of the weights of the document’s opinionating expressions. The base list of positive and negative sentiment terms originates from the Instituut voor de Nederlandse Lexicografie. This list is constantly adapted to the language encountered in our data sources. In regular tuning sessions we manually adapt and extend the lists to keep up with the evolving language use in social media and other online sources. 2.3 Index To efficiently combine relevance and sentiment scores, we developed our own inverted-file based indexing structures. These indexes contain the collected content, their metadata and the computed sentiment scores. Adding newly collected content and searching previously indexed content happens on the same files. This way there is no need to swap indexes to be able to access new data. Newly added data is initially kept in memory and only committed to the on-disk indexing structures later. A carefully designed recovery process is in-place to be able to reconstruct the non-committed indexing data Figure 1 - Timeline comparing the volume of results for Ziggo when something goes wrong during the indexing process. and KPN between January and November 2010 The eCare indexing structures are designed for efficient computation of both relevance and sentiment scores. We need to be able to efficiently rank documents on either of these scores. At the same time computing aggregate volume, sentiment and relevance scores for sets of documents should be efficient. 2.4 Dashboard eCare allows its users to slice and dice the collected content, and learn what people say, either at the very aggregated level: “What is the share of positive versus negative views about our new product?”, or at the very detailed level: “Which sources reflect this negative sentiment, and what exactly are people saying?”. Choosing from a range of available widgets, users can compose their own dashboards and share them with team members. A query can be formulated using terms, phrases and Boolean operators. On top of that, filters can be applied to zoom in on specific timeframes, sources or authors. The available views of the data include the following: • Document results: a relevance ranking of the matching documents • Sentiment overview: an indication of the overall sentiment in the matching documents Figure 2- Term cloud showing the most distinguishing terms • Timeline: showing the development of volume and in the context of Ziggo, including the context in which they sentiment over time (Figure 1) appear (green for positive, red for negative) • Related terms: showing the most distinguishing terms in the matching documents as well as whether they appear mostly in positive or negative documents (Figure 2) 3. REFERENCES • A breakdown of volume and sentiment by source type [1] The Nielsen Company. 2009. Nielsen Online Consumer (news/blogs/forums/etc.), by individual source or by Survey. author http://blog.nielsen.com/nielsenwire/consumer/global- eCare sends email alerts when the sentiment drops below a user- advertising-consumers-trust-real-friends-and-virtual- defined threshold. Users can export results to further analyse or strangers-the-most combine with external data. Finally, users can directly engage [2] http://www.ecarewebcare.nl/