SlideShare une entreprise Scribd logo
1  sur  32
7 NLP Must Haves
for Customer Feedback Analysis
Alyona Medelyan
alyona@getthematic.com
quora.com/What-are-the-best-customer-feedback-analysis-tools
Current Customer Feedback Analysis suck because they focus on scores, not reasons!
consumers: scores > comments
businesses: comments > scores
How do customer insight professionals
use people’s comments?
Price increase
New product feature
Marketing campaign
What happened?
Comments = Reasons behind scores & richer insights
Comments = Answers to who should follow up
Comments = Answers to strategic questions
So, which functionality is crucial
when you need to
understand customer comments?
Capture many ways
people talk about
the same thing
1
How many ways are there to complain about a wet delivered news paper?
paper
papers
newspaper
news paper
newspapers
news papers
wet
dripping
soaking
soaked
damp
drenched
+
Failure to capture dozens of ways issues can be expressed
leads to misrepresentations and poor decisions
vs
Synonyms can be dataset-specific
Autocomplete can mess up the meaning of a word!
People typed “airpoint” but were auto-completed to “airport”!
One size will not fit all!
The ideal solution should learn
data-set specific synonyms!
Capture positive & negative
attributes separately2
teaching
not helpful teachers bad learning style
good learning stylehelpful teachers
The lecturers aren’t particularly helpful and the learning style is far from perfect.
I have always found the lecturers to be very helpful and the learning style is perfect.
Same nouns & adjectives, but different feedback!
Purposes of Negation
• Reversing polarity
I did not like the learning style → dislike it
• Emphasising negativeness or positiveness
There is nothing I did not like about the learning style → love it
• Make weaker claims
The learning style is not bad → it’s ok
The ideal solution should
handle negation!
Capture
emerging themes3
✘ ✓
Supervised categorisation fails as customer comments change over time
54%
Other
8%
Other
The ideal solution should allow for themes to
emerge from data,
instead of be pre-defined!
Link to original
for verification & action4
1. Pull out all comments on a specific theme 2. Verify 3. Action
Ensure transparency
and ability to edit5
rugby world cup soccer world cupfootball world cup
Two themes?
Or one theme?
Often there is no right or wrong. Themes must be customisable.
Work well
on small dataset6
How can an NLP solution work on a small dataset?
• Industry-specific dictionaries & rules
But: How to avoid ambiguity errors?
• Pre-defined static categories
But: How to capture emerging themes?
• Creative data gathering
• Re-purpose survey data from related companies
• Re-purpose company-own resources
Example of a related dataset used to model specifics of word meanings
Provide
actionable insight7
Immediately
actionable theme
Repeated
but has no meaning
Trivial,
Already knew
Insightful,
new knowledge
Aspect or general
category of business
Ideal output
from NLP analysis
Most NLP Solutions
1h Prototype
with open-source tool
Suspected,
Data verified
Price increase
New product feature
Marketing campaign
What happened?
✓
Themes changing over time explain the reasons behind drops!
1
2
3
4
5
6
7
Capture ways people talk about the same thing
Capture positive & negative attributes separately
Capture emerging themes
Link to original for verification & action
Ensure transparency and ability to edit
Work well on small datasets
Provide actionable insights
Alyona Medelyan
alyona@getthematic.com
Need to make sense
of customer comments?
Get in touch!

Contenu connexe

En vedette

Build your first messenger bot
Build your first messenger botBuild your first messenger bot
Build your first messenger botNowa Labs Pte Ltd
 
An Introduction To Chat Bots
An Introduction To Chat BotsAn Introduction To Chat Bots
An Introduction To Chat BotsSohan Maheshwar
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial IntelligenceMd. Mahedi Mahfuj
 
Introduction to Chatbots
Introduction to ChatbotsIntroduction to Chatbots
Introduction to ChatbotsDaden Limited
 
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational Interfaces
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational InterfacesThe Chatbots Are Coming: A Guide to Chatbots, AI and Conversational Interfaces
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational InterfacesTWG
 
AI Agent and Chatbot Trends For Enterprises
AI Agent and Chatbot Trends For EnterprisesAI Agent and Chatbot Trends For Enterprises
AI Agent and Chatbot Trends For EnterprisesTeewee Ang
 
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skillVoice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skillKay Lerch
 
Speech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesSpeech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesChristiana Vasquez
 
How to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video AdsHow to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video AdsSohan Maheshwar
 
Mobile Gaming Monetization Trends in 2016
Mobile Gaming Monetization Trends in 2016Mobile Gaming Monetization Trends in 2016
Mobile Gaming Monetization Trends in 2016Sohan Maheshwar
 
Hacking the Mind: NLP and Influence by Mystic
Hacking the Mind: NLP and Influence by MysticHacking the Mind: NLP and Influence by Mystic
Hacking the Mind: NLP and Influence by MysticJacky Lim
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationssChandan Deb
 
Natural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital HumanitiesNatural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital HumanitiesXiang Li
 
SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...
SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...
SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...Distilled
 
Designing a Conversational Intelligent Bot which can cook
Designing a Conversational Intelligent Bot which can cookDesigning a Conversational Intelligent Bot which can cook
Designing a Conversational Intelligent Bot which can cookKaushik Das
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniquessonukumar142
 

En vedette (18)

Build your first messenger bot
Build your first messenger botBuild your first messenger bot
Build your first messenger bot
 
An Introduction To Chat Bots
An Introduction To Chat BotsAn Introduction To Chat Bots
An Introduction To Chat Bots
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial Intelligence
 
Introduction to Chatbots
Introduction to ChatbotsIntroduction to Chatbots
Introduction to Chatbots
 
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational Interfaces
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational InterfacesThe Chatbots Are Coming: A Guide to Chatbots, AI and Conversational Interfaces
The Chatbots Are Coming: A Guide to Chatbots, AI and Conversational Interfaces
 
AI Agent and Chatbot Trends For Enterprises
AI Agent and Chatbot Trends For EnterprisesAI Agent and Chatbot Trends For Enterprises
AI Agent and Chatbot Trends For Enterprises
 
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skillVoice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
Voice Interfaces Usergroup Berlin - 05-10-2016 : Kay Lerch on Morse-Coder skill
 
Speech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice InterfacesSpeech Recognition, Text to Speech, and Voice Interfaces
Speech Recognition, Text to Speech, and Voice Interfaces
 
How to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video AdsHow to Succeed With Rewarded Video Ads
How to Succeed With Rewarded Video Ads
 
Mobile Gaming Monetization Trends in 2016
Mobile Gaming Monetization Trends in 2016Mobile Gaming Monetization Trends in 2016
Mobile Gaming Monetization Trends in 2016
 
Hacking the Mind: NLP and Influence by Mystic
Hacking the Mind: NLP and Influence by MysticHacking the Mind: NLP and Influence by Mystic
Hacking the Mind: NLP and Influence by Mystic
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationss
 
Google voice
Google voice Google voice
Google voice
 
Natural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital HumanitiesNatural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital Humanities
 
SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...
SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...
SearchLove San Diego 2017 | Hana Abaza | Aiming for Impact: A Conversion-Cent...
 
Designing a Conversational Intelligent Bot which can cook
Designing a Conversational Intelligent Bot which can cookDesigning a Conversational Intelligent Bot which can cook
Designing a Conversational Intelligent Bot which can cook
 
Speech recognition techniques
Speech recognition techniquesSpeech recognition techniques
Speech recognition techniques
 

Dernier

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 

Dernier (20)

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 

7 NLP Must Haves for Customer Feedback Analysis

Notes de l'éditeur

  1. Today I want to talk about customer feedback analysis. We all here agree that sentiment analysis plays an important role in understanding customer feedback. But I found there is a disconnect to what’s actually happening in the industry.
  2. If you google ‘Customer Feedback Analysis software’, what you find is an overview of tools that collect people’s scores and then presenting them as pretty dashboards. Or here are the answers from Quora on ‘What’s the best customer analysis tool’. Most focus on scores and not people’s comments.
  3. And sure, if you a consumer, a quick summary of competitors by score may be all what you need. For example, to find the best restaurant. But as the owner of a poor restaurant with a 3 score rating, how would you know what do? Would you rather have 100 scores or 10 customer comments on why they gave you that score?
  4. We found that comments are quite important to customer insight professionals and this is how they use them.
  5. Comments that change over time, with scores, are particularly valuable. They can explain why the scores rise and drop, and if scores stay the same, provide a richer insight.
  6. By looking deeper into the comments, you can find out who should be following up with the customer. Imagine for example capturing all people who want to cancel a service.
  7. And also, if you have done any changes to your offering, for example use a new recipe, did that actually get noticed and affected the score. To summarise, applying NLP on people’s comments helps get a deeper insight and get to the action of improving customer experience faster.
  8. My background is in NLP but over the past 2 years we’ve spent a lot of time talking to customer insight team. I noticed that many current NLP solutions do not actually provide functionality that matters to them. Therefore, today I would like to share with you the needs that we discovered while building our NLP solution at Thematic. We may not have cracked all of them yet, but we do believe that they are Must Haves. If you own an NLP solution to CX or plan to build one, feel free to use the Must Haves as a guide. If you are looking to buy a solution, or implement one using open-source, send me an email and I will share with you a report that we found valuable while evaluating different options.
  9. The first Must-Have is about capturing many ways people may be referring to the same thing.
  10. Imagine you have paid for a newspaper delivered to your door. It rained. As you are unsticking the wet pages you are frustrated that you cannot read it. How many ways do you think there are to complain about a wet news paper?
  11. There are dozens possibilities! And if an NLP solution cannot capture them accurately, the importance of this issue may be misrepresented. Many solution out there use industry dictionaries or worse WordNet. But customer comments are messy and synonyms will be specific to your business. For example, ‘paper’ and ‘newspaper’ is rarely a synonym pair outside of publishing. And we found that ‘build’ and ‘buy’ could be either synonyms or antonyms depending on the context: real estate or software.
  12. At Thematic we learn synonyms from the data itself. And once, we came across an unusual, and at the first glance incorrect pair. Airport is the frequent flyer currency of AirNZ, airport is usually a very different thing. After examining the results closely we found that the system was right. Autocomplete did not know about ‘airpoint’ and autocorrected it to ‘airport’, which meant that this was a dataset specific synonym pair.
  13. This is why one size will not fit all.
  14. While you need to capture many different ways people are talking about the same thing, when it comes to attributes, e.g. good coffee/bad coffee, often Customer Insight professionals prefer if you capture them separately. This may be relatively easy, if the attributes are clear antonyms, e.g. ‘fast service’ vs. ‘slow service’. But negation makes everything much harder.
  15. Here is an actual example from manual categories chosen by a human tagger. An NLP system for customer feedback analysis should ideally be able to capture that the two sentences while using the same nouns and adjectives actually should be categorised differently.
  16. Most NLP solutions do not deal with negations. Those who do, simply reverse polarity: did not like = dislike. But there are other purposes, like the emphasis: nothing I did not like means loved it. Or making a weaker claim. So ‘not bad’ does not necessarily means ‘good’, most likely it means a rather neutral statement. When dealing with negation, parsing will help determine its focus and scope. But the next step is to actually merge negated statements with non-negated ones correctly. For this, you’ll need some sort of antonym detection. Only then, a solution can help accurately determine how many people liked or disliked a certain aspect of the business.
  17. This is why one size will not fit all.
  18. A common approach to summarising feedback, even when done manually, is to use a static set of categories or themes. The first problem with this, is that it reflects the bias of the person who created them. The second problem is that it is, well, static. It’s the nature of doing a business that there are always changes. There may be changes in pricing structure or in competition. If you want to capture people’s reaction to these changes, you need a solution where themes can emerge over time.
  19. If you do not do this, and let’s say use supervised categorisation, over time, what can happen is that you end up with a very large ‘Other’ category because comments would not fit into any of the pre-defined ones. You will always have people commenting on things that are different to others. But as a rule of thumb, your ‘other’ category should not be more than 20%. This is an actual examples from one company’s data we worked with, where we helped them reduce ‘Other’ to 8% compared to a 54% of a home-baked code.
  20. This is why one size will not fit all.
  21. My next NLP Must Have is about the necessity of having a clear link to the original comment. Context is king, as they say, and without context it is hard to interpret, understand and act upon the results. I have seen several NLP solutions that do not provide that option.
  22. Verification can be painful. Thematic once was tested against a human coder Kate. We identified that one of the key things students wished was improved at a university was the quality of food. Kate found the same issue, but at much lower frequency. By being able to pull out all comments on this topic, we verified them, and found that Kate was tagging only key issues in each comment, wheres we tagged all of them. As a result, the university could act upon this problem and increase student satisfaction simply by improving the situation with food.
  23. Transparency in how the algorithm came to particular results is also important, because only then we can give somebody like Kate a chance work with algorithm to benefit from both of their strengths. Kate knows the domain, what’s important to track and what can be ignored.
  24. Sometimes there is a wrong and right answer. For example, soccer world cup is in many countries the same as football world cup. But in other cases, it depends on the customer’s priorities whether they want to track rugby world cup separately from soccer/football world cup, or as the same thing. And they need to be able to make changes to how the system decided to do the grouping.
  25. Small datasets are a big pain for data-driven algorithms. You can’t build a language model on Wikipedia or IMDB reviews, because words mean different things in different context. And a model built on a small dataset won’t work. Solutions are: create industry-specific rules, repurpose data from different clients, or get creative.
  26. At Thematic, we get creative quite often. One of our customers is a DJ software company Serato. They have thousands of users, but only get a few hundred of short comments per month. So to help them, we built a language model from their community forums, that turned out to have millions of threads, and learned about things like processors, controllers, playback etc.
  27. Finally, the result of NLP analysis should provide information that’s not trivial and easy to act on. Let’s say, an NLP system analysed 500 comments of a software company and returned that key categories like ‘product’, ‘customer service’, and the name of the company. This is not insightful. Similarly, knowing that customer service has poor sentiment is not actionable.
  28. Keeping this in mind, NLP solutions can be evaluated according to this diagram. On the one axis we have language knowledge categorised by how actionable it is. On the other axis, we have trivial, suspected, but needed to verify using data, and finally new insightful knowledge. For example, we can easily guess which words will repeat in customer comments. These words will have zero meaning. 90% of NLP solutions that I’ve seen in the market capture general aspect of what’s in the comment and do not return any actionable results. Ideally, an NLP solution should return a mixture of themes, some of which should be insightful and actionable. Perhaps, only customer insight managers can judge if something is an insight to them or not, but in general this is where we want to be.
  29. Coming back to our diagram from the beginning of this talk, the correct answer is ‘New product feature’. If the NLP solution works correctly, as you are moving from one month to another, you should be able to see a change in the trending themes for that month. In this particular case, the trending keyword was ‘hard to read’, and the company fixed it by changing the font in the UI.
  30. Here they are again. If I have missed something or you disagree, let’s discuss! If you would like a report comparing different NLP methods against these Must Haves, please send me an email.