SlideShare une entreprise Scribd logo
1  sur  14
Language Intelligence
Why Sentiment Analysis is a Market for
Lemons … and How to Fix it
Robert Munro
With thanks!
Gary King & Jana Thompson:
<- other Idibon people here:
Michelle Casbon & Nick Gaylord
What is a market for lemons?
• Information asymmetry between
buyers and sellers, leaving only
"lemons" behind. George Akerlof
• Buyers cannot distinguish good
from bad products
• Prices are equally low for all
products
• The buyer's price adverse
selection problem drives the
high-quality products from the
market
Competition is not increasing accuracy
• 100+ companies
offering some
form of sentiment
analysis
• Accuracy hovering
around 70% for
real-world
applications for
almost a decade
The most honest sentiment analysis results you will
see
Accuracy F-Score Recall Precision F-Score
Positive Negative Neutral Positive Negative Neutral Positive Negative Neutral
Semantria 0.59 0.59 0.56 0.47 0.78 0.68 0.80 0.45 0.62 0.59 0.57
MonkeyLearn 0.50 0.38* 0.84 0.54 0.00 0.45 0.60 0.00 0.59 0.57 0.00
MetaMind 0.66 0.66 0.68 0.46 0.88 0.78 0.88 0.50 0.73 0.60 0.64
Idibon Public 0.68 0.67 0.76 0.75 0.49 0.66 0.69 0.72 0.71 0.72 0.58
• Even within the best results for one domain, there is no clear
leader when broken down by category
• All systems could have best results in other domains
• All could adapt here: Monkey Learn had errors with the ‘Neutral’
category, but we are sure they could update their models
Source: Sentiment 140 corpus, 3-way sentiment on social data:
http://cs.stanford.edu/people/alecmgo/trainingandtestdata.zip
Data beats algorithms; feedback beats data
0.457 0.473
0.615
0.948
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Linear model Deep Learning In-domain
training
10mins analyst
feedback
precision
recall
F-value
Distinguishing the correct ‘Ford’
Distinguishing “Ford” the company from people called “Ford”
Consumers are
uncertain
• When consumers try out-
of-domain analysis, they
lose confidence from the
poor results.
• Domain-dependence
means that even bad
models will be accurate
in some areas
• Consumers can only
evaluate anecdotally or
by precision, not recall
• Uncertainty prevails
Market forces are not breeding innovation
• Can’t innovate
through code alone
• More training data!
• But low price-points
means low margins
• Lack of capital to
find & label enough
training data
The Solution
• A different economic
models for useful
sentiment analysis:
• Data-sharing for more
accurate training data
• Protecting sensitive data
from public release
Machine
learning
Optimization
Human
annotation
Cloud
prediction
engine
Actionable
intelligence
On-site
prediction
engine
Copy & Sync Models
App Requests
Ambiguous, Novel & Interesting Items
Internal Data Flow
Hybrid Model Data Flow
Application Data Flow
firewall
The Benefits
• Multiple organizations can share in the benefits of better
sentiment analysis, without sacrificing privacy
• Single point of human-contact: no expensive duplicate
manual labeling of data
• Keeps lemons out of the market
Idibon Public: our implementation
• Free product, offered in addition to our enterprise
Idibon Studio and Idibon Terminal solutions
Applies to NLP and Machine
Learning more broadly
Every human communication
• Any task can be bundled this way
• Allows margins for use cases that
were not otherwise viable
• … including the full diversity of
languages, priced out when
everyone started in English
Language Intelligence
Why Sentiment Analysis is a Market for
Lemons … and How to Fix it
QUESTIONS?
Robert Munro

Contenu connexe

En vedette

Realtime crowdsourced translation for emergency response and beyond
Realtime crowdsourced translation for emergency response and beyondRealtime crowdsourced translation for emergency response and beyond
Realtime crowdsourced translation for emergency response and beyondRobert Munro
 
Subword and spatiotemporal models for identifying actionable information in ...
Subword and spatiotemporal models for identifying actionable information in ...Subword and spatiotemporal models for identifying actionable information in ...
Subword and spatiotemporal models for identifying actionable information in ...Robert Munro
 
Crowdsourcing and Natural Language Processing for Humanitarian Response
Crowdsourcing and Natural Language Processing for Humanitarian Response Crowdsourcing and Natural Language Processing for Humanitarian Response
Crowdsourcing and Natural Language Processing for Humanitarian Response Robert Munro
 
Bringing Data Science to the Speakers of Every Language
Bringing Data Science to the Speakers of Every Language Bringing Data Science to the Speakers of Every Language
Bringing Data Science to the Speakers of Every Language Robert Munro
 
Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...
Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...
Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...Idibon1
 
Talking to the crowd in 7,000 languages
Talking to the crowd in 7,000 languages �Talking to the crowd in 7,000 languages �
Talking to the crowd in 7,000 languages Robert Munro
 
Processing short-message communications in low-resource languages
Processing short-message communications in low-resource languages�Processing short-message communications in low-resource languages�
Processing short-message communications in low-resource languages Robert Munro
 
Energy for Opportunity, Presentation for E-Discuss
Energy for Opportunity, Presentation for E-DiscussEnergy for Opportunity, Presentation for E-Discuss
Energy for Opportunity, Presentation for E-DiscussRobert Munro
 
Tracking Epidemics with Natural Language Processing and Crowdsourcing
Tracking Epidemics with Natural Language Processing and Crowdsourcing�Tracking Epidemics with Natural Language Processing and Crowdsourcing�
Tracking Epidemics with Natural Language Processing and CrowdsourcingRobert Munro
 

En vedette (10)

Realtime crowdsourced translation for emergency response and beyond
Realtime crowdsourced translation for emergency response and beyondRealtime crowdsourced translation for emergency response and beyond
Realtime crowdsourced translation for emergency response and beyond
 
Subword and spatiotemporal models for identifying actionable information in ...
Subword and spatiotemporal models for identifying actionable information in ...Subword and spatiotemporal models for identifying actionable information in ...
Subword and spatiotemporal models for identifying actionable information in ...
 
Crowdsourcing and Natural Language Processing for Humanitarian Response
Crowdsourcing and Natural Language Processing for Humanitarian Response Crowdsourcing and Natural Language Processing for Humanitarian Response
Crowdsourcing and Natural Language Processing for Humanitarian Response
 
Bringing Data Science to the Speakers of Every Language
Bringing Data Science to the Speakers of Every Language Bringing Data Science to the Speakers of Every Language
Bringing Data Science to the Speakers of Every Language
 
Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...
Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...
Understanding Community Needs: Scalable SMS Processing for UNICEF Nigeria and...
 
Talking to the crowd in 7,000 languages
Talking to the crowd in 7,000 languages �Talking to the crowd in 7,000 languages �
Talking to the crowd in 7,000 languages
 
Processing short-message communications in low-resource languages
Processing short-message communications in low-resource languages�Processing short-message communications in low-resource languages�
Processing short-message communications in low-resource languages
 
Energy for Opportunity, Presentation for E-Discuss
Energy for Opportunity, Presentation for E-DiscussEnergy for Opportunity, Presentation for E-Discuss
Energy for Opportunity, Presentation for E-Discuss
 
Crowdring
CrowdringCrowdring
Crowdring
 
Tracking Epidemics with Natural Language Processing and Crowdsourcing
Tracking Epidemics with Natural Language Processing and Crowdsourcing�Tracking Epidemics with Natural Language Processing and Crowdsourcing�
Tracking Epidemics with Natural Language Processing and Crowdsourcing
 

Similaire à Why Sentiment Analysis is a Market for Lemons … and How to Fix it

Do's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven MarketingDo's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven MarketingSparkPost
 
Search Engine Results: The Best Measure?
Search Engine Results: The Best Measure? Search Engine Results: The Best Measure?
Search Engine Results: The Best Measure? Fan Foundry
 
A change manifesto for the CIO: A business perspective by Hugh Terry
A change manifesto for the CIO: A business perspective by Hugh Terry A change manifesto for the CIO: A business perspective by Hugh Terry
A change manifesto for the CIO: A business perspective by Hugh Terry The Digital Insurer
 
The future of Data Management
The future of Data ManagementThe future of Data Management
The future of Data ManagementThom. Poole
 
Andrew Grant
Andrew GrantAndrew Grant
Andrew GrantMRS
 
Winning the New Digital Consumer with Hyper-Relevance
Winning the New Digital Consumer with Hyper-RelevanceWinning the New Digital Consumer with Hyper-Relevance
Winning the New Digital Consumer with Hyper-RelevanceJoseph M Bradley
 
10 Ways to Leverage the Power of Marketing Automation
10 Ways to Leverage the Power of Marketing Automation10 Ways to Leverage the Power of Marketing Automation
10 Ways to Leverage the Power of Marketing Automationedynamic
 
From data to business intelligence
From data to business intelligenceFrom data to business intelligence
From data to business intelligencenjaffe
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneySri Ambati
 
Information transparency, michael paulson, sic, 2011 11 03
Information transparency, michael paulson, sic, 2011 11 03Information transparency, michael paulson, sic, 2011 11 03
Information transparency, michael paulson, sic, 2011 11 03Michael Paulson
 
Your CRM is a DEAD-END!
Your CRM is a DEAD-END!Your CRM is a DEAD-END!
Your CRM is a DEAD-END!Tenbound
 
Media vs Data: Why the Double Standard?
Media vs Data: Why the Double Standard?Media vs Data: Why the Double Standard?
Media vs Data: Why the Double Standard?MediaPost
 
Does big data = big insights?
Does big data = big insights?Does big data = big insights?
Does big data = big insights?Colin Strong
 
eCommerce expo- Jasper Bell
eCommerce expo- Jasper BelleCommerce expo- Jasper Bell
eCommerce expo- Jasper Bellamazeplc
 
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...Spark Summit
 
7 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 2016
7 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 20167 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 2016
7 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 2016Bench
 
Tyler Garns - Automation That Converts
Tyler Garns - Automation That ConvertsTyler Garns - Automation That Converts
Tyler Garns - Automation That ConvertsInfusionsoft
 
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab WebinarBluespire Marketing
 
Triangle AMA Marketing Workshop
Triangle AMA Marketing WorkshopTriangle AMA Marketing Workshop
Triangle AMA Marketing WorkshopJon Barlow
 

Similaire à Why Sentiment Analysis is a Market for Lemons … and How to Fix it (20)

Do's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven MarketingDo's and Don'ts of Data Driven Marketing
Do's and Don'ts of Data Driven Marketing
 
Search Engine Results: The Best Measure?
Search Engine Results: The Best Measure? Search Engine Results: The Best Measure?
Search Engine Results: The Best Measure?
 
A change manifesto for the CIO: A business perspective by Hugh Terry
A change manifesto for the CIO: A business perspective by Hugh Terry A change manifesto for the CIO: A business perspective by Hugh Terry
A change manifesto for the CIO: A business perspective by Hugh Terry
 
The future of Data Management
The future of Data ManagementThe future of Data Management
The future of Data Management
 
Andrew Grant
Andrew GrantAndrew Grant
Andrew Grant
 
Winning the New Digital Consumer with Hyper-Relevance
Winning the New Digital Consumer with Hyper-RelevanceWinning the New Digital Consumer with Hyper-Relevance
Winning the New Digital Consumer with Hyper-Relevance
 
10 Ways to Leverage the Power of Marketing Automation
10 Ways to Leverage the Power of Marketing Automation10 Ways to Leverage the Power of Marketing Automation
10 Ways to Leverage the Power of Marketing Automation
 
From data to business intelligence
From data to business intelligenceFrom data to business intelligence
From data to business intelligence
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 
Information transparency, michael paulson, sic, 2011 11 03
Information transparency, michael paulson, sic, 2011 11 03Information transparency, michael paulson, sic, 2011 11 03
Information transparency, michael paulson, sic, 2011 11 03
 
Michael paulson-sic-2011
Michael paulson-sic-2011Michael paulson-sic-2011
Michael paulson-sic-2011
 
Your CRM is a DEAD-END!
Your CRM is a DEAD-END!Your CRM is a DEAD-END!
Your CRM is a DEAD-END!
 
Media vs Data: Why the Double Standard?
Media vs Data: Why the Double Standard?Media vs Data: Why the Double Standard?
Media vs Data: Why the Double Standard?
 
Does big data = big insights?
Does big data = big insights?Does big data = big insights?
Does big data = big insights?
 
eCommerce expo- Jasper Bell
eCommerce expo- Jasper BelleCommerce expo- Jasper Bell
eCommerce expo- Jasper Bell
 
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...
 
7 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 2016
7 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 20167 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 2016
7 Deadly Sins of Programmatic | Gil Snir, Benchmarketing | GC Brisbane 2016
 
Tyler Garns - Automation That Converts
Tyler Garns - Automation That ConvertsTyler Garns - Automation That Converts
Tyler Garns - Automation That Converts
 
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
"Ready or Not, Here Comes 2015: Marketing Trends to Master" TrendLab Webinar
 
Triangle AMA Marketing Workshop
Triangle AMA Marketing WorkshopTriangle AMA Marketing Workshop
Triangle AMA Marketing Workshop
 

Dernier

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Dernier (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Why Sentiment Analysis is a Market for Lemons … and How to Fix it

  • 1. Language Intelligence Why Sentiment Analysis is a Market for Lemons … and How to Fix it Robert Munro
  • 2. With thanks! Gary King & Jana Thompson: <- other Idibon people here: Michelle Casbon & Nick Gaylord
  • 3. What is a market for lemons? • Information asymmetry between buyers and sellers, leaving only "lemons" behind. George Akerlof • Buyers cannot distinguish good from bad products • Prices are equally low for all products • The buyer's price adverse selection problem drives the high-quality products from the market
  • 4. Competition is not increasing accuracy • 100+ companies offering some form of sentiment analysis • Accuracy hovering around 70% for real-world applications for almost a decade
  • 5. The most honest sentiment analysis results you will see Accuracy F-Score Recall Precision F-Score Positive Negative Neutral Positive Negative Neutral Positive Negative Neutral Semantria 0.59 0.59 0.56 0.47 0.78 0.68 0.80 0.45 0.62 0.59 0.57 MonkeyLearn 0.50 0.38* 0.84 0.54 0.00 0.45 0.60 0.00 0.59 0.57 0.00 MetaMind 0.66 0.66 0.68 0.46 0.88 0.78 0.88 0.50 0.73 0.60 0.64 Idibon Public 0.68 0.67 0.76 0.75 0.49 0.66 0.69 0.72 0.71 0.72 0.58 • Even within the best results for one domain, there is no clear leader when broken down by category • All systems could have best results in other domains • All could adapt here: Monkey Learn had errors with the ‘Neutral’ category, but we are sure they could update their models Source: Sentiment 140 corpus, 3-way sentiment on social data: http://cs.stanford.edu/people/alecmgo/trainingandtestdata.zip
  • 6. Data beats algorithms; feedback beats data 0.457 0.473 0.615 0.948 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Linear model Deep Learning In-domain training 10mins analyst feedback precision recall F-value Distinguishing the correct ‘Ford’ Distinguishing “Ford” the company from people called “Ford”
  • 7. Consumers are uncertain • When consumers try out- of-domain analysis, they lose confidence from the poor results. • Domain-dependence means that even bad models will be accurate in some areas • Consumers can only evaluate anecdotally or by precision, not recall • Uncertainty prevails
  • 8. Market forces are not breeding innovation • Can’t innovate through code alone • More training data! • But low price-points means low margins • Lack of capital to find & label enough training data
  • 9. The Solution • A different economic models for useful sentiment analysis: • Data-sharing for more accurate training data • Protecting sensitive data from public release
  • 10. Machine learning Optimization Human annotation Cloud prediction engine Actionable intelligence On-site prediction engine Copy & Sync Models App Requests Ambiguous, Novel & Interesting Items Internal Data Flow Hybrid Model Data Flow Application Data Flow firewall
  • 11. The Benefits • Multiple organizations can share in the benefits of better sentiment analysis, without sacrificing privacy • Single point of human-contact: no expensive duplicate manual labeling of data • Keeps lemons out of the market
  • 12. Idibon Public: our implementation • Free product, offered in addition to our enterprise Idibon Studio and Idibon Terminal solutions
  • 13. Applies to NLP and Machine Learning more broadly Every human communication • Any task can be bundled this way • Allows margins for use cases that were not otherwise viable • … including the full diversity of languages, priced out when everyone started in English
  • 14. Language Intelligence Why Sentiment Analysis is a Market for Lemons … and How to Fix it QUESTIONS? Robert Munro