SlideShare une entreprise Scribd logo
1  sur  25
Master’s Thesis Defense
22nd January, 2020
Nikhil Khullar
MBA IBC (184856)
2
Research Details
 Title: Consumer perception and reviews on mobile phones:
An analysis using sentiment models in machine learning
 Primary Supervisor: Prof. Dr. rer. nat. Thomas Wenger
 Secondary Supervisor: Prof. Dr. rer. nat. Tobias Hagen
 MBA Program Director: Prof. Dr. rer. pol. Rainer Fischer
 Date of submission: 15th January, 2020
 Date of presentation: 22nd January, 2020
3
 Prior to the MBA (IBC) I have worked for more than 7 years at various
positions of the web and mobile application development spectrum in Japan,
South Korea, US, UK, India, Iceland and Germany.
 Have been a software engineer by profession, and am a hobbyist musician.
 Recently concluded an internship at Accenture in Burghausen, Germany.
 Find it interesting to comprehend how people perceive products and
services, and keenly follow the development of smartphones.
 Fascinated by craftsmanship behind the process of deriving actionable
insights from huge data-sets, which can help businesses in a huge way.
About Me
4
Agenda
Objectives
Introduction, Domain,
Target data and Goals
1
Methodology
Research phases and
techniques followed
2
Results
Findings from the
analyses
3
Conclusions
Summary, Business
interpretations and Q&A
4
5
Target Domain
 Mobile phone industry
 one of the fastest growing sectors
 defines success of consumer electronics firms today
 global smartphone sales revenue: 522 billion USD
 1.56 billion units being sold each year
 Constant existential threat to players in the market due to innovations as
well as new entrants.
 Amazon – one of the biggest online marketplaces, only matched by Alibaba.
 Fast changing landscape in terms of users’ needs and habits.
Source: Statista
6
Why Online Reviews?
 Significance of online reviews:
 90% of consumers read online reviews before visiting a business.
 Online reviews have been shown to impact 67.7% of purchasing decisions.
 84% of people trust online reviews as much as a personal recommendation.
 Businesses risk losing 22% of business when potential customers find one negative article
on the first page of their search results and this risk grows to 44% and to almost 60%
with two and three negative articles respectively.
 Why sentiment analysis?
 Scalar ratings (typically 1-5) are not very helpful as:
 The “why” for that rating or metric like average rating can’t be determined.
 Numeric ratings are not comparable across segments and devices.
Source: Forbes
7
Introduction
 Text analytics
 Machine learning
 Unstructured raw data
 Extracting human sentiments from written text
 Sentiment analysis as a classification problem
 Qualitative sentiment analysis
 Overall aim: gaining actionable insights from customers’ voices
 Polarity: discrete or continuous
 Subjective and Objective sentiment analysis
 ML-based models and lexicon-based VADER
8
Target Dataset
 Gathered by PromptCloud Web Scraping Service
 Long-term data until 2018 made available under Creative Commons
license with all copyrights waived off.
 Recent reviews data including reviews from mid 2018 to July 2019
purchased from PromptCloud for this research.
 Purchased from Data Stock shop via:
 https://datastock.shop
 After selection and pre-processing phases of the pipeline:
 99708 long-term reviews and 49484 recent reviews were retained.
 after de-duping, brand name harmonisation etc.
 Datasets intentionally not unified to perform separate analyses.
9
Goals
 Exploratory statistical analysis
 Comparing performance of models for sentiment classification:
 Logistic Regression
 Support Vector Machine (with linear kernel)
 k-Nearest Neighbours
 naïve Bayes (Gaussian)
 Random Forest and Ensemble Methods
 VADER
 Compound sentiment analysis using VADER and qualitative analysis on specific
target subsets.
 Business use-cases and interpretation of findings.
10
Agenda
Objectives
Introduction, Domain,
Target data and Goals
1
Methodology
Research phases and
techniques followed
2
Results
Findings from the
analyses
3
Conclusions
Summary, Business
interpretations and Q&A
4
11
Research Methodology
 Data selection and sanitisation
 Exploratory Statistical Analysis
 Counts, mean values, distribution of ratings among the reviews,
correlation between review length and perceived helpfulness
 Word clouds
 Sentiment Analysis
 Comparative Analysis
 Compound Sentiment Analysis
 Qualitative Sentiment Analysis
12
Data Encoding & Splitting
 One-hot encoding
 no ordinal relationship exists in unstructured textual data
 binary values are preferred over integer encoding
 Label encoding
 typically used for normalising a set of labels, or for transforming non-
numerical labels to numerical ones, our use case: KNN
 Data splitting
 hyper-parameter optimisation
 training, testing, hold-out validation
13
Agenda
Objectives
Introduction, Domain,
Target data and Goals
1
Methodology
Research phases and
techniques followed
2
Results
Findings from the
analyses
3
Conclusions
Summary, Business
interpretations and Q&A
4
14
Distribution Plots
15
Word Clouds
16
Confusion Matrix
 Business Implications of metrics
 True negatives have far less business costs compared to false negatives
and false positives.
 Example: missing a very receptive market for social video game in Taiwan,
based on tests using English localisation, leading to false negative!
17
Model Evaluation Metrics
 Accuracy
 percentage of correct predictions among total predictions.
 Precision
 when the model predicts positive, how often is it correct.
 Recall
 when the outcome is positive, how often is our model saying so.
 F1 Score
 harmonic mean of precision and recall.
 better measure to seek a balance, based on business costs.
18
Comparative Results
19
Comparative Results
20
VADER Compound Analysis
Apple
Samsung
21
Qualitative results
 Insights leading to most positives recently on Android phones:
 wireless charging, image stabilisation, curved edge
 quad-core, heart rate, super AMOLED, battery life, snapdragon
 Insights leading to most negatives recently on Android phones:
 unlocking, Bixby, phone heat, Android crash
 Pants pocket, bloatware apps, useless features
 Phone perception linking to general brand image:
 many highly positive iPhone reviews refer to MacBook Pro, Air, iPad Pro
 post-sale customer service also seems to impact product perception
 trends from rolling and expanding means on time series coincide with events
22
Agenda
Objectives
Introduction, Domain,
Target data and Goals
1
Methodology
Research phases and
techniques followed
2
Results
Findings from the
analyses
3
Conclusions
Summary, Business
interpretations and Q&A
4
23
Business use-cases
 Descriptive, predictive and prescriptive analytics
 data-driven decision making
 forecasting
 uses abound in industries from video games, stock markets to medicine
 Examples from currently thriving start-ups:
 Gavagai – instant operational insights
 Talkwalker – empowering brands socially
 Aspectiva – acquired by Walmart for recommendation engine
 Smartmunk – improving customer loyalty
 Revuze – text mining on call center feedbacks, online CX, social media etc.
24
“A breakthrough in machine learning
would be worth ten Microsofts”
25
~ Bill Gates

Contenu connexe

Tendances

IRJET- User Preferences and Similarity Estimation
IRJET- User Preferences and Similarity EstimationIRJET- User Preferences and Similarity Estimation
IRJET- User Preferences and Similarity EstimationIRJET Journal
 
地理情報を活用したレコメンダーシステムの構築と評価
地理情報を活用したレコメンダーシステムの構築と評価地理情報を活用したレコメンダーシステムの構築と評価
地理情報を活用したレコメンダーシステムの構築と評価Rakuten Group, Inc.
 
As is the case with most decisions in public and nonprofit manag
As is the case with most decisions in public and nonprofit managAs is the case with most decisions in public and nonprofit manag
As is the case with most decisions in public and nonprofit managAASTHA76
 
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...Joni Salminen
 
Towards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender SystemsTowards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender SystemsLadislav Peska
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET Journal
 
iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...Iaetsd Iaetsd
 
Correlation Technology Business Solutions: Market Research
Correlation Technology Business Solutions: Market ResearchCorrelation Technology Business Solutions: Market Research
Correlation Technology Business Solutions: Market Researchs0P5a41b
 
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...ijnlc
 
Powerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD VivaPowerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD VivaDr Mohan Savade
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET Journal
 
Actual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin RakutenActual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin RakutenRakuten Group, Inc.
 
Videocon industries limited
Videocon industries limitedVideocon industries limited
Videocon industries limitedShashwat Shankar
 

Tendances (19)

IRJET- User Preferences and Similarity Estimation
IRJET- User Preferences and Similarity EstimationIRJET- User Preferences and Similarity Estimation
IRJET- User Preferences and Similarity Estimation
 
地理情報を活用したレコメンダーシステムの構築と評価
地理情報を活用したレコメンダーシステムの構築と評価地理情報を活用したレコメンダーシステムの構築と評価
地理情報を活用したレコメンダーシステムの構築と評価
 
As is the case with most decisions in public and nonprofit manag
As is the case with most decisions in public and nonprofit managAs is the case with most decisions in public and nonprofit manag
As is the case with most decisions in public and nonprofit manag
 
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
 
2011.2.09 marketing
2011.2.09 marketing2011.2.09 marketing
2011.2.09 marketing
 
Towards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender SystemsTowards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender Systems
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
 
iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...iaetsd Co extracting opinion targets and opinion words from online reviews ba...
iaetsd Co extracting opinion targets and opinion words from online reviews ba...
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
Correlation Technology Business Solutions: Market Research
Correlation Technology Business Solutions: Market ResearchCorrelation Technology Business Solutions: Market Research
Correlation Technology Business Solutions: Market Research
 
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
 
Unit ii data analytics
Unit ii data analytics Unit ii data analytics
Unit ii data analytics
 
1305 track 3 siegel
1305 track 3 siegel1305 track 3 siegel
1305 track 3 siegel
 
Role of Analytics in Customer Management
Role of Analytics in Customer ManagementRole of Analytics in Customer Management
Role of Analytics in Customer Management
 
Powerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD VivaPowerpoint Presentation of PhD Viva
Powerpoint Presentation of PhD Viva
 
Int to mktng ch9
Int to mktng ch9Int to mktng ch9
Int to mktng ch9
 
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A SurveyIRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
IRJET- Customer Buying Prediction using Machine-Learning Techniques: A Survey
 
Actual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin RakutenActual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin Rakuten
 
Videocon industries limited
Videocon industries limitedVideocon industries limited
Videocon industries limited
 

Similaire à Defense nikhil khullar

An Empirical Study on the impact of the brand image in the purchase of smartp...
An Empirical Study on the impact of the brand image in the purchase of smartp...An Empirical Study on the impact of the brand image in the purchase of smartp...
An Empirical Study on the impact of the brand image in the purchase of smartp...PrakhyatRoy
 
Pitfalls and Countermeasures in Software Quality Measurements and Evaluations
Pitfalls and Countermeasures in Software Quality Measurements and EvaluationsPitfalls and Countermeasures in Software Quality Measurements and Evaluations
Pitfalls and Countermeasures in Software Quality Measurements and EvaluationsHironori Washizaki
 
IBM Cognos Social Media Analytic Solution - G A InfoMart
IBM Cognos Social Media Analytic Solution - G A InfoMartIBM Cognos Social Media Analytic Solution - G A InfoMart
IBM Cognos Social Media Analytic Solution - G A InfoMartGA InfoMart Ltd
 
20131021 Lavacon keynote - Who are today's and future's content professionals...
20131021 Lavacon keynote - Who are today's and future's content professionals...20131021 Lavacon keynote - Who are today's and future's content professionals...
20131021 Lavacon keynote - Who are today's and future's content professionals...Kapil Verma
 
The Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactThe Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactPaul Laughlin
 
IRJET- Customer Feedback Analysis using Machine Learning
IRJET-  	  Customer Feedback Analysis using Machine LearningIRJET-  	  Customer Feedback Analysis using Machine Learning
IRJET- Customer Feedback Analysis using Machine LearningIRJET Journal
 
Machine learning for customer classification
Machine learning for customer classificationMachine learning for customer classification
Machine learning for customer classificationAndrew Barnes
 
Digital summit Dallas 2015 - Research brings back the 'human' aspect to insights
Digital summit Dallas 2015 - Research brings back the 'human' aspect to insightsDigital summit Dallas 2015 - Research brings back the 'human' aspect to insights
Digital summit Dallas 2015 - Research brings back the 'human' aspect to insightsRamkumar Ravichandran
 
Allbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptx
Allbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptxAllbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptx
Allbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptxPrashant Singh
 
Presentation to Analytics Network of the OR Society Nov 2020
Presentation to Analytics Network of the OR Society Nov 2020Presentation to Analytics Network of the OR Society Nov 2020
Presentation to Analytics Network of the OR Society Nov 2020Paul Laughlin
 
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market ShareData Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market ShareFormulatedby
 
IRJET- Analysis of Brand Value Prediction based on Social Media Data
IRJET-  	  Analysis of Brand Value Prediction based on Social Media DataIRJET-  	  Analysis of Brand Value Prediction based on Social Media Data
IRJET- Analysis of Brand Value Prediction based on Social Media DataIRJET Journal
 
BRIDGEi2i Marketing Analytics Solutions for B2B Technology Companies
BRIDGEi2i Marketing Analytics Solutions for B2B Technology CompaniesBRIDGEi2i Marketing Analytics Solutions for B2B Technology Companies
BRIDGEi2i Marketing Analytics Solutions for B2B Technology CompaniesBRIDGEi2i Analytics Solutions
 
Business Analysis in the context of 'Holistic Products' - R N Prasad
Business Analysis in the context of 'Holistic Products' - R N PrasadBusiness Analysis in the context of 'Holistic Products' - R N Prasad
Business Analysis in the context of 'Holistic Products' - R N Prasadbaconfblr
 
How to Identify Relevant Product KPIs by Roomgo Head of Product
How to Identify Relevant Product KPIs by Roomgo Head of ProductHow to Identify Relevant Product KPIs by Roomgo Head of Product
How to Identify Relevant Product KPIs by Roomgo Head of ProductProduct School
 
Research methodology ppt babasab
Research methodology ppt babasab Research methodology ppt babasab
Research methodology ppt babasab Babasab Patil
 

Similaire à Defense nikhil khullar (20)

Research plan/ Business proposal
Research plan/ Business proposalResearch plan/ Business proposal
Research plan/ Business proposal
 
An Empirical Study on the impact of the brand image in the purchase of smartp...
An Empirical Study on the impact of the brand image in the purchase of smartp...An Empirical Study on the impact of the brand image in the purchase of smartp...
An Empirical Study on the impact of the brand image in the purchase of smartp...
 
2009 Market Research Dynamics
2009 Market Research Dynamics2009 Market Research Dynamics
2009 Market Research Dynamics
 
Pitfalls and Countermeasures in Software Quality Measurements and Evaluations
Pitfalls and Countermeasures in Software Quality Measurements and EvaluationsPitfalls and Countermeasures in Software Quality Measurements and Evaluations
Pitfalls and Countermeasures in Software Quality Measurements and Evaluations
 
IBM Cognos Social Media Analytic Solution - G A InfoMart
IBM Cognos Social Media Analytic Solution - G A InfoMartIBM Cognos Social Media Analytic Solution - G A InfoMart
IBM Cognos Social Media Analytic Solution - G A InfoMart
 
20131021 Lavacon keynote - Who are today's and future's content professionals...
20131021 Lavacon keynote - Who are today's and future's content professionals...20131021 Lavacon keynote - Who are today's and future's content professionals...
20131021 Lavacon keynote - Who are today's and future's content professionals...
 
The Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactThe Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impact
 
IRJET- Customer Feedback Analysis using Machine Learning
IRJET-  	  Customer Feedback Analysis using Machine LearningIRJET-  	  Customer Feedback Analysis using Machine Learning
IRJET- Customer Feedback Analysis using Machine Learning
 
Machine learning for customer classification
Machine learning for customer classificationMachine learning for customer classification
Machine learning for customer classification
 
Upload.pptx
Upload.pptxUpload.pptx
Upload.pptx
 
Digital summit Dallas 2015 - Research brings back the 'human' aspect to insights
Digital summit Dallas 2015 - Research brings back the 'human' aspect to insightsDigital summit Dallas 2015 - Research brings back the 'human' aspect to insights
Digital summit Dallas 2015 - Research brings back the 'human' aspect to insights
 
Allbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptx
Allbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptxAllbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptx
Allbirds 20180405_New Views on Segmentation Targeting_Final Deliverable.pptx
 
Presentation to Analytics Network of the OR Society Nov 2020
Presentation to Analytics Network of the OR Society Nov 2020Presentation to Analytics Network of the OR Society Nov 2020
Presentation to Analytics Network of the OR Society Nov 2020
 
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market ShareData Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
Data Science Salon: Adopting Machine Learning to Drive Revenue and Market Share
 
Advanced Analytics
Advanced AnalyticsAdvanced Analytics
Advanced Analytics
 
IRJET- Analysis of Brand Value Prediction based on Social Media Data
IRJET-  	  Analysis of Brand Value Prediction based on Social Media DataIRJET-  	  Analysis of Brand Value Prediction based on Social Media Data
IRJET- Analysis of Brand Value Prediction based on Social Media Data
 
BRIDGEi2i Marketing Analytics Solutions for B2B Technology Companies
BRIDGEi2i Marketing Analytics Solutions for B2B Technology CompaniesBRIDGEi2i Marketing Analytics Solutions for B2B Technology Companies
BRIDGEi2i Marketing Analytics Solutions for B2B Technology Companies
 
Business Analysis in the context of 'Holistic Products' - R N Prasad
Business Analysis in the context of 'Holistic Products' - R N PrasadBusiness Analysis in the context of 'Holistic Products' - R N Prasad
Business Analysis in the context of 'Holistic Products' - R N Prasad
 
How to Identify Relevant Product KPIs by Roomgo Head of Product
How to Identify Relevant Product KPIs by Roomgo Head of ProductHow to Identify Relevant Product KPIs by Roomgo Head of Product
How to Identify Relevant Product KPIs by Roomgo Head of Product
 
Research methodology ppt babasab
Research methodology ppt babasab Research methodology ppt babasab
Research methodology ppt babasab
 

Dernier

Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...HyderabadDolls
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...kumargunjan9515
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样wsppdmt
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...kumargunjan9515
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfSayantanBiswas37
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdfkhraisr
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareGraham Ware
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 

Dernier (20)

Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 

Defense nikhil khullar

  • 1. Master’s Thesis Defense 22nd January, 2020 Nikhil Khullar MBA IBC (184856)
  • 2. 2 Research Details  Title: Consumer perception and reviews on mobile phones: An analysis using sentiment models in machine learning  Primary Supervisor: Prof. Dr. rer. nat. Thomas Wenger  Secondary Supervisor: Prof. Dr. rer. nat. Tobias Hagen  MBA Program Director: Prof. Dr. rer. pol. Rainer Fischer  Date of submission: 15th January, 2020  Date of presentation: 22nd January, 2020
  • 3. 3  Prior to the MBA (IBC) I have worked for more than 7 years at various positions of the web and mobile application development spectrum in Japan, South Korea, US, UK, India, Iceland and Germany.  Have been a software engineer by profession, and am a hobbyist musician.  Recently concluded an internship at Accenture in Burghausen, Germany.  Find it interesting to comprehend how people perceive products and services, and keenly follow the development of smartphones.  Fascinated by craftsmanship behind the process of deriving actionable insights from huge data-sets, which can help businesses in a huge way. About Me
  • 4. 4 Agenda Objectives Introduction, Domain, Target data and Goals 1 Methodology Research phases and techniques followed 2 Results Findings from the analyses 3 Conclusions Summary, Business interpretations and Q&A 4
  • 5. 5 Target Domain  Mobile phone industry  one of the fastest growing sectors  defines success of consumer electronics firms today  global smartphone sales revenue: 522 billion USD  1.56 billion units being sold each year  Constant existential threat to players in the market due to innovations as well as new entrants.  Amazon – one of the biggest online marketplaces, only matched by Alibaba.  Fast changing landscape in terms of users’ needs and habits. Source: Statista
  • 6. 6 Why Online Reviews?  Significance of online reviews:  90% of consumers read online reviews before visiting a business.  Online reviews have been shown to impact 67.7% of purchasing decisions.  84% of people trust online reviews as much as a personal recommendation.  Businesses risk losing 22% of business when potential customers find one negative article on the first page of their search results and this risk grows to 44% and to almost 60% with two and three negative articles respectively.  Why sentiment analysis?  Scalar ratings (typically 1-5) are not very helpful as:  The “why” for that rating or metric like average rating can’t be determined.  Numeric ratings are not comparable across segments and devices. Source: Forbes
  • 7. 7 Introduction  Text analytics  Machine learning  Unstructured raw data  Extracting human sentiments from written text  Sentiment analysis as a classification problem  Qualitative sentiment analysis  Overall aim: gaining actionable insights from customers’ voices  Polarity: discrete or continuous  Subjective and Objective sentiment analysis  ML-based models and lexicon-based VADER
  • 8. 8 Target Dataset  Gathered by PromptCloud Web Scraping Service  Long-term data until 2018 made available under Creative Commons license with all copyrights waived off.  Recent reviews data including reviews from mid 2018 to July 2019 purchased from PromptCloud for this research.  Purchased from Data Stock shop via:  https://datastock.shop  After selection and pre-processing phases of the pipeline:  99708 long-term reviews and 49484 recent reviews were retained.  after de-duping, brand name harmonisation etc.  Datasets intentionally not unified to perform separate analyses.
  • 9. 9 Goals  Exploratory statistical analysis  Comparing performance of models for sentiment classification:  Logistic Regression  Support Vector Machine (with linear kernel)  k-Nearest Neighbours  naïve Bayes (Gaussian)  Random Forest and Ensemble Methods  VADER  Compound sentiment analysis using VADER and qualitative analysis on specific target subsets.  Business use-cases and interpretation of findings.
  • 10. 10 Agenda Objectives Introduction, Domain, Target data and Goals 1 Methodology Research phases and techniques followed 2 Results Findings from the analyses 3 Conclusions Summary, Business interpretations and Q&A 4
  • 11. 11 Research Methodology  Data selection and sanitisation  Exploratory Statistical Analysis  Counts, mean values, distribution of ratings among the reviews, correlation between review length and perceived helpfulness  Word clouds  Sentiment Analysis  Comparative Analysis  Compound Sentiment Analysis  Qualitative Sentiment Analysis
  • 12. 12 Data Encoding & Splitting  One-hot encoding  no ordinal relationship exists in unstructured textual data  binary values are preferred over integer encoding  Label encoding  typically used for normalising a set of labels, or for transforming non- numerical labels to numerical ones, our use case: KNN  Data splitting  hyper-parameter optimisation  training, testing, hold-out validation
  • 13. 13 Agenda Objectives Introduction, Domain, Target data and Goals 1 Methodology Research phases and techniques followed 2 Results Findings from the analyses 3 Conclusions Summary, Business interpretations and Q&A 4
  • 16. 16 Confusion Matrix  Business Implications of metrics  True negatives have far less business costs compared to false negatives and false positives.  Example: missing a very receptive market for social video game in Taiwan, based on tests using English localisation, leading to false negative!
  • 17. 17 Model Evaluation Metrics  Accuracy  percentage of correct predictions among total predictions.  Precision  when the model predicts positive, how often is it correct.  Recall  when the outcome is positive, how often is our model saying so.  F1 Score  harmonic mean of precision and recall.  better measure to seek a balance, based on business costs.
  • 21. 21 Qualitative results  Insights leading to most positives recently on Android phones:  wireless charging, image stabilisation, curved edge  quad-core, heart rate, super AMOLED, battery life, snapdragon  Insights leading to most negatives recently on Android phones:  unlocking, Bixby, phone heat, Android crash  Pants pocket, bloatware apps, useless features  Phone perception linking to general brand image:  many highly positive iPhone reviews refer to MacBook Pro, Air, iPad Pro  post-sale customer service also seems to impact product perception  trends from rolling and expanding means on time series coincide with events
  • 22. 22 Agenda Objectives Introduction, Domain, Target data and Goals 1 Methodology Research phases and techniques followed 2 Results Findings from the analyses 3 Conclusions Summary, Business interpretations and Q&A 4
  • 23. 23 Business use-cases  Descriptive, predictive and prescriptive analytics  data-driven decision making  forecasting  uses abound in industries from video games, stock markets to medicine  Examples from currently thriving start-ups:  Gavagai – instant operational insights  Talkwalker – empowering brands socially  Aspectiva – acquired by Walmart for recommendation engine  Smartmunk – improving customer loyalty  Revuze – text mining on call center feedbacks, online CX, social media etc.
  • 24. 24
  • 25. “A breakthrough in machine learning would be worth ten Microsofts” 25 ~ Bill Gates