Performance Analysis of Supervised Machine Learning Techniques for Sentiment Analysis

Performance Analysis of Supervised Machine
Learning Techniques for Sentiment Analysis
Presented By : BiswaRanja Samal
Software Engineer @ Acesocloud
IEEE 3rd International Conference on Sensing, Signal Processing & Security
Authors :
1. Anil Kumar Behera
2. Mrutyunjaya Panda
P.G. Department of Computer Science and Applications
UtkalUniversity,Vani Vihar,Bhubaneswar-751004, India
Contents
• Introduction
• Motivation
• Proposed Methodology
• Experimental Results
• Conclusions and Future Scope
• References
Introduction
>> What is Machine Learning?
>> Types of Machine Learning.
>>> Supervised Machine Learning
>>> Unsupervised Machine Learning
>>> Reinforcement Learning
>>> Transduction
>>> Semi-supervised Machine Learning
>> What is Sentiment Analysis?
>> Why Sentiment Analysis?
Expressing the emotions and feelings with the help of words
makes human beings unique[19]. These feelings are known as
the sentiments and the process of analyzing these statements is
known as the Sentiment Analysis.
Sentiment analysis along with the machine learning
techniques can result in the building of a high-performance
intelligent system and can proof its expertise in the area of
artificial intelligence [16].
Motivation
sometimes it becomes a very complex job for the researchers to
select an appropriate machine learning technique according to
their requirement which leads them to improper result with very
poor accuracy and performance of the model.
This motivated us towards doing an investigation on performance
analysis of available machine learning techniques for sentiment
analysis. We have considered only the supervised machine
learning techniques and have tried to do a comparison in each
criterion of this technique.
Proposed Methodology
Algorithm overflow of the proposed methodology
Step1 : Start
Step2 : for each dataset present in dataSetList
Step2.1: Clean the data set
Step2.2: Prepare training data set
Step2.3: Prepare testing data set
Step2.4: for each classifier present in classifierList
Step2.4.1: train classifier with training data set
Step2.4.2: test classifier with testing data set
Step2.4.3: Obtain the accuracy percent from result
Step3: Finish
Methodology in Details
>> Collecting Movie Review Data Sets
We have collected various size of movie review data sets such as
10600, 25000, 35600, 50000 and 85600.
>> Cleaning the Data Sets
Movie review data set consists of characters, numbers, special
characters and unrecognized characters. Which may create
hazard for our classifier, that’s why after collecting the data
sets we have undertaken the data set cleaning procedure
>> Data Categorization
Data Set Size Positive Feedbacks Negative Feedbacks
10,600 5,300 5,300
25,000 12,500 12,500
35,600 17,800 17,800
50,000 25,000 25,000
85,600 42,800 42,8001
Data sets showing numbers of positive and negative feedback
>> Preparing Training and Testing Data Sets
It’s a common convenience to use 70% of the data set for the
training purpose and use rest 30% of the data for testing the
model we have also followed it.
Data Set Size Training Data Set Size Testing Data Set Size
10,600 7420 3180
25,000 17500 7500
35,600 24920 10680
50,000 35000 15000
85,600 59918 25680
Representing number of training and testing reviews
>> Training the Model with Training Data Sets
>> Testing the Model with Testing Data Sets
Experimental Results
Representing used classifiers performance with highlighting the classifier
which has performed very accurately .
Representing the graphical representation of all classifiers performance .
Conclusions and Future Works
>> In this paper, a simple yet novel approach on sentiment
analysis of movie reviews is performed using seven
promising supervised machine learning algorithms.
>> The results obtained concludes linear SVC/SVM as the
best classifier among others in achieving 100% accuracy
for large number of movie reviews.
>> In future, we try to investigate its effectiveness
considering big datasets using the unsupervised and
semi supervised machine learning techniques.
References
[1] Scikit-learn: Machine Learning in Python, Pedregosa et al., JMLR 12, pp. 2825-2830,
2011.
[2] TaiwoOladipupoAyodele . Types of Machine Learning Algorithms, New Advances in
Machine Learning, Yagang Zhang (Ed.), InTech,2010, DOI: 10.5772/9385.
[3]https://www.jasondavies.com/wordcloud/
[4]CagatayCatal, Mehmet Nangir, A Sentiment
Classification Model Based On Multiple Classifiers, Applied Soft Computing Journal
http://dx.doi.org/10.1016/j.asoc.2016.11.022.
[5] S.C. Satapathy et al. (eds.), Proceedings of the 5th International Conference on
Frontiersin Intelligent Computing: Theory and Applications, Advances in Intelligent
Systemsand Computing 516, DOI 10.1007/978-981-10-3156-4_39.
[6] Springer International Publishing Switzerland 2017L. Igual and S. Seguí, Introduction
to Data Science,Undergraduate Topics in Computer Science, DOI 10.1007/978-3-319-
50017-1_10.
[7] S.V.Solai Ananth1, Chandu PMSS, Live Twitter Knowledge as a Corpus for
Sentiment Analysis and Opinion Mining , International Journal of Engineering Science
and Computing, January 2017.
[8] Singh, J.P., et al., Predicting the “helpfulness” of online consumer reviews, Journal
of Business Research (2016), http://dx.doi.org/10.1016/j.jbusres.2016.08.008.
[9]http:/northcampus.uok.edu.in/downloads/20161105144024077.pdf
[10] Bing Liu, Xiaoli Li, Wee Sun Lee and Philip S. Yu, “Text Classification by Labeling
Words” , American Association for Artificial Intelligence. 2004.[11]Semi-Supervised
Learning—O. Chapelle, B. Schölkopf, andA. Zien, Eds. (London, U.K.: MIT Press, 2006,
pp. 508, ISBN:978-0-262-03358-9). Reviewed by Philippe Thomas.
[12] Trevor Hastie, Robert Tibshirani, Jerome Friedman, The Elements of Statistical Learning:
Data Mining, Inference, and Prediction (2nd edition) (Springer Series in Statistics), 2009.
[13] Sebastian B. Thrun, Efficient Exploration In Reinforcement Learning (1992).
[14]Stiglitz, Joseph E. "Learning to learn, localized learning and technological progress."
Economic policy and technological performance (1987): 125-153.
[15] Freitag, Dayne. "Machine learning for information extraction in informal domains."
Machine learning 39.2-3 (2000): 169-202.
[16] Bing Liu. Sentiment Analysis and Opinion Mining, Morgan & Claypool Publishers, May
2012.
[17]Timothy et al (Timothy Jason Shepard, 1998).
[18] Maas, Andrew L., et al. "Learning word vectors for sentiment analysis." Proceedings of
the 49th Annual Meeting of the Association for Computational Linguistics: Human Language
Technologies-Volume 1. Association for Computational Linguistics, 2011.
[19] BiswaRanjanSamal, Mrutyunjaya Panda, HumanBeing Character Analysis from Their
SocialNetworking Profiles A Semisupervised Machine Learning Approach, (IJCSIS)
International Journal of Computer Science and Information Security, Vol. 14, No. 5, May 2016
[20] Murphy, Kevin P. "Naive bayes classifiers." University of British Columbia (2006)..
[21]McCallum, Andrew, and Kamal Nigam. "A comparison of event models for naive bayes
text classification." AAAI-98 workshop on learning for text categorization. Vol. 752. 1998.
[22]Meena, M. Janaki, and K. R. Chandran. "Naive Bayes text classification with positive
features selected by statistical method." Advanced Computing, 2009. ICAC 2009. First.
[23]Kurt, Imran, MevlutTure, and A. TurhanKurum. "Comparing performances of logistic
regression, classification and regression tree, and neural networks for predicting coronary.
[24]Bottou, Léon. "Large-scale machine learning with stochastic gradient descent."
Proceedings of COMPSTAT'2010. Physica-Verlag HD, 2010. 177-186.
[25]Alfaro, René, et al. "Forests for the New Millennium-Making Forests Work for People and
Nature." Selected Books 1 (2005).
Thank You
1 sur 20

Recommandé

Optimized Active Learning for User’s Behavior Modelling based on Non-Intrusiv... par
Optimized Active Learning for User’s Behavior Modelling based on Non-Intrusiv...Optimized Active Learning for User’s Behavior Modelling based on Non-Intrusiv...
Optimized Active Learning for User’s Behavior Modelling based on Non-Intrusiv...IJECEIAES
9 vues8 diapositives
IRJET- Survey on Various Techniques of Attendance marking and Attention D... par
IRJET-  	  Survey on Various Techniques of Attendance marking and Attention D...IRJET-  	  Survey on Various Techniques of Attendance marking and Attention D...
IRJET- Survey on Various Techniques of Attendance marking and Attention D...IRJET Journal
10 vues7 diapositives
IRJET- Automated Student’s Attendance Management using Convolutional Neural N... par
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...IRJET- Automated Student’s Attendance Management using Convolutional Neural N...
IRJET- Automated Student’s Attendance Management using Convolutional Neural N...IRJET Journal
9 vues4 diapositives
Machine learning Summer Training report par
Machine learning Summer Training reportMachine learning Summer Training report
Machine learning Summer Training reportSubhadip Mondal
29K vues49 diapositives
IRJET- Syllabus and Timetable Generation System par
IRJET- Syllabus and Timetable Generation SystemIRJET- Syllabus and Timetable Generation System
IRJET- Syllabus and Timetable Generation SystemIRJET Journal
139 vues4 diapositives
IRJET - Review on Machine Learning par
IRJET -  	  Review on Machine LearningIRJET -  	  Review on Machine Learning
IRJET - Review on Machine LearningIRJET Journal
8 vues3 diapositives

Contenu connexe

Tendances

ENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUES par
ENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUESENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUES
ENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUESIAEME Publication
70 vues10 diapositives
PRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEM par
PRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEMPRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEM
PRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEMIAEME Publication
329 vues9 diapositives
Face Recognition Based Attendance System with Auto Alert to Guardian using Ca... par
Face Recognition Based Attendance System with Auto Alert to Guardian using Ca...Face Recognition Based Attendance System with Auto Alert to Guardian using Ca...
Face Recognition Based Attendance System with Auto Alert to Guardian using Ca...ijtsrd
218 vues4 diapositives
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE par
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCEANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCEijesajournal
352 vues8 diapositives
Automatic Attendance System using Deep Learning Framework par
Automatic Attendance System using Deep Learning FrameworkAutomatic Attendance System using Deep Learning Framework
Automatic Attendance System using Deep Learning FrameworkPinaki Ranjan Sarkar
629 vues34 diapositives
An optimal face recoginition tool par
An optimal face recoginition toolAn optimal face recoginition tool
An optimal face recoginition tooleSAT Publishing House
515 vues7 diapositives

Tendances(20)

ENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUES par IAEME Publication
ENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUESENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUES
ENHANCING THE HUMAN EMOTION RECOGNITION WITH FEATURE EXTRACTION TECHNIQUES
PRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEM par IAEME Publication
PRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEMPRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEM
PRE-PROCESSING TECHNIQUES FOR FACIAL EMOTION RECOGNITION SYSTEM
Face Recognition Based Attendance System with Auto Alert to Guardian using Ca... par ijtsrd
Face Recognition Based Attendance System with Auto Alert to Guardian using Ca...Face Recognition Based Attendance System with Auto Alert to Guardian using Ca...
Face Recognition Based Attendance System with Auto Alert to Guardian using Ca...
ijtsrd218 vues
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE par ijesajournal
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCEANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE
ijesajournal352 vues
Automatic Attendance System using Deep Learning Framework par Pinaki Ranjan Sarkar
Automatic Attendance System using Deep Learning FrameworkAutomatic Attendance System using Deep Learning Framework
Automatic Attendance System using Deep Learning Framework
IRJET- An Overview on Automated Emotion Recognition System par IRJET Journal
IRJET-  	  An Overview on Automated Emotion Recognition SystemIRJET-  	  An Overview on Automated Emotion Recognition System
IRJET- An Overview on Automated Emotion Recognition System
IRJET Journal34 vues
Computer simulations par SolamanC
Computer simulationsComputer simulations
Computer simulations
SolamanC378 vues
NLP-based personal learning assistant for school education par IJECEIAES
NLP-based personal learning assistant for school education NLP-based personal learning assistant for school education
NLP-based personal learning assistant for school education
IJECEIAES55 vues
City i-Tick: The android based mobile application for students’ attendance at... par journalBEEI
City i-Tick: The android based mobile application for students’ attendance at...City i-Tick: The android based mobile application for students’ attendance at...
City i-Tick: The android based mobile application for students’ attendance at...
journalBEEI111 vues
An interactive image segmentation using multiple user input’s par eSAT Publishing House
An interactive image segmentation using multiple user input’sAn interactive image segmentation using multiple user input’s
An interactive image segmentation using multiple user input’s
LEMON : THE LEARNING EFFICIENCY COMPUTATION MODEL FOR ASSESSING LEARNER CONTE... par IJITE
LEMON : THE LEARNING EFFICIENCY COMPUTATION MODEL FOR ASSESSING LEARNER CONTE...LEMON : THE LEARNING EFFICIENCY COMPUTATION MODEL FOR ASSESSING LEARNER CONTE...
LEMON : THE LEARNING EFFICIENCY COMPUTATION MODEL FOR ASSESSING LEARNER CONTE...
IJITE550 vues
Automatic Emotion Recognition Using Facial Expression: A Review par IRJET Journal
Automatic Emotion Recognition Using Facial Expression: A ReviewAutomatic Emotion Recognition Using Facial Expression: A Review
Automatic Emotion Recognition Using Facial Expression: A Review
IRJET Journal155 vues
Automated attendance system based on facial recognition par Dhanush Kasargod
Automated attendance system based on facial recognitionAutomated attendance system based on facial recognition
Automated attendance system based on facial recognition
Dhanush Kasargod19.3K vues
IRJET- Spot Me - A Smart Attendance System based on Face Recognition par IRJET Journal
IRJET- Spot Me - A Smart Attendance System based on Face RecognitionIRJET- Spot Me - A Smart Attendance System based on Face Recognition
IRJET- Spot Me - A Smart Attendance System based on Face Recognition
IRJET Journal70 vues
IRJET - A Review on Face Recognition using Deep Learning Algorithm par IRJET Journal
IRJET -  	  A Review on Face Recognition using Deep Learning AlgorithmIRJET -  	  A Review on Face Recognition using Deep Learning Algorithm
IRJET - A Review on Face Recognition using Deep Learning Algorithm
IRJET Journal32 vues
IRJET - A Review on: Face Recognition using Laplacianface par IRJET Journal
IRJET - A Review on: Face Recognition using LaplacianfaceIRJET - A Review on: Face Recognition using Laplacianface
IRJET - A Review on: Face Recognition using Laplacianface
IRJET Journal8 vues
F03403031040 par theijes
F03403031040F03403031040
F03403031040
theijes304 vues
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In... par IRJET Journal
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...
IRJET- An Innovative Approach for Interviewer to Judge State of Mind of an In...
IRJET Journal21 vues

Similaire à Performance Analysis of Supervised Machine Learning Techniques for Sentiment Analysis

Ew36913917 par
Ew36913917Ew36913917
Ew36913917IJERA Editor
960 vues5 diapositives
A survey on Machine Learning and Artificial Neural Networks par
A survey on Machine Learning and Artificial Neural NetworksA survey on Machine Learning and Artificial Neural Networks
A survey on Machine Learning and Artificial Neural NetworksIRJET Journal
7 vues5 diapositives
Artificial Intelligence Question Bank par
Artificial Intelligence Question BankArtificial Intelligence Question Bank
Artificial Intelligence Question BankSpardhavijetha2DrKMs
92 vues11 diapositives
Machine Learning Ch 1.ppt par
Machine Learning Ch 1.pptMachine Learning Ch 1.ppt
Machine Learning Ch 1.pptARVIND SARDAR
20 vues78 diapositives
A Survey on Machine Learning Algorithms par
A Survey on Machine Learning AlgorithmsA Survey on Machine Learning Algorithms
A Survey on Machine Learning AlgorithmsAM Publications
349 vues9 diapositives
IRJET- Design an Approach for Prediction of Human Activity Recognition us... par
IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...
IRJET- Design an Approach for Prediction of Human Activity Recognition us...IRJET Journal
87 vues3 diapositives

Similaire à Performance Analysis of Supervised Machine Learning Techniques for Sentiment Analysis(20)

A survey on Machine Learning and Artificial Neural Networks par IRJET Journal
A survey on Machine Learning and Artificial Neural NetworksA survey on Machine Learning and Artificial Neural Networks
A survey on Machine Learning and Artificial Neural Networks
IRJET Journal7 vues
A Survey on Machine Learning Algorithms par AM Publications
A Survey on Machine Learning AlgorithmsA Survey on Machine Learning Algorithms
A Survey on Machine Learning Algorithms
AM Publications349 vues
IRJET- Design an Approach for Prediction of Human Activity Recognition us... par IRJET Journal
IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...IRJET-  	  Design an Approach for Prediction of Human Activity Recognition us...
IRJET- Design an Approach for Prediction of Human Activity Recognition us...
IRJET Journal87 vues
Intro/Overview on Machine Learning Presentation par Ankit Gupta
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
Ankit Gupta10.6K vues
MACHINE LEARNING(R17A0534).pdf par FayyoOlani
MACHINE LEARNING(R17A0534).pdfMACHINE LEARNING(R17A0534).pdf
MACHINE LEARNING(R17A0534).pdf
FayyoOlani53 vues
Email Spam Detection Using Machine Learning par IRJET Journal
Email Spam Detection Using Machine LearningEmail Spam Detection Using Machine Learning
Email Spam Detection Using Machine Learning
IRJET Journal44 vues
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O... par IRJET Journal
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...
MACHINE LEARNING AND DEEP LEARNING TECHNIQUES FOR DETECTING ABUSIVE CONTENT O...
IRJET Journal2 vues
IRJET- A Study on Automated Attendance System using Facial Recognition par IRJET Journal
IRJET- A Study on Automated Attendance System using Facial RecognitionIRJET- A Study on Automated Attendance System using Facial Recognition
IRJET- A Study on Automated Attendance System using Facial Recognition
IRJET Journal62 vues
A Study on Machine Learning and Its Working par IJMTST Journal
A Study on Machine Learning and Its WorkingA Study on Machine Learning and Its Working
A Study on Machine Learning and Its Working
IJMTST Journal22 vues
IRJET- Face Recognition based Mobile Automatic Classroom Attendance Manag... par IRJET Journal
IRJET-  	  Face Recognition based Mobile Automatic Classroom Attendance Manag...IRJET-  	  Face Recognition based Mobile Automatic Classroom Attendance Manag...
IRJET- Face Recognition based Mobile Automatic Classroom Attendance Manag...
IRJET Journal37 vues
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech... par IRJET Journal
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...
IRJET- Sentiment Analysis to Segregate Attributes using Machine Learning Tech...
IRJET Journal18 vues
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process par IRJET Journal
IRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP ProcessIRJET -  	  E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET - E-Assistant: An Interactive Bot for Banking Sector using NLP Process
IRJET Journal11 vues
Sentiment Analysis on Twitter Data par IRJET Journal
Sentiment Analysis on Twitter DataSentiment Analysis on Twitter Data
Sentiment Analysis on Twitter Data
IRJET Journal56 vues
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE par ijesajournal
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCEANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE
ijesajournal37 vues
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE par ijesajournal
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCEANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE
ANALYSIS OF SYSTEM ON CHIP DESIGN USING ARTIFICIAL INTELLIGENCE
ijesajournal35 vues

Dernier

20231129 - Platform @ localhost 2023 - Application-driven infrastructure with... par
20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...
20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...sparkfabrik
8 vues46 diapositives
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium... par
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...Lisi Hocke
35 vues124 diapositives
The Era of Large Language Models.pptx par
The Era of Large Language Models.pptxThe Era of Large Language Models.pptx
The Era of Large Language Models.pptxAbdulVahedShaik
7 vues9 diapositives
predicting-m3-devopsconMunich-2023.pptx par
predicting-m3-devopsconMunich-2023.pptxpredicting-m3-devopsconMunich-2023.pptx
predicting-m3-devopsconMunich-2023.pptxTier1 app
7 vues24 diapositives
Using Qt under LGPL-3.0 par
Using Qt under LGPL-3.0Using Qt under LGPL-3.0
Using Qt under LGPL-3.0Burkhard Stubert
13 vues11 diapositives
The Path to DevOps par
The Path to DevOpsThe Path to DevOps
The Path to DevOpsJohn Valentino
5 vues6 diapositives

Dernier(20)

20231129 - Platform @ localhost 2023 - Application-driven infrastructure with... par sparkfabrik
20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...
20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...
sparkfabrik8 vues
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium... par Lisi Hocke
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...
Lisi Hocke35 vues
predicting-m3-devopsconMunich-2023.pptx par Tier1 app
predicting-m3-devopsconMunich-2023.pptxpredicting-m3-devopsconMunich-2023.pptx
predicting-m3-devopsconMunich-2023.pptx
Tier1 app7 vues
Electronic AWB - Electronic Air Waybill par Freightoscope
Electronic AWB - Electronic Air Waybill Electronic AWB - Electronic Air Waybill
Electronic AWB - Electronic Air Waybill
360 graden fabriek par info33492
360 graden fabriek360 graden fabriek
360 graden fabriek
info33492143 vues
Top-5-production-devconMunich-2023-v2.pptx par Tier1 app
Top-5-production-devconMunich-2023-v2.pptxTop-5-production-devconMunich-2023-v2.pptx
Top-5-production-devconMunich-2023-v2.pptx
Tier1 app6 vues
Bootstrapping vs Venture Capital.pptx par Zeljko Svedic
Bootstrapping vs Venture Capital.pptxBootstrapping vs Venture Capital.pptx
Bootstrapping vs Venture Capital.pptx
Zeljko Svedic14 vues
ADDO_2022_CICID_Tom_Halpin.pdf par TomHalpin9
ADDO_2022_CICID_Tom_Halpin.pdfADDO_2022_CICID_Tom_Halpin.pdf
ADDO_2022_CICID_Tom_Halpin.pdf
TomHalpin95 vues
Gen Apps on Google Cloud PaLM2 and Codey APIs in Action par Márton Kodok
Gen Apps on Google Cloud PaLM2 and Codey APIs in ActionGen Apps on Google Cloud PaLM2 and Codey APIs in Action
Gen Apps on Google Cloud PaLM2 and Codey APIs in Action
Márton Kodok15 vues
JioEngage_Presentation.pptx par admin125455
JioEngage_Presentation.pptxJioEngage_Presentation.pptx
JioEngage_Presentation.pptx
admin1254556 vues
predicting-m3-devopsconMunich-2023-v2.pptx par Tier1 app
predicting-m3-devopsconMunich-2023-v2.pptxpredicting-m3-devopsconMunich-2023-v2.pptx
predicting-m3-devopsconMunich-2023-v2.pptx
Tier1 app9 vues

Performance Analysis of Supervised Machine Learning Techniques for Sentiment Analysis

  • 1. Performance Analysis of Supervised Machine Learning Techniques for Sentiment Analysis Presented By : BiswaRanja Samal Software Engineer @ Acesocloud IEEE 3rd International Conference on Sensing, Signal Processing & Security Authors : 1. Anil Kumar Behera 2. Mrutyunjaya Panda P.G. Department of Computer Science and Applications UtkalUniversity,Vani Vihar,Bhubaneswar-751004, India
  • 2. Contents • Introduction • Motivation • Proposed Methodology • Experimental Results • Conclusions and Future Scope • References
  • 4. >> What is Machine Learning? >> Types of Machine Learning. >>> Supervised Machine Learning >>> Unsupervised Machine Learning >>> Reinforcement Learning >>> Transduction >>> Semi-supervised Machine Learning
  • 5. >> What is Sentiment Analysis? >> Why Sentiment Analysis? Expressing the emotions and feelings with the help of words makes human beings unique[19]. These feelings are known as the sentiments and the process of analyzing these statements is known as the Sentiment Analysis. Sentiment analysis along with the machine learning techniques can result in the building of a high-performance intelligent system and can proof its expertise in the area of artificial intelligence [16].
  • 7. sometimes it becomes a very complex job for the researchers to select an appropriate machine learning technique according to their requirement which leads them to improper result with very poor accuracy and performance of the model. This motivated us towards doing an investigation on performance analysis of available machine learning techniques for sentiment analysis. We have considered only the supervised machine learning techniques and have tried to do a comparison in each criterion of this technique.
  • 9. Algorithm overflow of the proposed methodology Step1 : Start Step2 : for each dataset present in dataSetList Step2.1: Clean the data set Step2.2: Prepare training data set Step2.3: Prepare testing data set Step2.4: for each classifier present in classifierList Step2.4.1: train classifier with training data set Step2.4.2: test classifier with testing data set Step2.4.3: Obtain the accuracy percent from result Step3: Finish
  • 10. Methodology in Details >> Collecting Movie Review Data Sets We have collected various size of movie review data sets such as 10600, 25000, 35600, 50000 and 85600. >> Cleaning the Data Sets Movie review data set consists of characters, numbers, special characters and unrecognized characters. Which may create hazard for our classifier, that’s why after collecting the data sets we have undertaken the data set cleaning procedure >> Data Categorization Data Set Size Positive Feedbacks Negative Feedbacks 10,600 5,300 5,300 25,000 12,500 12,500 35,600 17,800 17,800 50,000 25,000 25,000 85,600 42,800 42,8001 Data sets showing numbers of positive and negative feedback
  • 11. >> Preparing Training and Testing Data Sets It’s a common convenience to use 70% of the data set for the training purpose and use rest 30% of the data for testing the model we have also followed it. Data Set Size Training Data Set Size Testing Data Set Size 10,600 7420 3180 25,000 17500 7500 35,600 24920 10680 50,000 35000 15000 85,600 59918 25680 Representing number of training and testing reviews
  • 12. >> Training the Model with Training Data Sets >> Testing the Model with Testing Data Sets
  • 14. Representing used classifiers performance with highlighting the classifier which has performed very accurately .
  • 15. Representing the graphical representation of all classifiers performance .
  • 17. >> In this paper, a simple yet novel approach on sentiment analysis of movie reviews is performed using seven promising supervised machine learning algorithms. >> The results obtained concludes linear SVC/SVM as the best classifier among others in achieving 100% accuracy for large number of movie reviews. >> In future, we try to investigate its effectiveness considering big datasets using the unsupervised and semi supervised machine learning techniques.
  • 18. References [1] Scikit-learn: Machine Learning in Python, Pedregosa et al., JMLR 12, pp. 2825-2830, 2011. [2] TaiwoOladipupoAyodele . Types of Machine Learning Algorithms, New Advances in Machine Learning, Yagang Zhang (Ed.), InTech,2010, DOI: 10.5772/9385. [3]https://www.jasondavies.com/wordcloud/ [4]CagatayCatal, Mehmet Nangir, A Sentiment Classification Model Based On Multiple Classifiers, Applied Soft Computing Journal http://dx.doi.org/10.1016/j.asoc.2016.11.022. [5] S.C. Satapathy et al. (eds.), Proceedings of the 5th International Conference on Frontiersin Intelligent Computing: Theory and Applications, Advances in Intelligent Systemsand Computing 516, DOI 10.1007/978-981-10-3156-4_39. [6] Springer International Publishing Switzerland 2017L. Igual and S. Seguí, Introduction to Data Science,Undergraduate Topics in Computer Science, DOI 10.1007/978-3-319- 50017-1_10. [7] S.V.Solai Ananth1, Chandu PMSS, Live Twitter Knowledge as a Corpus for Sentiment Analysis and Opinion Mining , International Journal of Engineering Science and Computing, January 2017. [8] Singh, J.P., et al., Predicting the “helpfulness” of online consumer reviews, Journal of Business Research (2016), http://dx.doi.org/10.1016/j.jbusres.2016.08.008. [9]http:/northcampus.uok.edu.in/downloads/20161105144024077.pdf [10] Bing Liu, Xiaoli Li, Wee Sun Lee and Philip S. Yu, “Text Classification by Labeling Words” , American Association for Artificial Intelligence. 2004.[11]Semi-Supervised Learning—O. Chapelle, B. Schölkopf, andA. Zien, Eds. (London, U.K.: MIT Press, 2006, pp. 508, ISBN:978-0-262-03358-9). Reviewed by Philippe Thomas.
  • 19. [12] Trevor Hastie, Robert Tibshirani, Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd edition) (Springer Series in Statistics), 2009. [13] Sebastian B. Thrun, Efficient Exploration In Reinforcement Learning (1992). [14]Stiglitz, Joseph E. "Learning to learn, localized learning and technological progress." Economic policy and technological performance (1987): 125-153. [15] Freitag, Dayne. "Machine learning for information extraction in informal domains." Machine learning 39.2-3 (2000): 169-202. [16] Bing Liu. Sentiment Analysis and Opinion Mining, Morgan & Claypool Publishers, May 2012. [17]Timothy et al (Timothy Jason Shepard, 1998). [18] Maas, Andrew L., et al. "Learning word vectors for sentiment analysis." Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics, 2011. [19] BiswaRanjanSamal, Mrutyunjaya Panda, HumanBeing Character Analysis from Their SocialNetworking Profiles A Semisupervised Machine Learning Approach, (IJCSIS) International Journal of Computer Science and Information Security, Vol. 14, No. 5, May 2016 [20] Murphy, Kevin P. "Naive bayes classifiers." University of British Columbia (2006).. [21]McCallum, Andrew, and Kamal Nigam. "A comparison of event models for naive bayes text classification." AAAI-98 workshop on learning for text categorization. Vol. 752. 1998. [22]Meena, M. Janaki, and K. R. Chandran. "Naive Bayes text classification with positive features selected by statistical method." Advanced Computing, 2009. ICAC 2009. First. [23]Kurt, Imran, MevlutTure, and A. TurhanKurum. "Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary. [24]Bottou, Léon. "Large-scale machine learning with stochastic gradient descent." Proceedings of COMPSTAT'2010. Physica-Verlag HD, 2010. 177-186. [25]Alfaro, René, et al. "Forests for the New Millennium-Making Forests Work for People and Nature." Selected Books 1 (2005).