SlideShare une entreprise Scribd logo
1  sur  20
Welcome!
Why NLP? 
lWe have to adopt to how computer wants data 
land we still adopt to the way computer gives back 
information. 
lNLP is helping us to make computer understand one of the 
most powerful interface to HUMANS : language. 
lApple Siri , Google Now are cutting edge examples of how 
NLP helps computer to fit humans. 
lMore details : http://www.slideshare.net/yourfrienddhruv/apps-with-ears-and-eyes
Google Now vs. Siri vs. Cortana 
https://www.stonetemple.com/great-knowledge-box-showdown/
Cutting edge NLP! 
http://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/ 
https://news.ycombinator.com/item?id=8428007 
https://news.ycombinator.com/item?id=8426148
Cutting edge NLP! 
https://news.ycombinator.com/item?id=8428418 
AI Websites That Design 
Themselves 
thegrid.i 
o
NLP in today's session 
In this session we will focus more on how we 
can deal with written language in software 
products.
NLP for text analysis 
lKnowledge is fundamental requirement for any 
problem solving. 
lAn intelligent decision making system needs 3 
Major things. 
lA) Lots of relevant knowledge 
lB) A way to represent that knowledge 
corresponding to current problem/question at 
hand 
lC) A way to represent the answer in human 
language.
General Architecture of NLP systems 
lBasic systems 
l Tokenization-> [lemmatization] -> tagging -> 
chunking -> domain mapping 
l NLP systems requires per-created domain 
specific corpora (dictionary+rule set handcrafted 
by humans) 
l Details: http://www.nltk.org/book/ch05.html
General Architecture of NLP systems 
lAdvance Systems 
http://nlp.stanford.edu/software/patternslearning.shtml
Relationship to Machine Learning 
lNLP 
lAlgo and tooling are targeted to convert Text/Data to 
Values 
lML 
lAlgo and tooling are targeted to consume Values and 
produce meaningful Values/Vectors
Few popular NLP toolkits 
lPython 
lhttp://www.nltk.org 
lhttp://scikit-learn.org/ 
lhttps://textblob.readthedocs.org 
lJava 
lhttp://nlp.stanford.edu/software/index.shtml 
lhttps://gate.ac.uk/overview.html 
lhttps://opennlp.apache.org/ 
l R 
lhttp://cran.r-project. 
org/web/views/NaturalLanguageProcessing.ht 
ml
Interesting applications 
lCoverd in this session 
l1) Information summarization 
l2) Information extraction 
l3) Sentiment Analysis 
l4) Dialog based systems
1) Information summarization 
lCreates summary of big text. 
l http://summly.com/ 
lYou can create highly personalized summary of same 
content per user 
lhttp://automatedinsights.com/wordsmith/ 
lRace is on between 'plagiarism detection' and 'automatic 
paraphrasing' 
l http://copyscape.com/ 
l https://oaps.eu/project/overview/ 
l http://plagcontrol.com 
lHandy code : 
l Python and related : https://github.com/miso-belica/sumy 
l Java/Scala : https://github.com/MojoJolo/textteaser 
lBasics:
2) Information extraction 
lNamed Entity Recognition 
lCommon entity types include ORGANIZATION, 
PERSON, LOCATION, DATE, TIME, MONEY, and 
GPE (geo-political entity). 
lRelationship extraction 
lMainly between NERs 
lhttp://www.cruxbot.com/ 
lHandy code : 
lhttp://www.nltk.org/book/ch07.html 
lBasics: 
l Find interesting pair of words, and note adjoining 
words to know relationship between them.
2.1) Information Retrieval 
lLarge text needs to be search based on key words 
lTraditional RDMS indexing don't work. 
lUsing Full text search toolkits, which are good practical 
example of NLP implementation. 
lHandy Code: 
lSolar:Java 
lPostgresql:DB 
lhttp://blog.lostpropertyhq.com/postgres-full-text-search-is-good- 
enough/ 
l Basics: 
lWhile storing large text, remove non value added words (e.g 
verbs) and index only root of words.
3) Sentiment Analysis 
lTo understand overall meaning/tone of text. 
le.g. Neutral vs. Polar. Positive vs. Negative. 
lDemo 
lhttp://text-processing.com/demo/sentiment/ 
lhttp://nlp.stanford.edu:8080/sentiment/rntnDemo.html 
lUse: 
lFinding twitter tread is positive or negative? 
lFinding overall review for a product is positive or 
negative? 
lBasics: 
l Have to pick most interesting phrases and co-relate 
their meaning. 
l Correlate/Group things with similar meaning
4) Dialog based systems 
lUnderstand input given in natural language. 
lGoogle search, Siri, Google Now 
lBuilding interactive chat bots to handle customer support. 
lDetails:http://www.nltk.org/book/ch10.html 
lHandy code: 
l We can convert a question to SQL Query! 
lBasics: 
lHave English grammar mapped to another grammar for input parsing 
& vise-a-verse
Development & Testing/Verifying of NLP systems 
l1) Understand Gold Set, Training Set , Test Set 
l2) Seen vs Unseen Data 
l3) Accuracy : Precision & Recall. 
l4) Confusion Matrices
Session Summary 
l1) NLP + ML capabilities are foundation for 
intelligent systems working with / on consumer 
data. 
l2) Domain knowledge is the key differentiators 
and MAJOR cost factor 
l3) NLP system development requires different mid 
set, as its not creation but its evolution of software 
system. 
l4) Lots and Lots of academic / research reading is 
must.
What Next? Q&A? Are you sure? 
lI have an Idea which might require NLP 
lGo reach out more people: 
l@nikunjness , @yourfrienddhruv 
lI am want to know how to develop such systems 
lI think I want to research more possibilities! 
lRead this : http://www.nltk.org/book/ch01.html 
lYes, It's python. 
lI think its too complex. 
lYou are not alone.

Contenu connexe

Tendances

Tendances (20)

NLP & Machine Learning - An Introductory Talk
NLP & Machine Learning - An Introductory Talk NLP & Machine Learning - An Introductory Talk
NLP & Machine Learning - An Introductory Talk
 
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
 
Recent Advances in NLP
  Recent Advances in NLP  Recent Advances in NLP
Recent Advances in NLP
 
Anthiil Inside workshop on NLP
Anthiil Inside workshop on NLPAnthiil Inside workshop on NLP
Anthiil Inside workshop on NLP
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
NLP Bootcamp
NLP BootcampNLP Bootcamp
NLP Bootcamp
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
 
Chatbots and Deep Learning
Chatbots and Deep LearningChatbots and Deep Learning
Chatbots and Deep Learning
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve OmohundroOpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
 
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
 
Implications of GPT-3
Implications of GPT-3Implications of GPT-3
Implications of GPT-3
 
Vitalii Braslavskyi - Declarative engineering
Vitalii Braslavskyi - Declarative engineering Vitalii Braslavskyi - Declarative engineering
Vitalii Braslavskyi - Declarative engineering
 
Nlp 2020 global ai conf -jeff_shomaker_final
Nlp 2020 global ai conf -jeff_shomaker_finalNlp 2020 global ai conf -jeff_shomaker_final
Nlp 2020 global ai conf -jeff_shomaker_final
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
UCU NLP Summer Workshops 2017 - Part 2
UCU NLP Summer Workshops 2017 - Part 2UCU NLP Summer Workshops 2017 - Part 2
UCU NLP Summer Workshops 2017 - Part 2
 
Information Retrieval with Deep Learning
Information Retrieval with Deep LearningInformation Retrieval with Deep Learning
Information Retrieval with Deep Learning
 

Similaire à Nautral Langauge Processing - Basics / Non Technical

Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Anant Corporation
 
Artificial Intelligence (AI): Deep Learning
Artificial Intelligence (AI): Deep LearningArtificial Intelligence (AI): Deep Learning
Artificial Intelligence (AI): Deep Learning
Flevy.com Best Practices
 
ChatGPT Shaping Tomorrow's Conversations
ChatGPT Shaping Tomorrow's ConversationsChatGPT Shaping Tomorrow's Conversations
ChatGPT Shaping Tomorrow's Conversations
FahadZafar39
 

Similaire à Nautral Langauge Processing - Basics / Non Technical (20)

Top Deep Learning Frameworks.pdf
Top Deep Learning Frameworks.pdfTop Deep Learning Frameworks.pdf
Top Deep Learning Frameworks.pdf
 
Weak Supervision.pdf
Weak Supervision.pdfWeak Supervision.pdf
Weak Supervision.pdf
 
Fine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP modelsFine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP models
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
 
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
 
Get Started With Python Language.pdf
Get Started With Python Language.pdfGet Started With Python Language.pdf
Get Started With Python Language.pdf
 
Text-mining and Automation
Text-mining and AutomationText-mining and Automation
Text-mining and Automation
 
DeepPavlov 2019
DeepPavlov 2019DeepPavlov 2019
DeepPavlov 2019
 
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdfITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf
 
Artificial Intelligence (AI): Deep Learning
Artificial Intelligence (AI): Deep LearningArtificial Intelligence (AI): Deep Learning
Artificial Intelligence (AI): Deep Learning
 
IC-SDV 2019: Down-to-earth machine learning: What you always wanted your data...
IC-SDV 2019: Down-to-earth machine learning: What you always wanted your data...IC-SDV 2019: Down-to-earth machine learning: What you always wanted your data...
IC-SDV 2019: Down-to-earth machine learning: What you always wanted your data...
 
Smart modeling of smart software
Smart modeling of smart softwareSmart modeling of smart software
Smart modeling of smart software
 
Developing Apps with GPT-4 and ChatGPT_ Build Intelligent Chatbots, Content G...
Developing Apps with GPT-4 and ChatGPT_ Build Intelligent Chatbots, Content G...Developing Apps with GPT-4 and ChatGPT_ Build Intelligent Chatbots, Content G...
Developing Apps with GPT-4 and ChatGPT_ Build Intelligent Chatbots, Content G...
 
ChatGPT Shaping Tomorrow's Conversations
ChatGPT Shaping Tomorrow's ConversationsChatGPT Shaping Tomorrow's Conversations
ChatGPT Shaping Tomorrow's Conversations
 
Introduction to Multimodal LLMs with LLaVA
Introduction to Multimodal LLMs with LLaVAIntroduction to Multimodal LLMs with LLaVA
Introduction to Multimodal LLMs with LLaVA
 
Introduction to Multimodal LLMs with LLaVA
Introduction to Multimodal LLMs with LLaVAIntroduction to Multimodal LLMs with LLaVA
Introduction to Multimodal LLMs with LLaVA
 
Machine Learning Techniques in Python Dissertation - Phdassistance
Machine Learning Techniques in Python Dissertation - PhdassistanceMachine Learning Techniques in Python Dissertation - Phdassistance
Machine Learning Techniques in Python Dissertation - Phdassistance
 
Multiskill Conversational AI
Multiskill Conversational AIMultiskill Conversational AI
Multiskill Conversational AI
 
Sudipta_Mukherjee_Resume_APR_2023.pdf
Sudipta_Mukherjee_Resume_APR_2023.pdfSudipta_Mukherjee_Resume_APR_2023.pdf
Sudipta_Mukherjee_Resume_APR_2023.pdf
 
Deprecating the state machine: building conversational AI with the Rasa stack...
Deprecating the state machine: building conversational AI with the Rasa stack...Deprecating the state machine: building conversational AI with the Rasa stack...
Deprecating the state machine: building conversational AI with the Rasa stack...
 

Dernier

%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
masabamasaba
 

Dernier (20)

%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 

Nautral Langauge Processing - Basics / Non Technical

  • 2. Why NLP? lWe have to adopt to how computer wants data land we still adopt to the way computer gives back information. lNLP is helping us to make computer understand one of the most powerful interface to HUMANS : language. lApple Siri , Google Now are cutting edge examples of how NLP helps computer to fit humans. lMore details : http://www.slideshare.net/yourfrienddhruv/apps-with-ears-and-eyes
  • 3. Google Now vs. Siri vs. Cortana https://www.stonetemple.com/great-knowledge-box-showdown/
  • 4. Cutting edge NLP! http://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/ https://news.ycombinator.com/item?id=8428007 https://news.ycombinator.com/item?id=8426148
  • 5. Cutting edge NLP! https://news.ycombinator.com/item?id=8428418 AI Websites That Design Themselves thegrid.i o
  • 6. NLP in today's session In this session we will focus more on how we can deal with written language in software products.
  • 7. NLP for text analysis lKnowledge is fundamental requirement for any problem solving. lAn intelligent decision making system needs 3 Major things. lA) Lots of relevant knowledge lB) A way to represent that knowledge corresponding to current problem/question at hand lC) A way to represent the answer in human language.
  • 8. General Architecture of NLP systems lBasic systems l Tokenization-> [lemmatization] -> tagging -> chunking -> domain mapping l NLP systems requires per-created domain specific corpora (dictionary+rule set handcrafted by humans) l Details: http://www.nltk.org/book/ch05.html
  • 9. General Architecture of NLP systems lAdvance Systems http://nlp.stanford.edu/software/patternslearning.shtml
  • 10. Relationship to Machine Learning lNLP lAlgo and tooling are targeted to convert Text/Data to Values lML lAlgo and tooling are targeted to consume Values and produce meaningful Values/Vectors
  • 11. Few popular NLP toolkits lPython lhttp://www.nltk.org lhttp://scikit-learn.org/ lhttps://textblob.readthedocs.org lJava lhttp://nlp.stanford.edu/software/index.shtml lhttps://gate.ac.uk/overview.html lhttps://opennlp.apache.org/ l R lhttp://cran.r-project. org/web/views/NaturalLanguageProcessing.ht ml
  • 12. Interesting applications lCoverd in this session l1) Information summarization l2) Information extraction l3) Sentiment Analysis l4) Dialog based systems
  • 13. 1) Information summarization lCreates summary of big text. l http://summly.com/ lYou can create highly personalized summary of same content per user lhttp://automatedinsights.com/wordsmith/ lRace is on between 'plagiarism detection' and 'automatic paraphrasing' l http://copyscape.com/ l https://oaps.eu/project/overview/ l http://plagcontrol.com lHandy code : l Python and related : https://github.com/miso-belica/sumy l Java/Scala : https://github.com/MojoJolo/textteaser lBasics:
  • 14. 2) Information extraction lNamed Entity Recognition lCommon entity types include ORGANIZATION, PERSON, LOCATION, DATE, TIME, MONEY, and GPE (geo-political entity). lRelationship extraction lMainly between NERs lhttp://www.cruxbot.com/ lHandy code : lhttp://www.nltk.org/book/ch07.html lBasics: l Find interesting pair of words, and note adjoining words to know relationship between them.
  • 15. 2.1) Information Retrieval lLarge text needs to be search based on key words lTraditional RDMS indexing don't work. lUsing Full text search toolkits, which are good practical example of NLP implementation. lHandy Code: lSolar:Java lPostgresql:DB lhttp://blog.lostpropertyhq.com/postgres-full-text-search-is-good- enough/ l Basics: lWhile storing large text, remove non value added words (e.g verbs) and index only root of words.
  • 16. 3) Sentiment Analysis lTo understand overall meaning/tone of text. le.g. Neutral vs. Polar. Positive vs. Negative. lDemo lhttp://text-processing.com/demo/sentiment/ lhttp://nlp.stanford.edu:8080/sentiment/rntnDemo.html lUse: lFinding twitter tread is positive or negative? lFinding overall review for a product is positive or negative? lBasics: l Have to pick most interesting phrases and co-relate their meaning. l Correlate/Group things with similar meaning
  • 17. 4) Dialog based systems lUnderstand input given in natural language. lGoogle search, Siri, Google Now lBuilding interactive chat bots to handle customer support. lDetails:http://www.nltk.org/book/ch10.html lHandy code: l We can convert a question to SQL Query! lBasics: lHave English grammar mapped to another grammar for input parsing & vise-a-verse
  • 18. Development & Testing/Verifying of NLP systems l1) Understand Gold Set, Training Set , Test Set l2) Seen vs Unseen Data l3) Accuracy : Precision & Recall. l4) Confusion Matrices
  • 19. Session Summary l1) NLP + ML capabilities are foundation for intelligent systems working with / on consumer data. l2) Domain knowledge is the key differentiators and MAJOR cost factor l3) NLP system development requires different mid set, as its not creation but its evolution of software system. l4) Lots and Lots of academic / research reading is must.
  • 20. What Next? Q&A? Are you sure? lI have an Idea which might require NLP lGo reach out more people: l@nikunjness , @yourfrienddhruv lI am want to know how to develop such systems lI think I want to research more possibilities! lRead this : http://www.nltk.org/book/ch01.html lYes, It's python. lI think its too complex. lYou are not alone.