SlideShare une entreprise Scribd logo
1  sur  26
Natural language
processing
(NLP)
Presented By :
Mohamed El-Serngawy
Agenda
 Definition & Introduction
 Steps in NLP
 Statistical NLP
 Real World Application
 Demos with free NLP Application
Definition & Introduction
 Natural language processing (NLP) is a field
of computer science and linguistics concerned
with the interactions between computers and
human (natural) languages
 Why Natural Language Processing ?
Huge amounts of data
◦ Internet = at least 20 billions pages
◦ Intranet
Applications for processing large
amounts of texts
require NLP expertise
Definition & Introduction
 We look at how we can exploit knowledge
about the world, in combination with linguistic
facts, to build computational natural language
systems.
 Natural language generation systems convert
information from computer databases into
readable human language, Natural language
understanding systems convert samples of
human language into more formal
representations such as parse trees or first-
order logic structures that are easier for
computer programs to manipulate
Steps in NLP
 Phonetics, Phonology: how Word are prononce
in termes of sequences of sounds
 Morphological Analysis: Individual words are
analyzed into their components and non word
tokens such as punctuation are separated from
the words.
 Syntactic Analysis: Linear sequences of words
are transformed into structures that show how the
words relate to each other.
 Semantic Analysis: The structures created by the
syntactic analyzer are assigned meanings.
 Discourse integration: The meaning of an
individual sentence may depend on the sentences
that precede it and may influence the meanings of
the sentences that follow it.
 Pragmatic Analysis: The structure representing
what was said is reinterpreted to determine what
was actually meant.
Phonetics
Study of the physical sounds of human speech
◦ /i:/, /ɜ:/, /ɔ:/, /ɑ:/ and /u:/
◦ 'there' => /ðeə/
◦ 'there on the table' => /ðeər ɒn ðə teɪbl /
•Transcription of sounds (IPA)
Phonetic
 Articulory phonetics : production
• Auditory phonetics : speech perception
– McGurk effect
• Acoustics phonetics: properties of sound
waves (frequency and harmonics)
Morphological Analysis
 Suppose we have an english interface to an
operating system and the following sentence is
typed:
◦ I want to print Bill’s .init file.
 Morphological analysis must do the following
things:
◦ Pull apart the word “Bill’s” into proper noun “Bill” and the
possessive suffix “’s”
◦ Recognize the sequence “.init” as a file extension that is
functioning as an adjective in the sentence.
 This process will usually assign syntactic
categories to all the words in the sentence.
 Consider the word “prints”. This word is either a
plural noun or a third person singular verb ( he
prints ).
Syntactic Analysis
 Syntactic analysis must exploit the results of morphological
analysis to build a structural description of the sentence.
 The goal of this process, called parsing, is to convert the
flat list of words that forms the sentence into a structure
that defines the units that are represented by that flat list.
 The important thing here is that a flat sentence has been
converted into a hierarchical structure and that the
structure correspond to meaning units when semantic
analysis is performed.
 Reference markers are shown in the parenthesis in the
parse tree
 Each one corresponds to some entity that has been
mentioned in the sentence.
Syntactic Analysis
 Syntactic Processing :
Almost all the systems that are
actually used have two main
components:
◦ A declarative representation, called a
grammar, of the syntactic facts about the
language.
◦ A procedure, called parser, that compares
the grammar against input sentences to
produce parsed structures.
Syntactic Analysis
 Grammars and Parsers :
The most common way to represent grammars is as a set of production rules.
A simple Context-free phrase structure grammar fro English:
 S → NP VP
 NP → the NP1
 NP → PRO
 NP → PN
 NP → NP1
 NP1 → ADJS N
 ADJS → ε | ADJ ADJS
 VP → V
 VP → V NP
 N → file | printer
 PN → Bill
 PRO → I
 ADJ → short | long | fast
 V → printed | created | want
First rule can be read as “ A sentence is composed of a noun phrase followed by Verb
Phrase”; Vertical bar is OR ; ε represnts empty string.
Symbols that are further expanded by rules are called nonterminal symbols.
Symbols that correspond directly to strings that must be found in an input sentence are called
terminal symbols.
Syntactic Analysis
S
NP
PN
Bill
VP
V
printed
NP
the
NP1
ADJS
E
N
file
Bill Printed the file
A Parse tree for a sentence :
Syntactic Analysis
 A parse tree :
John ate the apple.
1. S -> NP VP
2. VP -> V NP
3. NP -> NAME
4. NP -> ART N
5. NAME -> John
6. V -> ate
7. ART-> the
8. N -> apple
S
NP VP
NAME
John
V
ate
NP
ART N
the apple
Semantic Analysis
 Semantic analysis must do two
important things:
◦ It must map individual words into
appropriate objects in the knowledge
base or database
◦ It must create the correct structures to
correspond to the way the meanings of
the individual words combine with each
other.
Semantic Analysis
 Lexical processing :
 The first step in any semantic processing system is to look up
the individual words in a dictionary ( or lexicon) and extract
their meanings.
 Many words have several meanings, and it may not be
possible to choose the correct one just by looking at the word
itself.
 The process of determining the correct meaning of an
individual word is called word sense disambiguation or lexical
disambiguation.
 It is done by associating, with each word in lexicon,
information about the contexts in which each of the word’s
senses may appear.
 Sometimes only very straightforward info about each word
sense is necessary. For example, baseball field interpretation
of diamond could be marked as a LOCATION.
 Some useful semantic markers are :
◦ PHYSICAL-OBJECT
◦ ANIMATE-OBJECT
◦ ABSTRACT-OBJECT
Semantic Analysis
 Word Net (common sense
KnowledgBase) :
A database of lexical relations.
Inspired by current psycholinguistic theories of
human lexical memory.
Synset: a set of synonyms, representing one
underlying lexical concept
◦ Example:
 fool {chump, fish, fool, gull, mark, patsy, fall
guy, sucker, schlemiel, shlemiel, soft touch,
mug}
Relations link the synsets: hypernym, Has-
Member, Member-Of, Antonym, etc.
16
Semantic Analysis
 Example
pu-erh.cs.utexas.edu$ wn bike -partn
Part Meronyms of noun bike
2 senses of bike
Sense 1
motorcycle, bike
HAS PART: mudguard, splashguard
Sense 2
bicycle, bike, wheel
HAS PART: bicycle seat, saddle
HAS PART: bicycle wheel
HAS PART: chain
HAS PART: coaster brake
HAS PART: handlebar
HAS PART: mudguard, splashguard
HAS PART: pedal, treadle, foot lever
HAS PART: sprocket, sprocket wheel
17
• Example
Pu-erh.cs.utexas.edu$wn bike
Information available for noun bike
-hypen Hypernyms
-hypon, -treen Hyponyms & Hyponym Tree
-synsn Synonyms (ordered by frequency)
-partn Has Part Meronyms
-meron All Meronyms
-famln Familiarity & Polysemy Count
-coorn Coordinate Sisters
-simsn Synonyms (grouped by similarity of
meaning)
-hmern Hierarchical Meronyms
-grepn List of Compound Words
-over Overview of Senses
Information available for verb bike
-hypev Hypernyms
-hypov, -treev Hyponyms & Hyponym Tree
-synsv Synonyms (ordered by frequency)
-famlv Familiarity & Polysemy Count
-framv Verb Frames
-simsv Synonyms (grouped by similarity of
meaning)
-grepv List of Compound Words
-over Overview of Senses
Discourse Integration
 Specifically we do not know whom the
pronoun “I” or the proper noun “Bill” refers
to.
 To pin down these references requires an
appeal to a model of the current discourse
context, from which we can learn that the
current user is USER068 and that the only
person named “Bill” about whom we could
be talking is USER073.
 Once the correct referent for Bill is known,
we can also determine exactly which file is
being referred to.
Pragmatic Analysis
 The final step toward effective understanding is to decide
what to do as a results.
 One possible thing to do is to record what was said as a fact
and be done with it.
 For some sentences, whose intended effect is clearly
declarative, that is precisely correct thing to do.
 But for other sentences, including this one, the intended effect
is different.
 We can discover this intended effect by applying a set of rules
that characterize cooperative dialogues.
 The final step in pragmatic processing is to translate, from the
knowledge based representation to a command to be
executed by the system.
 The results of the understanding process is
Pragmatic Analysis
 Knowledge about the kind of actions
that speakers intend by their use of
sentences
◦ REQUEST: HAL, open the pod bay door.
◦ STATEMENT: HAL, the pod bay door is
open.
◦ INFORMATION QUESTION: HAL, is the
pod bay door open?
 Speech act analysis (politeness, irony,
greeting, apologizing...)
Statistical NLP
 Statistical NLP aims to perform
statistical inference for the field of NLP
 Statistical inference consists of taking
some data generated in accordance
with some unknown probability
distribution and making inferences.
Motivations for Statistical NLP
 Cognitive modeling of the human language
processing has not reached a stage where
we can have a complete mapping between
the language signal and the information
contents.
 Complete mapping is not always required.
 Statistical approach provides the flexibility
required for making the modeling of a
language more accurate.
Real World Application
 Automatic summarization
 Foreign language reading aid
 Foreign language writing aid
 Information extraction
 Information retrieval (IR) - IR is concerned with storing, searching
and retrieving information. It is a separate field within computer
science (closer to databases), but IR relies on some NLP methods
(for example, stemming). Some current research and applications
seek to bridge the gap between IR and NLP.
 Machine translation - Automatically translating from one human
language to another.
 Named entity recognition (NER) - Given a stream of text,
determining which items in the text map to proper names, such as
people or places. Although in English, named entities are marked
with capitalized words, many other languages do not use
capitalization to distinguish named entities.
 Natural language generation
 Natural language search
Real World Application
 Natural language understanding
 Optical character recognition
 anaphora resolution
 Query expansion
 Question answering - Given a human language question, the
task of producing a human-language answer. The question
may be a closed-ended (such as "What is the capital of
Canada?") or open-ended (such as "What is the meaning of
life?").
 Speech recognition - Given a sound clip of a person or
people speaking, the task of producing a text dictation of the
speaker(s). (The opposite of text to speech.)
 Spoken dialogue system
 Stemming
 Text simplification
 Text-to-speech
 Text-proofing
Demos with free NLP Application
DEMO
THANKS

Contenu connexe

Tendances

Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AISaurav Shrestha
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingIla Group
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
Natural language processing
Natural language processingNatural language processing
Natural language processingYogendra Tamang
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingYasir Khan
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP) ASWINKP11
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingRishikese MR
 
Natural Language Processing
Natural Language Processing Natural Language Processing
Natural Language Processing Adarsh Saxena
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingMariana Soffer
 
Natural language processing
Natural language processingNatural language processing
Natural language processingBasha Chand
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.netwww.myassignmenthelp.net
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processingMinh Pham
 
Natural language processing
Natural language processingNatural language processing
Natural language processingHansi Thenuwara
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processingrohitnayak
 

Tendances (20)

Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP)
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language Processing Natural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
NLP
NLPNLP
NLP
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
NLP.pptx
NLP.pptxNLP.pptx
NLP.pptx
 
Intro to nlp
Intro to nlpIntro to nlp
Intro to nlp
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 

En vedette

Літературна галявина від Галини Малик
Літературна галявина  від Галини МаликЛітературна галявина  від Галини Малик
Літературна галявина від Галини МаликНБУ для дітей
 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language ProcessingJaganadh Gopinadhan
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
NLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in PythonNLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in Pythonshanbady
 
Generations of Programming Languages
Generations of Programming LanguagesGenerations of Programming Languages
Generations of Programming Languagesjocleph
 
Introduction and architecture of expert system
Introduction  and architecture of expert systemIntroduction  and architecture of expert system
Introduction and architecture of expert systempremdeshmane
 

En vedette (8)

Літературна галявина від Галини Малик
Літературна галявина  від Галини МаликЛітературна галявина  від Галини Малик
Літературна галявина від Галини Малик
 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language Processing
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
NLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in PythonNLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in Python
 
NLTK in 20 minutes
NLTK in 20 minutesNLTK in 20 minutes
NLTK in 20 minutes
 
Generations of Programming Languages
Generations of Programming LanguagesGenerations of Programming Languages
Generations of Programming Languages
 
Introduction and architecture of expert system
Introduction  and architecture of expert systemIntroduction  and architecture of expert system
Introduction and architecture of expert system
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
 

Similaire à NLP

Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AISATHYANARAYANAKB
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Abdullah al Mamun
 
Natural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxNatural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxAlyaaMachi
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsshrey bhate
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translationMarcis Pinnis
 
Chapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrievalChapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrievalcaptainmactavish1996
 
Chapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdfChapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdfJemalNesre1
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewBenjaminlapid1
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingBhavya Chawla
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Daniel Adenew
 
MACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMassimo Schenone
 
AI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptxAI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptxwekineheshete
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationssChandan Deb
 

Similaire à NLP (20)

REPORT.doc
REPORT.docREPORT.doc
REPORT.doc
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
NLP
NLPNLP
NLP
 
Nlp (1)
Nlp (1)Nlp (1)
Nlp (1)
 
Natural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptxNatural Language Processing_in semantic web.pptx
Natural Language Processing_in semantic web.pptx
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
NLP todo
NLP todoNLP todo
NLP todo
 
NLP-my-lecture (3).ppt
NLP-my-lecture (3).pptNLP-my-lecture (3).ppt
NLP-my-lecture (3).ppt
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translation
 
Chapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrievalChapter 2: Text Operation in information stroage and retrieval
Chapter 2: Text Operation in information stroage and retrieval
 
Chapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdfChapter 2 Text Operation and Term Weighting.pdf
Chapter 2 Text Operation and Term Weighting.pdf
 
Nlp
NlpNlp
Nlp
 
Natural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overviewNatural Language Processing: A comprehensive overview
Natural Language Processing: A comprehensive overview
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...
 
MACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSIS
 
AI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptxAI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptx
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationss
 

Dernier

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 

Dernier (20)

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

NLP

  • 2. Agenda  Definition & Introduction  Steps in NLP  Statistical NLP  Real World Application  Demos with free NLP Application
  • 3. Definition & Introduction  Natural language processing (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages  Why Natural Language Processing ? Huge amounts of data ◦ Internet = at least 20 billions pages ◦ Intranet Applications for processing large amounts of texts require NLP expertise
  • 4. Definition & Introduction  We look at how we can exploit knowledge about the world, in combination with linguistic facts, to build computational natural language systems.  Natural language generation systems convert information from computer databases into readable human language, Natural language understanding systems convert samples of human language into more formal representations such as parse trees or first- order logic structures that are easier for computer programs to manipulate
  • 5. Steps in NLP  Phonetics, Phonology: how Word are prononce in termes of sequences of sounds  Morphological Analysis: Individual words are analyzed into their components and non word tokens such as punctuation are separated from the words.  Syntactic Analysis: Linear sequences of words are transformed into structures that show how the words relate to each other.  Semantic Analysis: The structures created by the syntactic analyzer are assigned meanings.  Discourse integration: The meaning of an individual sentence may depend on the sentences that precede it and may influence the meanings of the sentences that follow it.  Pragmatic Analysis: The structure representing what was said is reinterpreted to determine what was actually meant.
  • 6. Phonetics Study of the physical sounds of human speech ◦ /i:/, /ɜ:/, /ɔ:/, /ɑ:/ and /u:/ ◦ 'there' => /ðeə/ ◦ 'there on the table' => /ðeər ɒn ðə teɪbl / •Transcription of sounds (IPA)
  • 7. Phonetic  Articulory phonetics : production • Auditory phonetics : speech perception – McGurk effect • Acoustics phonetics: properties of sound waves (frequency and harmonics)
  • 8. Morphological Analysis  Suppose we have an english interface to an operating system and the following sentence is typed: ◦ I want to print Bill’s .init file.  Morphological analysis must do the following things: ◦ Pull apart the word “Bill’s” into proper noun “Bill” and the possessive suffix “’s” ◦ Recognize the sequence “.init” as a file extension that is functioning as an adjective in the sentence.  This process will usually assign syntactic categories to all the words in the sentence.  Consider the word “prints”. This word is either a plural noun or a third person singular verb ( he prints ).
  • 9. Syntactic Analysis  Syntactic analysis must exploit the results of morphological analysis to build a structural description of the sentence.  The goal of this process, called parsing, is to convert the flat list of words that forms the sentence into a structure that defines the units that are represented by that flat list.  The important thing here is that a flat sentence has been converted into a hierarchical structure and that the structure correspond to meaning units when semantic analysis is performed.  Reference markers are shown in the parenthesis in the parse tree  Each one corresponds to some entity that has been mentioned in the sentence.
  • 10. Syntactic Analysis  Syntactic Processing : Almost all the systems that are actually used have two main components: ◦ A declarative representation, called a grammar, of the syntactic facts about the language. ◦ A procedure, called parser, that compares the grammar against input sentences to produce parsed structures.
  • 11. Syntactic Analysis  Grammars and Parsers : The most common way to represent grammars is as a set of production rules. A simple Context-free phrase structure grammar fro English:  S → NP VP  NP → the NP1  NP → PRO  NP → PN  NP → NP1  NP1 → ADJS N  ADJS → ε | ADJ ADJS  VP → V  VP → V NP  N → file | printer  PN → Bill  PRO → I  ADJ → short | long | fast  V → printed | created | want First rule can be read as “ A sentence is composed of a noun phrase followed by Verb Phrase”; Vertical bar is OR ; ε represnts empty string. Symbols that are further expanded by rules are called nonterminal symbols. Symbols that correspond directly to strings that must be found in an input sentence are called terminal symbols.
  • 13. Syntactic Analysis  A parse tree : John ate the apple. 1. S -> NP VP 2. VP -> V NP 3. NP -> NAME 4. NP -> ART N 5. NAME -> John 6. V -> ate 7. ART-> the 8. N -> apple S NP VP NAME John V ate NP ART N the apple
  • 14. Semantic Analysis  Semantic analysis must do two important things: ◦ It must map individual words into appropriate objects in the knowledge base or database ◦ It must create the correct structures to correspond to the way the meanings of the individual words combine with each other.
  • 15. Semantic Analysis  Lexical processing :  The first step in any semantic processing system is to look up the individual words in a dictionary ( or lexicon) and extract their meanings.  Many words have several meanings, and it may not be possible to choose the correct one just by looking at the word itself.  The process of determining the correct meaning of an individual word is called word sense disambiguation or lexical disambiguation.  It is done by associating, with each word in lexicon, information about the contexts in which each of the word’s senses may appear.  Sometimes only very straightforward info about each word sense is necessary. For example, baseball field interpretation of diamond could be marked as a LOCATION.  Some useful semantic markers are : ◦ PHYSICAL-OBJECT ◦ ANIMATE-OBJECT ◦ ABSTRACT-OBJECT
  • 16. Semantic Analysis  Word Net (common sense KnowledgBase) : A database of lexical relations. Inspired by current psycholinguistic theories of human lexical memory. Synset: a set of synonyms, representing one underlying lexical concept ◦ Example:  fool {chump, fish, fool, gull, mark, patsy, fall guy, sucker, schlemiel, shlemiel, soft touch, mug} Relations link the synsets: hypernym, Has- Member, Member-Of, Antonym, etc. 16
  • 17. Semantic Analysis  Example pu-erh.cs.utexas.edu$ wn bike -partn Part Meronyms of noun bike 2 senses of bike Sense 1 motorcycle, bike HAS PART: mudguard, splashguard Sense 2 bicycle, bike, wheel HAS PART: bicycle seat, saddle HAS PART: bicycle wheel HAS PART: chain HAS PART: coaster brake HAS PART: handlebar HAS PART: mudguard, splashguard HAS PART: pedal, treadle, foot lever HAS PART: sprocket, sprocket wheel 17 • Example Pu-erh.cs.utexas.edu$wn bike Information available for noun bike -hypen Hypernyms -hypon, -treen Hyponyms & Hyponym Tree -synsn Synonyms (ordered by frequency) -partn Has Part Meronyms -meron All Meronyms -famln Familiarity & Polysemy Count -coorn Coordinate Sisters -simsn Synonyms (grouped by similarity of meaning) -hmern Hierarchical Meronyms -grepn List of Compound Words -over Overview of Senses Information available for verb bike -hypev Hypernyms -hypov, -treev Hyponyms & Hyponym Tree -synsv Synonyms (ordered by frequency) -famlv Familiarity & Polysemy Count -framv Verb Frames -simsv Synonyms (grouped by similarity of meaning) -grepv List of Compound Words -over Overview of Senses
  • 18. Discourse Integration  Specifically we do not know whom the pronoun “I” or the proper noun “Bill” refers to.  To pin down these references requires an appeal to a model of the current discourse context, from which we can learn that the current user is USER068 and that the only person named “Bill” about whom we could be talking is USER073.  Once the correct referent for Bill is known, we can also determine exactly which file is being referred to.
  • 19. Pragmatic Analysis  The final step toward effective understanding is to decide what to do as a results.  One possible thing to do is to record what was said as a fact and be done with it.  For some sentences, whose intended effect is clearly declarative, that is precisely correct thing to do.  But for other sentences, including this one, the intended effect is different.  We can discover this intended effect by applying a set of rules that characterize cooperative dialogues.  The final step in pragmatic processing is to translate, from the knowledge based representation to a command to be executed by the system.  The results of the understanding process is
  • 20. Pragmatic Analysis  Knowledge about the kind of actions that speakers intend by their use of sentences ◦ REQUEST: HAL, open the pod bay door. ◦ STATEMENT: HAL, the pod bay door is open. ◦ INFORMATION QUESTION: HAL, is the pod bay door open?  Speech act analysis (politeness, irony, greeting, apologizing...)
  • 21. Statistical NLP  Statistical NLP aims to perform statistical inference for the field of NLP  Statistical inference consists of taking some data generated in accordance with some unknown probability distribution and making inferences.
  • 22. Motivations for Statistical NLP  Cognitive modeling of the human language processing has not reached a stage where we can have a complete mapping between the language signal and the information contents.  Complete mapping is not always required.  Statistical approach provides the flexibility required for making the modeling of a language more accurate.
  • 23. Real World Application  Automatic summarization  Foreign language reading aid  Foreign language writing aid  Information extraction  Information retrieval (IR) - IR is concerned with storing, searching and retrieving information. It is a separate field within computer science (closer to databases), but IR relies on some NLP methods (for example, stemming). Some current research and applications seek to bridge the gap between IR and NLP.  Machine translation - Automatically translating from one human language to another.  Named entity recognition (NER) - Given a stream of text, determining which items in the text map to proper names, such as people or places. Although in English, named entities are marked with capitalized words, many other languages do not use capitalization to distinguish named entities.  Natural language generation  Natural language search
  • 24. Real World Application  Natural language understanding  Optical character recognition  anaphora resolution  Query expansion  Question answering - Given a human language question, the task of producing a human-language answer. The question may be a closed-ended (such as "What is the capital of Canada?") or open-ended (such as "What is the meaning of life?").  Speech recognition - Given a sound clip of a person or people speaking, the task of producing a text dictation of the speaker(s). (The opposite of text to speech.)  Spoken dialogue system  Stemming  Text simplification  Text-to-speech  Text-proofing
  • 25. Demos with free NLP Application DEMO