SlideShare une entreprise Scribd logo
1  sur  47
Télécharger pour lire hors ligne
Artificial Intelligence
Palacode Narayana Iyer Anantharaman
Gmail id: narayana dot anantharaman
9 Aug 2017
What is this course about?
• Ultimate AI dream: Build a machine that is
indistinguishable from humans.
• Google’s CEO Sundar Pichai: “AI First”
• This course will cover Applied AI using
modern techniques
• After taking this course you can:
• Develop a formal understanding of AI techniques
• Build products that require cutting edge
techniques that include and not limited to deep
learning techniques
Copyright 2016 JNResearch, All Rights Reserved
Human Cognitive tasks and AI applications
Humans Can: Typical AI Applications
See Image classification, Face Recognition
Speak Text to speech
Understand speech Speech Recognition : Speech to text
Write Handwriting generation, Draw
Understand written text Handwriting Recognition, Digit
Recognition
Drive Self Driving cars
Physical tasks: Walk, Run, … Robotics
What AI can do?
AI Definition
AI : Working Definition for our course (Ref: Russel and Norvig)
Four schools of thought (Ref: Russel and Norvig)
• Think humanly
• Understand how humans think and model this process
• Cognitive Science
• Act humanly
• Turing Test
• Knowledge, Reasoning, Language, Understanding, learning
• Think rationally
• Rational implies thinking or doing the right thing
• Use logic to encode the right thing and process inputs with
this framework
• Act rationally
• Define the right thing as: “Maximizing the goal
achievement”
• A rational agent achieves the best or optimal outcome
So, what is an agent?
Agent
EnvironmentSensors
Actuators
Internal StateInternal State
• Agent perceives the environment (Percepts) and acts upon the environment in order to
maximize achievement of the required goal (Actions)
• Looked at from this perspective, an agent is a function that maps the percepts to actions
Intelligent Agents
• Intelligent agents interact with the Environment
• Interactions through:
• Sensors
• Actuators
• The function f(.) that maps the sensor to actuators is the control policy
• Determines how the agent makes the decisions
• Those decisions take place many many times in a loop: Perception-Action cycle
• The concept of intelligent agents is abstract. One can cast any real world problem
using this model – for instance, it is possible to apply this model to search engine
design.
Fig Credit: Sebastian Thrun, Udacity
Modelling real world problems
Can we model the following examples? Identify what the sensors are, what are the
actuators and how would you describe the agent?
• Home Security Systems: Suppose we have a system that can take pictures or
record video on a continuous basis and we are interested in detecting an intruder.
• The speech recognizer: Receive the speech input and transcribe
• Receive the inputs from location sensors and turn the steering wheel
• Look at the last 4 hours data on stock trend of a set of companies and buy the
stock of the most promising company
What distinguishes AI programs?
• Algorithms that perform complex mathematical
operations do these much faster than a human
• If a student is perceived more “intelligent” if he
solves these problems faster, then why not these
programs be termed “intelligent”?
• In short, when do we call a program a AI
program?
Terminology
• Fully versus Partial Observability
• Deterministic versus Stochastic
• Discrete versus continuous
• Benign versus Adversarial
Observability
• Fully versus Partially Observable
• If what the agent senses momentarily at any time
from the environment is completely sufficient to
make the decisions it is fully observable
• Examples: Chess game, Tennis service,
• Environment has an internal state
• The agent may be able to fully observe the
state or partial
• The agent needs internal memory when
dealing with partially observable situations
• Markov models help us to structure such a
memory
Fig Credit: Sebastian Thrun, Udacity
Deterministic versus Stochastic
• Deterministic: chess moves – outcome is pre
determined
• Stochastic: Moving the dice – the outcome is not
pre determined
Discrete versus Continuous
• Examples of actions modelled as a continuous
variable
• Throwing a dart : infinitely many ways to angle the dart
• Turning the steering by an angle (0 to 360 deg, real
valued)
• Magnitude of acceleration to apply
• Actions that are Discrete Variables
• Choosing a gear (1, 2, 3, 4, Top, Reverse) in a car with
manual transmission system
• Deciding (buy, sell, wait) on stocks of a finite, small set
of companies and acting
• Deciding which elective course to sign up based on the
data available and aptitude
Benign versus Adversarial
• A benign system is not attempting to defeat the
agent
• Weather may affect the actions of a self driving car –
e.g: reduced visibility and hence increased
uncertainty of actions. But it is not an adversary
• An adversarial system attempts to score over
the agent. It tries to win over and not allow the
agent to succeed.
• A Generative Adversarial Network (GAN) is
modelled as an adversarial game.
AI as uncertainty management
• What to do when you don’t know what to do?
• Reasons for uncertainty
• Sensor faults, limitations
• Adversaries : we don’t know what it will do
• Stochastic Environments
• Ignorance
Uncertainty Management : NLP example
Exercise
Analyze wrt Observability, Stochasticity, Discrete/Continuous, Benign/Adversarial,
Stationary/Dynamic environment
• A “teacher agent” that decides the action to take (Go deeper, Give a break, Ask questions,
Repeat the concept) based on its observation of the environment (classroom).
• Game of chess
• Self Driving car
• Face recognition
Modelling
Fig Credit: Percy Liang, Stanford
Modelling AI Problems
Different types of problems may require different types of approaches
• Some problems can be easily represented using state spaces
• E.g. Robot navigation through the maze
• Problems that can be solved using Machine Learning techniques
• E.g. Face recognition
• Probabilistic Graphical Models such as Bayes Networks, HMMs
• E.g Speech Recognition
• Problems that can be well addressed using deductive logic
• Given certain propositions and input, perform logical inference – e.g. imagine a chatbot that
encodes some knowledge and can reason with the user
Example#1 : Modelling as a Graph Search
• Real World Problem
• Suppose you are to reach the Bangalore Airport from PESIT (your current location). You prefer
the route that leads you to the destination fastest. You are given the map of Bangalore and
are provided with information on the traffic congestion along each route and distance.
(Assume this doesn’t change till you reach the destination)
• Model
• Represent the landmarks as nodes (states) of a graph, the goal state being the Airport. Edges
represent the connection between the landmarks. Edges are annotated with the time cost of
moving from one landmark to the next.
• Algorithm
• Graph search algorithms such as BFS, DFS, Uniform Cost Search, A* Search etc
Example#2 – Modelling as a ML Problem
• Real World Problem
• Suppose you are to perform handwriting recognition, where the input is a English text written by
hand.
• Model
• Two possibilities: If the handwriting is recorded using a sensor that captures the strokes, we can
model the problem as a time series analysis problem and choose a suitable ML Classifier (HMM,
RNN, …)
• If the input is an image, we can model the problem as a text recognition in an image, where the
input is a tensor and choose a convolutional neural network
• Algorithm
• Training: Supervised learning using stochastic gradient descent for the type of classifier chosen
Artificial Intelligence, Machine Learning and Deep Learning
• The goal of AI is to build human-like intelligence on machines
• ML is a core approach to achieve this goal.
• Key idea behind ML: Learning from data
• ML is narrower in scope relative to AI
• DL is a suite of techniques that form a sub set of a broad suite
of ML techniques
• ML includes a broad variety of techniques like Probabilistic
Graphical Models, Decision Trees, Neural Networks etc. The
models can be shallow or deep.
• Deep learning uses a large number of computing layers stacked
vertically (output of one feeds in to the input of the next).
• The depth can be spatial or temporal
Copyright 2016 JNResearch, All Rights Reserved
AI
ML DL
Machine Learning
• During the first edition of DARPA self driving cars challenge (2004), none of the
participants succeeded
• In the next edition (2005), 5 cars succeeded, with Stanford Stanley bagging the
first position
• Machine Learning made all the difference
Copyright 2016 JNResearch, All Rights Reserved
When to apply Machine Learning?
• Data
• No data, no machine learning!
• Patterns that exist in the data
• If the data doesn’t contain definitive patterns, there is nothing to learn from
• No satisfactory algorithm exists
• If a satisfactory algorithm exists, no need to do statistical learning
Copyright 2016 JNResearch, All Rights Reserved
Machine Learning
Copyright 2016 JNResearch, All Rights Reserved
ML Approach Advantages
• E.g hte-> hate versus hte->the
• Example: tweets to correct words (Edit Distance > 2)
• Potential hands on project
Learning From Data
Fig Credit: Percy Liang, Stanford
ML – Some taxonomy
Copyright 2016 JNResearch, All Rights Reserved
Learning Taxonomy
What is “learnt”? Parameters, Structure, Hidden Concepts
Technique Supervised, Unsupervised, Reinforcement
What for? Prediction, Diagnostics, Summarization
How? Passive, Active, Online, Offline
Outputs Classification, Regression
Model Paradigm Generative, Discriminative
Can we learn better?
• Can we achieve a human-like performance at least for some narrowly defined tasks?
• If I happen to have lots of data, can my learning scale with data size?
• If the problem solved by a machine learning classifier is narrowly scoped, how to use
a ML approach to solve large, complex problems?
• How much of domain expertise we need to have in order to apply ML to our
problem? E.g. Should I be an expert in signal processing in order to design a speech
recognition system? Should I be a linguist knowledgeable on Kannada in order to
develop an English to Kannada machine translator?
Copyright 2016 JNResearch, All Rights Reserved
Deep Learning
• Large number of layers forming a deep network
• The depth can be spatial or temporal
• More complex models but less dependency on
human experts crafting the best features
• Due to the model’s higher capacity, can leverage
the data better – more the data you give, better
can be the learning
Copyright 2016 JNResearch, All Rights Reserved
Feature Learning (fig from Y Bengio)
• Representation Learning
• Automatically learn the “right”
features at each hidden layer
• Learn multiple levels of
representations increasing in
abstraction
• Allow effective sharing of the
learned parameters across
different tasks: Multitask learning
Three reasons to use deep learning
• Performance
• The difference between 93% to 96% can make all the difference
• Make cool technologies usable for a common man.
• Broad Applicability (Domain independence)
• Not limited to a narrow set of problems
• Minimize the need for domain specialized feature engineering
• New class of applications
• Applications that require higher level semantics as opposed to routine classification
• Multimodal fusion
• Generative models
Copyright 2016 JNResearch, All Rights Reserved
ML Everywhere!: Text, Speech, Image, Video
Copyright 2016 JNResearch, All Rights Reserved
Reflex Agents
• Actions of a reflex agent depends only on the current inputs
• The current percept determines the action
• Example:
• Imagine driving fast on a road in Bangalore. You suddenly notice a speed hump that is not
painted, nor there was any sign board on the road to caution you. The moment you notice
the hump that springs up unexpected, you just apply the brakes!
State Space Models
• Real world problems are modelled as a graph
• Example: Finding the low cost path between 2 cities when there are many
paths that are possible
• Solutions are represented as paths through the graph.
• Our goal is to find the optimal path
Examples
• Chinese words are written without spaces
• Arabic written without vowels
• English:rtfclntllgnc
State Space Models
Fig Credit: Percy Liang, Stanford
Variable Based Models
Fig Credit: Percy Liang,
Stanford
Course Plan
• We will use the broad framework used by
Stanford AI course
• We also draw heavily on the Sebastian
Thrun’s courses on Self Driving Cars
• We differ in the following aspects:
• Deeper deep learning as a tool to build
intelligent agents
• Hands on: Projects based on TensorFlow
• Emphasis on Applied AI : Core computer vision
problems, Core NLP problems and discussions
on self driving cars
Fig Credit: Percy Liang, Stanford
Course Contents
• Unit 1: Classical AI techniques
• Unit 2: Machine Learning and Deep Learning
• Unit 3: Applied AI: Computer Vision
• Unit 4: Applied AI: Natural Language Processing
• Unit 5: Applied AI: Robotics and Self Driving Cars
Why should I take this course?
• AI is a hot topic in the industry
• Every major technology company (Google, Apple, Facebook, Microsoft, Adobe, …) has made
huge bets on AI
• Large number of start ups working in this space and well funded
• AI courses and degree programs are highly sought after in the academia
• This course with focus on both sound theoretical principles as well as hands on
development helps you master the basics
Course Assessment
What are the pre-requisites?
• Technical
• Probability theory
• Calculus
• Linear Algebra
• Python Programming
• Aptitude
• Aptitude for AI, Machine Learning and willingness to experiment and a strong commitment to
class policy.
How will be the course experience?
• Is this an introductory course or a rigorous one?
• Rigorous – the course is going to cover both classical AI as well as modern applied AI.
• What way it will be rigorous?
• Contemporary Research topics will be covered in addition to traditional approaches
• Lab work will be intensive
• Total effort you will put in during the lab/evaluations/final exam will be sizable.
• Will it burn me out?
• If you have the right aptitude, it will not. On the contrary you will find the course thrilling.
(The best way is to find out if this is worth it is by consulting your seniors who have taken NLP
or AML in the recent times!)
Course Timings
• We will have 2 classes per week, each lasting 2 hours. One of the two classes will
be on Saturday.
• As I work in the industry, there may be a few rescheduling of the classes on some
occasions. We will keep such disruptions to a minimum.

Contenu connexe

Tendances

Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationAnkit Gupta
 
Introduction to Artificial Intelligence.pptx
Introduction to Artificial Intelligence.pptxIntroduction to Artificial Intelligence.pptx
Introduction to Artificial Intelligence.pptxRSAISHANKAR
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningEng Teong Cheah
 
Artificial intelligence introduction
Artificial intelligence introductionArtificial intelligence introduction
Artificial intelligence introductionBHAGYAPRASADBUGGE
 
Machine learning seminar ppt
Machine learning seminar pptMachine learning seminar ppt
Machine learning seminar pptRAHUL DANGWAL
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesRui Pedro Paiva
 
The fundamentals of Machine Learning
The fundamentals of Machine LearningThe fundamentals of Machine Learning
The fundamentals of Machine LearningHichem Felouat
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligencefalepiz
 
The Top Trends in Artificial Intelligence
The Top Trends in Artificial IntelligenceThe Top Trends in Artificial Intelligence
The Top Trends in Artificial IntelligenceErling Hesselberg
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt Poojamanic
 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learningBabu Priyavrat
 
Machine learning
Machine learningMachine learning
Machine learningWes Eklund
 
Machine Learning
Machine LearningMachine Learning
Machine LearningShrey Malik
 

Tendances (20)

Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
 
Introduction to Artificial Intelligence.pptx
Introduction to Artificial Intelligence.pptxIntroduction to Artificial Intelligence.pptx
Introduction to Artificial Intelligence.pptx
 
Machine learning
Machine learningMachine learning
Machine learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Can Think
Machine Can ThinkMachine Can Think
Machine Can Think
 
Artificial intelligence introduction
Artificial intelligence introductionArtificial intelligence introduction
Artificial intelligence introduction
 
Machine learning seminar ppt
Machine learning seminar pptMachine learning seminar ppt
Machine learning seminar ppt
 
Deep learning
Deep learningDeep learning
Deep learning
 
Machine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and TechniquesMachine Learning: Applications, Process and Techniques
Machine Learning: Applications, Process and Techniques
 
The fundamentals of Machine Learning
The fundamentals of Machine LearningThe fundamentals of Machine Learning
The fundamentals of Machine Learning
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
The Top Trends in Artificial Intelligence
The Top Trends in Artificial IntelligenceThe Top Trends in Artificial Intelligence
The Top Trends in Artificial Intelligence
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt
 
Introduction to-machine-learning
Introduction to-machine-learningIntroduction to-machine-learning
Introduction to-machine-learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Turing test
Turing testTuring test
Turing test
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 

Similaire à Introduction to Artificial Intelligence

Artificial Intelligence and The Complexity
Artificial Intelligence and The ComplexityArtificial Intelligence and The Complexity
Artificial Intelligence and The ComplexityHendri Karisma
 
Introduction of machine learning.pptx
Introduction of machine learning.pptxIntroduction of machine learning.pptx
Introduction of machine learning.pptxDr.Shweta
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningCloudxLab
 
Introduction To Applied Machine Learning
Introduction To Applied Machine LearningIntroduction To Applied Machine Learning
Introduction To Applied Machine Learningananth
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflowCharmi Chokshi
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningSSSSSS354882
 
Deep Learning Overview
Deep Learning OverviewDeep Learning Overview
Deep Learning OverviewCloudxLab
 
W1_Lec01_Lec02_Introduction.pptx
W1_Lec01_Lec02_Introduction.pptxW1_Lec01_Lec02_Introduction.pptx
W1_Lec01_Lec02_Introduction.pptxJavaid Iqbal
 
Machine learning basics by akanksha bali
Machine learning basics by akanksha baliMachine learning basics by akanksha bali
Machine learning basics by akanksha baliAkanksha Bali
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics Akanksha Bali
 
Engineering Intelligent Systems using Machine Learning
Engineering Intelligent Systems using Machine Learning Engineering Intelligent Systems using Machine Learning
Engineering Intelligent Systems using Machine Learning Saurabh Kaushik
 
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...AILABS Academy
 
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...DataScienceConferenc1
 
Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Animesh Sinha
 

Similaire à Introduction to Artificial Intelligence (20)

Artificial Intelligence and The Complexity
Artificial Intelligence and The ComplexityArtificial Intelligence and The Complexity
Artificial Intelligence and The Complexity
 
Introduction of machine learning.pptx
Introduction of machine learning.pptxIntroduction of machine learning.pptx
Introduction of machine learning.pptx
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction To Applied Machine Learning
Introduction To Applied Machine LearningIntroduction To Applied Machine Learning
Introduction To Applied Machine Learning
 
Deep learning with tensorflow
Deep learning with tensorflowDeep learning with tensorflow
Deep learning with tensorflow
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Deep Learning Overview
Deep Learning OverviewDeep Learning Overview
Deep Learning Overview
 
W1_Lec01_Lec02_Introduction.pptx
W1_Lec01_Lec02_Introduction.pptxW1_Lec01_Lec02_Introduction.pptx
W1_Lec01_Lec02_Introduction.pptx
 
Machine learning basics by akanksha bali
Machine learning basics by akanksha baliMachine learning basics by akanksha bali
Machine learning basics by akanksha bali
 
Machine learning basics
Machine learning basics Machine learning basics
Machine learning basics
 
Engineering Intelligent Systems using Machine Learning
Engineering Intelligent Systems using Machine Learning Engineering Intelligent Systems using Machine Learning
Engineering Intelligent Systems using Machine Learning
 
Lecture 1 introduction
Lecture 1   introductionLecture 1   introduction
Lecture 1 introduction
 
machine learning
machine learningmachine learning
machine learning
 
tensorflow.pptx
tensorflow.pptxtensorflow.pptx
tensorflow.pptx
 
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
AILABS - Lecture Series - Is AI the New Electricity? - Advances In Machine Le...
 
Ai introduction ppt
Ai introduction pptAi introduction ppt
Ai introduction ppt
 
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
[DSC Europe 22] On the Aspects of Artificial Intelligence and Robotic Autonom...
 
Deep learning
Deep learningDeep learning
Deep learning
 
AI CH 1d.pptx
AI CH 1d.pptxAI CH 1d.pptx
AI CH 1d.pptx
 
Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha Machine Learning Basics - By Animesh Sinha
Machine Learning Basics - By Animesh Sinha
 

Plus de ananth

Generative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variantsGenerative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variantsananth
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architecturesananth
 
Foundations: Artificial Neural Networks
Foundations: Artificial Neural NetworksFoundations: Artificial Neural Networks
Foundations: Artificial Neural Networksananth
 
Overview of Convolutional Neural Networks
Overview of Convolutional Neural NetworksOverview of Convolutional Neural Networks
Overview of Convolutional Neural Networksananth
 
Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models ananth
 
An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier ananth
 
Mathematical Background for Artificial Intelligence
Mathematical Background for Artificial IntelligenceMathematical Background for Artificial Intelligence
Mathematical Background for Artificial Intelligenceananth
 
Search problems in Artificial Intelligence
Search problems in Artificial IntelligenceSearch problems in Artificial Intelligence
Search problems in Artificial Intelligenceananth
 
Word representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2VecWord representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2Vecananth
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognitionananth
 
Overview of TensorFlow For Natural Language Processing
Overview of TensorFlow For Natural Language ProcessingOverview of TensorFlow For Natural Language Processing
Overview of TensorFlow For Natural Language Processingananth
 
Convolutional Neural Networks: Part 1
Convolutional Neural Networks: Part 1Convolutional Neural Networks: Part 1
Convolutional Neural Networks: Part 1ananth
 
Machine Learning Lecture 3 Decision Trees
Machine Learning Lecture 3 Decision TreesMachine Learning Lecture 3 Decision Trees
Machine Learning Lecture 3 Decision Treesananth
 
Machine Learning Lecture 2 Basics
Machine Learning Lecture 2 BasicsMachine Learning Lecture 2 Basics
Machine Learning Lecture 2 Basicsananth
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRUananth
 
MaxEnt (Loglinear) Models - Overview
MaxEnt (Loglinear) Models - OverviewMaxEnt (Loglinear) Models - Overview
MaxEnt (Loglinear) Models - Overviewananth
 
An overview of Hidden Markov Models (HMM)
An overview of Hidden Markov Models (HMM)An overview of Hidden Markov Models (HMM)
An overview of Hidden Markov Models (HMM)ananth
 
L06 stemmer and edit distance
L06 stemmer and edit distanceL06 stemmer and edit distance
L06 stemmer and edit distanceananth
 
L05 language model_part2
L05 language model_part2L05 language model_part2
L05 language model_part2ananth
 
L05 word representation
L05 word representationL05 word representation
L05 word representationananth
 

Plus de ananth (20)

Generative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variantsGenerative Adversarial Networks : Basic architecture and variants
Generative Adversarial Networks : Basic architecture and variants
 
Convolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular ArchitecturesConvolutional Neural Networks : Popular Architectures
Convolutional Neural Networks : Popular Architectures
 
Foundations: Artificial Neural Networks
Foundations: Artificial Neural NetworksFoundations: Artificial Neural Networks
Foundations: Artificial Neural Networks
 
Overview of Convolutional Neural Networks
Overview of Convolutional Neural NetworksOverview of Convolutional Neural Networks
Overview of Convolutional Neural Networks
 
Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models Artificial Intelligence Course: Linear models
Artificial Intelligence Course: Linear models
 
An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier
 
Mathematical Background for Artificial Intelligence
Mathematical Background for Artificial IntelligenceMathematical Background for Artificial Intelligence
Mathematical Background for Artificial Intelligence
 
Search problems in Artificial Intelligence
Search problems in Artificial IntelligenceSearch problems in Artificial Intelligence
Search problems in Artificial Intelligence
 
Word representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2VecWord representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2Vec
 
Deep Learning For Speech Recognition
Deep Learning For Speech RecognitionDeep Learning For Speech Recognition
Deep Learning For Speech Recognition
 
Overview of TensorFlow For Natural Language Processing
Overview of TensorFlow For Natural Language ProcessingOverview of TensorFlow For Natural Language Processing
Overview of TensorFlow For Natural Language Processing
 
Convolutional Neural Networks: Part 1
Convolutional Neural Networks: Part 1Convolutional Neural Networks: Part 1
Convolutional Neural Networks: Part 1
 
Machine Learning Lecture 3 Decision Trees
Machine Learning Lecture 3 Decision TreesMachine Learning Lecture 3 Decision Trees
Machine Learning Lecture 3 Decision Trees
 
Machine Learning Lecture 2 Basics
Machine Learning Lecture 2 BasicsMachine Learning Lecture 2 Basics
Machine Learning Lecture 2 Basics
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
 
MaxEnt (Loglinear) Models - Overview
MaxEnt (Loglinear) Models - OverviewMaxEnt (Loglinear) Models - Overview
MaxEnt (Loglinear) Models - Overview
 
An overview of Hidden Markov Models (HMM)
An overview of Hidden Markov Models (HMM)An overview of Hidden Markov Models (HMM)
An overview of Hidden Markov Models (HMM)
 
L06 stemmer and edit distance
L06 stemmer and edit distanceL06 stemmer and edit distance
L06 stemmer and edit distance
 
L05 language model_part2
L05 language model_part2L05 language model_part2
L05 language model_part2
 
L05 word representation
L05 word representationL05 word representation
L05 word representation
 

Dernier

Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncssuser2ae721
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - GuideGOPINATHS437943
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionMebane Rash
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the weldingMuhammadUzairLiaqat
 

Dernier (20)

Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - Guide
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the welding
 

Introduction to Artificial Intelligence

  • 1. Artificial Intelligence Palacode Narayana Iyer Anantharaman Gmail id: narayana dot anantharaman 9 Aug 2017
  • 2. What is this course about? • Ultimate AI dream: Build a machine that is indistinguishable from humans. • Google’s CEO Sundar Pichai: “AI First” • This course will cover Applied AI using modern techniques • After taking this course you can: • Develop a formal understanding of AI techniques • Build products that require cutting edge techniques that include and not limited to deep learning techniques Copyright 2016 JNResearch, All Rights Reserved
  • 3. Human Cognitive tasks and AI applications Humans Can: Typical AI Applications See Image classification, Face Recognition Speak Text to speech Understand speech Speech Recognition : Speech to text Write Handwriting generation, Draw Understand written text Handwriting Recognition, Digit Recognition Drive Self Driving cars Physical tasks: Walk, Run, … Robotics
  • 6. AI : Working Definition for our course (Ref: Russel and Norvig)
  • 7. Four schools of thought (Ref: Russel and Norvig) • Think humanly • Understand how humans think and model this process • Cognitive Science • Act humanly • Turing Test • Knowledge, Reasoning, Language, Understanding, learning • Think rationally • Rational implies thinking or doing the right thing • Use logic to encode the right thing and process inputs with this framework • Act rationally • Define the right thing as: “Maximizing the goal achievement” • A rational agent achieves the best or optimal outcome
  • 8. So, what is an agent? Agent EnvironmentSensors Actuators Internal StateInternal State • Agent perceives the environment (Percepts) and acts upon the environment in order to maximize achievement of the required goal (Actions) • Looked at from this perspective, an agent is a function that maps the percepts to actions
  • 9. Intelligent Agents • Intelligent agents interact with the Environment • Interactions through: • Sensors • Actuators • The function f(.) that maps the sensor to actuators is the control policy • Determines how the agent makes the decisions • Those decisions take place many many times in a loop: Perception-Action cycle • The concept of intelligent agents is abstract. One can cast any real world problem using this model – for instance, it is possible to apply this model to search engine design. Fig Credit: Sebastian Thrun, Udacity
  • 10. Modelling real world problems Can we model the following examples? Identify what the sensors are, what are the actuators and how would you describe the agent? • Home Security Systems: Suppose we have a system that can take pictures or record video on a continuous basis and we are interested in detecting an intruder. • The speech recognizer: Receive the speech input and transcribe • Receive the inputs from location sensors and turn the steering wheel • Look at the last 4 hours data on stock trend of a set of companies and buy the stock of the most promising company
  • 11. What distinguishes AI programs? • Algorithms that perform complex mathematical operations do these much faster than a human • If a student is perceived more “intelligent” if he solves these problems faster, then why not these programs be termed “intelligent”? • In short, when do we call a program a AI program?
  • 12. Terminology • Fully versus Partial Observability • Deterministic versus Stochastic • Discrete versus continuous • Benign versus Adversarial
  • 13. Observability • Fully versus Partially Observable • If what the agent senses momentarily at any time from the environment is completely sufficient to make the decisions it is fully observable • Examples: Chess game, Tennis service, • Environment has an internal state • The agent may be able to fully observe the state or partial • The agent needs internal memory when dealing with partially observable situations • Markov models help us to structure such a memory Fig Credit: Sebastian Thrun, Udacity
  • 14. Deterministic versus Stochastic • Deterministic: chess moves – outcome is pre determined • Stochastic: Moving the dice – the outcome is not pre determined
  • 15. Discrete versus Continuous • Examples of actions modelled as a continuous variable • Throwing a dart : infinitely many ways to angle the dart • Turning the steering by an angle (0 to 360 deg, real valued) • Magnitude of acceleration to apply • Actions that are Discrete Variables • Choosing a gear (1, 2, 3, 4, Top, Reverse) in a car with manual transmission system • Deciding (buy, sell, wait) on stocks of a finite, small set of companies and acting • Deciding which elective course to sign up based on the data available and aptitude
  • 16. Benign versus Adversarial • A benign system is not attempting to defeat the agent • Weather may affect the actions of a self driving car – e.g: reduced visibility and hence increased uncertainty of actions. But it is not an adversary • An adversarial system attempts to score over the agent. It tries to win over and not allow the agent to succeed. • A Generative Adversarial Network (GAN) is modelled as an adversarial game.
  • 17. AI as uncertainty management • What to do when you don’t know what to do? • Reasons for uncertainty • Sensor faults, limitations • Adversaries : we don’t know what it will do • Stochastic Environments • Ignorance
  • 19. Exercise Analyze wrt Observability, Stochasticity, Discrete/Continuous, Benign/Adversarial, Stationary/Dynamic environment • A “teacher agent” that decides the action to take (Go deeper, Give a break, Ask questions, Repeat the concept) based on its observation of the environment (classroom). • Game of chess • Self Driving car • Face recognition
  • 20. Modelling Fig Credit: Percy Liang, Stanford
  • 21. Modelling AI Problems Different types of problems may require different types of approaches • Some problems can be easily represented using state spaces • E.g. Robot navigation through the maze • Problems that can be solved using Machine Learning techniques • E.g. Face recognition • Probabilistic Graphical Models such as Bayes Networks, HMMs • E.g Speech Recognition • Problems that can be well addressed using deductive logic • Given certain propositions and input, perform logical inference – e.g. imagine a chatbot that encodes some knowledge and can reason with the user
  • 22. Example#1 : Modelling as a Graph Search • Real World Problem • Suppose you are to reach the Bangalore Airport from PESIT (your current location). You prefer the route that leads you to the destination fastest. You are given the map of Bangalore and are provided with information on the traffic congestion along each route and distance. (Assume this doesn’t change till you reach the destination) • Model • Represent the landmarks as nodes (states) of a graph, the goal state being the Airport. Edges represent the connection between the landmarks. Edges are annotated with the time cost of moving from one landmark to the next. • Algorithm • Graph search algorithms such as BFS, DFS, Uniform Cost Search, A* Search etc
  • 23. Example#2 – Modelling as a ML Problem • Real World Problem • Suppose you are to perform handwriting recognition, where the input is a English text written by hand. • Model • Two possibilities: If the handwriting is recorded using a sensor that captures the strokes, we can model the problem as a time series analysis problem and choose a suitable ML Classifier (HMM, RNN, …) • If the input is an image, we can model the problem as a text recognition in an image, where the input is a tensor and choose a convolutional neural network • Algorithm • Training: Supervised learning using stochastic gradient descent for the type of classifier chosen
  • 24. Artificial Intelligence, Machine Learning and Deep Learning • The goal of AI is to build human-like intelligence on machines • ML is a core approach to achieve this goal. • Key idea behind ML: Learning from data • ML is narrower in scope relative to AI • DL is a suite of techniques that form a sub set of a broad suite of ML techniques • ML includes a broad variety of techniques like Probabilistic Graphical Models, Decision Trees, Neural Networks etc. The models can be shallow or deep. • Deep learning uses a large number of computing layers stacked vertically (output of one feeds in to the input of the next). • The depth can be spatial or temporal Copyright 2016 JNResearch, All Rights Reserved AI ML DL
  • 25. Machine Learning • During the first edition of DARPA self driving cars challenge (2004), none of the participants succeeded • In the next edition (2005), 5 cars succeeded, with Stanford Stanley bagging the first position • Machine Learning made all the difference Copyright 2016 JNResearch, All Rights Reserved
  • 26. When to apply Machine Learning? • Data • No data, no machine learning! • Patterns that exist in the data • If the data doesn’t contain definitive patterns, there is nothing to learn from • No satisfactory algorithm exists • If a satisfactory algorithm exists, no need to do statistical learning Copyright 2016 JNResearch, All Rights Reserved
  • 27. Machine Learning Copyright 2016 JNResearch, All Rights Reserved
  • 28. ML Approach Advantages • E.g hte-> hate versus hte->the • Example: tweets to correct words (Edit Distance > 2) • Potential hands on project
  • 29. Learning From Data Fig Credit: Percy Liang, Stanford
  • 30. ML – Some taxonomy Copyright 2016 JNResearch, All Rights Reserved Learning Taxonomy What is “learnt”? Parameters, Structure, Hidden Concepts Technique Supervised, Unsupervised, Reinforcement What for? Prediction, Diagnostics, Summarization How? Passive, Active, Online, Offline Outputs Classification, Regression Model Paradigm Generative, Discriminative
  • 31. Can we learn better? • Can we achieve a human-like performance at least for some narrowly defined tasks? • If I happen to have lots of data, can my learning scale with data size? • If the problem solved by a machine learning classifier is narrowly scoped, how to use a ML approach to solve large, complex problems? • How much of domain expertise we need to have in order to apply ML to our problem? E.g. Should I be an expert in signal processing in order to design a speech recognition system? Should I be a linguist knowledgeable on Kannada in order to develop an English to Kannada machine translator? Copyright 2016 JNResearch, All Rights Reserved
  • 32. Deep Learning • Large number of layers forming a deep network • The depth can be spatial or temporal • More complex models but less dependency on human experts crafting the best features • Due to the model’s higher capacity, can leverage the data better – more the data you give, better can be the learning Copyright 2016 JNResearch, All Rights Reserved
  • 33. Feature Learning (fig from Y Bengio) • Representation Learning • Automatically learn the “right” features at each hidden layer • Learn multiple levels of representations increasing in abstraction • Allow effective sharing of the learned parameters across different tasks: Multitask learning
  • 34. Three reasons to use deep learning • Performance • The difference between 93% to 96% can make all the difference • Make cool technologies usable for a common man. • Broad Applicability (Domain independence) • Not limited to a narrow set of problems • Minimize the need for domain specialized feature engineering • New class of applications • Applications that require higher level semantics as opposed to routine classification • Multimodal fusion • Generative models Copyright 2016 JNResearch, All Rights Reserved
  • 35. ML Everywhere!: Text, Speech, Image, Video Copyright 2016 JNResearch, All Rights Reserved
  • 36. Reflex Agents • Actions of a reflex agent depends only on the current inputs • The current percept determines the action • Example: • Imagine driving fast on a road in Bangalore. You suddenly notice a speed hump that is not painted, nor there was any sign board on the road to caution you. The moment you notice the hump that springs up unexpected, you just apply the brakes!
  • 37. State Space Models • Real world problems are modelled as a graph • Example: Finding the low cost path between 2 cities when there are many paths that are possible • Solutions are represented as paths through the graph. • Our goal is to find the optimal path
  • 38. Examples • Chinese words are written without spaces • Arabic written without vowels • English:rtfclntllgnc
  • 39. State Space Models Fig Credit: Percy Liang, Stanford
  • 40. Variable Based Models Fig Credit: Percy Liang, Stanford
  • 41. Course Plan • We will use the broad framework used by Stanford AI course • We also draw heavily on the Sebastian Thrun’s courses on Self Driving Cars • We differ in the following aspects: • Deeper deep learning as a tool to build intelligent agents • Hands on: Projects based on TensorFlow • Emphasis on Applied AI : Core computer vision problems, Core NLP problems and discussions on self driving cars Fig Credit: Percy Liang, Stanford
  • 42. Course Contents • Unit 1: Classical AI techniques • Unit 2: Machine Learning and Deep Learning • Unit 3: Applied AI: Computer Vision • Unit 4: Applied AI: Natural Language Processing • Unit 5: Applied AI: Robotics and Self Driving Cars
  • 43. Why should I take this course? • AI is a hot topic in the industry • Every major technology company (Google, Apple, Facebook, Microsoft, Adobe, …) has made huge bets on AI • Large number of start ups working in this space and well funded • AI courses and degree programs are highly sought after in the academia • This course with focus on both sound theoretical principles as well as hands on development helps you master the basics
  • 45. What are the pre-requisites? • Technical • Probability theory • Calculus • Linear Algebra • Python Programming • Aptitude • Aptitude for AI, Machine Learning and willingness to experiment and a strong commitment to class policy.
  • 46. How will be the course experience? • Is this an introductory course or a rigorous one? • Rigorous – the course is going to cover both classical AI as well as modern applied AI. • What way it will be rigorous? • Contemporary Research topics will be covered in addition to traditional approaches • Lab work will be intensive • Total effort you will put in during the lab/evaluations/final exam will be sizable. • Will it burn me out? • If you have the right aptitude, it will not. On the contrary you will find the course thrilling. (The best way is to find out if this is worth it is by consulting your seniors who have taken NLP or AML in the recent times!)
  • 47. Course Timings • We will have 2 classes per week, each lasting 2 hours. One of the two classes will be on Saturday. • As I work in the industry, there may be a few rescheduling of the classes on some occasions. We will keep such disruptions to a minimum.