SlideShare une entreprise Scribd logo
1  sur  33
Télécharger pour lire hors ligne
Practical Machine Learning
Jaganadh G
jaganadhg@gmail.com
BarCamp Kerala 9
Amrita Vishwa Vidyapeetham
Karunagapally
14 November 2010
Jaganadh G Practical Machine Learning
About me !!
Working in Natural Language Processing, Machine Learning,
Data Mining etc...
Passionate about Free and Open source :-)
When gets free time teaches Python and blogs at
http://jaganadhg.freeflux.net/blog
Working as Project Lead (NLP) 365Media Pvt. Ltd.
Coimbatore
I am a computational linguist / Linguist and Indologist
Now Software Engineer by Profession
Jaganadh G Practical Machine Learning
Machine Learning
Machine Learning
Machine learning is a subfield of artificial intelligence (AI)
concerned with algorithms that allow computers to learn.
Jaganadh G Practical Machine Learning
Machine Learning
Machine Learning
Machine learning is a subfield of artificial intelligence (AI)
concerned with algorithms that allow computers to learn.
Jaganadh G Practical Machine Learning
Machine Learning
Machine Learning
Machine learning is a subfield of artificial intelligence (AI)
concerned with algorithms that allow computers to learn.
This talk is not aimed to give introduction about Machine
Learning
Jaganadh G Practical Machine Learning
Machine Learning
Machine Learning
Machine learning is a subfield of artificial intelligence (AI)
concerned with algorithms that allow computers to learn.
This talk is not aimed to give introduction about Machine
Learning
Dont expect some mathy equations here
Jaganadh G Practical Machine Learning
Machine Learning and Our Life
Do you think that Machine Learning has any impact in our life
??
Jaganadh G Practical Machine Learning
Machine Learning and Our Life
Do you think that Machine Learning has any impact in our life
??
Yes
Jaganadh G Practical Machine Learning
Machine Learning and Our Life
Do you think that Machine Learning has any impact in our life
??
Yes
In our day to day life we may use many Machine Learning
powered tools
Jaganadh G Practical Machine Learning
Machine Learning and Our Life
Do you think that Machine Learning has any impact in our life
??
Yes
In our day to day life we may use many Machine Learning
powered tools
E-mail spam filtering , product recommendations etc ..
Jaganadh G Practical Machine Learning
Machine Learning and Our Life
Do you think that Machine Learning has any impact in our life
??
Yes
In our day to day life we may use many Machine Learning
powered tools
E-mail spam filtering , product recommendations etc ..
Fraud detection
Jaganadh G Practical Machine Learning
Examples
Jaganadh G Practical Machine Learning
Examples
Jaganadh G Practical Machine Learning
Examples
Jaganadh G Practical Machine Learning
Tool for building Machine Learning powerd product/service
Apache Mahout
Apache Mahout is a scalable machine learning library that supports
large data sets. Apache Mahout’s goal is to build scalable machine
learning libraries.
Commercially friendly licence
Well documented
Healthy community
Targeted to developers
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Singular value decomposition
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Singular value decomposition
Parallel Frequent Pattern mining
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Singular value decomposition
Parallel Frequent Pattern mining
Complementary Naive Bayes classifier
Jaganadh G Practical Machine Learning
Algorithms in Apache Mahout
Collaborative Filtering
User and Item based recommenders
K-Means, Fuzzy K-Means clustering
Mean Shift clustering
Dirichlet process clustering
Latent Dirichlet Allocation
Singular value decomposition
Parallel Frequent Pattern mining
Complementary Naive Bayes classifier
Random forest decision tree based classifier
Jaganadh G Practical Machine Learning
Demo
Building recommendations engines with Mahout
Document Classification with Mahout
Some Python stuff on Machine Learning
Jaganadh G Practical Machine Learning
Reference
Jaganadh G Practical Machine Learning
Reference
Mahout in Action - Book by Sean Owen and Robin Anil,
published by Manning Publications.
Taming Text - By Grant Ingersoll and Tom Morton, published
by Manning Publications.
Introducing Apache Mahout - Grant Ingersoll - Intro to
Apache Mahout focused on clustering, classification and
collaborative filtering.
https://www.ibm.com/developerworks/java/library/j-
mahout/index.html
Programming Collective Intelligence: Building Smart Web 2.0
Applications
http://www.amazon.com/Programming-Collective-
Intelligence-Building-Applications/dp/0596529325
Jaganadh G Practical Machine Learning
Useful Resources
Apache Mahout Site http://mahout.apache.org/
Apache Mahout Mailing List user@mahout.apache.org
The code which I used for Mahout demo is available at
http://bitbucket.org/jaganadhg/blog/src/tip/bck9/java/
Twenty News Group data set
http://people.csail.mit.edu/jrennie/20Newsgroups/20news-
bydate.tar.gz
Jaganadh G Practical Machine Learning
Questions ??
Jaganadh G Practical Machine Learning
Acknowledgments
Thanks to :
Manning Publications for Review Copy of the book ”Mahout
in Action”
Apache Mahout mailing list members
Ted Dunning and Robin Anil for suggestions
Sreejith S and Biju B for Java help
@chelakkandupoda for review and criticism
Mukundhanchari R&D Director 365Media Pvt. Ltd. for
support and encouragement
Jaganadh G Practical Machine Learning
Finally
Jaganadh G Practical Machine Learning

Contenu connexe

Similaire à Practical Machine Learning

Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language Processing
Jaganadh Gopinadhan
 
Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...
Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...
Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...
Simplilearn
 

Similaire à Practical Machine Learning (20)

BotConf..pptx
BotConf..pptxBotConf..pptx
BotConf..pptx
 
Practical Natural Language Processing
Practical Natural Language ProcessingPractical Natural Language Processing
Practical Natural Language Processing
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Python Machine Learning Tutorial
Python Machine Learning TutorialPython Machine Learning Tutorial
Python Machine Learning Tutorial
 
Building Data Apps with Python
Building Data Apps with PythonBuilding Data Apps with Python
Building Data Apps with Python
 
ChatGPT in academic settings H2.de
ChatGPT in academic settings H2.deChatGPT in academic settings H2.de
ChatGPT in academic settings H2.de
 
Advances in ML learning process require. ppt.pptx
Advances in ML learning process require. ppt.pptxAdvances in ML learning process require. ppt.pptx
Advances in ML learning process require. ppt.pptx
 
Advances in ML. ppt.pptx
Advances in ML. ppt.pptxAdvances in ML. ppt.pptx
Advances in ML. ppt.pptx
 
How to start a career in AI and Machine Learning?
How to start a career in AI and Machine Learning?How to start a career in AI and Machine Learning?
How to start a career in AI and Machine Learning?
 
How to Start a career in AI and Machine Learning?
How to Start a career in AI and Machine Learning?How to Start a career in AI and Machine Learning?
How to Start a career in AI and Machine Learning?
 
Cognitive Automation - Your AI Coworker
Cognitive Automation - Your AI CoworkerCognitive Automation - Your AI Coworker
Cognitive Automation - Your AI Coworker
 
OpenAI Chatgpt.pptx
OpenAI Chatgpt.pptxOpenAI Chatgpt.pptx
OpenAI Chatgpt.pptx
 
Lambda architecture for real time big data
Lambda architecture for real time big dataLambda architecture for real time big data
Lambda architecture for real time big data
 
Екатерина Гордиенко (Serpstat)
Екатерина Гордиенко (Serpstat)Екатерина Гордиенко (Serpstat)
Екатерина Гордиенко (Serpstat)
 
Overview of Artificial Intelligence & Machine Learning
Overview of Artificial Intelligence & Machine LearningOverview of Artificial Intelligence & Machine Learning
Overview of Artificial Intelligence & Machine Learning
 
Agile Network India | Agility Day @Noida | Enterprise agility through enginee...
Agile Network India | Agility Day @Noida | Enterprise agility through enginee...Agile Network India | Agility Day @Noida | Enterprise agility through enginee...
Agile Network India | Agility Day @Noida | Enterprise agility through enginee...
 
Webinar - AI Powered Recommendation Engine for Businesses
Webinar - AI Powered Recommendation Engine for BusinessesWebinar - AI Powered Recommendation Engine for Businesses
Webinar - AI Powered Recommendation Engine for Businesses
 
ML vs AI
ML vs AIML vs AI
ML vs AI
 
Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...
Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...
Machine Learning Engineer Salary, Roles And Responsibilities, Skills and Resu...
 
Academic Integrity and Gen AI -Basic Concepts and Skills
Academic Integrity and Gen AI -Basic Concepts and SkillsAcademic Integrity and Gen AI -Basic Concepts and Skills
Academic Integrity and Gen AI -Basic Concepts and Skills
 

Plus de Jaganadh Gopinadhan

Natural Language Processing with Per
Natural Language Processing with PerNatural Language Processing with Per
Natural Language Processing with Per
Jaganadh Gopinadhan
 
Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org
Jaganadh Gopinadhan
 
Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic
Jaganadh Gopinadhan
 
Script to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latestScript to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latest
Jaganadh Gopinadhan
 

Plus de Jaganadh Gopinadhan (19)

Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 
Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - I
 
Natural Language Processing with Per
Natural Language Processing with PerNatural Language Processing with Per
Natural Language Processing with Per
 
Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org Indian Language Spellchecker Development for OpenOffice.org
Indian Language Spellchecker Development for OpenOffice.org
 
Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic Sanskrit and Computational Linguistic
Sanskrit and Computational Linguistic
 
Script to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latestScript to Sentiment : on future of Language TechnologyMysore latest
Script to Sentiment : on future of Language TechnologyMysore latest
 
A tutorial on Machine Translation
A tutorial on Machine TranslationA tutorial on Machine Translation
A tutorial on Machine Translation
 
Linguistic localization framework for Ooo
Linguistic localization framework for OooLinguistic localization framework for Ooo
Linguistic localization framework for Ooo
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Ilucbe python v1.2
Ilucbe python v1.2Ilucbe python v1.2
Ilucbe python v1.2
 
Social Media Analytics
Social Media Analytics Social Media Analytics
Social Media Analytics
 
Success Factor
Success Factor Success Factor
Success Factor
 
ntroduction to GNU/Linux Linux Installation and Basic Commands
ntroduction to GNU/Linux Linux Installation and Basic Commands ntroduction to GNU/Linux Linux Installation and Basic Commands
ntroduction to GNU/Linux Linux Installation and Basic Commands
 
Let’s Learn Python An introduction to Python
Let’s Learn Python An introduction to Python Let’s Learn Python An introduction to Python
Let’s Learn Python An introduction to Python
 
Introduction to Free and Open Source Software
Introduction to Free and Open Source Software Introduction to Free and Open Source Software
Introduction to Free and Open Source Software
 
Opinion Mining and Sentiment Analysis Issues and Challenges
Opinion Mining and Sentiment Analysis Issues and Challenges Opinion Mining and Sentiment Analysis Issues and Challenges
Opinion Mining and Sentiment Analysis Issues and Challenges
 
What they think about my brand/product ?!?!? An Introduction to Sentiment Ana...
What they think about my brand/product ?!?!? An Introduction to Sentiment Ana...What they think about my brand/product ?!?!? An Introduction to Sentiment Ana...
What they think about my brand/product ?!?!? An Introduction to Sentiment Ana...
 
Hdfs
HdfsHdfs
Hdfs
 
Will Foss get me a Job?
Will Foss get me a Job?Will Foss get me a Job?
Will Foss get me a Job?
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Practical Machine Learning

  • 1. Practical Machine Learning Jaganadh G jaganadhg@gmail.com BarCamp Kerala 9 Amrita Vishwa Vidyapeetham Karunagapally 14 November 2010 Jaganadh G Practical Machine Learning
  • 2. About me !! Working in Natural Language Processing, Machine Learning, Data Mining etc... Passionate about Free and Open source :-) When gets free time teaches Python and blogs at http://jaganadhg.freeflux.net/blog Working as Project Lead (NLP) 365Media Pvt. Ltd. Coimbatore I am a computational linguist / Linguist and Indologist Now Software Engineer by Profession Jaganadh G Practical Machine Learning
  • 3. Machine Learning Machine Learning Machine learning is a subfield of artificial intelligence (AI) concerned with algorithms that allow computers to learn. Jaganadh G Practical Machine Learning
  • 4. Machine Learning Machine Learning Machine learning is a subfield of artificial intelligence (AI) concerned with algorithms that allow computers to learn. Jaganadh G Practical Machine Learning
  • 5. Machine Learning Machine Learning Machine learning is a subfield of artificial intelligence (AI) concerned with algorithms that allow computers to learn. This talk is not aimed to give introduction about Machine Learning Jaganadh G Practical Machine Learning
  • 6. Machine Learning Machine Learning Machine learning is a subfield of artificial intelligence (AI) concerned with algorithms that allow computers to learn. This talk is not aimed to give introduction about Machine Learning Dont expect some mathy equations here Jaganadh G Practical Machine Learning
  • 7. Machine Learning and Our Life Do you think that Machine Learning has any impact in our life ?? Jaganadh G Practical Machine Learning
  • 8. Machine Learning and Our Life Do you think that Machine Learning has any impact in our life ?? Yes Jaganadh G Practical Machine Learning
  • 9. Machine Learning and Our Life Do you think that Machine Learning has any impact in our life ?? Yes In our day to day life we may use many Machine Learning powered tools Jaganadh G Practical Machine Learning
  • 10. Machine Learning and Our Life Do you think that Machine Learning has any impact in our life ?? Yes In our day to day life we may use many Machine Learning powered tools E-mail spam filtering , product recommendations etc .. Jaganadh G Practical Machine Learning
  • 11. Machine Learning and Our Life Do you think that Machine Learning has any impact in our life ?? Yes In our day to day life we may use many Machine Learning powered tools E-mail spam filtering , product recommendations etc .. Fraud detection Jaganadh G Practical Machine Learning
  • 12. Examples Jaganadh G Practical Machine Learning
  • 13. Examples Jaganadh G Practical Machine Learning
  • 14. Examples Jaganadh G Practical Machine Learning
  • 15. Tool for building Machine Learning powerd product/service Apache Mahout Apache Mahout is a scalable machine learning library that supports large data sets. Apache Mahout’s goal is to build scalable machine learning libraries. Commercially friendly licence Well documented Healthy community Targeted to developers Jaganadh G Practical Machine Learning
  • 16. Algorithms in Apache Mahout Jaganadh G Practical Machine Learning
  • 17. Algorithms in Apache Mahout Collaborative Filtering Jaganadh G Practical Machine Learning
  • 18. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders Jaganadh G Practical Machine Learning
  • 19. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Jaganadh G Practical Machine Learning
  • 20. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Jaganadh G Practical Machine Learning
  • 21. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Dirichlet process clustering Jaganadh G Practical Machine Learning
  • 22. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Dirichlet process clustering Latent Dirichlet Allocation Jaganadh G Practical Machine Learning
  • 23. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Dirichlet process clustering Latent Dirichlet Allocation Singular value decomposition Jaganadh G Practical Machine Learning
  • 24. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Dirichlet process clustering Latent Dirichlet Allocation Singular value decomposition Parallel Frequent Pattern mining Jaganadh G Practical Machine Learning
  • 25. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Dirichlet process clustering Latent Dirichlet Allocation Singular value decomposition Parallel Frequent Pattern mining Complementary Naive Bayes classifier Jaganadh G Practical Machine Learning
  • 26. Algorithms in Apache Mahout Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Dirichlet process clustering Latent Dirichlet Allocation Singular value decomposition Parallel Frequent Pattern mining Complementary Naive Bayes classifier Random forest decision tree based classifier Jaganadh G Practical Machine Learning
  • 27. Demo Building recommendations engines with Mahout Document Classification with Mahout Some Python stuff on Machine Learning Jaganadh G Practical Machine Learning
  • 28. Reference Jaganadh G Practical Machine Learning
  • 29. Reference Mahout in Action - Book by Sean Owen and Robin Anil, published by Manning Publications. Taming Text - By Grant Ingersoll and Tom Morton, published by Manning Publications. Introducing Apache Mahout - Grant Ingersoll - Intro to Apache Mahout focused on clustering, classification and collaborative filtering. https://www.ibm.com/developerworks/java/library/j- mahout/index.html Programming Collective Intelligence: Building Smart Web 2.0 Applications http://www.amazon.com/Programming-Collective- Intelligence-Building-Applications/dp/0596529325 Jaganadh G Practical Machine Learning
  • 30. Useful Resources Apache Mahout Site http://mahout.apache.org/ Apache Mahout Mailing List user@mahout.apache.org The code which I used for Mahout demo is available at http://bitbucket.org/jaganadhg/blog/src/tip/bck9/java/ Twenty News Group data set http://people.csail.mit.edu/jrennie/20Newsgroups/20news- bydate.tar.gz Jaganadh G Practical Machine Learning
  • 31. Questions ?? Jaganadh G Practical Machine Learning
  • 32. Acknowledgments Thanks to : Manning Publications for Review Copy of the book ”Mahout in Action” Apache Mahout mailing list members Ted Dunning and Robin Anil for suggestions Sreejith S and Biju B for Java help @chelakkandupoda for review and criticism Mukundhanchari R&D Director 365Media Pvt. Ltd. for support and encouragement Jaganadh G Practical Machine Learning
  • 33. Finally Jaganadh G Practical Machine Learning