SlideShare a Scribd company logo
1 of 19
Download to read offline
MongoDB use cases and setup
involving Elasticsearch
MongoDB Meetup @hikeapp Gurgaon
Bharvi Dixit
@d_bharvi
13th February 2015
Agenda
 About Me and Orkash.
 Why we chose MongoDB.
 Our use cases and setup of MongoDB.
 Better Than Apple: MongoDB-Elasticsearch.
 Elasticsearch An Overview.
 The most common issues.
 Mongo University: Learn from the masters.
About Me
 Software Engineer @Orkash.
 Organizer and Speaker @Delhi Elasticsearch Meetup.
 Loves Java, Data, Elasticsearch, MongoDB, Eclipse.
 Interested in all things scale, search, security & DevOps.
 Working with NoSQL databases for more than a year.
 Social Media and News Media Intelligence. (Complex
schemas & Query designs)
About Orkash
 Founded in 2007 by Ashish Sonal.
 An R&D driven company which provides Big Data Automated Intelligence
Platform with a focus in following areas:
– Counter-terrorism, Security intelligence and Risk management.
– Political Consulting And Homeland Security.
– Decision Support Systems.
– Market/Brand intelligence.
 We create the FOUR pillars of Automated intelligence:
– Information Extraction and Monitoring.
– Semantic and Link Analysis.
– Geo-Spatial Analysis.
– Data Mining & Forensics.
Everything starts with a problem..!!
• Data Driven Decisions
• Logfiles for scaling up/down
• Warehouse withdrawal triggers orders
• History for fraud detection
• Internet of Things and Smart Cities.
... data explosion
Everything starts with a problem..!!
Better decisions == more data
And NoSQL adds more problems
Data
Big Data
BIG DATA
Big Data Problem goes on..
• I need BIG DATA.
• I need to analyze this data.
• I need to enrich this big data & make it more bigger.
• I need fast searching.
• I need real-time analytics.
• Ohh wait.. I need relational queries on this big data to get
more insights..
Why we chose mongoDB
• It does the impossible. (Can incorporate any kind of data)
• Document model.
• Distributed computing.
• Awesome sharding and replications.
• Scales big (horizontally) on commodity hardware's.
• Powerful Analytics with aggregation framework.
• Highly Persistence and Read-Write Performance.
• Awesome security features.
• OS-Managed memory management.
Our use cases and setup of MongoDB.
• A primary data store for collecting and storing humongous
amount of unstructured/semi-structured texts.
• Building GIS applications for government and security agencies
using GEO Spatial features.
• Data analytics.
Our use cases and setup of MongoDB.
Our current production setup has 14 nodes:
Node Type #of nodes Hardware Specifications
Data nodes 5 (20 GB RAM with 8 core CPU each)
Mongos (VM’s) 4 (4 GB RAM with 4 core CPU each)
Arbiter nodes(VM’s) 2 (1 GB RAM with 1 core CPU each)
Config servers(VM’s) 3 (4 GB RAM with 2 core CPU each)
Better Than Apple: MongoDB-Elasticsearch
• One of the greatest
combinations this era has
seen.
• Continuous improvements
• Fulfills each other’s
missing features.
• Both have almost similar
concepts and data types.
• Both keep cloud in mind.
• Driven by Open-Source
community, knowledge
sharing, and High
collaboration with users.
Better Than Apple: MongoDB-Elasticsearch
Sources: Twitter
Elasticsearch Overview
What is Elasticsearch:
• “you know, for search”
• Schema-free, REST & JSON Based distributed Full Text
search engine & document store.
• Written in JAVA & Build on top of Lucene.
• Highly reliable, scalable, fault tolerant.
• Support distributed Indexing, Replication, and load
balanced querying.
• Powerful Geo-Spatial Queries.
• Latest Release : 1.4.2
Wait..!! Schema Free?? The real gotcha.. Mongo-ES breakup 
Elasticsearch Overview
What does it add to Lucene:
• REST service: Json API’s over HTTP
• High Availability & Performance: Clustering & Replication
• A Powerful query DSL.
• Interoperation with non-Java/JVM languages.
• More and more Resilience.
• Multitenancy
• And the best one: It allows to maintain relationship
among documents.
The Elasticsearch Open Source Model
Understanding Elasticsearch Structure in respect to
MongoDB
The most common issues..
1. Distributed computing comes with two problems:
Node failures and Network Bottlenecks
Node failures can be handled by MongoDB very easily but
Network bottleneck/partitions won’t let you sleep at nights
because of Replicaset failovers and Rollbacks.
Separate networks for read and write.
2. Assuring Business continuity plan
Mongodump is not fit for the large dataset backups.
3. Data Modeling
4. Keeping a close eye on Connection
5. Importing embedded documents in CSV
Mongo University: Learn from the masters..!!
Thank You for Listening
bharvidixit@yahoo.com
https://twitter.com/d_bharvi
http://www.meetup.com/Delhi-Elasticsearch-Meetup/
http://www.slideshare.net/bharvidixit/

More Related Content

What's hot

Frontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling FrameworkFrontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling Framework
sixtyone
 
How search engines work
How search engines workHow search engines work
How search engines work
Chinna Botla
 

What's hot (20)

Encryption in the enterprise
Encryption in the enterpriseEncryption in the enterprise
Encryption in the enterprise
 
Introduction to MongoDB Basics from SQL to NoSQL
Introduction to MongoDB Basics from SQL to NoSQLIntroduction to MongoDB Basics from SQL to NoSQL
Introduction to MongoDB Basics from SQL to NoSQL
 
MongoDB & The McGraw-Hill Education Learning Analytics Platform
MongoDB & The McGraw-Hill Education Learning Analytics PlatformMongoDB & The McGraw-Hill Education Learning Analytics Platform
MongoDB & The McGraw-Hill Education Learning Analytics Platform
 
MongoDB
MongoDBMongoDB
MongoDB
 
Migrating from MySQL to MongoDB at Wordnik
Migrating from MySQL to MongoDB at WordnikMigrating from MySQL to MongoDB at Wordnik
Migrating from MySQL to MongoDB at Wordnik
 
DomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web AssetsDomainTools Fingerprinting Threat Actors with Web Assets
DomainTools Fingerprinting Threat Actors with Web Assets
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the Hyperlink
 
The Internet as a Single Database
The Internet as a Single DatabaseThe Internet as a Single Database
The Internet as a Single Database
 
Frontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling FrameworkFrontera-Open Source Large Scale Web Crawling Framework
Frontera-Open Source Large Scale Web Crawling Framework
 
Scrapinghub Deck for Startups
Scrapinghub Deck for StartupsScrapinghub Deck for Startups
Scrapinghub Deck for Startups
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the Hyperlink
 
Elasticsearch tuning
Elasticsearch tuningElasticsearch tuning
Elasticsearch tuning
 
“Just the Facts, Ma’am”: RSS and your library
“Just the Facts, Ma’am”: RSS and your library“Just the Facts, Ma’am”: RSS and your library
“Just the Facts, Ma’am”: RSS and your library
 
Web Scraping Technologies
Web Scraping TechnologiesWeb Scraping Technologies
Web Scraping Technologies
 
About onlineextrems concept
About onlineextrems conceptAbout onlineextrems concept
About onlineextrems concept
 
Using Web Data for Finance
Using Web Data for FinanceUsing Web Data for Finance
Using Web Data for Finance
 
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB  present...
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
 
How search engines work
How search engines workHow search engines work
How search engines work
 
Correcting and Updating the Scholarly Record through CrossMark
Correcting and Updating the Scholarly Record through CrossMarkCorrecting and Updating the Scholarly Record through CrossMark
Correcting and Updating the Scholarly Record through CrossMark
 
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
 

Similar to MongoDB meetup at Hike

MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetup
ikanow
 
Accra MongoDB User Group
Accra MongoDB User GroupAccra MongoDB User Group
Accra MongoDB User Group
MongoDB
 

Similar to MongoDB meetup at Hike (20)

Webinar: When to Use MongoDB
Webinar: When to Use MongoDBWebinar: When to Use MongoDB
Webinar: When to Use MongoDB
 
Exploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better TogetherExploring MongoDB & Elasticsearch: Better Together
Exploring MongoDB & Elasticsearch: Better Together
 
When to Use MongoDB
When to Use MongoDBWhen to Use MongoDB
When to Use MongoDB
 
NoSQL
NoSQLNoSQL
NoSQL
 
Solr and Elasticsearch, a performance study
Solr and Elasticsearch, a performance studySolr and Elasticsearch, a performance study
Solr and Elasticsearch, a performance study
 
Ten things to consider for interactive analytics on write once workloads
Ten things to consider for interactive analytics on write once workloadsTen things to consider for interactive analytics on write once workloads
Ten things to consider for interactive analytics on write once workloads
 
MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetup
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
 
Your data layer - Choosing the right database solutions for the future
Your data layer - Choosing the right database solutions for the futureYour data layer - Choosing the right database solutions for the future
Your data layer - Choosing the right database solutions for the future
 
Elasticsearch vs MongoDB comparison
Elasticsearch vs MongoDB comparisonElasticsearch vs MongoDB comparison
Elasticsearch vs MongoDB comparison
 
MongoDB Basics
MongoDB BasicsMongoDB Basics
MongoDB Basics
 
No sq lv1_0
No sq lv1_0No sq lv1_0
No sq lv1_0
 
Accra MongoDB User Group
Accra MongoDB User GroupAccra MongoDB User Group
Accra MongoDB User Group
 
Augmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure DataAugmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure Data
 
Augmenting Mongo DB with treasure data
Augmenting Mongo DB with treasure dataAugmenting Mongo DB with treasure data
Augmenting Mongo DB with treasure data
 
Big Data Technologies.pdf
Big Data Technologies.pdfBig Data Technologies.pdf
Big Data Technologies.pdf
 
Architecting Your First Big Data Implementation
Architecting Your First Big Data ImplementationArchitecting Your First Big Data Implementation
Architecting Your First Big Data Implementation
 
Big Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case studyBig Data with IOT approach and trends with case study
Big Data with IOT approach and trends with case study
 
MongoDB.pptx
MongoDB.pptxMongoDB.pptx
MongoDB.pptx
 
SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018 SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018
 

Recently uploaded

Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Abortion pills in Riyadh +966572737505 get cytotec
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
vexqp
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
vexqp
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 

Recently uploaded (20)

Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 

MongoDB meetup at Hike

  • 1. MongoDB use cases and setup involving Elasticsearch MongoDB Meetup @hikeapp Gurgaon Bharvi Dixit @d_bharvi 13th February 2015
  • 2. Agenda  About Me and Orkash.  Why we chose MongoDB.  Our use cases and setup of MongoDB.  Better Than Apple: MongoDB-Elasticsearch.  Elasticsearch An Overview.  The most common issues.  Mongo University: Learn from the masters.
  • 3. About Me  Software Engineer @Orkash.  Organizer and Speaker @Delhi Elasticsearch Meetup.  Loves Java, Data, Elasticsearch, MongoDB, Eclipse.  Interested in all things scale, search, security & DevOps.  Working with NoSQL databases for more than a year.  Social Media and News Media Intelligence. (Complex schemas & Query designs)
  • 4. About Orkash  Founded in 2007 by Ashish Sonal.  An R&D driven company which provides Big Data Automated Intelligence Platform with a focus in following areas: – Counter-terrorism, Security intelligence and Risk management. – Political Consulting And Homeland Security. – Decision Support Systems. – Market/Brand intelligence.  We create the FOUR pillars of Automated intelligence: – Information Extraction and Monitoring. – Semantic and Link Analysis. – Geo-Spatial Analysis. – Data Mining & Forensics.
  • 5. Everything starts with a problem..!! • Data Driven Decisions • Logfiles for scaling up/down • Warehouse withdrawal triggers orders • History for fraud detection • Internet of Things and Smart Cities. ... data explosion
  • 6. Everything starts with a problem..!! Better decisions == more data And NoSQL adds more problems Data Big Data BIG DATA
  • 7. Big Data Problem goes on.. • I need BIG DATA. • I need to analyze this data. • I need to enrich this big data & make it more bigger. • I need fast searching. • I need real-time analytics. • Ohh wait.. I need relational queries on this big data to get more insights..
  • 8. Why we chose mongoDB • It does the impossible. (Can incorporate any kind of data) • Document model. • Distributed computing. • Awesome sharding and replications. • Scales big (horizontally) on commodity hardware's. • Powerful Analytics with aggregation framework. • Highly Persistence and Read-Write Performance. • Awesome security features. • OS-Managed memory management.
  • 9. Our use cases and setup of MongoDB. • A primary data store for collecting and storing humongous amount of unstructured/semi-structured texts. • Building GIS applications for government and security agencies using GEO Spatial features. • Data analytics.
  • 10. Our use cases and setup of MongoDB. Our current production setup has 14 nodes: Node Type #of nodes Hardware Specifications Data nodes 5 (20 GB RAM with 8 core CPU each) Mongos (VM’s) 4 (4 GB RAM with 4 core CPU each) Arbiter nodes(VM’s) 2 (1 GB RAM with 1 core CPU each) Config servers(VM’s) 3 (4 GB RAM with 2 core CPU each)
  • 11. Better Than Apple: MongoDB-Elasticsearch • One of the greatest combinations this era has seen. • Continuous improvements • Fulfills each other’s missing features. • Both have almost similar concepts and data types. • Both keep cloud in mind. • Driven by Open-Source community, knowledge sharing, and High collaboration with users.
  • 12. Better Than Apple: MongoDB-Elasticsearch Sources: Twitter
  • 13. Elasticsearch Overview What is Elasticsearch: • “you know, for search” • Schema-free, REST & JSON Based distributed Full Text search engine & document store. • Written in JAVA & Build on top of Lucene. • Highly reliable, scalable, fault tolerant. • Support distributed Indexing, Replication, and load balanced querying. • Powerful Geo-Spatial Queries. • Latest Release : 1.4.2 Wait..!! Schema Free?? The real gotcha.. Mongo-ES breakup 
  • 14. Elasticsearch Overview What does it add to Lucene: • REST service: Json API’s over HTTP • High Availability & Performance: Clustering & Replication • A Powerful query DSL. • Interoperation with non-Java/JVM languages. • More and more Resilience. • Multitenancy • And the best one: It allows to maintain relationship among documents.
  • 15. The Elasticsearch Open Source Model
  • 16. Understanding Elasticsearch Structure in respect to MongoDB
  • 17. The most common issues.. 1. Distributed computing comes with two problems: Node failures and Network Bottlenecks Node failures can be handled by MongoDB very easily but Network bottleneck/partitions won’t let you sleep at nights because of Replicaset failovers and Rollbacks. Separate networks for read and write. 2. Assuring Business continuity plan Mongodump is not fit for the large dataset backups. 3. Data Modeling 4. Keeping a close eye on Connection 5. Importing embedded documents in CSV
  • 18. Mongo University: Learn from the masters..!!
  • 19. Thank You for Listening bharvidixit@yahoo.com https://twitter.com/d_bharvi http://www.meetup.com/Delhi-Elasticsearch-Meetup/ http://www.slideshare.net/bharvidixit/