SlideShare une entreprise Scribd logo
1  sur  18
Introduction To NOSQL
Agenda
 Overview of NoSQL
 Why NoSQL?
 NoSQL Market Overview
 Categories of NoSQL databases
 Hadoop – Overview
Overview of NoSQL
A term which stands for
Overview of NoSQL (Contd…)
 NoSQL doesn’t mean to stop using SQL or SQL won’t be used.
 The term refers to those databases that differ from relational databases.
 Simply Non-relational databases.
 NoSQL is a non-relational database management systems, different from
traditional relational database management systems in some significant ways.
 It is designed for distributed data stores where very large scale of data storing
needs (for example Google or Facebook which collects terabits of data every
day for their users). These type of data storing may not require fixed schema,
avoid join operations and typically scale horizontally.
NoSQL databases are eventually consistent / CAP (not ACID).
CAP theorem:
 Consistency - This means that the data in the database remains consistent
after the execution of an operation. For example after an update operation all
clients see the same data.
 Availability - This means that the system is always on (service guarantee
availability), no downtime.
Node failures do not prevent survivors from continuing to operate
 Partition Tolerance - This means that the system continues to function even
the communication among the servers is unreliable, i.e. the servers may be
partitioned into multiple groups that cannot communicate with one another.
Overview of NoSQL (Contd…)
Overview of NoSQL (Contd…)
NoSQL Features:
1. Scalability
To maintain performance.
 Horizontal Scalability:
To increase the number of machines but maintaining proportional
performance.
Vertical scalability:
To add more resources to your single machine to optimize
performance
2. Open Source
Most of the NoSQL Projects are Open source. So any one can use, modify
it, like
 Cassandra by facebook.
 Bigtable by Google but only allowed for Google application.
3. Schema Freeness
 NoSQL databases doesn’t use any fixed schema like relational database.
 Internal schema
 External schema etc
 The original intention of NoSQL is the modern web-scale databases.
There are large number of companies using NoSQL. To name a few :
• Google
• Facebook
• Mozilla
• Adobe
Overview of NoSQL (Contd…)
• Foursquare
• LinkedIn
• Digg
• McGraw-Hill Education
WHY NOSQL?
Benefits of NOSQL:
1. Scaling
RDBs weren’t easy to scale out.
On the other hand NoSQL DBs are specially designed to scale out.
2. Big data
Single RDBMS is almost unable to handle today’s huge amount of data and
the transaction on that data.
But
Non-Relational databases are specially designed to handle big data.
Data is becoming easier to capture and access through third parties such as
Facebook, D&B, and others. Personal user information, geo location data,
social graphs, user-generated content, machine logging data, and sensor-
generated data are just a few examples of the ever-expanding array of data
being captured.
3. Needs no Expert DBAs
Although RDMS vendors claim that RDBMS provide management facilities
but it still need an expert DBA to operate it.
In contrast NoSQL DBs don’t need expert DBAs, as it provides automatic
repair, data distribution, and simpler data models, which lead to lower
administration.
WHY NOSQL? (CONTD…)
4. Economics
RDBMS requires expensive components for providing efficient service.
NoSQL uses cheap commodity servers to manage the same amount of
data for which RDBMS needs expensive server. So NoSQL is economical
as well.
5. Flexibility of data models
There can occur changes in the requirements of an organization with the
passage of time. Changes in RDBMS after its deployment creates
many problems and also affects its services or some time it’s even almost
impossible to make changes. NoSQL database can be changed at
any instance, i.e. existing columns can be altered and new can be added.
WHY NOSQL? (CONTD…)
Scale up with relational technology: limitations at the database tier
Source: http://www.couchbase.com/why-nosql/nosql-database
WHY NOSQL? (CONTD…)
Source: http://www.couchbase.com/why-nosql/nosql-database
Scale out with NoSQL technology at the database tier
NOSQL MARKET OVERVIEW
Source: Wikibon 2013 (http://wikibon.org/wiki/v/Hadoop-
NoSQL_Software_and_Services_Market_Forecast_2012-2017)
Hadoop/NoSQL Software and Services Marketshare, 2012
NOSQL MARKET OVERVIEW (CONTD…)
Hadoop/NoSQL Software and Services Market Forecast, 2012-2017
Source: Wikibon 2013 (http://wikibon.org/wiki/v/Hadoop-
NoSQL_Software_and_Services_Market_Forecast_2012-2017)
CATEGORIES OF NOSQL DATABASES
There is a variety of types:
• Column Store – Each storage block contains data from only one column
• Document Store – stores documents made up of tagged elements
• Key-Value Store – Hash table of keys
1. Column Store
• Each storage block contains data from only one column
• Example: Hadoop/Hbase
 http://hadoop.apache.org/
 Clients : Yahoo, Facebook
• Example: Ingres VectorWise
 Column Store integrated with an SQL database
• More efficient than row (or document) store if:
 Multiple row/record/documents are inserted at the same time so updates of
column blocks can be aggregated
 Retrievals access only some of the columns in a row/record/document
CATEGORIES OF NOSQL DATABASES (CONTD…)
2. Document Store:
• It stores documents made up of tagged elements.
• Example: CouchDB
 http://couchdb.apache.org/
 Clients - BBC
• Example: MongoDB
 http://www.mongodb.org/
 Clients - Foursquare, Shutterfly
CATEGORIES OF NOSQL DATABASES (CONTD…)
3. Key-Value Store:
• Hash table of keys
• Values stored with Keys
• Fast access to small data values
• Example – Project-Voldemort
 http://www.project-voldemort.com/
 Clients : Linkedin
• Example – MemCacheDB
 http://memcachedb.org/
HADOOP - OVERVIEW
 The Apache Hadoop software library is a framework that allows for the distributed
processing of large data sets across clusters of computers using simple
programming models.
 It is designed to scale up from single servers to thousands of machines, each
offering local computation and storage.
 Rather than rely on hardware to deliver high-availability, the library itself is designed
to detect and handle failures at the application layer, so delivering a highly-available
service on top of a cluster of computers, each of which may be prone to failures.
The Apache Hadoop framework is composed of the following modules :
 Hadoop Common - contains libraries and utilities needed by other Hadoop modules
 Hadoop Distributed File System (HDFS) - a distributed file-system that stores data
on the commodity machines, providing very high aggregate bandwidth across the
cluster.
 Hadoop YARN - a resource-management platform responsible for managing
compute resources in clusters and using them for scheduling of users' applications.
 Hadoop MapReduce - a programming model for large scale data processing.
Thank You

Contenu connexe

Tendances

NoSQL databases - An introduction
NoSQL databases - An introductionNoSQL databases - An introduction
NoSQL databases - An introductionPooyan Mehrparvar
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational DatabasesUdi Bauman
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Databasenehabsairam
 
NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and UsesSuvradeep Rudra
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLRamakant Soni
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sqlRam kumar
 
Appache Cassandra
Appache Cassandra  Appache Cassandra
Appache Cassandra nehabsairam
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databasesAshwani Kumar
 
NoSql Data Management
NoSql Data ManagementNoSql Data Management
NoSql Data Managementsameerfaizan
 
NoSQL - 05March2014 Seminar
NoSQL - 05March2014 SeminarNoSQL - 05March2014 Seminar
NoSQL - 05March2014 SeminarJainul Musani
 
NoSQL Slideshare Presentation
NoSQL Slideshare Presentation NoSQL Slideshare Presentation
NoSQL Slideshare Presentation Ericsson Labs
 
NoSQL Data Architecture Patterns
NoSQL Data ArchitecturePatternsNoSQL Data ArchitecturePatterns
NoSQL Data Architecture PatternsMaynooth University
 
Sql vs NoSQL-Presentation
 Sql vs NoSQL-Presentation Sql vs NoSQL-Presentation
Sql vs NoSQL-PresentationShubham Tomar
 

Tendances (20)

NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
NoSQL databases - An introduction
NoSQL databases - An introductionNoSQL databases - An introduction
NoSQL databases - An introduction
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational Databases
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
 
NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and Uses
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
Appache Cassandra
Appache Cassandra  Appache Cassandra
Appache Cassandra
 
Nosql databases
Nosql databasesNosql databases
Nosql databases
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databases
 
the rising no sql technology
the rising no sql technologythe rising no sql technology
the rising no sql technology
 
NoSql Data Management
NoSql Data ManagementNoSql Data Management
NoSql Data Management
 
Mongo db
Mongo dbMongo db
Mongo db
 
NoSQL - 05March2014 Seminar
NoSQL - 05March2014 SeminarNoSQL - 05March2014 Seminar
NoSQL - 05March2014 Seminar
 
NoSQL Consepts
NoSQL ConseptsNoSQL Consepts
NoSQL Consepts
 
NoSQL Slideshare Presentation
NoSQL Slideshare Presentation NoSQL Slideshare Presentation
NoSQL Slideshare Presentation
 
NoSQL Data Architecture Patterns
NoSQL Data ArchitecturePatternsNoSQL Data ArchitecturePatterns
NoSQL Data Architecture Patterns
 
Sql vs NoSQL-Presentation
 Sql vs NoSQL-Presentation Sql vs NoSQL-Presentation
Sql vs NoSQL-Presentation
 

En vedette

SO YOU WANT TO REFURBISH COMPUTERS
SO YOU WANT TO REFURBISH COMPUTERSSO YOU WANT TO REFURBISH COMPUTERS
SO YOU WANT TO REFURBISH COMPUTERSPeter Sobotta
 
Ppt laptop repair
Ppt laptop repairPpt laptop repair
Ppt laptop repairJo Polancos
 
Docker from basics to orchestration (PHPConfBr2015)
Docker from basics to orchestration (PHPConfBr2015)Docker from basics to orchestration (PHPConfBr2015)
Docker from basics to orchestration (PHPConfBr2015)Wellington Silva
 
Basic Concepts of OOPs (Object Oriented Programming in Java)
Basic Concepts of OOPs (Object Oriented Programming in Java)Basic Concepts of OOPs (Object Oriented Programming in Java)
Basic Concepts of OOPs (Object Oriented Programming in Java)Michelle Anne Meralpis
 
Docker: The basics - Including a demo with an awesome full-stack JS app
Docker: The basics - Including a demo with an awesome full-stack JS appDocker: The basics - Including a demo with an awesome full-stack JS app
Docker: The basics - Including a demo with an awesome full-stack JS appMarcelo Rodrigues
 
oops concept in java | object oriented programming in java
oops concept in java | object oriented programming in javaoops concept in java | object oriented programming in java
oops concept in java | object oriented programming in javaCPD INDIA
 
LAPTOP SERVICING
LAPTOP SERVICING LAPTOP SERVICING
LAPTOP SERVICING Akhil T S
 

En vedette (11)

SO YOU WANT TO REFURBISH COMPUTERS
SO YOU WANT TO REFURBISH COMPUTERSSO YOU WANT TO REFURBISH COMPUTERS
SO YOU WANT TO REFURBISH COMPUTERS
 
Ppt laptop repair
Ppt laptop repairPpt laptop repair
Ppt laptop repair
 
Docker from basics to orchestration (PHPConfBr2015)
Docker from basics to orchestration (PHPConfBr2015)Docker from basics to orchestration (PHPConfBr2015)
Docker from basics to orchestration (PHPConfBr2015)
 
Basics of oops concept
Basics of oops conceptBasics of oops concept
Basics of oops concept
 
Basic Concepts of OOPs (Object Oriented Programming in Java)
Basic Concepts of OOPs (Object Oriented Programming in Java)Basic Concepts of OOPs (Object Oriented Programming in Java)
Basic Concepts of OOPs (Object Oriented Programming in Java)
 
Docker: The basics - Including a demo with an awesome full-stack JS app
Docker: The basics - Including a demo with an awesome full-stack JS appDocker: The basics - Including a demo with an awesome full-stack JS app
Docker: The basics - Including a demo with an awesome full-stack JS app
 
oops concept in java | object oriented programming in java
oops concept in java | object oriented programming in javaoops concept in java | object oriented programming in java
oops concept in java | object oriented programming in java
 
Magnetism
MagnetismMagnetism
Magnetism
 
LAPTOP SERVICING
LAPTOP SERVICING LAPTOP SERVICING
LAPTOP SERVICING
 
Book Keeping For Beginners
Book Keeping For BeginnersBook Keeping For Beginners
Book Keeping For Beginners
 
Magnetism ppt
Magnetism pptMagnetism ppt
Magnetism ppt
 

Similaire à Introduction to NoSQL

Sql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSatya Pal
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013Facundo Farias
 
1. introduction to no sql
1. introduction to no sql1. introduction to no sql
1. introduction to no sqlAnuja Gunale
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 
Introduction to NoSQL and MongoDB
Introduction to NoSQL and MongoDBIntroduction to NoSQL and MongoDB
Introduction to NoSQL and MongoDBAhmed Farag
 
Why no sql_ibm_cloudant
Why no sql_ibm_cloudantWhy no sql_ibm_cloudant
Why no sql_ibm_cloudantPeter Tutty
 
MongoDB Lab Manual (1).pdf used in data science
MongoDB Lab Manual (1).pdf used in data scienceMongoDB Lab Manual (1).pdf used in data science
MongoDB Lab Manual (1).pdf used in data sciencebitragowthamkumar1
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAinventy
 
SQL vs NoSQL deep dive
SQL vs NoSQL deep diveSQL vs NoSQL deep dive
SQL vs NoSQL deep diveAhmed Shaaban
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology LandscapeShivanandaVSeeri
 
No sql databases explained
No sql databases explainedNo sql databases explained
No sql databases explainedSalil Mehendale
 
No sqlpresentation
No sqlpresentationNo sqlpresentation
No sqlpresentationSalma Gouia
 
Chapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choicesChapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choicesMaynooth University
 

Similaire à Introduction to NoSQL (20)

No sql database
No sql databaseNo sql database
No sql database
 
NoSQL Basics and MongDB
NoSQL Basics and  MongDBNoSQL Basics and  MongDB
NoSQL Basics and MongDB
 
Sql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explained
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013
 
Unit 3 MongDB
Unit 3 MongDBUnit 3 MongDB
Unit 3 MongDB
 
1. introduction to no sql
1. introduction to no sql1. introduction to no sql
1. introduction to no sql
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
Introduction to NoSQL and MongoDB
Introduction to NoSQL and MongoDBIntroduction to NoSQL and MongoDB
Introduction to NoSQL and MongoDB
 
Why no sql_ibm_cloudant
Why no sql_ibm_cloudantWhy no sql_ibm_cloudant
Why no sql_ibm_cloudant
 
Know what is NOSQL
Know what is NOSQL Know what is NOSQL
Know what is NOSQL
 
NoSQL and MongoDB
NoSQL and MongoDBNoSQL and MongoDB
NoSQL and MongoDB
 
MongoDB Lab Manual (1).pdf used in data science
MongoDB Lab Manual (1).pdf used in data scienceMongoDB Lab Manual (1).pdf used in data science
MongoDB Lab Manual (1).pdf used in data science
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBA
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
SQL vs NoSQL deep dive
SQL vs NoSQL deep diveSQL vs NoSQL deep dive
SQL vs NoSQL deep dive
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology Landscape
 
No sql databases explained
No sql databases explainedNo sql databases explained
No sql databases explained
 
No sqlpresentation
No sqlpresentationNo sqlpresentation
No sqlpresentation
 
Chapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choicesChapter1: NoSQL: It’s about making intelligent choices
Chapter1: NoSQL: It’s about making intelligent choices
 
No sql
No sqlNo sql
No sql
 

Dernier

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 

Dernier (20)

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 

Introduction to NoSQL

  • 2. Agenda  Overview of NoSQL  Why NoSQL?  NoSQL Market Overview  Categories of NoSQL databases  Hadoop – Overview
  • 3. Overview of NoSQL A term which stands for
  • 4. Overview of NoSQL (Contd…)  NoSQL doesn’t mean to stop using SQL or SQL won’t be used.  The term refers to those databases that differ from relational databases.  Simply Non-relational databases.  NoSQL is a non-relational database management systems, different from traditional relational database management systems in some significant ways.  It is designed for distributed data stores where very large scale of data storing needs (for example Google or Facebook which collects terabits of data every day for their users). These type of data storing may not require fixed schema, avoid join operations and typically scale horizontally.
  • 5. NoSQL databases are eventually consistent / CAP (not ACID). CAP theorem:  Consistency - This means that the data in the database remains consistent after the execution of an operation. For example after an update operation all clients see the same data.  Availability - This means that the system is always on (service guarantee availability), no downtime. Node failures do not prevent survivors from continuing to operate  Partition Tolerance - This means that the system continues to function even the communication among the servers is unreliable, i.e. the servers may be partitioned into multiple groups that cannot communicate with one another. Overview of NoSQL (Contd…)
  • 6. Overview of NoSQL (Contd…) NoSQL Features: 1. Scalability To maintain performance.  Horizontal Scalability: To increase the number of machines but maintaining proportional performance. Vertical scalability: To add more resources to your single machine to optimize performance 2. Open Source Most of the NoSQL Projects are Open source. So any one can use, modify it, like  Cassandra by facebook.  Bigtable by Google but only allowed for Google application.
  • 7. 3. Schema Freeness  NoSQL databases doesn’t use any fixed schema like relational database.  Internal schema  External schema etc  The original intention of NoSQL is the modern web-scale databases. There are large number of companies using NoSQL. To name a few : • Google • Facebook • Mozilla • Adobe Overview of NoSQL (Contd…) • Foursquare • LinkedIn • Digg • McGraw-Hill Education
  • 8. WHY NOSQL? Benefits of NOSQL: 1. Scaling RDBs weren’t easy to scale out. On the other hand NoSQL DBs are specially designed to scale out. 2. Big data Single RDBMS is almost unable to handle today’s huge amount of data and the transaction on that data. But Non-Relational databases are specially designed to handle big data. Data is becoming easier to capture and access through third parties such as Facebook, D&B, and others. Personal user information, geo location data, social graphs, user-generated content, machine logging data, and sensor- generated data are just a few examples of the ever-expanding array of data being captured. 3. Needs no Expert DBAs Although RDMS vendors claim that RDBMS provide management facilities but it still need an expert DBA to operate it. In contrast NoSQL DBs don’t need expert DBAs, as it provides automatic repair, data distribution, and simpler data models, which lead to lower administration.
  • 9. WHY NOSQL? (CONTD…) 4. Economics RDBMS requires expensive components for providing efficient service. NoSQL uses cheap commodity servers to manage the same amount of data for which RDBMS needs expensive server. So NoSQL is economical as well. 5. Flexibility of data models There can occur changes in the requirements of an organization with the passage of time. Changes in RDBMS after its deployment creates many problems and also affects its services or some time it’s even almost impossible to make changes. NoSQL database can be changed at any instance, i.e. existing columns can be altered and new can be added.
  • 10. WHY NOSQL? (CONTD…) Scale up with relational technology: limitations at the database tier Source: http://www.couchbase.com/why-nosql/nosql-database
  • 11. WHY NOSQL? (CONTD…) Source: http://www.couchbase.com/why-nosql/nosql-database Scale out with NoSQL technology at the database tier
  • 12. NOSQL MARKET OVERVIEW Source: Wikibon 2013 (http://wikibon.org/wiki/v/Hadoop- NoSQL_Software_and_Services_Market_Forecast_2012-2017) Hadoop/NoSQL Software and Services Marketshare, 2012
  • 13. NOSQL MARKET OVERVIEW (CONTD…) Hadoop/NoSQL Software and Services Market Forecast, 2012-2017 Source: Wikibon 2013 (http://wikibon.org/wiki/v/Hadoop- NoSQL_Software_and_Services_Market_Forecast_2012-2017)
  • 14. CATEGORIES OF NOSQL DATABASES There is a variety of types: • Column Store – Each storage block contains data from only one column • Document Store – stores documents made up of tagged elements • Key-Value Store – Hash table of keys 1. Column Store • Each storage block contains data from only one column • Example: Hadoop/Hbase  http://hadoop.apache.org/  Clients : Yahoo, Facebook • Example: Ingres VectorWise  Column Store integrated with an SQL database • More efficient than row (or document) store if:  Multiple row/record/documents are inserted at the same time so updates of column blocks can be aggregated  Retrievals access only some of the columns in a row/record/document
  • 15. CATEGORIES OF NOSQL DATABASES (CONTD…) 2. Document Store: • It stores documents made up of tagged elements. • Example: CouchDB  http://couchdb.apache.org/  Clients - BBC • Example: MongoDB  http://www.mongodb.org/  Clients - Foursquare, Shutterfly
  • 16. CATEGORIES OF NOSQL DATABASES (CONTD…) 3. Key-Value Store: • Hash table of keys • Values stored with Keys • Fast access to small data values • Example – Project-Voldemort  http://www.project-voldemort.com/  Clients : Linkedin • Example – MemCacheDB  http://memcachedb.org/
  • 17. HADOOP - OVERVIEW  The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.  It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.  Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures. The Apache Hadoop framework is composed of the following modules :  Hadoop Common - contains libraries and utilities needed by other Hadoop modules  Hadoop Distributed File System (HDFS) - a distributed file-system that stores data on the commodity machines, providing very high aggregate bandwidth across the cluster.  Hadoop YARN - a resource-management platform responsible for managing compute resources in clusters and using them for scheduling of users' applications.  Hadoop MapReduce - a programming model for large scale data processing.