SlideShare une entreprise Scribd logo
1  sur  40
Microsoft Azure DocumentDB
Overview presentation
James Serra
Big Data Evangelist
Microsoft
JamesSerra3@gmail.com
About Me
 Microsoft, Big Data Evangelist
 In IT for 30 years, worked on many BI and DW projects
 Worked as desktop/web/database developer, DBA, BI and DW architect and developer, MDM
architect, PDW/APS developer
 Been perm employee, contractor, consultant, business owner
 Presenter at PASS Business Analytics Conference, PASS Summit, Enterprise Data World conference
 Certifications: MCSE: Data Platform, Business Intelligence; MS: Architecting Microsoft Azure
Solutions, Design and Implement Big Data Analytics Solutions, Design and Implement Cloud Data
Platform Solutions
 Blog at JamesSerra.com
 Former SQL Server MVP
 Author of book “Reporting with Microsoft SQL Server 2012”
Agenda
NoSQL Overview
DocumentDB Overview
Today’s application environment
Pricing
DocumentDB basics
Service summary
Development scenarios
Resources and tools
What is NoSQL?
Choose the store that
best fits your needs
A database solution designed to compensate for the technical limitations of SQL
Traditional approach: relational stores
Data is stored in tables that comprise:
• Schemas
• Columns
• Rows
Chappell & Associates. “Understanding NoSQL on Microsoft Azure.” 2014. http://www.davidchappell.com/writing/white_papers/Azure-NoSQL-Technologies-v2.0--Chappell.pdf.
Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
Azure DocumentDB
Uses all but graph category
Includes some key-value and columnstore capabilities
NoSQL approach: various types of stores
PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
A NoSQL database uses four categories of stores:
Key-value stores
Key-value stores offer high speed
through the least-complicated data
model—anything can be stored as
a value, as long as each value is
associated with a key or name.
Key Value
Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
Wide-column stores
Wide-column stores are fast and can be almost as simple as key-value stores. They
include a primary key, an optional secondary key, and anything stored as a value.
Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
Values
Primary key
Keys and values can be
sparse or numerous
Secondary key
Graph databases
Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
Title:
Forgotten
Bridges
Title:
Mythical
Bridges
Purchased
Date: 03/02/2011
Purchased
Date: 09/09/2011
Purchased
Date: 05/07/2011
Name:
Ian
Name:
Alan
Document stores
Document stores contain data objects
that are inherently hierarchical, tree-
like structures (most notably JavaScript
Object Notation [JSON] or Extensible
Markup Language [XML]).
Note that these are not Microsoft
Word documents!
Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
NewSQL: another variation
Relational NewSQL stores are
designed for web-scale applications,
but they still require up-front schemas,
joins, and table management that can
be labor intensive.
Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015.
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf.
Why NoSQL evolved
Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015
http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
Drivers
SQL and NoSQL: each has its place
Fully featured RDBMS
Transactional processing
RichQuery
Managed as a service
Elastic scale
Internet-accessible http/rest
Schema-free data model
Arbitrary data formats
Azure DocumentDB
Perfect for cloud architects and developers who need an enterprise-ready NoSQL document database
JSON
{
"name": "John",
"country": "Canada",
"age": 43,
"lastUse": "March 4, 2014"
}
{
"name": "Eva",
"country": "Germany",
"age": 25
}
{
"name": "Lou",
"country": "Australia",
"age": 51,
"firstUse": "May 8, 2013"
}
{
"docCount": 3,
"last": "May 1, 2014"
}
DOCUMENT1
DOCUMENT2
DOCUMENT3
DOCUMENT4
A NoSQL document database-as-a-service, fully managed by Azure
{
"name": "SmugMug",
"permalink": "smugmug",
"homepage_url": "http://www.smugmug.com",
"blog_url": "http://blogs.smugmug.com/",
"category_code": "photo_video",
"products": [
{
"name": "SmugMug",
"permalink": "smugmug"
}
],
"offices": [
{
"description": "",
"address1": "67 E. Evelyn Ave",
"address2": "",
"zip_code": "94041",
"city": "Mountain View",
"state_code": "CA",
"country_code": "USA",
"latitude": 37.390056,
"longitude": -122.067692
}
]
}
Perfect for: schema-agnostic JSON store for
hierarchical and denormalized data at scale
What documents?
Not Word documents
Azure DocumentDB details
Native support for JavaScript, SQL query, and transactions over JSON documents
Reliable and
predictable
performance
• Tunable consistency
• Elastic scale
Rapid
development
• Build with familiar tools—REST,
JSON, JavaScript
RichQuery and
transactions
over JSON data
• Query JSON data with no
secondary indices
Ideal for apps designed for the cloud when the following are high priorities:
Top Features
Auto-scaling/sharding
• Improved scalability and reliability due to distribution
of large data sets across multiple machines
Automatic indexing
• All document properties are available for queries
• Frees you from relying on schemas or secondary indexes
SQL query language
• Make use of SQL experience and .NET LINQ
Managed service
• Spin up on demand with no setup
• Availability guarantee of 99.95%
• Linear price curve without virtual-machine step functions
• Integration with Azure HDInsight and Azure Search
Top Features
Greater consistency control
• Four consistency levels provide more options for
consistency, availability, and performance requirements
Atomicity, Consistency,
Isolation, and Durability
(ACID) transaction control
• Simpler programming model (compared to state variables)
• Use JavaScript for insert, update, and delete actions
Standards-based open
API with RESTful HTTP
• Uses JSON standard—no mapping of Binary
JSON (BSON) to JSON needed
Granular access rights
• Allows access to all documents and attachments
within collections
Monitor an account
• View performance metrics for a DocumentDB account
• Customize performance metric views for a DocumentDB account
• Create side-by-side performance metric charts
• View usage metrics for a DocumentDB account
• Set up performance metric alerts for a DocumentDB account
Today’s modern apps
• Produce and consume data at a staggering rate
• Require instantaneous response times to
match user expectations
• Are developed iteratively with many versions
supported concurrently
• Are developed with continuously evolving
data models
• Are increasingly complex
• Experience unpredictable and explosive growth
Well-suited for web and mobile apps
Catalog data Preferences
and state
Event store
User-generated
content
Data exchange
Azure DocumentDB at Microsoft
More than 450 million unique users
Store 20 TB of JSON document data
Under 15 millisecond (ms) writes and
single-digit ms reads
Store for 40+ app/device combinations
Available globally to serve all markets
USER DATA STORE
Standard pricing tier with hourly billing
Azure DocumentDB basics
Resource model
• Entities addressable by logical Uniform Resource
Identifier (URI)
• Partitioned for scale out
• Replicated for high availability
• Entities represented as JSON
• Accounts scale out by moving a slider
Interaction model
• RESTful interaction over HTTPS
• HTTPS and TCP connectivity
• Standard HTTPS verbs and semantics
Development
• .NET, Node.js, Python, Java, and JavaScript clients
• SQL for query expression, .NET LINQ
• JavaScript for server-side app logic
Azure
DocumentDB
account Databases
Users
Permissions
101
010
Attachments
Your documents here
{ }
{ }
DocumentsCollections
Stored procedures
Triggers
User-defined functions
JS
JS
JS
• Collections != tables
• Unit of partitioning
• Transaction boundary
• No enforced schema, flexible
• Queried or updated stay together in
one collection
• Elasticity to 10 GB
• RUs evenly distributed
across partitions
Azure DocumentDB collections
101
010
Attachments
Your documents here
DocumentsCollections
Stored procedures
Triggers
User-defined functions
JS
JS
JS
…
Elastic collections
• Collection != single partition
• Partition count dynamic
• Each partition (key) is 10 GB
• Online splits and merges with
full availability
• RUs evenly distributed
across partitions
Rich query over JSON data
Native JavaScript
transactional
processing
Familiar SQL-based
query language
Query on JSON data
without specifying
secondary indices or
constructing views
Build modern, scalable apps with robust transactional querying and data
processing on JSON documents
JavaScript transactions
Transactionally process multiple documents
with application-defined stored procedures
and triggers
• JavaScript as the procedural language
• Language integrated
• Execution wrapped in an implicit transaction
• Preregistered and scoped to a collection
• Performed with ACID guarantees
• Triggers invoked as pre- or post-operations
Stored procedures
JS
Triggers
Reliable and predictable performance
Tunable
consistency
Elastic scaleFast, predictable
performance
Defined throughput levels that scale
linearly with application needs
Azure DocumentDB is born in the cloud to achieve fast, predictable performance
with reserved resources to deliver on throughput needs. Delivers reliable, tunable
consistency to increase performance based on application needs.
Document myDoc = await
client.ReadDocumentAsync(documentLink, new
RequestOptions { ConsistencyLevel =
ConsistencyLevel.Eventual });
Four consistency levels
Strong Session
Bounded
Staleness
Eventual
Lower consistency level on read operations
Consistency levels enable guarantees
Choose your consistency level and make predictable trade-offs between
consistency, availability, and performance
Choose
your level
Strong
Data consistency
Session
Monotonic reads
(on explicit read
requests) and writes
Bounded Staleness
Total order of
propagation of writes
Eventual
Lowest latency
for reads and writes
Security model
Azure Document DB is designed to be secure with:
• Master key
• Access control on resources
• User operations
• Permission operations
• Code execution
Rapid development
Easy to start and
fully-manage
Enterprise-grade
Azure platform
Build with familiar
tools—REST, JSON,
and JavaScript
Reduce development friction and complexity when building new business-class
applications by using familiar tools and industry-standard platforms. Combine
Azure DocumentDB with a portfolio of complementary cloud services on the
Azure platform, such as the Azure HDInsight Connector and Azure Search Indexer
Tools
https://azure.microsoft.com/en-
us/blog/exploring-azure-documentdb-
in-visual-studio/
https://azure.microsoft.com/en-
us/documentation/articles/documentdb
-import-data/
http://portal.azure.com
Azure DocumentDB service summary
Unique among NoSQL stores:
• Developed for the cloud and for delivery
as a service
• Truly query-able JSON store
• Transactional processing through language-
integrated JavaScript
• Predictable performance and
tunable consistency
Development scenarios
Consider Azure DocumentDB when you need:
• To build new web and mobile cloud-based applications
• Rapid development and high-scalability requirements
• Query and processing of user- and device-generated data
• More query and processing support for your key-value stores
• To run a document store in virtual machines
• A managed service model
Build your first Azure DocumentDB app today
Get support
Schedule a 1:1 chat directly with
the Azure DocumentDB engineering
team at askdocdb.com
Give feedback
Ask questions through the forum
at http://aka.ms/docdbforum
Suggest an idea and vote to support
other ideas for Azure DocumentDB at
http://aka.ms/docdbideas
On Twitter @documentdb
Get started
Sign up for Azure DocumentDB
at http://aka.ms/docdbstart
Access and configure your account
at http://portal.azure.com
Download an SDK from
http://aka.ms/docdbsdks,
and then build a sample at
http://aka.ms/docdbsample
Go to
http://www.documentdb.com/sql/demo
Test out sample queries or
write your own against the
dataset
Using DocumentDB Query Playground
Learn more
David Chappell NoSQL overview paper on Infopedia
http://www.davidchappell.com/writing/white_papers/Azure-NoSQL-Technologies-v2.0--Chappell.pdf
Seven Databases in Seven Weeks: A Guide to Modern
Databases and the NoSQL Movement [book]
http://www.pdfiles.com/pdf/files/English/Databases/Seven_Databases_In_Seven_Weeks.pdf
Replicated Data Consistency Explained Through Baseball
[paper]
http://research.microsoft.com/apps/pubs/default.aspx?id=206913
Q & A ?
James Serra, Big Data Evangelist
Email me at: JamesSerra3@gmail.com
Follow me at: @JamesSerra
Link to me at: www.linkedin.com/in/JamesSerra
Visit my blog at: JamesSerra.com (where this slide deck will be posted)

Contenu connexe

Tendances

Tendances (20)

Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
Microsoft Data Integration Pipelines: Azure Data Factory and SSISMicrosoft Data Integration Pipelines: Azure Data Factory and SSIS
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Azure Data Factory Data Flows Training (Sept 2020 Update)
Azure Data Factory Data Flows Training (Sept 2020 Update)Azure Data Factory Data Flows Training (Sept 2020 Update)
Azure Data Factory Data Flows Training (Sept 2020 Update)
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
Introduction to Azure Databricks
Introduction to Azure DatabricksIntroduction to Azure Databricks
Introduction to Azure Databricks
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Running Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using KubernetesRunning Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using Kubernetes
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Power BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data SolutionsPower BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data Solutions
 
Azure DataBricks for Data Engineering by Eugene Polonichko
Azure DataBricks for Data Engineering by Eugene PolonichkoAzure DataBricks for Data Engineering by Eugene Polonichko
Azure DataBricks for Data Engineering by Eugene Polonichko
 
Azure Data Platform Overview.pdf
Azure Data Platform Overview.pdfAzure Data Platform Overview.pdf
Azure Data Platform Overview.pdf
 
SQL Server on AWS
SQL Server on AWSSQL Server on AWS
SQL Server on AWS
 
Data Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & AthenaData Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & Athena
 
Delta lake and the delta architecture
Delta lake and the delta architectureDelta lake and the delta architecture
Delta lake and the delta architecture
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Introduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptxIntroduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptx
 
Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)
 
Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
 

Similaire à Introducing DocumentDB

Modern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptxModern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptx
ssuser290967
 
Building Big Data Solutions with Azure Data Lake.10.11.17.pptx
Building Big Data Solutions with Azure Data Lake.10.11.17.pptxBuilding Big Data Solutions with Azure Data Lake.10.11.17.pptx
Building Big Data Solutions with Azure Data Lake.10.11.17.pptx
thando80
 

Similaire à Introducing DocumentDB (20)

QuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing WebinarQuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing Webinar
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?
 
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
 
Ralph Kemperdick – IT-Tage 2015 – Microsoft Azure als Datenplattform
Ralph Kemperdick – IT-Tage 2015 – Microsoft Azure als DatenplattformRalph Kemperdick – IT-Tage 2015 – Microsoft Azure als Datenplattform
Ralph Kemperdick – IT-Tage 2015 – Microsoft Azure als Datenplattform
 
Benefits of the Azure cloud
Benefits of the Azure cloudBenefits of the Azure cloud
Benefits of the Azure cloud
 
USQL Trivadis Azure Data Lake Event
USQL Trivadis Azure Data Lake EventUSQL Trivadis Azure Data Lake Event
USQL Trivadis Azure Data Lake Event
 
Modern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptxModern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptx
 
Building Big Data Solutions with Azure Data Lake.10.11.17.pptx
Building Big Data Solutions with Azure Data Lake.10.11.17.pptxBuilding Big Data Solutions with Azure Data Lake.10.11.17.pptx
Building Big Data Solutions with Azure Data Lake.10.11.17.pptx
 
Introduction to Azure Data Lake
Introduction to Azure Data LakeIntroduction to Azure Data Lake
Introduction to Azure Data Lake
 
[WITH THE VISION 2017] IoT/AI時代を生き抜くためのデータ プラットフォーム (Leveraging Azure Data Se...
[WITH THE VISION 2017] IoT/AI時代を生き抜くためのデータ プラットフォーム (Leveraging Azure Data Se...[WITH THE VISION 2017] IoT/AI時代を生き抜くためのデータ プラットフォーム (Leveraging Azure Data Se...
[WITH THE VISION 2017] IoT/AI時代を生き抜くためのデータ プラットフォーム (Leveraging Azure Data Se...
 
Introduction to SQL Server Analysis services 2008
Introduction to SQL Server Analysis services 2008Introduction to SQL Server Analysis services 2008
Introduction to SQL Server Analysis services 2008
 
KoprowskiT_session1_SDNEvent_WASDforBeginners
KoprowskiT_session1_SDNEvent_WASDforBeginnersKoprowskiT_session1_SDNEvent_WASDforBeginners
KoprowskiT_session1_SDNEvent_WASDforBeginners
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
 
Unlocking the Value of Your Data Lake
Unlocking the Value of Your Data LakeUnlocking the Value of Your Data Lake
Unlocking the Value of Your Data Lake
 
Using Data Lakes
Using Data Lakes Using Data Lakes
Using Data Lakes
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
Self service BI with sql server 2008 R2 and microsoft power pivot short
Self service BI with sql server 2008 R2 and microsoft power pivot shortSelf service BI with sql server 2008 R2 and microsoft power pivot short
Self service BI with sql server 2008 R2 and microsoft power pivot short
 
[「RDB技術者のためのNoSQLガイド」出版記念セミナー] Azure DocumentDB
[「RDB技術者のためのNoSQLガイド」出版記念セミナー] Azure DocumentDB[「RDB技術者のためのNoSQLガイド」出版記念セミナー] Azure DocumentDB
[「RDB技術者のためのNoSQLガイド」出版記念セミナー] Azure DocumentDB
 
Microsoft Data Platform - What's included
Microsoft Data Platform - What's includedMicrosoft Data Platform - What's included
Microsoft Data Platform - What's included
 

Plus de James Serra

Plus de James Serra (18)

Microsoft Fabric Introduction
Microsoft Fabric IntroductionMicrosoft Fabric Introduction
Microsoft Fabric Introduction
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
Power BI Overview, Deployment and Governance
Power BI Overview, Deployment and GovernancePower BI Overview, Deployment and Governance
Power BI Overview, Deployment and Governance
 
Power BI Overview
Power BI OverviewPower BI Overview
Power BI Overview
 
Machine Learning and AI
Machine Learning and AIMachine Learning and AI
Machine Learning and AI
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
 
How to build your career
How to build your careerHow to build your career
How to build your career
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
 
What’s new in SQL Server 2017
What’s new in SQL Server 2017What’s new in SQL Server 2017
What’s new in SQL Server 2017
 
Learning to present and becoming good at it
Learning to present and becoming good at itLearning to present and becoming good at it
Learning to present and becoming good at it
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloud
 
What's new in SQL Server 2016
What's new in SQL Server 2016What's new in SQL Server 2016
What's new in SQL Server 2016
 
Introduction to PolyBase
Introduction to PolyBaseIntroduction to PolyBase
Introduction to PolyBase
 
Overview on Azure Machine Learning
Overview on Azure Machine LearningOverview on Azure Machine Learning
Overview on Azure Machine Learning
 
Introduction to Microsoft’s Hadoop solution (HDInsight)
Introduction to Microsoft’s Hadoop solution (HDInsight)Introduction to Microsoft’s Hadoop solution (HDInsight)
Introduction to Microsoft’s Hadoop solution (HDInsight)
 
HA/DR options with SQL Server in Azure and hybrid
HA/DR options with SQL Server in Azure and hybridHA/DR options with SQL Server in Azure and hybrid
HA/DR options with SQL Server in Azure and hybrid
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Introducing DocumentDB

  • 1. Microsoft Azure DocumentDB Overview presentation James Serra Big Data Evangelist Microsoft JamesSerra3@gmail.com
  • 2. About Me  Microsoft, Big Data Evangelist  In IT for 30 years, worked on many BI and DW projects  Worked as desktop/web/database developer, DBA, BI and DW architect and developer, MDM architect, PDW/APS developer  Been perm employee, contractor, consultant, business owner  Presenter at PASS Business Analytics Conference, PASS Summit, Enterprise Data World conference  Certifications: MCSE: Data Platform, Business Intelligence; MS: Architecting Microsoft Azure Solutions, Design and Implement Big Data Analytics Solutions, Design and Implement Cloud Data Platform Solutions  Blog at JamesSerra.com  Former SQL Server MVP  Author of book “Reporting with Microsoft SQL Server 2012”
  • 3. Agenda NoSQL Overview DocumentDB Overview Today’s application environment Pricing DocumentDB basics Service summary Development scenarios Resources and tools
  • 4. What is NoSQL? Choose the store that best fits your needs A database solution designed to compensate for the technical limitations of SQL
  • 5. Traditional approach: relational stores Data is stored in tables that comprise: • Schemas • Columns • Rows Chappell & Associates. “Understanding NoSQL on Microsoft Azure.” 2014. http://www.davidchappell.com/writing/white_papers/Azure-NoSQL-Technologies-v2.0--Chappell.pdf. Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015 http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
  • 6. Azure DocumentDB Uses all but graph category Includes some key-value and columnstore capabilities NoSQL approach: various types of stores PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015 http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf A NoSQL database uses four categories of stores:
  • 7. Key-value stores Key-value stores offer high speed through the least-complicated data model—anything can be stored as a value, as long as each value is associated with a key or name. Key Value Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015 http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
  • 8. Wide-column stores Wide-column stores are fast and can be almost as simple as key-value stores. They include a primary key, an optional secondary key, and anything stored as a value. Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015 http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf Values Primary key Keys and values can be sparse or numerous Secondary key
  • 9. Graph databases Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015 http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf Title: Forgotten Bridges Title: Mythical Bridges Purchased Date: 03/02/2011 Purchased Date: 09/09/2011 Purchased Date: 05/07/2011 Name: Ian Name: Alan
  • 10. Document stores Document stores contain data objects that are inherently hierarchical, tree- like structures (most notably JavaScript Object Notation [JSON] or Extensible Markup Language [XML]). Note that these are not Microsoft Word documents! Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015 http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf
  • 11. NewSQL: another variation Relational NewSQL stores are designed for web-scale applications, but they still require up-front schemas, joins, and table management that can be labor intensive. Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015. http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf.
  • 12. Why NoSQL evolved Image based on: PricewaterhouseCoopers. “Data models in NoSQL and NewSQL databases.” 2015 http://www.pwc.com/us/en/technology-forecast/2015/remapping-database-landscape/features/assets/data-models-production.pdf Drivers
  • 13. SQL and NoSQL: each has its place Fully featured RDBMS Transactional processing RichQuery Managed as a service Elastic scale Internet-accessible http/rest Schema-free data model Arbitrary data formats
  • 14. Azure DocumentDB Perfect for cloud architects and developers who need an enterprise-ready NoSQL document database JSON { "name": "John", "country": "Canada", "age": 43, "lastUse": "March 4, 2014" } { "name": "Eva", "country": "Germany", "age": 25 } { "name": "Lou", "country": "Australia", "age": 51, "firstUse": "May 8, 2013" } { "docCount": 3, "last": "May 1, 2014" } DOCUMENT1 DOCUMENT2 DOCUMENT3 DOCUMENT4 A NoSQL document database-as-a-service, fully managed by Azure
  • 15. { "name": "SmugMug", "permalink": "smugmug", "homepage_url": "http://www.smugmug.com", "blog_url": "http://blogs.smugmug.com/", "category_code": "photo_video", "products": [ { "name": "SmugMug", "permalink": "smugmug" } ], "offices": [ { "description": "", "address1": "67 E. Evelyn Ave", "address2": "", "zip_code": "94041", "city": "Mountain View", "state_code": "CA", "country_code": "USA", "latitude": 37.390056, "longitude": -122.067692 } ] } Perfect for: schema-agnostic JSON store for hierarchical and denormalized data at scale What documents? Not Word documents
  • 16. Azure DocumentDB details Native support for JavaScript, SQL query, and transactions over JSON documents Reliable and predictable performance • Tunable consistency • Elastic scale Rapid development • Build with familiar tools—REST, JSON, JavaScript RichQuery and transactions over JSON data • Query JSON data with no secondary indices Ideal for apps designed for the cloud when the following are high priorities:
  • 17. Top Features Auto-scaling/sharding • Improved scalability and reliability due to distribution of large data sets across multiple machines Automatic indexing • All document properties are available for queries • Frees you from relying on schemas or secondary indexes SQL query language • Make use of SQL experience and .NET LINQ Managed service • Spin up on demand with no setup • Availability guarantee of 99.95% • Linear price curve without virtual-machine step functions • Integration with Azure HDInsight and Azure Search
  • 18. Top Features Greater consistency control • Four consistency levels provide more options for consistency, availability, and performance requirements Atomicity, Consistency, Isolation, and Durability (ACID) transaction control • Simpler programming model (compared to state variables) • Use JavaScript for insert, update, and delete actions Standards-based open API with RESTful HTTP • Uses JSON standard—no mapping of Binary JSON (BSON) to JSON needed Granular access rights • Allows access to all documents and attachments within collections
  • 19. Monitor an account • View performance metrics for a DocumentDB account • Customize performance metric views for a DocumentDB account • Create side-by-side performance metric charts • View usage metrics for a DocumentDB account • Set up performance metric alerts for a DocumentDB account
  • 20. Today’s modern apps • Produce and consume data at a staggering rate • Require instantaneous response times to match user expectations • Are developed iteratively with many versions supported concurrently • Are developed with continuously evolving data models • Are increasingly complex • Experience unpredictable and explosive growth
  • 21. Well-suited for web and mobile apps Catalog data Preferences and state Event store User-generated content Data exchange
  • 22. Azure DocumentDB at Microsoft More than 450 million unique users Store 20 TB of JSON document data Under 15 millisecond (ms) writes and single-digit ms reads Store for 40+ app/device combinations Available globally to serve all markets USER DATA STORE
  • 23. Standard pricing tier with hourly billing
  • 24. Azure DocumentDB basics Resource model • Entities addressable by logical Uniform Resource Identifier (URI) • Partitioned for scale out • Replicated for high availability • Entities represented as JSON • Accounts scale out by moving a slider Interaction model • RESTful interaction over HTTPS • HTTPS and TCP connectivity • Standard HTTPS verbs and semantics Development • .NET, Node.js, Python, Java, and JavaScript clients • SQL for query expression, .NET LINQ • JavaScript for server-side app logic Azure DocumentDB account Databases Users Permissions 101 010 Attachments Your documents here { } { } DocumentsCollections Stored procedures Triggers User-defined functions JS JS JS
  • 25. • Collections != tables • Unit of partitioning • Transaction boundary • No enforced schema, flexible • Queried or updated stay together in one collection • Elasticity to 10 GB • RUs evenly distributed across partitions Azure DocumentDB collections 101 010 Attachments Your documents here DocumentsCollections Stored procedures Triggers User-defined functions JS JS JS
  • 26. … Elastic collections • Collection != single partition • Partition count dynamic • Each partition (key) is 10 GB • Online splits and merges with full availability • RUs evenly distributed across partitions
  • 27. Rich query over JSON data Native JavaScript transactional processing Familiar SQL-based query language Query on JSON data without specifying secondary indices or constructing views Build modern, scalable apps with robust transactional querying and data processing on JSON documents
  • 28. JavaScript transactions Transactionally process multiple documents with application-defined stored procedures and triggers • JavaScript as the procedural language • Language integrated • Execution wrapped in an implicit transaction • Preregistered and scoped to a collection • Performed with ACID guarantees • Triggers invoked as pre- or post-operations Stored procedures JS Triggers
  • 29. Reliable and predictable performance Tunable consistency Elastic scaleFast, predictable performance Defined throughput levels that scale linearly with application needs Azure DocumentDB is born in the cloud to achieve fast, predictable performance with reserved resources to deliver on throughput needs. Delivers reliable, tunable consistency to increase performance based on application needs.
  • 30. Document myDoc = await client.ReadDocumentAsync(documentLink, new RequestOptions { ConsistencyLevel = ConsistencyLevel.Eventual }); Four consistency levels Strong Session Bounded Staleness Eventual Lower consistency level on read operations
  • 31. Consistency levels enable guarantees Choose your consistency level and make predictable trade-offs between consistency, availability, and performance Choose your level Strong Data consistency Session Monotonic reads (on explicit read requests) and writes Bounded Staleness Total order of propagation of writes Eventual Lowest latency for reads and writes
  • 32. Security model Azure Document DB is designed to be secure with: • Master key • Access control on resources • User operations • Permission operations • Code execution
  • 33. Rapid development Easy to start and fully-manage Enterprise-grade Azure platform Build with familiar tools—REST, JSON, and JavaScript Reduce development friction and complexity when building new business-class applications by using familiar tools and industry-standard platforms. Combine Azure DocumentDB with a portfolio of complementary cloud services on the Azure platform, such as the Azure HDInsight Connector and Azure Search Indexer
  • 35. Azure DocumentDB service summary Unique among NoSQL stores: • Developed for the cloud and for delivery as a service • Truly query-able JSON store • Transactional processing through language- integrated JavaScript • Predictable performance and tunable consistency
  • 36. Development scenarios Consider Azure DocumentDB when you need: • To build new web and mobile cloud-based applications • Rapid development and high-scalability requirements • Query and processing of user- and device-generated data • More query and processing support for your key-value stores • To run a document store in virtual machines • A managed service model
  • 37. Build your first Azure DocumentDB app today Get support Schedule a 1:1 chat directly with the Azure DocumentDB engineering team at askdocdb.com Give feedback Ask questions through the forum at http://aka.ms/docdbforum Suggest an idea and vote to support other ideas for Azure DocumentDB at http://aka.ms/docdbideas On Twitter @documentdb Get started Sign up for Azure DocumentDB at http://aka.ms/docdbstart Access and configure your account at http://portal.azure.com Download an SDK from http://aka.ms/docdbsdks, and then build a sample at http://aka.ms/docdbsample
  • 38. Go to http://www.documentdb.com/sql/demo Test out sample queries or write your own against the dataset Using DocumentDB Query Playground
  • 39. Learn more David Chappell NoSQL overview paper on Infopedia http://www.davidchappell.com/writing/white_papers/Azure-NoSQL-Technologies-v2.0--Chappell.pdf Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement [book] http://www.pdfiles.com/pdf/files/English/Databases/Seven_Databases_In_Seven_Weeks.pdf Replicated Data Consistency Explained Through Baseball [paper] http://research.microsoft.com/apps/pubs/default.aspx?id=206913
  • 40. Q & A ? James Serra, Big Data Evangelist Email me at: JamesSerra3@gmail.com Follow me at: @JamesSerra Link to me at: www.linkedin.com/in/JamesSerra Visit my blog at: JamesSerra.com (where this slide deck will be posted)