SlideShare une entreprise Scribd logo
1  sur  73
MichaelLimcaco
Solutions Architect
Amazon Web Services
NoSQL in the Cloud: Amazon DynamoDB
Fast and durable at any scale
Databases in the Cloud
first a little context
Traditional Database Architecture
App/Web Tier
Client Tier
RDBMS
one database for
all workloads
• key-value access
• complex queries
• transactions
• analytics
Traditional Database Architecture
App/Web Tier
Client Tier
RDBMS
Data Tier
Cache Data Warehouse Blob Store
RDBMSNoSQL Search
Cloud Data Tier Architecture
App/Web Tier
Client Tier
best database for
each workload
Workload Driven Data Store Selection
Data Tier
Cache Data Warehouse Blob Store
RDBMSNoSQL Search
logging
rich search
key/value
simple query
hot reads
analytics
complexqueries
& transactions
AWS Services for the Data Tier
Data Tier
Amazon
DynamoDB
Amazon
RDS
Amazon
ElastiCache
Amazon
S3
Amazon
CloudSearch
Amazon
Redshift
logging
rich search
key/value
simple query
hot reads
analytics
complexqueries
& transactions
AWS Services for the Data Tier
Data Tier
Amazon
DynamoDB
Amazon
RDS
Amazon
ElastiCache
Amazon
S3
Amazon
CloudSearch
Amazon
Redshift
logging
rich search
key/value
simple query
hot reads
analytics
complexqueries
& transactions
DynamoDB is a managed
NoSQL database service.
Store and retrieve any amount of data
Serve any level of request traffic
Consistent, predictable
performance.
Single digit millisecond latency.
Backed on solid-state drives.
Flexible data model.
Key attribute pairs. No schema required.
Rich Tooling
SDK/Libraries
JSON-Based Web API
IDE Plugins
CLI
Without the operational
burden.
DynamoDB Customers
DynamoDB
Background
RDBMS = Default Choice
• Amazon.com page composed of responses from 1000’s of
independent services
• Query patterns for different service are different
 Catalog service is usually heavy key-value
 Ordering service is very write intensive (key-value)
 Catalog search has a different pattern for querying
Relational Era @ Amazon.com
RDBMS
PoorAvailability Limited Scalability High Cost
Dynamo = NoSQL Technology
• Replicated DHT
• Consistent hashing
• Optimistic replication
• Quorum strategies
• Anti-entropy mechanisms
• Object versioning
Distributed Era @ Amazon.com
lack of strong every engineer needsto operational
consistency learndistributedsystems complexity
DynamoDB = NoSQL Cloud Service
Cloud Era @ Amazon.com
Seamless Scalability
Fast & Predictable Performance
Easy Administration
Streamlined Development
Cost Effective
partitions
1 .. N
table
• DynamoDB automatically
partitions data by the hash key
 Hash key spreads data (& workload)
across partitions
• Auto-partitioning occurs with
 Data set size growth
 Provisioned capacity increases
Massive and Seamless Scale
WRITES
Continuously replicated to 3 Facilities
Quorum acknowledgment
Persisted to disk (SSD)
READS
Strongly or eventually consistent
No trade-off in latency
Durable At Scale
Provisioned Throughput
• Request-based capacity provisioning model
• Throughput is declared and updated via the API or the
console
 CreateTable (foo, reads/sec = 100, writes/sec = 150)
 UpdateTable (foo, reads/sec=10000, writes/sec=4500)
• DynamoDB handles the rest
 Capacity is reserved and available when needed
 Scaling-up triggers repartitioning and reallocation
 No impact to performance or availability
Predictable Performance
WRITES
Continuously replicated to 3 Facilities
Quorum acknowledgment
Persisted to disk (SSD)
READS
Strongly or eventually consistent
No trade-off in latency
Low Latency At Scale
Making life easier for developers…
• Developers are freed from:
 Performance tuning (latency)
 Automatic 3-way multi-facility replication
 Scalability (and scaling operations)
 Security inspections, patches, upgrades
 Software upgrades, patches
 Automatic hardware failover
 Improving the underlying hardware
…and more!
Automated Operations
DynamoDB Primitives
DynamoDB Concepts
table
DynamoDB Concepts
table
items
DynamoDB Concepts
attributes
items
table
schema-less
schema is defined per attribute
DynamoDB Concepts
attributes
items
table
scalar data types
• number, string, and binary
multi-valued types
• string set, number set, and binary set
DynamoDB Concepts
hash
hash keys
mandatory for all items in a table
key-value access pattern
PutItem
UpdateItem
DeleteItem
BatchWriteItem
GetItem
BatchGetItem
Hash = Distribution Key
partition 1..N
hash keys
mandatory for all items in a table
key-value access pattern
determines data distribution
Hash = Distribution Key
large number of unique hash keys
uniform distribution of workload
across hash keys
optimal
schema
design
+
Range = Query
range
hash
range keys
model 1:N relationships
enable rich query capabilities
composite primary key
all items for a hash key
==, <, >, >=, <=
“begins with”
“between”
sorted results
counts
top / bottom N values
paged responses
Index Options
local secondary indexes (LSI)
alternate range key + same hash key
index and table data is co-located (same partition)
Projected Attributes
KEYS_ONLY
INCLUDE
ALL
Projected Attributes
KEYS_ONLY
INCLUDE
ALL
Projected Attributes
KEYS_ONLY
INCLUDE
ALL
Index Options
global secondary
indexes (GSI)
any attribute indexed as
new hash or range key
KEYS_ONLY
INCLUDE
ALL
Example Patterns
access pattern use case highlighted
modeling walk-thru features
• Method
1. Describe the overall use case – maintain context
2. Identify the individual access patterns of the use case
3. Model each access pattern to its own discrete data set
4. Consolidate data sets into tables and indexes
• Benefits
 Single table fetch for each query
 Payloads are minimal for each access
Access Pattern Modeling
Multi-tenant application for file storing and sharing
• User_ID is the unique identifier of each user
• File_ID is the unique identifier of each file, owner by user
GoodPK selection:User_ID(hash) + File_ID(range)
use case access patterns data design
Design Use Case: Media Catalog
1. Users should be able to query all the files they own
2. Search by File Name
3. Search by File Type
4. Search by Date Range
5. Keep track of Shared Files
Design Use Case: Media Catalog
use case access patterns data design
1. Users should be able to query all the files they own
2. Search by File Name
3. Search by File Type
4. Search by Date Range
5. Keep track of Shared Files
Design Use Case: Media Catalog
use case access patterns data design
additional (non-PK) attributes
& index candidates
Users
Hash key = User_ID
Attributes= User_Name
Email
Address
User_Files
Hash key = User_ID
Range key = File_ID
Attributes= Name
Size (N)
Date
SharedFlag
Link
DynamoDB Data Model: Main Tables
User has file[]
+ Secondary Indexes
Table Name Index Name Attribute to Index Projected Attribute
User_Files NameIndex Name KEYS
User_Files TypeIndex Type KEYS + Name
User_Files DateIndex Date KEYS + Name
User_Files SharedFlagIndex SharedFlag KEYS + Name
User_Files SizeIndex Size KEYS + Name
example only – required data returned
determines optimal projections
• Find all files owned by a user
 Query User_Files table (User_ID = “2”)
Access Pattern 1
User_ID
(Hash)
File_ID
(Range)
Name Date Type SharedFlag Size Link
1 1 File1 2013-04-23 JPG 10000 bucket1
1 2 File2 2013-03-10 MP4 Y 1000000 bucket2
2 3 File3 2013-03-10 MP4 Y 2000000 bucket3
2 4 File4 2013-03-10 AVI 3000000 bucket4
3 5 File5 2013-04-10 MP4 40000 bucket5
• Find all files owned by a user
 Query User_Files table (User_ID = “2”)
Access Pattern 1
User_ID
(Hash)
File_ID
(Range)
Name Date Type SharedFlag Size Link
1 1 File1 2013-04-23 JPG 10000 bucket1
1 2 File2 2013-03-10 MP4 Y 1000000 bucket2
2 3 File3 2013-03-10 MP4 Y 2000000 bucket3
2 4 File4 2013-03-10 AVI 3000000 bucket4
3 5 File5 2013-04-10 MP3 40000 bucket5
• Search by File Name
 Query
• IndexName = “NameIndex”
• User_ID = “1”
• Name = “File1”
Access Pattern 2
User_ID
(hash)
Name
(range)
File_ID
1 File1 1
1 File2 2
2 File3 3
2 File4 4
3 File5 5
NameIndex
• Search by File Name
 Query
• IndexName = “NameIndex”
• User_ID = “1”
• Name = “File1”
Access Pattern 2
User_ID
(hash)
Name
(range)
File_ID
1 File1 1
1 File2 2
2 File3 3
2 File4 4
3 File5 5
NameIndex
• Search for file name by
file Type
 Query
• IndexName = “TypeIndex”
• User_ID = “2”
• Type = “MP4”
Access Pattern 3
UserId
(hash)
Type
(range)
File_ID Name
1 JPG 1 File1
1 MP4 2 File2
2 MP4 4 File4
2 AVI 3 File3
3 MP3 5 File5
projection
TypeIndex
• Search for file name by
file Type
 Query
• IndexName = “TypeIndex”
• User_ID = “2”
• Type = “MP4”
Access Pattern 3
UserId
(hash)
Type
(range)
File_ID Name
1 JPG 1 File1
1 MP4 2 File2
2 MP4 4 File4
2 AVI 3 File3
3 MP3 5 File5
projection
TypeIndex
• Search for file name by Date
range
 Query
• IndexName = “DateIndex”
• User_ID = “1”
• Date between “2013-03-01”
and “2013-03-29”
Access Pattern 4
User_ID
(hash)
Date
(range)
FileId Name
1 2013-03-10 2 File2
1 2013-04-23 1 File1
2 2013-03-10 3 File3
2 2013-03-10 4 File4
3 2013-04-10 5 File5
DateIndex
projection
• Search for file name by Date
range
 Query
• IndexName = “DateIndex”
• User_ID = “1”
• Date between “2013-03-01”
and “2013-03-29”
Access Pattern 4
User_ID
(hash)
Date
(range)
FileId Name
1 2013-03-10 2 File2
1 2013-04-23 1 File1
2 2013-03-10 3 File3
2 2013-03-10 4 File4
3 2013-04-10 5 File5
DateIndex
projection
• Search for names of
Shared files
 Query
• IndexName =
“SharedFlagIndex”
• User_ID = “1”
• SharedFlag = “Y”
Access Pattern 5
User_ID
(hash)
SharedFlag
(range)
FileId Name
1 Y 2 File2
2 Y 3 File3
SharedFlagIndex
projection
• Search for names of
Shared files
 Query
• IndexName =
“SharedFlagIndex”
• User_ID = “1”
• SharedFlag = “Y”
Access Pattern 5
User_ID
(hash)
SharedFlag
(range)
FileId Name
1 Y 2 File2
2 Y 3 File3
SharedFlagIndex
projection
• Schema-less
 Only key information needed
 Individual items can define their own set of attributes
• Consistent Reads
 Inventory, shopping cart applications
• Atomic Counters
 Increment and return new value in same operation
• Conditional Writes
 Expected value before write – fails on mismatch
 “state machine” use cases
Highlighted Features
Hadoop Integration
+ Amazon Elastic Map Reduce (EMR)
Managed Hadoop service for
data-intensive workflows.
Define External Table (Hive)
create external table items_db
(id string, votes bigint, views bigint) stored by
'org.apache.hadoop.hive.dynamodb.DynamoDBStorageHandler'
tblproperties
("dynamodb.table.name" = "items",
"dynamodb.column.mapping" =
"id:id,votes:votes,views:views");
Query It
select id, likes, views
from items_db
order by views desc;
What Else?
autoscaling local testing cross-region
library and development export / import
• Third party library for automating scaling decisions
• Scale up for service levels, scale down for cost
• CloudFormation template for fast deployment
Autoscaling with Dynamic DynamoDB
• Cross-Region Export and Import
• DynamoDB Local
 Disconnected development with full API support
• No network
• No usage costs
• No SLA
• Geospatial and Transaction Libraries
• Fine-Grained Access Control
 Direct-to-DynamoDB access for mobile devices
Other Key Features
Get started today!
aws.amazon.com/dynamodb/developer-resources/
Wrapup
Managed NoSQL
seamless scalability predictable performance
always durable
automated operations
fast development cost effective
=
Thank You
aws.amazon.com/dynamodb

Contenu connexe

Tendances

Relevance in the Wild - Daniel Gomez Vilanueva, Findwise
Relevance in the Wild - Daniel Gomez Vilanueva, FindwiseRelevance in the Wild - Daniel Gomez Vilanueva, Findwise
Relevance in the Wild - Daniel Gomez Vilanueva, FindwiseLucidworks
 
Azure doc db (slideshare)
Azure doc db (slideshare)Azure doc db (slideshare)
Azure doc db (slideshare)David Green
 
Hacking Lucene for Custom Search Results
Hacking Lucene for Custom Search ResultsHacking Lucene for Custom Search Results
Hacking Lucene for Custom Search ResultsOpenSource Connections
 
Webinar | Introduction to Amazon DynamoDB
Webinar | Introduction to Amazon DynamoDBWebinar | Introduction to Amazon DynamoDB
Webinar | Introduction to Amazon DynamoDBAmazon Web Services
 
The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...
The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...
The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...Lucidworks
 
State-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrState-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache Solrguest432cd6
 
Tagging search solution design Advanced edition
Tagging search solution design Advanced editionTagging search solution design Advanced edition
Tagging search solution design Advanced editionAlexander Tokarev
 
Exploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, LucidworksExploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, LucidworksLucidworks
 
Data Warehousing in the Era of Big Data
Data Warehousing in the Era of Big DataData Warehousing in the Era of Big Data
Data Warehousing in the Era of Big DataAmazon Web Services
 
Search at Twitter: Presented by Michael Busch, Twitter
Search at Twitter: Presented by Michael Busch, TwitterSearch at Twitter: Presented by Michael Busch, Twitter
Search at Twitter: Presented by Michael Busch, TwitterLucidworks
 
Apache Solr/Lucene Internals by Anatoliy Sokolenko
Apache Solr/Lucene Internals  by Anatoliy SokolenkoApache Solr/Lucene Internals  by Anatoliy Sokolenko
Apache Solr/Lucene Internals by Anatoliy SokolenkoProvectus
 
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...Naoki (Neo) SATO
 
New Persistence Features in Spring Roo 1.1
New Persistence Features in Spring Roo 1.1New Persistence Features in Spring Roo 1.1
New Persistence Features in Spring Roo 1.1Stefan Schmidt
 
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...NoSQLmatters
 
Introduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneIntroduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneRahul Jain
 
eHarmony - Messaging Platform with MongoDB Atlas
eHarmony - Messaging Platform with MongoDB Atlas eHarmony - Messaging Platform with MongoDB Atlas
eHarmony - Messaging Platform with MongoDB Atlas MongoDB
 
Faceted Search with Lucene
Faceted Search with LuceneFaceted Search with Lucene
Faceted Search with Lucenelucenerevolution
 

Tendances (20)

Relevance in the Wild - Daniel Gomez Vilanueva, Findwise
Relevance in the Wild - Daniel Gomez Vilanueva, FindwiseRelevance in the Wild - Daniel Gomez Vilanueva, Findwise
Relevance in the Wild - Daniel Gomez Vilanueva, Findwise
 
Azure doc db (slideshare)
Azure doc db (slideshare)Azure doc db (slideshare)
Azure doc db (slideshare)
 
Hacking Lucene for Custom Search Results
Hacking Lucene for Custom Search ResultsHacking Lucene for Custom Search Results
Hacking Lucene for Custom Search Results
 
Webinar | Introduction to Amazon DynamoDB
Webinar | Introduction to Amazon DynamoDBWebinar | Introduction to Amazon DynamoDB
Webinar | Introduction to Amazon DynamoDB
 
The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...
The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...
The Evolution of Streaming Expressions - Joel Bernstein, Alfresco & Dennis Go...
 
State-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrState-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache Solr
 
Tagging search solution design Advanced edition
Tagging search solution design Advanced editionTagging search solution design Advanced edition
Tagging search solution design Advanced edition
 
Exploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, LucidworksExploring Direct Concept Search - Steve Rowe, Lucidworks
Exploring Direct Concept Search - Steve Rowe, Lucidworks
 
Data Warehousing in the Era of Big Data
Data Warehousing in the Era of Big DataData Warehousing in the Era of Big Data
Data Warehousing in the Era of Big Data
 
Search at Twitter: Presented by Michael Busch, Twitter
Search at Twitter: Presented by Michael Busch, TwitterSearch at Twitter: Presented by Michael Busch, Twitter
Search at Twitter: Presented by Michael Busch, Twitter
 
LDAP
LDAPLDAP
LDAP
 
Apache Solr/Lucene Internals by Anatoliy Sokolenko
Apache Solr/Lucene Internals  by Anatoliy SokolenkoApache Solr/Lucene Internals  by Anatoliy Sokolenko
Apache Solr/Lucene Internals by Anatoliy Sokolenko
 
Lucene basics
Lucene basicsLucene basics
Lucene basics
 
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
 
New Persistence Features in Spring Roo 1.1
New Persistence Features in Spring Roo 1.1New Persistence Features in Spring Roo 1.1
New Persistence Features in Spring Roo 1.1
 
Ldap
LdapLdap
Ldap
 
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
 
Introduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneIntroduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of Lucene
 
eHarmony - Messaging Platform with MongoDB Atlas
eHarmony - Messaging Platform with MongoDB Atlas eHarmony - Messaging Platform with MongoDB Atlas
eHarmony - Messaging Platform with MongoDB Atlas
 
Faceted Search with Lucene
Faceted Search with LuceneFaceted Search with Lucene
Faceted Search with Lucene
 

En vedette

En vedette (7)

Twitter juist
Twitter juistTwitter juist
Twitter juist
 
Hari ug
Hari ugHari ug
Hari ug
 
Pais vasco
Pais vascoPais vasco
Pais vasco
 
Pembiakan pokok hias
Pembiakan pokok hiasPembiakan pokok hias
Pembiakan pokok hias
 
Dynamo db
Dynamo dbDynamo db
Dynamo db
 
AWS Webcast - Data Modeling for low cost and high performance with DynamoDB
AWS Webcast - Data Modeling for low cost and high performance with DynamoDBAWS Webcast - Data Modeling for low cost and high performance with DynamoDB
AWS Webcast - Data Modeling for low cost and high performance with DynamoDB
 
Dynamo
DynamoDynamo
Dynamo
 

Similaire à La big datacamp-2014-aws-dynamodb-overview-michael_limcaco

Modernising your Applications on AWS: AWS SDKs and Application Web Services –...
Modernising your Applications on AWS: AWS SDKs and Application Web Services –...Modernising your Applications on AWS: AWS SDKs and Application Web Services –...
Modernising your Applications on AWS: AWS SDKs and Application Web Services –...Amazon Web Services
 
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech TalksHow to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech TalksAmazon Web Services
 
Dynamodb Presentation
Dynamodb PresentationDynamodb Presentation
Dynamodb Presentationadvaitdeo
 
Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3Amazon Web Services
 
Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016Sumo Logic
 
AWS Webcast - Build high-scale applications with Amazon DynamoDB
AWS Webcast - Build high-scale applications with Amazon DynamoDBAWS Webcast - Build high-scale applications with Amazon DynamoDB
AWS Webcast - Build high-scale applications with Amazon DynamoDBAmazon Web Services
 
PASS Summit - SQL Server 2017 Deep Dive
PASS Summit - SQL Server 2017 Deep DivePASS Summit - SQL Server 2017 Deep Dive
PASS Summit - SQL Server 2017 Deep DiveTravis Wright
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftAmazon Web Services
 
Achieve big data analytic platform with lambda architecture on cloud
Achieve big data analytic platform with lambda architecture on cloudAchieve big data analytic platform with lambda architecture on cloud
Achieve big data analytic platform with lambda architecture on cloudScott Miao
 
Data Science with the Help of Metadata
Data Science with the Help of MetadataData Science with the Help of Metadata
Data Science with the Help of MetadataJim Dowling
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSAmazon Web Services
 
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...Amazon Web Services
 
An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...DataWorks Summit
 

Similaire à La big datacamp-2014-aws-dynamodb-overview-michael_limcaco (20)

Modernising your Applications on AWS: AWS SDKs and Application Web Services –...
Modernising your Applications on AWS: AWS SDKs and Application Web Services –...Modernising your Applications on AWS: AWS SDKs and Application Web Services –...
Modernising your Applications on AWS: AWS SDKs and Application Web Services –...
 
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech TalksHow to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
 
Dynamodb Presentation
Dynamodb PresentationDynamodb Presentation
Dynamodb Presentation
 
Tthornton code4lib
Tthornton code4libTthornton code4lib
Tthornton code4lib
 
Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3
 
Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016Sumo Logic QuickStart Webinar - Jan 2016
Sumo Logic QuickStart Webinar - Jan 2016
 
AWS Webcast - Build high-scale applications with Amazon DynamoDB
AWS Webcast - Build high-scale applications with Amazon DynamoDBAWS Webcast - Build high-scale applications with Amazon DynamoDB
AWS Webcast - Build high-scale applications with Amazon DynamoDB
 
Azure Fundamentals.pdf
Azure Fundamentals.pdfAzure Fundamentals.pdf
Azure Fundamentals.pdf
 
Informatica slides
Informatica slidesInformatica slides
Informatica slides
 
PASS Summit - SQL Server 2017 Deep Dive
PASS Summit - SQL Server 2017 Deep DivePASS Summit - SQL Server 2017 Deep Dive
PASS Summit - SQL Server 2017 Deep Dive
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 
AWS Data Collection & Storage
AWS Data Collection & StorageAWS Data Collection & Storage
AWS Data Collection & Storage
 
Achieve big data analytic platform with lambda architecture on cloud
Achieve big data analytic platform with lambda architecture on cloudAchieve big data analytic platform with lambda architecture on cloud
Achieve big data analytic platform with lambda architecture on cloud
 
Data Science with the Help of Metadata
Data Science with the Help of MetadataData Science with the Help of Metadata
Data Science with the Help of Metadata
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
Deep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDBDeep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDB
 
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
 
DynamodbDB Deep Dive
DynamodbDB Deep DiveDynamodbDB Deep Dive
DynamodbDB Deep Dive
 
An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...
 
Deep Dive in Big Data
Deep Dive in Big DataDeep Dive in Big Data
Deep Dive in Big Data
 

Plus de Data Con LA

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA
 

Plus de Data Con LA (20)

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
 

Dernier

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Dernier (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

La big datacamp-2014-aws-dynamodb-overview-michael_limcaco

  • 1. MichaelLimcaco Solutions Architect Amazon Web Services NoSQL in the Cloud: Amazon DynamoDB Fast and durable at any scale
  • 2. Databases in the Cloud first a little context
  • 3. Traditional Database Architecture App/Web Tier Client Tier RDBMS one database for all workloads
  • 4. • key-value access • complex queries • transactions • analytics Traditional Database Architecture App/Web Tier Client Tier RDBMS
  • 5. Data Tier Cache Data Warehouse Blob Store RDBMSNoSQL Search Cloud Data Tier Architecture App/Web Tier Client Tier best database for each workload
  • 6. Workload Driven Data Store Selection Data Tier Cache Data Warehouse Blob Store RDBMSNoSQL Search logging rich search key/value simple query hot reads analytics complexqueries & transactions
  • 7. AWS Services for the Data Tier Data Tier Amazon DynamoDB Amazon RDS Amazon ElastiCache Amazon S3 Amazon CloudSearch Amazon Redshift logging rich search key/value simple query hot reads analytics complexqueries & transactions
  • 8. AWS Services for the Data Tier Data Tier Amazon DynamoDB Amazon RDS Amazon ElastiCache Amazon S3 Amazon CloudSearch Amazon Redshift logging rich search key/value simple query hot reads analytics complexqueries & transactions
  • 9. DynamoDB is a managed NoSQL database service. Store and retrieve any amount of data Serve any level of request traffic
  • 10. Consistent, predictable performance. Single digit millisecond latency. Backed on solid-state drives.
  • 11. Flexible data model. Key attribute pairs. No schema required.
  • 16. RDBMS = Default Choice • Amazon.com page composed of responses from 1000’s of independent services • Query patterns for different service are different  Catalog service is usually heavy key-value  Ordering service is very write intensive (key-value)  Catalog search has a different pattern for querying Relational Era @ Amazon.com RDBMS PoorAvailability Limited Scalability High Cost
  • 17. Dynamo = NoSQL Technology • Replicated DHT • Consistent hashing • Optimistic replication • Quorum strategies • Anti-entropy mechanisms • Object versioning Distributed Era @ Amazon.com lack of strong every engineer needsto operational consistency learndistributedsystems complexity
  • 18. DynamoDB = NoSQL Cloud Service Cloud Era @ Amazon.com Seamless Scalability Fast & Predictable Performance Easy Administration Streamlined Development Cost Effective
  • 19. partitions 1 .. N table • DynamoDB automatically partitions data by the hash key  Hash key spreads data (& workload) across partitions • Auto-partitioning occurs with  Data set size growth  Provisioned capacity increases Massive and Seamless Scale
  • 20. WRITES Continuously replicated to 3 Facilities Quorum acknowledgment Persisted to disk (SSD) READS Strongly or eventually consistent No trade-off in latency Durable At Scale
  • 21. Provisioned Throughput • Request-based capacity provisioning model • Throughput is declared and updated via the API or the console  CreateTable (foo, reads/sec = 100, writes/sec = 150)  UpdateTable (foo, reads/sec=10000, writes/sec=4500) • DynamoDB handles the rest  Capacity is reserved and available when needed  Scaling-up triggers repartitioning and reallocation  No impact to performance or availability Predictable Performance
  • 22. WRITES Continuously replicated to 3 Facilities Quorum acknowledgment Persisted to disk (SSD) READS Strongly or eventually consistent No trade-off in latency Low Latency At Scale
  • 23. Making life easier for developers… • Developers are freed from:  Performance tuning (latency)  Automatic 3-way multi-facility replication  Scalability (and scaling operations)  Security inspections, patches, upgrades  Software upgrades, patches  Automatic hardware failover  Improving the underlying hardware …and more! Automated Operations
  • 28. DynamoDB Concepts attributes items table scalar data types • number, string, and binary multi-valued types • string set, number set, and binary set
  • 29. DynamoDB Concepts hash hash keys mandatory for all items in a table key-value access pattern PutItem UpdateItem DeleteItem BatchWriteItem GetItem BatchGetItem
  • 30. Hash = Distribution Key partition 1..N hash keys mandatory for all items in a table key-value access pattern determines data distribution
  • 31. Hash = Distribution Key large number of unique hash keys uniform distribution of workload across hash keys optimal schema design +
  • 32. Range = Query range hash range keys model 1:N relationships enable rich query capabilities composite primary key all items for a hash key ==, <, >, >=, <= “begins with” “between” sorted results counts top / bottom N values paged responses
  • 33. Index Options local secondary indexes (LSI) alternate range key + same hash key index and table data is co-located (same partition)
  • 37. Index Options global secondary indexes (GSI) any attribute indexed as new hash or range key KEYS_ONLY INCLUDE ALL
  • 38. Example Patterns access pattern use case highlighted modeling walk-thru features
  • 39. • Method 1. Describe the overall use case – maintain context 2. Identify the individual access patterns of the use case 3. Model each access pattern to its own discrete data set 4. Consolidate data sets into tables and indexes • Benefits  Single table fetch for each query  Payloads are minimal for each access Access Pattern Modeling
  • 40. Multi-tenant application for file storing and sharing • User_ID is the unique identifier of each user • File_ID is the unique identifier of each file, owner by user GoodPK selection:User_ID(hash) + File_ID(range) use case access patterns data design Design Use Case: Media Catalog
  • 41. 1. Users should be able to query all the files they own 2. Search by File Name 3. Search by File Type 4. Search by Date Range 5. Keep track of Shared Files Design Use Case: Media Catalog use case access patterns data design
  • 42. 1. Users should be able to query all the files they own 2. Search by File Name 3. Search by File Type 4. Search by Date Range 5. Keep track of Shared Files Design Use Case: Media Catalog use case access patterns data design additional (non-PK) attributes & index candidates
  • 43. Users Hash key = User_ID Attributes= User_Name Email Address User_Files Hash key = User_ID Range key = File_ID Attributes= Name Size (N) Date SharedFlag Link DynamoDB Data Model: Main Tables User has file[]
  • 44. + Secondary Indexes Table Name Index Name Attribute to Index Projected Attribute User_Files NameIndex Name KEYS User_Files TypeIndex Type KEYS + Name User_Files DateIndex Date KEYS + Name User_Files SharedFlagIndex SharedFlag KEYS + Name User_Files SizeIndex Size KEYS + Name example only – required data returned determines optimal projections
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54. • Find all files owned by a user  Query User_Files table (User_ID = “2”) Access Pattern 1 User_ID (Hash) File_ID (Range) Name Date Type SharedFlag Size Link 1 1 File1 2013-04-23 JPG 10000 bucket1 1 2 File2 2013-03-10 MP4 Y 1000000 bucket2 2 3 File3 2013-03-10 MP4 Y 2000000 bucket3 2 4 File4 2013-03-10 AVI 3000000 bucket4 3 5 File5 2013-04-10 MP4 40000 bucket5
  • 55. • Find all files owned by a user  Query User_Files table (User_ID = “2”) Access Pattern 1 User_ID (Hash) File_ID (Range) Name Date Type SharedFlag Size Link 1 1 File1 2013-04-23 JPG 10000 bucket1 1 2 File2 2013-03-10 MP4 Y 1000000 bucket2 2 3 File3 2013-03-10 MP4 Y 2000000 bucket3 2 4 File4 2013-03-10 AVI 3000000 bucket4 3 5 File5 2013-04-10 MP3 40000 bucket5
  • 56. • Search by File Name  Query • IndexName = “NameIndex” • User_ID = “1” • Name = “File1” Access Pattern 2 User_ID (hash) Name (range) File_ID 1 File1 1 1 File2 2 2 File3 3 2 File4 4 3 File5 5 NameIndex
  • 57. • Search by File Name  Query • IndexName = “NameIndex” • User_ID = “1” • Name = “File1” Access Pattern 2 User_ID (hash) Name (range) File_ID 1 File1 1 1 File2 2 2 File3 3 2 File4 4 3 File5 5 NameIndex
  • 58. • Search for file name by file Type  Query • IndexName = “TypeIndex” • User_ID = “2” • Type = “MP4” Access Pattern 3 UserId (hash) Type (range) File_ID Name 1 JPG 1 File1 1 MP4 2 File2 2 MP4 4 File4 2 AVI 3 File3 3 MP3 5 File5 projection TypeIndex
  • 59. • Search for file name by file Type  Query • IndexName = “TypeIndex” • User_ID = “2” • Type = “MP4” Access Pattern 3 UserId (hash) Type (range) File_ID Name 1 JPG 1 File1 1 MP4 2 File2 2 MP4 4 File4 2 AVI 3 File3 3 MP3 5 File5 projection TypeIndex
  • 60. • Search for file name by Date range  Query • IndexName = “DateIndex” • User_ID = “1” • Date between “2013-03-01” and “2013-03-29” Access Pattern 4 User_ID (hash) Date (range) FileId Name 1 2013-03-10 2 File2 1 2013-04-23 1 File1 2 2013-03-10 3 File3 2 2013-03-10 4 File4 3 2013-04-10 5 File5 DateIndex projection
  • 61. • Search for file name by Date range  Query • IndexName = “DateIndex” • User_ID = “1” • Date between “2013-03-01” and “2013-03-29” Access Pattern 4 User_ID (hash) Date (range) FileId Name 1 2013-03-10 2 File2 1 2013-04-23 1 File1 2 2013-03-10 3 File3 2 2013-03-10 4 File4 3 2013-04-10 5 File5 DateIndex projection
  • 62. • Search for names of Shared files  Query • IndexName = “SharedFlagIndex” • User_ID = “1” • SharedFlag = “Y” Access Pattern 5 User_ID (hash) SharedFlag (range) FileId Name 1 Y 2 File2 2 Y 3 File3 SharedFlagIndex projection
  • 63. • Search for names of Shared files  Query • IndexName = “SharedFlagIndex” • User_ID = “1” • SharedFlag = “Y” Access Pattern 5 User_ID (hash) SharedFlag (range) FileId Name 1 Y 2 File2 2 Y 3 File3 SharedFlagIndex projection
  • 64. • Schema-less  Only key information needed  Individual items can define their own set of attributes • Consistent Reads  Inventory, shopping cart applications • Atomic Counters  Increment and return new value in same operation • Conditional Writes  Expected value before write – fails on mismatch  “state machine” use cases Highlighted Features
  • 65. Hadoop Integration + Amazon Elastic Map Reduce (EMR) Managed Hadoop service for data-intensive workflows.
  • 66. Define External Table (Hive) create external table items_db (id string, votes bigint, views bigint) stored by 'org.apache.hadoop.hive.dynamodb.DynamoDBStorageHandler' tblproperties ("dynamodb.table.name" = "items", "dynamodb.column.mapping" = "id:id,votes:votes,views:views");
  • 67. Query It select id, likes, views from items_db order by views desc;
  • 68. What Else? autoscaling local testing cross-region library and development export / import
  • 69. • Third party library for automating scaling decisions • Scale up for service levels, scale down for cost • CloudFormation template for fast deployment Autoscaling with Dynamic DynamoDB
  • 70. • Cross-Region Export and Import • DynamoDB Local  Disconnected development with full API support • No network • No usage costs • No SLA • Geospatial and Transaction Libraries • Fine-Grained Access Control  Direct-to-DynamoDB access for mobile devices Other Key Features Get started today! aws.amazon.com/dynamodb/developer-resources/
  • 72. Managed NoSQL seamless scalability predictable performance always durable automated operations fast development cost effective =