3. About Basho
Our Mission is to Be The Leader in Distributed Systems
• Founded January 2008
• 115+ employees
• Headquartered in Cambridge, with
regional offices in San Francisco,
Washington DC, London and Tokyo
• Makers of Riak & Riak CS
• Thousands of Users Worldwide including
over 20% of the Fortune 50
• 30,000+ downloads per month now up
from 19,500 in Dec 2011
• Strategic Partners include Citrix, IDC
Frontier, Yahoo! Japan, and Microsoft
4. Product Suite
Basho’s
distributed
systems
technologies
EnterpriseDS
Open
Source
Distributed
Database
Commercial
Distributed
Database
Distributed
Cloud
Storage
Pla7orm
• Always-‐available,
scalable,
ops-‐friendly
NoSQL
database
• Over
35,000
downloads
per
month
• Thousands
of
users
worldwide
• Version
1.0
unveiled
September
2011
• Adds
mulK-‐data
center
replicaKon
&
Basho
24x7
support
• Enables
Secondary
Search
and
MapReduce
Clusters
• Requires
commercial
contract
and
secure
download
• Version
1.4
in
May
2013
• Expands
with
mulK-‐
tenancy,
large
object
support,
metering
and
Amazon
S3
API
• Enables
public
and
private
cloud
storage
• SoXware
released
to
open
source
on
March
20th
6. Riak CS
is...
enterprise cloud storage
Riak
S3-compatibility
multi-tenancy
per user reporting
built
on top
of
large object storage
7. Enabling you to host your own
PUBLIC
PRIVATE
&
CLOUDS
or….
Reliable Storage Behind Apps
8. Riak CS Use Cases
Public Cloud
Storage
S3 Without AWS Cloud Drive
(General Content Storage)
Backup-as-a-
Service
Archival and
Preservation
Storage for
Internal
Clients / Depts.
17. Riak
Masterless
• No master/slave or different roles
• All nodes are equal
• Write availability and scalability
• All nodes can accept/route requests
18. Riak
No Sharding
• Consistent hashing
• Prevents “hot spots”
• Lowers operational burden of scale
• Data rebalanced automatically
19. Riak
Availability and Fault-Tolerance
• Automatically replicates data
• Read and write data during
hardware failure and network
partition
• Hinted handoff
23. Basic Operations
S3 API Commands
s3cmd mb s3://pavan-test
s3cmd ls s3://pavan-test
s3cmd put README.txt
s3://pavan-test s3cmd get s3://pavan-test/README.txt
25. Riak CS
Large Object Support
• Started with 5GB / object
• Now have multipart upload
• Content agnostic
26. Riak CS
S3-Compatible API
• Use existing S3 libraries and tools
• RESTful operations
• Multipart upload
• S3-style ACLs for object/bucket
permissions
• S3 authentication scheme
27. Riak CS
Administration and Users
• Interface for user creation,
deletion, and credentials
• Configure so only admins can
create users
28. Riak CS
New Stuff in Riak 1.3
• Multipart upload: parts between
5MB and 5GB
• Restrict access to buckets based on
source IP
29. Multi-Datacenter Replication
• For active backups, availability
zones, disaster recovery, global
traffic
• Real-time or full-sync
• 24/7 support
• Per-node or storage-based pricing
“[deploying Riak CS] reduces the risk of using AWS and allows
customers to store their data in their own data centers, on their
own terms.” - Alex Williams, TechCrunch
34. QUERY USAGE STATS
Storage and access statistics tracked
on per-user basis, as rollups for slices
of time
• Operations, Count, BytesIn,
BytesOut, + system and user error
• Objects, Bytes