SlideShare une entreprise Scribd logo
1  sur  41
How to Tell if Your Business
Needs NoSQL
Robin Schumacher
VP Products
• Founded in April 2010
• The Apache Cassandra™ company
• Home to Apache Cassandra Chair & most committers
• Cassandra is a massively scalable NoSQL database
• Provide enterprise-class big data platform based on
Cassandra
• 270+ customers
• Headquartered in San Francisco Bay area
• Funded by prominent venture firms
Overview of DataStax
Serving Every Industry
Leading in Performance
http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-
on.html
Netflix Cloud Benchmark…
“In terms of scalability, there is a clear winner throughout
our experiments. Cassandra achieves the highest
throughput for the maximum number of nodes in all
experiments with a linear increasing throughput.”
Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August
2013, p. 10. Benchmark paper presented at the Very Large Database Conference, 2013.
http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2013.pdf
End Point Independent NoSQL Benchmark
Highest in throughput…
Lowest in latency…
NoSQL Momentum
“According to analysis by
Wikibon‟s David Floyer (and
highlighted in the Wall
Street Journal), the NoSQL
database market is
expected to grow at a
compound annual growth
rate of nearly 60% between
2011 and 2017. The SQL
slice of the Big Data market,
in contrast, will grow at just
a 26% CAGR during that
same time period.”
NoSQL Momentum
“NoSQL is the stuff of the Internet
Age.”
- Andrew
Oliver, InfoWorld
But Does My Business Need NoSQL…?
Just because a technology
appears to be having
strong adoption in the
market, that doesn‟t mean
it‟s right for your
business…
What is NoSQL…?
• Progressive data management
engines
• Go beyond legacy relational
databases
• Flexible data model
• Horizontal scalability
• Distributed architectures
• Use of languages and
interfaces that are “not only”
SQL
NoSQL Example – Apache Cassandra
Apache Cassandra is a massively scalable NoSQL database that
offers continuous availability and easy data distribution.
NoSQL Example – Apache Cassandra
“Cassandra stands at the front of the NoSQL pack when it
comes to supporting real-time, big data applications.”
– Wikibon
How Can I Tell if NoSQL Can Help Me Run My
Business and Reduce Costs?
©2013 DataStax Confidential. Do not distribute without consent.
NoSQL Business Considerations
• Need scale-out (vs. scale-up)?
• Manage different types of data like social media?
• Lots of data coming in (and fast)?
• Have non-RDBMS, non-ACID transactions?
• Must keep large data volumes online?
• Continuous uptime necessary?
• Wide-scale data distribution needed?
• Need to integrate different systems?
• Cost a factor?
Need Scale-Out (vs. Scale-Up)?
No
• Application does not require multiple machines
• Can scale-up and meet the application’s current and future needs
Yes
• Application demands divide-and-conquer
• Capacity expansion is best/can only be handled via new machines
Key takeaway: If your applications can easily run on one machine, fit all your
data in RAM or can easily expand via new cores/more drives to fulfill current
and future requirements, you may not need NoSQL…
NoSQL Case Study
Ooyala distributes and analyzes video content for companies like
ESPN, Rolling Stone and others. They track about one quarter of all
online video viewers each day and generate 1-2 billion events that are
streaming in real-time through their system.
Manage Different Types of Data?
No
• No non-structured data (all or mostly rigid formats)
• E.G. No social media data
Yes
• All types of data (structured, semi, and unstructured)
• Social media data
Key takeaway: If all your data systems deal with standard RDBMS structured
data and that won‟t be changing, then you may not need NoSQL…
NoSQL Case Study
HealthCare Anytime needs to analyze doctor’s notes and other types
of difficult data to properly bill back Medicare / Medicaid.
NoSQL Case Study
“Cassandra‟s NoSQL data model allows us to insert and query data much more
naturally than what we had previously. The analysts who routinely use this data were
impressed with the flexibility and speed at which the queries came back.”
– CSC/NASA
Lots of Data Coming In (and Fast)?
No
• No high velocity data (e.g. device, sensors, web streaming, etc.)
• No multiple locations
• Little/no concern about write speed
Yes
• High velocity, write intensive
• Multiple locations sending data
• Must consume data as quickly as possible
Key takeaway: Business applications involving rapid time series data, device
„exhaust‟, web or financial streaming data make good use cases for
NoSQL…
NoSQL Case Study
Gnip takes in huge volumes of social media data at high rates of
speed (e.g. 20,000 Tweets per second).
Non-RDBMS, Non-ACID transactions?
No
• Standard RDBMS, Nested, ACID transactions required
• Complex, requiring rollbacks, savepoints, etc., needed
Yes
• “Big Data” transactions OK or are necessary
• Atomic, Isolated, Durable (AID), but eventual or tunable consistency
allowed
Key takeaway: NoSQL databases do transactions, but since they don‟t
support joins or foreign keys, consistency conforms to the CAP theorem vs.
RDBMS ACID styled consistency…
NoSQL Case Study
eBay does transactions, but does not want overhead of RDBMS
ACID-type transactions.
Cassandra and Transactions
Individual or batch transactions with AID and tunable consistency.
Must Keep Large Data Volumes Online?
No
• No application requirement to keep large volumes of data
• System typically purges data older than certain time period
Yes
• Must keep large volumes of data online and available to customers
• Retain both hot and cold data
Key takeaway: Some NoSQL databases like Cassandra can excel over
typical RDBMS‟s when it comes to maintaining large volumes of data online
and meeting stringent performance SLA‟s …
NoSQL Case Study
Easou is the #1 mobile search firm in China. One of their Cassandra
applications stores online video images for retrieval / viewing and is
300TB in size.
Continuous Uptime Necessary?
No
• Applications have no need for constant uptime
• Unplanned downtime can be handled via traditional failover
Yes
• Applications cannot tolerate any downtime
• Standard log shipping, failover, hot backups, won’t do
Key takeaway: Some NoSQL databases like Cassandra are able to
guarantee no downtime because of their architectures…
NoSQL Case Study
Netflix systems are run in the cloud across multiple availability zones
with Cassandra and sport constant uptime.
NoSQL Case Study
Commenting on Amazon outage in Oct 2012: “We configure all our clusters
to use a replication factor of three, with each replica located in a different
Availability Zone. This allowed Cassandra to handle the outage remarkably
well. When a single zone became unavailable, we didn't need to do
anything. Cassandra routed requests around the unavailable zone and when
it recovered, the ring was repaired.”
- Netflix Tech Blog
Wide-Scale Data Distribution Needed?
No
• Application’s data needs are single site only
• No need to distribute data in other locales for any reason
Yes
• Application serves customers in multiple locations
• Data is distributed across multiple data centers / cloud zones for
latency/performance or disaster recover reasons
Key takeaway: Cassandra is the gold standard among NoSQL databases for
multi-data center, data distribution use cases…
NoSQL Case Study
Rightscale keeps its customers in contact with each other all over the
world via Cassandra clusters in 5+ global data centers.
Need to Integrate Different Systems?
No
• Applications use siloed databases
• No need for different data systems to interact with each other
Yes
• Application has different database workloads
• Multiple data domains serve single application
Key takeaway: ETL and simple connectors oftentimes do not do the job.
Instead, what‟s needed is something like DataStax Enterprise, which
provides one database that serves multiple database workloads…
NoSQL Case Study
Datafiniti, which is a search engine for data, needs to consume lots
of data in real time and provide fast search on top of the same data.
Cost a Factor?
No
• Application is small and not cost intensive to operate
• Software license costs not a factor
Yes
• Large scale business applications
• Traditional RDBMS software costs a significant concern
Key takeaway: NoSQL databases costs can oftentimes be 70-80% less than
legacy RDBMS software. Further large operations staff are not required to
manage NoSQL systems.
NoSQL Case Study
Constant Contact found that scaling out with NoSQL vs. an RDBMS
saved them 90% in software costs, and was implemented in 1/3 the
time...
What Strategies Can I Use To Implement
NoSQL in my Business?
©2013 DataStax Confidential. Do not distribute without consent.
NoSQL Implementation Strategies
New Hybrid Replacement
• New big data
applications
• Legacy systems keep
old databases
• NoSQL database
used for heavy lifting /
big data management
• Legacy RDBMS
maintains smaller
parts of database
• Legacy RDBMS
cannot meet
demands of new or
evolving big data
system
• Data models and data
are migrated
DataStax Enterprise – NoSQL for the Enterprise
DataStax Enterprise is a complete big data platform, built on Cassandra, that
is architected to manage real-time, analytic, and enterprise search data all
in the same database cluster.
What You Get With DataStax Enterprise
1. DataStax Enterprise
Database Server
1. OpsCenter Enterprise
Management solution
1. Expert 24x7 support
Use Cases Handled By DataStax Enterprise
Managed by Cassandra Managed by Hadoop Managed by Solr
• Time series data
• Device/Sensor/Data
“exhaust” systems
• Distributed applications
• Media streaming
• Online Web retail
(transactional, shopping
carts, etc.)
• Real-time data analytics
• Social media capture and
analysis
• Web click-stream analysis
• Write-intensive transactional
systems
• Buyer behavior analytics
• Compliance/regulatory
analysis
• Customer
recommendation output
• Fraud detection
• Risk analysis
• Sales program
campaign analysis
• Supply chain analytics
• Batch Web clickstream
analysis
• General Web search
• Web retail faceted
(categorization) search
• Search/hit prioritization
and highlighting
• Application log search and
analysis
• Document (PDF, MS
Word, etc.) search and
analysis
• Geospatial search
• Real estate location and
property search
• Social media match ups
Next Steps
Download DataStax Enterprise and try it in your own
environment.
• Go to www.datastax.com/download
• Download a copy of DataStax
Enterprise
• Installs and configures in minutes
• Completely free for development use;
subscription required for production
deployments
For More Information
Thank You
We power the big data applications
that transform business.
©2013 DataStax Confidential. Do not distribute without consent.

Contenu connexe

Tendances

Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStaxWebinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
DataStax
 

Tendances (20)

Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
Webinar | Real-time Analytics for Healthcare: How Amara Turned Big Data into ...
 
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStaxWebinar | From Zero to 1 Million with Google Cloud Platform and DataStax
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
 
DataStax Training – Everything you need to become a Cassandra Rockstar
DataStax Training – Everything you need to become a Cassandra RockstarDataStax Training – Everything you need to become a Cassandra Rockstar
DataStax Training – Everything you need to become a Cassandra Rockstar
 
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStaxWebinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
 
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
 
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
 
Building a Digital Bank
Building a Digital BankBuilding a Digital Bank
Building a Digital Bank
 
Transforms Document Management at Scale with Distributed Database Solution wi...
Transforms Document Management at Scale with Distributed Database Solution wi...Transforms Document Management at Scale with Distributed Database Solution wi...
Transforms Document Management at Scale with Distributed Database Solution wi...
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
 
How to Successfully Visualize DSE Graph data
How to Successfully Visualize DSE Graph dataHow to Successfully Visualize DSE Graph data
How to Successfully Visualize DSE Graph data
 
Webinar: Transforming Customer Experience Through an Always-On Data Platform
Webinar: Transforming Customer Experience Through an Always-On Data PlatformWebinar: Transforming Customer Experience Through an Always-On Data Platform
Webinar: Transforming Customer Experience Through an Always-On Data Platform
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionTop 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data Solution
 
Webinar: Become PSD2 ready with DataStax
Webinar: Become PSD2 ready with DataStaxWebinar: Become PSD2 ready with DataStax
Webinar: Become PSD2 ready with DataStax
 
Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help f...
Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help f...Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help f...
Webinar - DataStax Enterprise 5.1: 3X the operational analytics speed, help f...
 
Introduction: Architecting for Scale
Introduction: Architecting for ScaleIntroduction: Architecting for Scale
Introduction: Architecting for Scale
 
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
 
Building a scalable analytics environment to support diverse workloads
Building a scalable analytics environment to support diverse workloadsBuilding a scalable analytics environment to support diverse workloads
Building a scalable analytics environment to support diverse workloads
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
 
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
 
Announcing Spark Driver for Cassandra
Announcing Spark Driver for CassandraAnnouncing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
 

En vedette

En vedette (18)

Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural LessonsCassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
 
Webinar: Eventual Consistency != Hopeful Consistency
Webinar: Eventual Consistency != Hopeful ConsistencyWebinar: Eventual Consistency != Hopeful Consistency
Webinar: Eventual Consistency != Hopeful Consistency
 
Cassandra Community Webinar: Back to Basics with CQL3
Cassandra Community Webinar: Back to Basics with CQL3Cassandra Community Webinar: Back to Basics with CQL3
Cassandra Community Webinar: Back to Basics with CQL3
 
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
 
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
 
Cassandra Community Webinar | In Case of Emergency Break Glass
Cassandra Community Webinar | In Case of Emergency Break GlassCassandra Community Webinar | In Case of Emergency Break Glass
Cassandra Community Webinar | In Case of Emergency Break Glass
 
Webinar | Introducing DataStax Enterprise 4.6
Webinar | Introducing DataStax Enterprise 4.6Webinar | Introducing DataStax Enterprise 4.6
Webinar | Introducing DataStax Enterprise 4.6
 
Webinar: Getting Started with Apache Cassandra
Webinar: Getting Started with Apache CassandraWebinar: Getting Started with Apache Cassandra
Webinar: Getting Started with Apache Cassandra
 
Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...
Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...
Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...
 
Cassandra TK 2014 - Large Nodes
Cassandra TK 2014 - Large NodesCassandra TK 2014 - Large Nodes
Cassandra TK 2014 - Large Nodes
 
Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...
Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...
Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...
 
Webinar: Building Blocks for the Future of Television
Webinar: Building Blocks for the Future of TelevisionWebinar: Building Blocks for the Future of Television
Webinar: Building Blocks for the Future of Television
 
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra RockstarWebinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
 
Webinar: Diagnosing Apache Cassandra Problems in Production
Webinar: Diagnosing Apache Cassandra Problems in ProductionWebinar: Diagnosing Apache Cassandra Problems in Production
Webinar: Diagnosing Apache Cassandra Problems in Production
 
Cassandra Community Webinar | Become a Super Modeler
Cassandra Community Webinar | Become a Super ModelerCassandra Community Webinar | Become a Super Modeler
Cassandra Community Webinar | Become a Super Modeler
 
Cassandra Community Webinar: Apache Cassandra Internals
Cassandra Community Webinar: Apache Cassandra InternalsCassandra Community Webinar: Apache Cassandra Internals
Cassandra Community Webinar: Apache Cassandra Internals
 
Community Webinar: 15 Commandments of Cassandra DBAs
Community Webinar: 15 Commandments of Cassandra DBAsCommunity Webinar: 15 Commandments of Cassandra DBAs
Community Webinar: 15 Commandments of Cassandra DBAs
 
Cassandra Community Webinar | The World's Next Top Data Model
Cassandra Community Webinar | The World's Next Top Data ModelCassandra Community Webinar | The World's Next Top Data Model
Cassandra Community Webinar | The World's Next Top Data Model
 

Similaire à How To Tell if Your Business Needs NoSQL

NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
Adi Challa
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Precisely
 
How to Choose a Host for a Big Data Project
How to Choose a Host for a Big Data ProjectHow to Choose a Host for a Big Data Project
How to Choose a Host for a Big Data Project
Peak Hosting
 
Nosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptxNosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptx
Radhika R
 

Similaire à How To Tell if Your Business Needs NoSQL (20)

NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera Hadoop
 
Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native apps
 
Introduction to NoSQL database technology
Introduction to NoSQL database technologyIntroduction to NoSQL database technology
Introduction to NoSQL database technology
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
 
How to Choose a Host for a Big Data Project
How to Choose a Host for a Big Data ProjectHow to Choose a Host for a Big Data Project
How to Choose a Host for a Big Data Project
 
Key Database Criteria for Cloud Applications
Key Database Criteria for Cloud ApplicationsKey Database Criteria for Cloud Applications
Key Database Criteria for Cloud Applications
 
Webinar How to Achieve True Scalability in SaaS Applications
Webinar How to Achieve True Scalability in SaaS ApplicationsWebinar How to Achieve True Scalability in SaaS Applications
Webinar How to Achieve True Scalability in SaaS Applications
 
Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big Data
 
Nosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptxNosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptx
 
Couchbase 3.0.2 d1
Couchbase 3.0.2  d1Couchbase 3.0.2  d1
Couchbase 3.0.2 d1
 
Big Data Boom
Big Data BoomBig Data Boom
Big Data Boom
 
SpringPeople - Introduction to Cloud Computing
SpringPeople - Introduction to Cloud ComputingSpringPeople - Introduction to Cloud Computing
SpringPeople - Introduction to Cloud Computing
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
NoSQL and Couchbase
NoSQL and CouchbaseNoSQL and Couchbase
NoSQL and Couchbase
 

Plus de DataStax

Plus de DataStax (20)

Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?
 
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
 
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
Running DataStax Enterprise in VMware Cloud and Hybrid EnvironmentsRunning DataStax Enterprise in VMware Cloud and Hybrid Environments
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
 
Best Practices for Getting to Production with DataStax Enterprise Graph
Best Practices for Getting to Production with DataStax Enterprise GraphBest Practices for Getting to Production with DataStax Enterprise Graph
Best Practices for Getting to Production with DataStax Enterprise Graph
 
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step JourneyWebinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
 
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
 
Webinar | Better Together: Apache Cassandra and Apache Kafka
Webinar  |  Better Together: Apache Cassandra and Apache KafkaWebinar  |  Better Together: Apache Cassandra and Apache Kafka
Webinar | Better Together: Apache Cassandra and Apache Kafka
 
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax EnterpriseTop 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
 
Introduction to Apache Cassandra™ + What’s New in 4.0
Introduction to Apache Cassandra™ + What’s New in 4.0Introduction to Apache Cassandra™ + What’s New in 4.0
Introduction to Apache Cassandra™ + What’s New in 4.0
 
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
 
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud RealitiesWebinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
 
Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDesigning a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for Dummies
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid CloudHow to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
 
How to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerceHow to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerce
 
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
 
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
 
Datastax - The Architect's guide to customer experience (CX)
Datastax - The Architect's guide to customer experience (CX)Datastax - The Architect's guide to customer experience (CX)
Datastax - The Architect's guide to customer experience (CX)
 
An Operational Data Layer is Critical for Transformative Banking Applications
An Operational Data Layer is Critical for Transformative Banking ApplicationsAn Operational Data Layer is Critical for Transformative Banking Applications
An Operational Data Layer is Critical for Transformative Banking Applications
 
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design ThinkingBecoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 

How To Tell if Your Business Needs NoSQL

  • 1. How to Tell if Your Business Needs NoSQL Robin Schumacher VP Products
  • 2. • Founded in April 2010 • The Apache Cassandra™ company • Home to Apache Cassandra Chair & most committers • Cassandra is a massively scalable NoSQL database • Provide enterprise-class big data platform based on Cassandra • 270+ customers • Headquartered in San Francisco Bay area • Funded by prominent venture firms Overview of DataStax
  • 4. Leading in Performance http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability- on.html Netflix Cloud Benchmark… “In terms of scalability, there is a clear winner throughout our experiments. Cassandra achieves the highest throughput for the maximum number of nodes in all experiments with a linear increasing throughput.” Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August 2013, p. 10. Benchmark paper presented at the Very Large Database Conference, 2013. http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2013.pdf End Point Independent NoSQL Benchmark Highest in throughput… Lowest in latency…
  • 5. NoSQL Momentum “According to analysis by Wikibon‟s David Floyer (and highlighted in the Wall Street Journal), the NoSQL database market is expected to grow at a compound annual growth rate of nearly 60% between 2011 and 2017. The SQL slice of the Big Data market, in contrast, will grow at just a 26% CAGR during that same time period.”
  • 6. NoSQL Momentum “NoSQL is the stuff of the Internet Age.” - Andrew Oliver, InfoWorld
  • 7. But Does My Business Need NoSQL…? Just because a technology appears to be having strong adoption in the market, that doesn‟t mean it‟s right for your business…
  • 8. What is NoSQL…? • Progressive data management engines • Go beyond legacy relational databases • Flexible data model • Horizontal scalability • Distributed architectures • Use of languages and interfaces that are “not only” SQL
  • 9. NoSQL Example – Apache Cassandra Apache Cassandra is a massively scalable NoSQL database that offers continuous availability and easy data distribution.
  • 10. NoSQL Example – Apache Cassandra “Cassandra stands at the front of the NoSQL pack when it comes to supporting real-time, big data applications.” – Wikibon
  • 11. How Can I Tell if NoSQL Can Help Me Run My Business and Reduce Costs? ©2013 DataStax Confidential. Do not distribute without consent.
  • 12. NoSQL Business Considerations • Need scale-out (vs. scale-up)? • Manage different types of data like social media? • Lots of data coming in (and fast)? • Have non-RDBMS, non-ACID transactions? • Must keep large data volumes online? • Continuous uptime necessary? • Wide-scale data distribution needed? • Need to integrate different systems? • Cost a factor?
  • 13. Need Scale-Out (vs. Scale-Up)? No • Application does not require multiple machines • Can scale-up and meet the application’s current and future needs Yes • Application demands divide-and-conquer • Capacity expansion is best/can only be handled via new machines Key takeaway: If your applications can easily run on one machine, fit all your data in RAM or can easily expand via new cores/more drives to fulfill current and future requirements, you may not need NoSQL…
  • 14. NoSQL Case Study Ooyala distributes and analyzes video content for companies like ESPN, Rolling Stone and others. They track about one quarter of all online video viewers each day and generate 1-2 billion events that are streaming in real-time through their system.
  • 15. Manage Different Types of Data? No • No non-structured data (all or mostly rigid formats) • E.G. No social media data Yes • All types of data (structured, semi, and unstructured) • Social media data Key takeaway: If all your data systems deal with standard RDBMS structured data and that won‟t be changing, then you may not need NoSQL…
  • 16. NoSQL Case Study HealthCare Anytime needs to analyze doctor’s notes and other types of difficult data to properly bill back Medicare / Medicaid.
  • 17. NoSQL Case Study “Cassandra‟s NoSQL data model allows us to insert and query data much more naturally than what we had previously. The analysts who routinely use this data were impressed with the flexibility and speed at which the queries came back.” – CSC/NASA
  • 18. Lots of Data Coming In (and Fast)? No • No high velocity data (e.g. device, sensors, web streaming, etc.) • No multiple locations • Little/no concern about write speed Yes • High velocity, write intensive • Multiple locations sending data • Must consume data as quickly as possible Key takeaway: Business applications involving rapid time series data, device „exhaust‟, web or financial streaming data make good use cases for NoSQL…
  • 19. NoSQL Case Study Gnip takes in huge volumes of social media data at high rates of speed (e.g. 20,000 Tweets per second).
  • 20. Non-RDBMS, Non-ACID transactions? No • Standard RDBMS, Nested, ACID transactions required • Complex, requiring rollbacks, savepoints, etc., needed Yes • “Big Data” transactions OK or are necessary • Atomic, Isolated, Durable (AID), but eventual or tunable consistency allowed Key takeaway: NoSQL databases do transactions, but since they don‟t support joins or foreign keys, consistency conforms to the CAP theorem vs. RDBMS ACID styled consistency…
  • 21. NoSQL Case Study eBay does transactions, but does not want overhead of RDBMS ACID-type transactions.
  • 22. Cassandra and Transactions Individual or batch transactions with AID and tunable consistency.
  • 23. Must Keep Large Data Volumes Online? No • No application requirement to keep large volumes of data • System typically purges data older than certain time period Yes • Must keep large volumes of data online and available to customers • Retain both hot and cold data Key takeaway: Some NoSQL databases like Cassandra can excel over typical RDBMS‟s when it comes to maintaining large volumes of data online and meeting stringent performance SLA‟s …
  • 24. NoSQL Case Study Easou is the #1 mobile search firm in China. One of their Cassandra applications stores online video images for retrieval / viewing and is 300TB in size.
  • 25. Continuous Uptime Necessary? No • Applications have no need for constant uptime • Unplanned downtime can be handled via traditional failover Yes • Applications cannot tolerate any downtime • Standard log shipping, failover, hot backups, won’t do Key takeaway: Some NoSQL databases like Cassandra are able to guarantee no downtime because of their architectures…
  • 26. NoSQL Case Study Netflix systems are run in the cloud across multiple availability zones with Cassandra and sport constant uptime.
  • 27. NoSQL Case Study Commenting on Amazon outage in Oct 2012: “We configure all our clusters to use a replication factor of three, with each replica located in a different Availability Zone. This allowed Cassandra to handle the outage remarkably well. When a single zone became unavailable, we didn't need to do anything. Cassandra routed requests around the unavailable zone and when it recovered, the ring was repaired.” - Netflix Tech Blog
  • 28. Wide-Scale Data Distribution Needed? No • Application’s data needs are single site only • No need to distribute data in other locales for any reason Yes • Application serves customers in multiple locations • Data is distributed across multiple data centers / cloud zones for latency/performance or disaster recover reasons Key takeaway: Cassandra is the gold standard among NoSQL databases for multi-data center, data distribution use cases…
  • 29. NoSQL Case Study Rightscale keeps its customers in contact with each other all over the world via Cassandra clusters in 5+ global data centers.
  • 30. Need to Integrate Different Systems? No • Applications use siloed databases • No need for different data systems to interact with each other Yes • Application has different database workloads • Multiple data domains serve single application Key takeaway: ETL and simple connectors oftentimes do not do the job. Instead, what‟s needed is something like DataStax Enterprise, which provides one database that serves multiple database workloads…
  • 31. NoSQL Case Study Datafiniti, which is a search engine for data, needs to consume lots of data in real time and provide fast search on top of the same data.
  • 32. Cost a Factor? No • Application is small and not cost intensive to operate • Software license costs not a factor Yes • Large scale business applications • Traditional RDBMS software costs a significant concern Key takeaway: NoSQL databases costs can oftentimes be 70-80% less than legacy RDBMS software. Further large operations staff are not required to manage NoSQL systems.
  • 33. NoSQL Case Study Constant Contact found that scaling out with NoSQL vs. an RDBMS saved them 90% in software costs, and was implemented in 1/3 the time...
  • 34. What Strategies Can I Use To Implement NoSQL in my Business? ©2013 DataStax Confidential. Do not distribute without consent.
  • 35. NoSQL Implementation Strategies New Hybrid Replacement • New big data applications • Legacy systems keep old databases • NoSQL database used for heavy lifting / big data management • Legacy RDBMS maintains smaller parts of database • Legacy RDBMS cannot meet demands of new or evolving big data system • Data models and data are migrated
  • 36. DataStax Enterprise – NoSQL for the Enterprise DataStax Enterprise is a complete big data platform, built on Cassandra, that is architected to manage real-time, analytic, and enterprise search data all in the same database cluster.
  • 37. What You Get With DataStax Enterprise 1. DataStax Enterprise Database Server 1. OpsCenter Enterprise Management solution 1. Expert 24x7 support
  • 38. Use Cases Handled By DataStax Enterprise Managed by Cassandra Managed by Hadoop Managed by Solr • Time series data • Device/Sensor/Data “exhaust” systems • Distributed applications • Media streaming • Online Web retail (transactional, shopping carts, etc.) • Real-time data analytics • Social media capture and analysis • Web click-stream analysis • Write-intensive transactional systems • Buyer behavior analytics • Compliance/regulatory analysis • Customer recommendation output • Fraud detection • Risk analysis • Sales program campaign analysis • Supply chain analytics • Batch Web clickstream analysis • General Web search • Web retail faceted (categorization) search • Search/hit prioritization and highlighting • Application log search and analysis • Document (PDF, MS Word, etc.) search and analysis • Geospatial search • Real estate location and property search • Social media match ups
  • 39. Next Steps Download DataStax Enterprise and try it in your own environment. • Go to www.datastax.com/download • Download a copy of DataStax Enterprise • Installs and configures in minutes • Completely free for development use; subscription required for production deployments
  • 41. Thank You We power the big data applications that transform business. ©2013 DataStax Confidential. Do not distribute without consent.

Notes de l'éditeur

  1. http://techblog.netflix.com/2012/10/post-mortem-of-october-222012-aws.html