If you want to stay up to date, subscribe to our newsletter here: https://bit.ly/3tiw1I8
A presentation about the strong competition between open-source vendors and public cloud providers in the Big Data landscape.
5. With or without cloud
● Amazon Elastic MapReduce launched in April 2009
● Most of the companies started on-premise
○ Y!, FB, Twitter, LI, Allegro, ING
● Several companies were happy with the cloud
○ Netflix, Base CRM, Wikia
● A few companies went back and forth
○ Spotify, Foursquare, Synerise
13. Open-source responses
● Self-defense licenses
○ Confluent (Kafka), MongoDB, Redis
● Own cloud-based products
○ Databricks (Spark), Confluent (Kafka), Cloudera
(Hadoop)
● Cloud-first approach
○ That code is released first to their cloud offering
14. Current trends
1. More companies start using or evaluating the
cloud
2. Companies usually avoid or sometimes accept
vendor lock-in
3. Open-source projects become cloud-first and
cloud-ready
4. Kubernetes provides a portability layer
16. Native open-source
services on Google
Cloud Platform
● a seamless sign-up
experience
● an integrated billing
● the 1st
line support
provided directly by
Google Cloud
22. Hybrid
environment
(test and prod)
● Real-time user journey
analytics and interactions
○ Kafka, Flink, Hive,
Docker
● Development in the cloud
● Production on=premise