SlideShare une entreprise Scribd logo
1  sur  4
Télécharger pour lire hors ligne
A couple of days ago I came across the article "Mapping AWS, Google Cloud, Azure Services to Big
Data Warehouse Architecture" here. I do know a bit about data warehousing, and even big data
warehouse architecture. However, what interests me is actually a "map of various cloud services
against the big data warehouse architecture". More precisely, cloud services from "the three most
popular cloud platforms: Microsoft Azure, Google Cloud Platform, and Amazon AWS" are mapped to
their open source origination and/or counterparts. As a technical IBMer, my primary area is Big Data &
Advanced Analytics, but I happen to know a little about the IBM Bluemix platform. So for (more)
completeness, here it comes Bluemix! - Note though, here only Bluemix services involved in big data
warehouse architecture are listed. To explore more, see Bluemix website.
Disclaimer
1.While I'm employed by IBM this article represents completely my personal viewpoints.
Furthermore, I've tried my best but still I can't guarantee the 100% completeness, accuracy,
and/or potential services changes.
2.The original author of the article aforementioned own(s) the copyright and by no means I'm
modifying the content. Neither do I agree nor disagree with the author on the content. However,
for convenience, I'm putting the original table (or map) along with their IBM Bluemix
counterparts side by side.
PS, Due to space limitation, all the open source stuff in the Bluemix column refers to the cloud service
provisioned by IBM Bluemix rather than the original open source software, e.g., HDFS/Hadoop/Hive,
etc. means the individual component within BigInsights for Apache Hadoop or BigInsights for Apache
Hadoop (Subscription) service and PostgreSQL refers to ElephantSQL and/or Compose for
PostgreSQL service.
Open Source Amazon AWS Microsoft Azure Google Cloud IBM Bluemix
Batch Ingest
Sqoop
File Transfer
Flume
StreamSets
AWS Data Transfer
Services (various
options)
Import/Export
Service
Data Factory
Cloud DataFlow
Sqoop
File Transfer
Lift (Aspera)
Flume
Various services
Streaming Ingest
Flume
StreamSets
Amazon Kinesis
Firehose
Event Hubs
IOT Hub
Cloud DataFlow
Flume, Spark
Streaming Analytics
Persistent
Storage
HDFS
RDBMS
S3, Glacier
RDS
Storage Blob
HDFS
SQL Database
Persistent Disk
Google Cloud
Storage
Cloud SQL
HDFS
RDBMS (IBM
Proprietary: Db2,
dashDB, Informix ...
open source: MySQL,
PostgreSQL ...
NoSQL: MongoDB,
Redis, Cloudant ...
Block Storage, Cloud
Object Storage, File
Storage, CDN, etc.
Transient Storage Kafka Kinesis
Event Hubs
IOT Hub
HDInsight (Kafka)
Cloud Pub/Sub
Cloud IoT Core
Kafka, Message Hub
Batch Processing
Hive
Flink, Spark
MapReduce
PostgreSQL
EMR Spark
EMR Hadoop
EMR Presto
AWS Batch
Redshift
Azure Batch
HDInisght
(Spark/Map Reduce)
SQL Data
Warehouse
Data Lake Analytics
Cloud Dataflow
(open source
Apache Beam)
Cloud DataProc
(Spark, Hadoop)
Hive, Spark,
MapReduce, MySQL,
PostgreSQL
Db2, Information Server
on Cloud, etc.
Stream
Processing
Flink
Spark
Beam
Amazon Kinesis
Streams
Amazon Kinesis
Analytics
EMR Spark
Stream Analytics
HDInsight (Storm,
Spark)
Cloud Dataflow
(open source
Apache Beam)
DataProc (Spark,
Hadoop)
Spark
Streaming Analytics
Machine
Learning
Scikit
Tensorflow
Spark MLLib
Lex
Polly
Recognition
Azure ML
Cognitive Services
Natural
Language
SpeechTranslati
Data Science
Experience (includes
TensorFlow
etc.
Huge number
of libraries
Amazon Machine
Learning
on
Vision
Video
ML Engine
support for R, Python
with scikit, TensorFlow,
Spark with MLLib, etc.)
Watson Machine
Learning
Serving Storage
Graph
JanusGraph
N/A Marketplace
Only, e.g. OrientDB
N/A Marketplace
only, e.g OrientDB
N/A IBM Graph
Serving Storage
BI/EDW
Impala +
Kudu
Redshift
Athena
SQL Data
Warehouse
BigQuery
Db2 for Warehouse
BigSQL
Serving Storage
Search (keywords
+ facets)
Solr
Amazon
CloudSearch
Amazon
Elasticsearch
Azure Search
N/A
Marketplace,
e.g. Solr
Solr, Compose for
ElasticSearch
Serving Storage
RDBMS
PostgreSQL RDS SQL DB Cloud SQL
IBM Proprietary: Db2,
dashDB, Informix ...
and open source:
MySQL, PostgreSQL ...
Serving Storage
NoSQL
HBase DynamoDB
HDInsight (HBase)
CosmosDB
BigTable
Spanner
DataStore
NoSQL: HBase,
MongoDB, Redis,
Cloudant, Redis ...
Sandboxes
Notebook
Zeppelin EMR Zeppelin Azure Notebooks Cloud Datalab
Data Science
Experience (Juypter)
Spark
Sandboxes Data
Science or
Preparation
Platform
Dataiku DSS
Community
Edition (not
open source)
N/A Marketplace
only, e.g. Dataiku
DSS
N/A Marketplace
only, e.g. Dataiku
DSS
Cloud DataPrep
(beta). Under the
hood this is
Trifacta.
Data Science
Experience
Clients/Data
Apps
Superset (BI) Quicksight PowerBI
Google Data
Studio
Data Science
Experience
Watson Machine
Learning
Decision Optimization
Orchestration Airflow AWS Data Pipeline Data Factory
N/A
Marketplace
Workload Scheduler (?)
ETL Tool N/A AWS Glue (beta) Data Factory N/A
Marketplace
Data Connect
Information Server on
Cloud
MDM Hub N/A N/A Marketplace N/A Marketplace
N/A
Marketplace
MDM on Cloud
Lineage N/A AWS Glue (beta) N/A N/A
Information Server on
Cloud
Catalog N/A AWS Glue (beta) Data Catalog
N/A
Marketplace
Information Server on
Cloud

Contenu connexe

Dernier

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 

Dernier (20)

Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 

En vedette

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Cloud platform aws-gcp-azure-bluemix

  • 1. A couple of days ago I came across the article "Mapping AWS, Google Cloud, Azure Services to Big Data Warehouse Architecture" here. I do know a bit about data warehousing, and even big data warehouse architecture. However, what interests me is actually a "map of various cloud services against the big data warehouse architecture". More precisely, cloud services from "the three most popular cloud platforms: Microsoft Azure, Google Cloud Platform, and Amazon AWS" are mapped to their open source origination and/or counterparts. As a technical IBMer, my primary area is Big Data & Advanced Analytics, but I happen to know a little about the IBM Bluemix platform. So for (more) completeness, here it comes Bluemix! - Note though, here only Bluemix services involved in big data warehouse architecture are listed. To explore more, see Bluemix website. Disclaimer 1.While I'm employed by IBM this article represents completely my personal viewpoints. Furthermore, I've tried my best but still I can't guarantee the 100% completeness, accuracy, and/or potential services changes. 2.The original author of the article aforementioned own(s) the copyright and by no means I'm modifying the content. Neither do I agree nor disagree with the author on the content. However, for convenience, I'm putting the original table (or map) along with their IBM Bluemix counterparts side by side. PS, Due to space limitation, all the open source stuff in the Bluemix column refers to the cloud service provisioned by IBM Bluemix rather than the original open source software, e.g., HDFS/Hadoop/Hive, etc. means the individual component within BigInsights for Apache Hadoop or BigInsights for Apache Hadoop (Subscription) service and PostgreSQL refers to ElephantSQL and/or Compose for PostgreSQL service.
  • 2. Open Source Amazon AWS Microsoft Azure Google Cloud IBM Bluemix Batch Ingest Sqoop File Transfer Flume StreamSets AWS Data Transfer Services (various options) Import/Export Service Data Factory Cloud DataFlow Sqoop File Transfer Lift (Aspera) Flume Various services Streaming Ingest Flume StreamSets Amazon Kinesis Firehose Event Hubs IOT Hub Cloud DataFlow Flume, Spark Streaming Analytics Persistent Storage HDFS RDBMS S3, Glacier RDS Storage Blob HDFS SQL Database Persistent Disk Google Cloud Storage Cloud SQL HDFS RDBMS (IBM Proprietary: Db2, dashDB, Informix ... open source: MySQL, PostgreSQL ... NoSQL: MongoDB, Redis, Cloudant ... Block Storage, Cloud Object Storage, File Storage, CDN, etc. Transient Storage Kafka Kinesis Event Hubs IOT Hub HDInsight (Kafka) Cloud Pub/Sub Cloud IoT Core Kafka, Message Hub Batch Processing Hive Flink, Spark MapReduce PostgreSQL EMR Spark EMR Hadoop EMR Presto AWS Batch Redshift Azure Batch HDInisght (Spark/Map Reduce) SQL Data Warehouse Data Lake Analytics Cloud Dataflow (open source Apache Beam) Cloud DataProc (Spark, Hadoop) Hive, Spark, MapReduce, MySQL, PostgreSQL Db2, Information Server on Cloud, etc. Stream Processing Flink Spark Beam Amazon Kinesis Streams Amazon Kinesis Analytics EMR Spark Stream Analytics HDInsight (Storm, Spark) Cloud Dataflow (open source Apache Beam) DataProc (Spark, Hadoop) Spark Streaming Analytics Machine Learning Scikit Tensorflow Spark MLLib Lex Polly Recognition Azure ML Cognitive Services Natural Language SpeechTranslati Data Science Experience (includes
  • 3. TensorFlow etc. Huge number of libraries Amazon Machine Learning on Vision Video ML Engine support for R, Python with scikit, TensorFlow, Spark with MLLib, etc.) Watson Machine Learning Serving Storage Graph JanusGraph N/A Marketplace Only, e.g. OrientDB N/A Marketplace only, e.g OrientDB N/A IBM Graph Serving Storage BI/EDW Impala + Kudu Redshift Athena SQL Data Warehouse BigQuery Db2 for Warehouse BigSQL Serving Storage Search (keywords + facets) Solr Amazon CloudSearch Amazon Elasticsearch Azure Search N/A Marketplace, e.g. Solr Solr, Compose for ElasticSearch Serving Storage RDBMS PostgreSQL RDS SQL DB Cloud SQL IBM Proprietary: Db2, dashDB, Informix ... and open source: MySQL, PostgreSQL ... Serving Storage NoSQL HBase DynamoDB HDInsight (HBase) CosmosDB BigTable Spanner DataStore NoSQL: HBase, MongoDB, Redis, Cloudant, Redis ... Sandboxes Notebook Zeppelin EMR Zeppelin Azure Notebooks Cloud Datalab Data Science Experience (Juypter) Spark Sandboxes Data Science or Preparation Platform Dataiku DSS Community Edition (not open source) N/A Marketplace only, e.g. Dataiku DSS N/A Marketplace only, e.g. Dataiku DSS Cloud DataPrep (beta). Under the hood this is Trifacta. Data Science Experience Clients/Data Apps Superset (BI) Quicksight PowerBI Google Data Studio Data Science Experience Watson Machine Learning Decision Optimization Orchestration Airflow AWS Data Pipeline Data Factory N/A Marketplace Workload Scheduler (?) ETL Tool N/A AWS Glue (beta) Data Factory N/A Marketplace Data Connect Information Server on
  • 4. Cloud MDM Hub N/A N/A Marketplace N/A Marketplace N/A Marketplace MDM on Cloud Lineage N/A AWS Glue (beta) N/A N/A Information Server on Cloud Catalog N/A AWS Glue (beta) Data Catalog N/A Marketplace Information Server on Cloud