SlideShare une entreprise Scribd logo
1  sur  34
Télécharger pour lire hors ligne
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Architecting Data Lake in Cloud
Driving Insights and Adding Intelligence
Ivan Cheng (鄭志帆)
Solutions Architect | AWS
Stanley Huang (黃士展)
Sr. Manager | Trend Micro
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Answer new questions and move faster
Raymond, Head of ProductLim, Head of Finance
Which regions are the new product selling well?
What is the demand forecast by product category?
What is the social sentiment about our products?
How do quality issues impact cost of production?
Can I look at supplier performance over time?
How can we reduce our inventory holding costs?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
To answer new questions quickly, we look to a
modern data architecture design
Massive upfront costs
Overprovisioned capacity
Long implementation times
Pay as you go, for what you use
Decoupled pipelines and engines
Experimentation platform
Ingest/
Collect
Consume/
visualize
Store Process/
analyze
1 4
0 9
5
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingest ServingData
sources
Modern data architecture
Insights to enhance business applications, new digital services
Transactions
ERP
Data analysts
DATA PIPELINES
Ingest/
Collect
Consume /
visualize
Store Process /
analyze
1 4
0 9
5
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingest ServingData
sources
Modern data architecture
Insights to enhance business applications, new digital services
Transactions
ERP
DATA PIPELINES
Data
Lake
expdp
Data Data analysts
Data Warehouse
Amazon Redshift
Direct Query
Amazon Athena
She asks for the SMALLEST amount of data to answer her questions.
If it isn’t good enough, she asks for another small slice to be loaded to the DATA LAKE
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Characteristics of a Data Lake
Future
Proof
Flexible
Access
Dive in
Anywhere
Collect
Anything
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingest ServingData
sources
Modern data architecture
Insights to enhance business applications, new digital services
DATA PIPELINES
Data
Lake
He first looks to the DATA LAKE, and imports only the category data he needs
He imports JUST ENOUGH data to see if the market is responding to products.
Business users
Transactions
ERP
Social media
Data
Stream
Capture
Amazon
Kinesis
Events
Amazon
QuickSight
Data Warehouse
Amazon Redshift
Stream Data
Amazon
ElasticSearch
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Common data pipeline configuration
Raw Data
Amazon S3
Highly decoupled configurations scale better, are more fault tolerant, and cost optimized
ETL (Hadoop)
Amazon EMR
Triggered Code
AWS Lambda
Staged Data
(Data Lake)
Amazon S3
ETL & Catalog Management
AWS Glue
Data Warehouse
Amazon Redshift
Triggered Code
AWS Lambda
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingest ServingData
sources
Modern data architecture
Insights to enhance business applications, new digital services
Transactions
ERP
Data analysts
Business users
DATA PIPELINES
EVENT PIPELINES
Data
Event
Insights
Data
Lake
Social media
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Direct relationship with consumers
Krzysztof, Data ScientistBala, Head of Marketing
What are our customer segments, based on usage?
Can predict user preference?
How are people using the product?
What kinds of people are in/decreasing usage?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingest ServingData
sources
Modern data architecture
Insights to enhance business applications, new digital services
Transactions
Data scientists
Business users
Connected
devices
DATA PIPELINES
EVENT PIPELINES
Data
Event
Insights
Data
Lake
Sandbox
ML / Analytics / DLWeb logs /
clickstream
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingest ServingData
sources
Modern data architecture
Innovate for new revenues - personalization and forecasting
Transactions
ERP
Data analysts
Data scientists
Business users
Connected
devices
DATA PIPELINES
EVENT PIPELINES
Data
Event
Insights
Data
Lake
ML / Analytics
Social media
Web logs /
clickstream
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ingest ServingData
sources
Modern data architecture
Real-time engagement and interactive customer experiences
Transactions
ERP
Data analysts
Data scientists
Business users
Engagement platformsConnected
devices
Automation / events
DATA PIPELINES
EVENT PIPELINES
Data
Event Action
Insights
Data
Lake
ML / Analytics
Predict /
Recommend
AI Services
Social media
Web logs /
clickstream
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Stanley Huang 黃士展
Sr. Manager, Web Service Engineering (WSE), Consumer, R&D, Trend Micro
Data Lake Journey in
Trend Micro Consumer Products
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Founded
RD HQ
Revenue
Customers
1988, United States
Taipei, Taiwan
$1.2B USD
500,000+ businesses,
Millions of consumers
5800+ Employees in 50+ Countries
100% of the
top 10 automotive
companies.
45 of the top 50
global
corporations.
100% of the
top 10 telecom
companies.
100% of the top
10 banks.
100% of the top
10 oil companies.
A world safe for exchanging digital information
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deep Security
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Lake for Consumer Products
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Home Network Security
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Business Requirement
Real-time
Dashboard
Business Metrics Feature
Comparison
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Architecture 1.0
Analyze & ConsumeCatalog & TransformIngest & StoreData
Source
Amazon
Redshift
AmazonKinesis
Streams
Lambda
function
Lambda
function
Lambda
Amazon EMR
Amazon S3
Transformed Data
Amazon S3
Raw Data
application
backend
Tableau
(Data Visualization)
Lambda
Amazon
Pinpoint
mobile app
AmazonKinesis
Streams
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Timely Updated Business Metrics
Dashboard Business Metrics
Original image source: https://www.interaction-design.org/literature/article/google-s-heart-framework-for-measuring-ux
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Feature Comparison
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Frequent Release Cycles of the Products
High Availability Data Quality Auto-recovery
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Build-Measure-Learn Feedback Loop
Now2017, Sep.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Architecture 2.0
Analyze & ConsumeCatalog & TransformIngest & StoreData
Source
Amazon
Redshift
AmazonKinesis
Streams
Lambda
function
Lambda
function
Lambda
Amazon EMR
Amazon S3
Transformed Data
Amazon S3
Raw Data
application
backend
Amazon SQS
Task queue Lambda
function
Tableau
(Data Visualization)
Amazon SNS
Notification
Lambda
Amazon
Pinpoint
mobile app
AmazonKinesis
Streams
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Culture of Data Driven
Exploratory
Data Analysis
Self-service
ETL
Machine Learning
Platform
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Architecture 3.0
Analyze & ConsumeCatalog & TransformIngest & StoreData
Source
Amazon
Redshift
Amazon
Pinpoint
AmazonKinesis
Streams
Lambda
function
Lambda
function
Lambda
Amazon EMR
Amazon S3
Transformed Data
Amazon S3
Raw Data
mobile app
Amazon EMR
application
backend
Amazon S3
Enriched
Data
Redshift
Spectrum
Amazon
Athena
Glue Catalog Glue Catalog
Amazon SQS
Task queue Lambda
function
Tableau
(Data Visualization)
Downstream
Service
Amazon SNS
Notification
Lambda
AmazonKinesis
Streams
Amazon SageMaker
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Feature Engineering for Machine Learning
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
• Focus on Customer’s Value
• Secure and anonymize data
• Real-time Dashboard
• Work with PM and product team at early stage
• Design data pipelines to fit the dashboard
• Frequent Release Cycles
• Store data in raw format
• Ensure data set could be rebuilt by event replaying
• Data-Driven Culture
• Open data to stakeholder who has domain knowledge
• Prepare technical documents for data and tools
Takeaway
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Summary
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Central Storage
Secure, Cost Effective
Storage in S3
S3
Kinesis Direct Connect Snowball DMS
Data Ingestion
Get your data into S3
quickly and securely
Athena Quicksight EMR Redshift
Processing & Analytics
Use predictive and prescriptive
analytics to gain better understanding
Glue ETL
Protect & Secure
Use entitlements to ensure data is secure and users identities are verified
Security Token
Service
Cloudwatch Cloudtrail KMS
Catalog & Search
Access & Search Metadata
DynamoDB Amazon ESGlue Catalog
Access & User Interface
Give your users easy & secure access
API Gateway IAM Cognito
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ready to build better business from your ideas?
Short list projects that
directly impact
customer engagement
and adoption
Build simple data
pipelines that allow you
to test new ideas, and
fill your data lake
Ask our solution architects
and professional services
teams to help you build
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Please complete the session survey
Thank you!

Contenu connexe

Tendances

Sicurezza e conformità al GDPR con AWS
Sicurezza e conformità al GDPR con AWSSicurezza e conformità al GDPR con AWS
Sicurezza e conformità al GDPR con AWSAmazon Web Services
 
Exploiting IoT & Machine Learning to transform Power and Utilities
Exploiting IoT & Machine Learning to transform Power and UtilitiesExploiting IoT & Machine Learning to transform Power and Utilities
Exploiting IoT & Machine Learning to transform Power and UtilitiesAmazon Web Services
 
AWS IoT: servizi costruiti per migliorare le performance di business
AWS IoT: servizi costruiti per migliorare le performance di businessAWS IoT: servizi costruiti per migliorare le performance di business
AWS IoT: servizi costruiti per migliorare le performance di businessAmazon Web Services
 
Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018
Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018
Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018Amazon Web Services Korea
 
使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)
使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)
使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)Amazon Web Services
 
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...Amazon Web Services
 
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud Amazon Web Services
 
Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018
Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018
Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018Amazon Web Services
 
Leadership Session: Using AWS End User Computing Services for Your Modern Wor...
Leadership Session: Using AWS End User Computing Services for Your Modern Wor...Leadership Session: Using AWS End User Computing Services for Your Modern Wor...
Leadership Session: Using AWS End User Computing Services for Your Modern Wor...Amazon Web Services
 
Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...
Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...
Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...Amazon Web Services
 
Managed Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSManaged Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSAmazon Web Services
 
The Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons LearnedThe Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons LearnedAmazon Web Services
 
Usare la tecnologia Container su AWS
Usare la tecnologia Container su AWSUsare la tecnologia Container su AWS
Usare la tecnologia Container su AWSAmazon Web Services
 
Data freedom: come migrare i carichi di lavoro Big Data su AWS
Data freedom: come migrare i carichi di lavoro Big Data su AWSData freedom: come migrare i carichi di lavoro Big Data su AWS
Data freedom: come migrare i carichi di lavoro Big Data su AWSAmazon Web Services
 
BI & Analytics - A Datalake on AWS
BI & Analytics - A Datalake on AWSBI & Analytics - A Datalake on AWS
BI & Analytics - A Datalake on AWSAmazon Web Services
 
Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...
Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...
Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...Amazon Web Services
 

Tendances (20)

Migrating database to cloud
Migrating database to cloudMigrating database to cloud
Migrating database to cloud
 
Sicurezza e conformità al GDPR con AWS
Sicurezza e conformità al GDPR con AWSSicurezza e conformità al GDPR con AWS
Sicurezza e conformità al GDPR con AWS
 
Exploiting IoT & Machine Learning to transform Power and Utilities
Exploiting IoT & Machine Learning to transform Power and UtilitiesExploiting IoT & Machine Learning to transform Power and Utilities
Exploiting IoT & Machine Learning to transform Power and Utilities
 
AWS IoT: servizi costruiti per migliorare le performance di business
AWS IoT: servizi costruiti per migliorare le performance di businessAWS IoT: servizi costruiti per migliorare le performance di business
AWS IoT: servizi costruiti per migliorare le performance di business
 
Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018
Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018
Data Analytics를 통한 비지니스 혁신::Craig Stries::AWS Summit Seoul 2018
 
使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)
使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)
使用 AWS Step Functions 靈活調度 AWS Lambda (Level:200)
 
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
Proven Methodologies for Accelerating Your Cloud Journey (ENT308-S) - AWS re:...
 
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
 
Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018
Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018
Amazon Cloud Directory Deep Dive (DAT364) - AWS re:Invent 2018
 
Leadership Session: Using AWS End User Computing Services for Your Modern Wor...
Leadership Session: Using AWS End User Computing Services for Your Modern Wor...Leadership Session: Using AWS End User Computing Services for Your Modern Wor...
Leadership Session: Using AWS End User Computing Services for Your Modern Wor...
 
New Tools for a New World
New Tools for a New WorldNew Tools for a New World
New Tools for a New World
 
GDPR x AWS 導覽 (Level 200)
GDPR x AWS 導覽 (Level 200)GDPR x AWS 導覽 (Level 200)
GDPR x AWS 導覽 (Level 200)
 
Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...
Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...
Dissecting Media Asset Management Architecture and Media Archive TCO (MAE301)...
 
Managed Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSManaged Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDS
 
The Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons LearnedThe Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons Learned
 
Usare la tecnologia Container su AWS
Usare la tecnologia Container su AWSUsare la tecnologia Container su AWS
Usare la tecnologia Container su AWS
 
Data freedom: come migrare i carichi di lavoro Big Data su AWS
Data freedom: come migrare i carichi di lavoro Big Data su AWSData freedom: come migrare i carichi di lavoro Big Data su AWS
Data freedom: come migrare i carichi di lavoro Big Data su AWS
 
BI & Analytics - A Datalake on AWS
BI & Analytics - A Datalake on AWSBI & Analytics - A Datalake on AWS
BI & Analytics - A Datalake on AWS
 
Moving forward with AI
Moving forward with AIMoving forward with AI
Moving forward with AI
 
Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...
Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...
Enable Your Marketing Teams to Engage Users with Relevant & Personalized Cont...
 

Similaire à 雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)

Get to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data ArchitectureGet to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data ArchitectureAmazon Web Services
 
Build and Innovate with a Modern Data Architecture
Build and Innovate with a Modern Data ArchitectureBuild and Innovate with a Modern Data Architecture
Build and Innovate with a Modern Data ArchitectureAmazon Web Services
 
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Amazon Web Services
 
Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...
Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...
Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...Amazon Web Services
 
Better Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSBetter Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSAmazon Web Services
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Amazon Web Services
 
AWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAdir Sharabi
 
AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...
AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...
AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...Amazon Web Services
 
Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...
Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...
Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...Amazon Web Services
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Amazon Web Services
 
AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...
AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...
AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...AWS Summits
 
Better Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSBetter Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSAmazon Web Services
 
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018Amazon Web Services
 
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...Amazon Web Services
 
Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018Amazon Web Services
 
Modern Data Platforms - Thinking Data Flywheel on the Cloud
Modern Data Platforms - Thinking Data Flywheel on the CloudModern Data Platforms - Thinking Data Flywheel on the Cloud
Modern Data Platforms - Thinking Data Flywheel on the CloudAlluxio, Inc.
 
Big Data Meets AI - Driving Insights and Adding Intelligence to Your Solutions
 Big Data Meets AI - Driving Insights and Adding Intelligence to Your Solutions Big Data Meets AI - Driving Insights and Adding Intelligence to Your Solutions
Big Data Meets AI - Driving Insights and Adding Intelligence to Your SolutionsAmazon Web Services
 
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...Amazon Web Services
 
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...Amazon Web Services
 

Similaire à 雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300) (20)

Get to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data ArchitectureGet to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data Architecture
 
Build and Innovate with a Modern Data Architecture
Build and Innovate with a Modern Data ArchitectureBuild and Innovate with a Modern Data Architecture
Build and Innovate with a Modern Data Architecture
 
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
Building a Data Lake in Amazon S3 & Amazon Glacier (STG401-R1) - AWS re:Inven...
 
Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...
Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...
Modern Cloud Data Warehousing ft. Equinox Fitness Clubs: Optimize Analytics P...
 
Better Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSBetter Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWS
 
BI & Analytics
BI & AnalyticsBI & Analytics
BI & Analytics
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28
 
AWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWS
 
AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...
AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...
AI/ML with Data Lakes: Counterintuitive Consumer Insights in Retail (RET206) ...
 
Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...
Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...
Driving Machine Learning and Analytics Use Cases with AWS Storage (STG302) - ...
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
 
AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...
AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...
AWS Summit Singapore 2019 | Big Data Analytics Architectural Patterns and Bes...
 
Better Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWSBetter Business from Exploring Ideas - Modern Data Architectures on AWS
Better Business from Exploring Ideas - Modern Data Architectures on AWS
 
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
 
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
 
Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018Better Business From Exploring Ideas - AWS Summit Sydney 2018
Better Business From Exploring Ideas - AWS Summit Sydney 2018
 
Modern Data Platforms - Thinking Data Flywheel on the Cloud
Modern Data Platforms - Thinking Data Flywheel on the CloudModern Data Platforms - Thinking Data Flywheel on the Cloud
Modern Data Platforms - Thinking Data Flywheel on the Cloud
 
Big Data Meets AI - Driving Insights and Adding Intelligence to Your Solutions
 Big Data Meets AI - Driving Insights and Adding Intelligence to Your Solutions Big Data Meets AI - Driving Insights and Adding Intelligence to Your Solutions
Big Data Meets AI - Driving Insights and Adding Intelligence to Your Solutions
 
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
 
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
Pemahaman Pelanggan & Machine Learning (Level 200 – 300) | Kenali Pelanggan A...
 

Plus de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Plus de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)

  • 1. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Architecting Data Lake in Cloud Driving Insights and Adding Intelligence Ivan Cheng (鄭志帆) Solutions Architect | AWS Stanley Huang (黃士展) Sr. Manager | Trend Micro
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Answer new questions and move faster Raymond, Head of ProductLim, Head of Finance Which regions are the new product selling well? What is the demand forecast by product category? What is the social sentiment about our products? How do quality issues impact cost of production? Can I look at supplier performance over time? How can we reduce our inventory holding costs?
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. To answer new questions quickly, we look to a modern data architecture design Massive upfront costs Overprovisioned capacity Long implementation times Pay as you go, for what you use Decoupled pipelines and engines Experimentation platform Ingest/ Collect Consume/ visualize Store Process/ analyze 1 4 0 9 5
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingest ServingData sources Modern data architecture Insights to enhance business applications, new digital services Transactions ERP Data analysts DATA PIPELINES Ingest/ Collect Consume / visualize Store Process / analyze 1 4 0 9 5
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingest ServingData sources Modern data architecture Insights to enhance business applications, new digital services Transactions ERP DATA PIPELINES Data Lake expdp Data Data analysts Data Warehouse Amazon Redshift Direct Query Amazon Athena She asks for the SMALLEST amount of data to answer her questions. If it isn’t good enough, she asks for another small slice to be loaded to the DATA LAKE
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Characteristics of a Data Lake Future Proof Flexible Access Dive in Anywhere Collect Anything
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingest ServingData sources Modern data architecture Insights to enhance business applications, new digital services DATA PIPELINES Data Lake He first looks to the DATA LAKE, and imports only the category data he needs He imports JUST ENOUGH data to see if the market is responding to products. Business users Transactions ERP Social media Data Stream Capture Amazon Kinesis Events Amazon QuickSight Data Warehouse Amazon Redshift Stream Data Amazon ElasticSearch
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Common data pipeline configuration Raw Data Amazon S3 Highly decoupled configurations scale better, are more fault tolerant, and cost optimized ETL (Hadoop) Amazon EMR Triggered Code AWS Lambda Staged Data (Data Lake) Amazon S3 ETL & Catalog Management AWS Glue Data Warehouse Amazon Redshift Triggered Code AWS Lambda
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingest ServingData sources Modern data architecture Insights to enhance business applications, new digital services Transactions ERP Data analysts Business users DATA PIPELINES EVENT PIPELINES Data Event Insights Data Lake Social media
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Direct relationship with consumers Krzysztof, Data ScientistBala, Head of Marketing What are our customer segments, based on usage? Can predict user preference? How are people using the product? What kinds of people are in/decreasing usage?
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingest ServingData sources Modern data architecture Insights to enhance business applications, new digital services Transactions Data scientists Business users Connected devices DATA PIPELINES EVENT PIPELINES Data Event Insights Data Lake Sandbox ML / Analytics / DLWeb logs / clickstream
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingest ServingData sources Modern data architecture Innovate for new revenues - personalization and forecasting Transactions ERP Data analysts Data scientists Business users Connected devices DATA PIPELINES EVENT PIPELINES Data Event Insights Data Lake ML / Analytics Social media Web logs / clickstream
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ingest ServingData sources Modern data architecture Real-time engagement and interactive customer experiences Transactions ERP Data analysts Data scientists Business users Engagement platformsConnected devices Automation / events DATA PIPELINES EVENT PIPELINES Data Event Action Insights Data Lake ML / Analytics Predict / Recommend AI Services Social media Web logs / clickstream
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Stanley Huang 黃士展 Sr. Manager, Web Service Engineering (WSE), Consumer, R&D, Trend Micro Data Lake Journey in Trend Micro Consumer Products
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Founded RD HQ Revenue Customers 1988, United States Taipei, Taiwan $1.2B USD 500,000+ businesses, Millions of consumers 5800+ Employees in 50+ Countries 100% of the top 10 automotive companies. 45 of the top 50 global corporations. 100% of the top 10 telecom companies. 100% of the top 10 banks. 100% of the top 10 oil companies. A world safe for exchanging digital information
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deep Security
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Lake for Consumer Products
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Home Network Security
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Business Requirement Real-time Dashboard Business Metrics Feature Comparison
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Architecture 1.0 Analyze & ConsumeCatalog & TransformIngest & StoreData Source Amazon Redshift AmazonKinesis Streams Lambda function Lambda function Lambda Amazon EMR Amazon S3 Transformed Data Amazon S3 Raw Data application backend Tableau (Data Visualization) Lambda Amazon Pinpoint mobile app AmazonKinesis Streams
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Timely Updated Business Metrics Dashboard Business Metrics Original image source: https://www.interaction-design.org/literature/article/google-s-heart-framework-for-measuring-ux
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Feature Comparison
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Frequent Release Cycles of the Products High Availability Data Quality Auto-recovery
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Build-Measure-Learn Feedback Loop Now2017, Sep.
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Architecture 2.0 Analyze & ConsumeCatalog & TransformIngest & StoreData Source Amazon Redshift AmazonKinesis Streams Lambda function Lambda function Lambda Amazon EMR Amazon S3 Transformed Data Amazon S3 Raw Data application backend Amazon SQS Task queue Lambda function Tableau (Data Visualization) Amazon SNS Notification Lambda Amazon Pinpoint mobile app AmazonKinesis Streams
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The Culture of Data Driven Exploratory Data Analysis Self-service ETL Machine Learning Platform
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Architecture 3.0 Analyze & ConsumeCatalog & TransformIngest & StoreData Source Amazon Redshift Amazon Pinpoint AmazonKinesis Streams Lambda function Lambda function Lambda Amazon EMR Amazon S3 Transformed Data Amazon S3 Raw Data mobile app Amazon EMR application backend Amazon S3 Enriched Data Redshift Spectrum Amazon Athena Glue Catalog Glue Catalog Amazon SQS Task queue Lambda function Tableau (Data Visualization) Downstream Service Amazon SNS Notification Lambda AmazonKinesis Streams Amazon SageMaker
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Feature Engineering for Machine Learning
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. • Focus on Customer’s Value • Secure and anonymize data • Real-time Dashboard • Work with PM and product team at early stage • Design data pipelines to fit the dashboard • Frequent Release Cycles • Store data in raw format • Ensure data set could be rebuilt by event replaying • Data-Driven Culture • Open data to stakeholder who has domain knowledge • Prepare technical documents for data and tools Takeaway
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Summary
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Central Storage Secure, Cost Effective Storage in S3 S3 Kinesis Direct Connect Snowball DMS Data Ingestion Get your data into S3 quickly and securely Athena Quicksight EMR Redshift Processing & Analytics Use predictive and prescriptive analytics to gain better understanding Glue ETL Protect & Secure Use entitlements to ensure data is secure and users identities are verified Security Token Service Cloudwatch Cloudtrail KMS Catalog & Search Access & Search Metadata DynamoDB Amazon ESGlue Catalog Access & User Interface Give your users easy & secure access API Gateway IAM Cognito
  • 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ready to build better business from your ideas? Short list projects that directly impact customer engagement and adoption Build simple data pipelines that allow you to test new ideas, and fill your data lake Ask our solution architects and professional services teams to help you build
  • 34. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Please complete the session survey Thank you!