SlideShare une entreprise Scribd logo
1  sur  39
Télécharger pour lire hors ligne
1
Choice of ‘ElasticSearch’ for online e-
commerce big-data analysis based on
high performance and high availability
Bosoon, Kim
CTO (Builton Co., Ltd.)
February 22, 2018
http://www.builton.co.kr/en
2
BuiltOn
• The scale of e-commerce worldwide grows day by day.
• E-commerce, data analysis is essential for companies to choose what to do
and how to do it.
• We analyze various aspects of retailers, sellers and consumers of e-
commerce industry.
• Many companies in South Korea, including global companies, is using
BuiltOn’s data to analyze e-commerce big-data.
• We also collaborate with global data analysis partners.
http://www.builton.co.kr/en
Source: Gray Arial 10pt
3
What is
e-commerce analysis?
What does BuiltOn analyze?
4
Necessity of e-commerce analysis
• Why my product is not selling?
• What is the strategies of selling product of the competitors?
• What is the thoughts of consumers who bought our products or services?
• What is the best selling products?
• What is most effective way of ads to boost sales?
• Who is selling our products?
• How much is sold for our product?
• Where the our products are sold?
• In addition, still there are many questions in e-commerce.
People who work in the e-commerce environment are curious.
Source: Gray Arial 10pt
5
The e-commerce big-data analysis process diagram
Same flow as typical big-data analytics.
1 2 3 4 5
E-commerce big-data
warehouse configuration
Data collection,
data refining and
data quality control
Configure aggregate
data marts
Visualization Derivation of KPIs
(Ker performance
indicators)
6
Analysis based on digital-shelf.
• Collects the search results of categories
and keywords in target online retailer.
• Analyze the digital shelf share of the
manufacturers and brands.
• Can see the market penetration rate of
my products and competitors.
• Can also see the share of advertising by
manufacturer, brand and product.
• The search results show that consumers
are more likely to choose products that
are exposed to the top.
Brand analysis in TV category digital shelf for target retailer
Source: Gray Arial 10pt
D2.27%(3)
E2.27%(3)
C2.27%(3)
Etc 7.58% (10)
B Electronics
40.91%
(54)
A Electronics
43.18%
(57)
F
16.67%
(2)
D2.50%(3)
Etc 5.00% (6)
B Electronics
45.00%
(54)
A Electronics
47.50%
(57)
G
16.67%
(2)
H
16.67
(2)
E
25.00%
(3)
C Electronics
25.00%
(3)
A Electronics
B Electronics
D
C Electronics
E
F
Total
(132)
Advertisement
(12)
Normal
(120)
G
H
7
Shoot example: Top5 Shelf Share.
8
The analysis based on price.
• Can analyze the price of products by the seller
and the online retailer according to the time
series.
• For the same product, consumers are more
likely to purchase the lowest-priced product.
• If the prices of goods sold abroad are much
lower, consumers are not willing to buy it in
local.
• The lower of the commodity price, the less
profitable the seller is.
Minimum Advertised Price(MAP) violations by resellers.
Source: Gray Arial 10pt
CHANEL SUBLIMAGE LA CR. TS
420,000
430,000
440,000
450,000
460,000
470,000
480,000
490,000
500,000
510,000
520,000
530,000
540,000
550,000
560,000
570,000
580,000
Retailer A
Retailer B
Retailer C
Retailer D
Retailer E
9
Shoot Example: Official and Unofficial price trend
10
The analysis based on customer review
• Analyzes the customer’s review of the product.
• Analyzes customer reaction (positive and
negative) of product characteristics through
comments.
• Identify problems of your products and
competitor’s products.
• Discover the sales trend of your products count
by totaling the number of purchases in the
ecommerce websites.
Product review trend
Source: Gray Arial 10pt
Instant rice 210g x 1
Reviews satisfaction rate
11
Shoot Example: Consumer reviews analytics
12
Analysis based on consumer behavior
• Provide real-time inflow status of online
product page.
• Track consumer behavior of each product.
‒ # of purchase button clicks
‒ # of cart button clicks
‒ Sales success rate
• Provides tracking report that has consist of
analysis platforms, keywords and ads.
Source: Gray Arial 10pt
0
10
20
30
40
50
60
70
80
90
100
PC Mobile App
100% Stacked chart for platform share based
on time series.
13
Shoot Example: Real-time product page status
14
Retrospective from
2012 to 2016
A bit embarrassed …
15
Starting Architecture
RDBMS
(with Replication)
Nodes (X)
Data Collection Engine
Nodes (X)
Business Server
Web Service
Data-mart
Visualization
Nodes (X)
Network gateway
Nodes (X)
Network controllerRetailer Information
• Product title
• Price and discount
ratio
• Card promotion
• Digital shelfs
• Reviews
• Seller
• Etc…
Batch process
Nodes (X)
Nodes (X)
Text search engine
Based on RDBMS
16
Reason for starting architecture configuration
• Familiar development environment.
‒ C/C++
‒ LUA script engine.
‒ RDBMS on columns such as MySQL, SQL-SERVER, PostgreSQL…
• Execute separate data collection engine instance for each user.
• Cloud platforms such as Amazon web service.
‒ Cloud platform cost is very expensive.
‒ BuiltOn manage own hardware infrastructure to provide efficient architecture service for
partners.
• Self-developed visualization.
Source: Gray Arial 10pt
17
Develops almost of the architecture component
• Full-text search engine.
‒ Search engine is required to find the products that you want in
big-data.
• Monitoring system.
‒ CPU, Memory, Disk, Network traffic and etc…
• Data replication into storage of customer.
‒ Interpreting and replicating the event log of RDBMS.
‒ Customers want to replicate refined data to their data center.
Source: Gray Arial 10pt
18
AS THE COMPANY GROWS,
FACING WITH ANOTHER
PROBLEMS.
19
As the company grows…
• Limit point exposure of RDBMS
‒ System slows down.
‒ Difficult to reflect customer customization.
‒ Added columns that other customers do not need.
‒ Too much time waste adding columns.
‒ Increased indexing time.
‒ Frequent replication synchronization issues.
‒ Full-text search tasks a long time.
‒ RDBMS cluster is not very fast even though increase nodes.
• Storage scale-up cost is too expensive.
‒ Initially, HDD
‒ Next, SDD
‒ High-performance NVMe SSD in the end
‒ It’s too expensive
There have been many technical issues.
Source: Gray Arial 10pt
Storage cost & Maintenance cost
Storage Performance
20
As the company grows…
• Spending too much time for developing
visualization.
• Difficulties on O/S log analysis.
• Long downtime for hardware failures.
• Recurrent development for solving issues.
There have been many technical issues.
Source: Gray Arial 10pt
21
FREQUENT DOWNTIME
22
Why & What happened?
• Excessive desire for development and
testing.
• Enormous stored data.
• The belief that hardware scale-up will
solve everything.
• Lack of understanding on the latest
analytical trend.
Source: Gray Arial 10pt
23
It’s the economy, stupid
James Carville
24
What should be changed?
• At least the performance has to be much faster than
now.
‒ Without expensive NVMe SSD.
• Schema free for flexible data management.
• Minimize downtime due to hardware equipment
replacement.
• Storage engine that can support full text search
without a separate search engine.
• Automatically, archiving old data in low-cost storage.
Excessive desire for development and testing is wasting of time and money.
Source: Gray Arial 10pt
25
WHAT DO WE HAVE TO
CHOOSE?
26
Own evaluation for existing storage engine
• RDBMS Cluster
‒ As the number of nodes increased, storage capacity was available, but performance was
not satisfactory.
• CouchBase NoSQL database
‒ Random access is good, but the sequential access is bad. The system died as the data
grow up. Now? Changed maybe?
• HDFS
‒ Reliable, high-capacity storage is good. But all the rest must be developed by the
developer.
Source: Gray Arial 10pt
27
Suddenly, the worst situation happens.
• There was a report that has to be aggregated and processed for 3 minutes
to the analytic report.
• Because many of the input parameters are changed by the user, pre-
calculation is not possible.
• The customer asked us to get the output as soon as they clicked on it.
• It was an unreasonable and excessive demand and could not be processed
in our environment.
One day…
Source: Gray Arial 10pt
28
ElasticSearch
• Unstable and unreliable storage engine could not be used.
• Meet ElasticSearch while trying to solve these troubles.
• We moved all the data from RDBMS to ElasticSearch, so we provided the
reports within time customer required.
First meet.
Source: Gray Arial 10pt
29
RDBMS
Based on high
performance
NVMe SSD
420000 IOPS
1 nodes
Response time = x60 faster
ElasticSearch
Based on Normal
SSD
96000 IOPS
2 nodes
3m 3s
180 seconds
response time
3 seconds
response time
30
Amazing performance
With ElasticSearch
BuiltOn
31
New architecture
design in 2017
Based on ElasticSearch
32
New Architecture
Nodes (X)
Job Worker
Node.js
Nodes (X)
Central Scheduler
Node.js
RDBMS data-mart
Visualization based on
Business Intelligence
Nodes (X)
Network gateway
Nodes (X)
Network controller
Retailer Information
Product title, price, card
promotion
Digital shelfs
Shopper reviews
ETL & ELT
Nodes (X)
Elasticsearch
X-pack
Master Nodes (3)
Ingest Nodes (X)
Data Nodes - Hot (X)
Data Nodes - Warm (X)
Nodes (X)
Server
Metricbeat
X-pack
Instances (X)
Refinement
Nodes (X)
Elasticsearch
33
What have we changed?
• Replaced storage engine from RDBMS to
ElasticSearch.
• Perform a full-text search directly from ElasticSearch.
• Changed the system monitoring to Metricbeat.
• Use Hot-Warm nodes without backup old data
separately.
‒ Old data uses based on low-cost hardware such as HDD.
• No longer operate RDBMS data replication.
‒ We trust shard and replication of ElasticSearch.
• If not enough capacity, just add a new node.
‒ ElasticSearch is fast and easy to scale-out.
We’ve changed everything that can be replaced by ElasticSearch.
Source: Gray Arial 10pt
Metricbeat
34
Changed architecture comparison
Item Old - RDBMS New – ElasticSearch
Data type Based on columns Document
Schema free support N/A YES
Real-time analysis response time Slow High Fast
Downtime Long Almost none
Storage extension policy Scale-up Scale-out
Storage cost Expensive Cheap
SSD type Server side high performance NVMe Server side normal SSD
CPU Xeon E5-2620 v4 2.10GHz / x2 Xeon E5-2620 v4 2.10GHz
Memory 512GB per a node 64GB per a node
Data distribution N/A Shard
Backup Replication Replication
Full-text search In house-development Basic support
Archiving Individual backup into HDD Hot-Warm
System monitoring In house-development Metricbeat
Visualization In house-development Kibana, Tableau or Etc…
35
Before
RDBMS
Expensive CPU /
6 Nodes based on server-side
NVMe SSD /
512GB Memory per a node /
Replication-based backup policies /
Sometimes slow response time
Daily data throughput
After
ElasticSearch
Cheap CPU /
17 Nodes based on Normal server-
side SSD /
64GB Memory per a node /
Multi-shard based cluster /
High fast response time
30GB 500GB
36
Technical Support
• Rapid advanced technical support.
‒ Restart some nodes.
‒ The problem is that the primary shard data is not redistributed.
‒ In the worst case, data loss can occur.
‒ We ask for technical support and were able to solve the problem quickly.
‒ We found that problem turned off the index recovery setting.
‒ We still have technical support if have questions.
X-PACK
Source: Gray Arial 10pt
37
Future work
Based on ElasticSearch
38
Future work
• Virtualization of ElasticSearch with Docker.
• Infographic using Canvas.
• Buzz analysis using Nori.
• Network monitoring with Packetbeat.
• Monitoring e-commerce big-data properties information using Kibana.
• Logstash will be applied to ETL and ELT.
Can do more with ElasticSearch.
Source: Gray Arial 10pt
39
Don’t be greedy!



Just use
ElasticSearch!



Thank you.

Contenu connexe

Tendances

Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceRon Bodkin
 
Bitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FSBitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FSPhilip Filleul
 
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...MongoDB
 
Hadoop summit 2017 enterprise graph analytics
Hadoop summit 2017 enterprise graph analyticsHadoop summit 2017 enterprise graph analytics
Hadoop summit 2017 enterprise graph analyticsJun(Terry) Yang
 
Data Warehouse $ Business Intelligence
Data Warehouse $ Business IntelligenceData Warehouse $ Business Intelligence
Data Warehouse $ Business IntelligenceSiavosh Moradabadi
 
Enterprise architectsview 2015-apr
Enterprise architectsview 2015-aprEnterprise architectsview 2015-apr
Enterprise architectsview 2015-aprMongoDB
 
5 Myths about Spark and Big Data by Nik Rouda
5 Myths about Spark and Big Data by Nik Rouda5 Myths about Spark and Big Data by Nik Rouda
5 Myths about Spark and Big Data by Nik RoudaSpark Summit
 
Hadoop Summit 2017 Enterprise Graph Analytics
Hadoop Summit 2017 Enterprise Graph AnalyticsHadoop Summit 2017 Enterprise Graph Analytics
Hadoop Summit 2017 Enterprise Graph AnalyticsJing Chen (Jerry) He
 
Global Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use Cases
Global Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use CasesGlobal Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use Cases
Global Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use CasesSanjay Sharma
 
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data DeliveryDenodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data DeliveryDenodo
 
Cloud Cost Management and Apache Spark with Xuan Wang
Cloud Cost Management and Apache Spark with Xuan WangCloud Cost Management and Apache Spark with Xuan Wang
Cloud Cost Management and Apache Spark with Xuan WangDatabricks
 
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data WarehouseData Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data WarehouseRittman Analytics
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowCambridge Semantics
 
UNLIMITED by Capgemini
UNLIMITED by CapgeminiUNLIMITED by Capgemini
UNLIMITED by CapgeminiDetlev Sandel
 

Tendances (16)

Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligence
 
Bitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FSBitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FS
 
ESGYN Overview
ESGYN OverviewESGYN Overview
ESGYN Overview
 
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
 
Hadoop summit 2017 enterprise graph analytics
Hadoop summit 2017 enterprise graph analyticsHadoop summit 2017 enterprise graph analytics
Hadoop summit 2017 enterprise graph analytics
 
Data Warehouse $ Business Intelligence
Data Warehouse $ Business IntelligenceData Warehouse $ Business Intelligence
Data Warehouse $ Business Intelligence
 
Enterprise architectsview 2015-apr
Enterprise architectsview 2015-aprEnterprise architectsview 2015-apr
Enterprise architectsview 2015-apr
 
5 Myths about Spark and Big Data by Nik Rouda
5 Myths about Spark and Big Data by Nik Rouda5 Myths about Spark and Big Data by Nik Rouda
5 Myths about Spark and Big Data by Nik Rouda
 
Hadoop Summit 2017 Enterprise Graph Analytics
Hadoop Summit 2017 Enterprise Graph AnalyticsHadoop Summit 2017 Enterprise Graph Analytics
Hadoop Summit 2017 Enterprise Graph Analytics
 
Global Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use Cases
Global Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use CasesGlobal Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use Cases
Global Big Data Conference Hyderabad-2Aug2013- Finance/Manufacturing Use Cases
 
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data DeliveryDenodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
Denodo DataFest 2017: Modern Data Architectures Need Real-time Data Delivery
 
Cloud Cost Management and Apache Spark with Xuan Wang
Cloud Cost Management and Apache Spark with Xuan WangCloud Cost Management and Apache Spark with Xuan Wang
Cloud Cost Management and Apache Spark with Xuan Wang
 
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data WarehouseData Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
Data Warehouse Like a Tech Startup with Oracle Autonomous Data Warehouse
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
 
Project+team+1 slides (2)
Project+team+1 slides (2)Project+team+1 slides (2)
Project+team+1 slides (2)
 
UNLIMITED by Capgemini
UNLIMITED by CapgeminiUNLIMITED by Capgemini
UNLIMITED by Capgemini
 

Similaire à Customer Story: Elastic Stack을 이용한 게임 서비스 통합 로깅 플랫폼

클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스Amazon Web Services Korea
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsLooker
 
Renewing the BI infrastructure at Hellorider - Big Data Expo 2019
Renewing the BI infrastructure at Hellorider - Big Data Expo 2019Renewing the BI infrastructure at Hellorider - Big Data Expo 2019
Renewing the BI infrastructure at Hellorider - Big Data Expo 2019webwinkelvakdag
 
Enabling Telco to Build and Run Modern Applications
Enabling Telco to Build and Run Modern Applications Enabling Telco to Build and Run Modern Applications
Enabling Telco to Build and Run Modern Applications Tugdual Grall
 
L’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneL’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneMongoDB
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data productsVikas Sardana
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneMongoDB
 
Neo4j GraphTalks - Introduction to GraphDatabases and Neo4j
Neo4j GraphTalks - Introduction to GraphDatabases and Neo4jNeo4j GraphTalks - Introduction to GraphDatabases and Neo4j
Neo4j GraphTalks - Introduction to GraphDatabases and Neo4jNeo4j
 
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliL'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliData Driven Innovation
 
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your DataMongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your DataMongoDB
 
MongoDB and In-Memory Computing
MongoDB and In-Memory ComputingMongoDB and In-Memory Computing
MongoDB and In-Memory ComputingDylan Tong
 
Dell Digital Transformation Through AI and Data Analytics Webinar
Dell Digital Transformation Through AI and  Data Analytics WebinarDell Digital Transformation Through AI and  Data Analytics Webinar
Dell Digital Transformation Through AI and Data Analytics WebinarBill Wong
 
10/ EnterpriseDB @ OPEN'16
10/ EnterpriseDB @ OPEN'16 10/ EnterpriseDB @ OPEN'16
10/ EnterpriseDB @ OPEN'16 Kangaroot
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Jos van Dongen
 
Webinar: The OpEx Business Plan for NoSQL
 Webinar: The OpEx Business Plan for NoSQL Webinar: The OpEx Business Plan for NoSQL
Webinar: The OpEx Business Plan for NoSQLMongoDB
 
Le big data à l'épreuve des projets d'entreprise
Le big data à l'épreuve des projets d'entrepriseLe big data à l'épreuve des projets d'entreprise
Le big data à l'épreuve des projets d'entrepriseRubedo, a WebTales solution
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureMark Kromer
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 

Similaire à Customer Story: Elastic Stack을 이용한 게임 서비스 통합 로깅 플랫폼 (20)

클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
 
Renewing the BI infrastructure at Hellorider - Big Data Expo 2019
Renewing the BI infrastructure at Hellorider - Big Data Expo 2019Renewing the BI infrastructure at Hellorider - Big Data Expo 2019
Renewing the BI infrastructure at Hellorider - Big Data Expo 2019
 
Enabling Telco to Build and Run Modern Applications
Enabling Telco to Build and Run Modern Applications Enabling Telco to Build and Run Modern Applications
Enabling Telco to Build and Run Modern Applications
 
L’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova GenerazioneL’architettura di Classe Enterprise di Nuova Generazione
L’architettura di Classe Enterprise di Nuova Generazione
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data products
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
L’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazioneL’architettura di classe enterprise di nuova generazione
L’architettura di classe enterprise di nuova generazione
 
Neo4j GraphTalks - Introduction to GraphDatabases and Neo4j
Neo4j GraphTalks - Introduction to GraphDatabases and Neo4jNeo4j GraphTalks - Introduction to GraphDatabases and Neo4j
Neo4j GraphTalks - Introduction to GraphDatabases and Neo4j
 
The New Model
The New ModelThe New Model
The New Model
 
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliL'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
 
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your DataMongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
 
MongoDB and In-Memory Computing
MongoDB and In-Memory ComputingMongoDB and In-Memory Computing
MongoDB and In-Memory Computing
 
Dell Digital Transformation Through AI and Data Analytics Webinar
Dell Digital Transformation Through AI and  Data Analytics WebinarDell Digital Transformation Through AI and  Data Analytics Webinar
Dell Digital Transformation Through AI and Data Analytics Webinar
 
10/ EnterpriseDB @ OPEN'16
10/ EnterpriseDB @ OPEN'16 10/ EnterpriseDB @ OPEN'16
10/ EnterpriseDB @ OPEN'16
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?
 
Webinar: The OpEx Business Plan for NoSQL
 Webinar: The OpEx Business Plan for NoSQL Webinar: The OpEx Business Plan for NoSQL
Webinar: The OpEx Business Plan for NoSQL
 
Le big data à l'épreuve des projets d'entreprise
Le big data à l'épreuve des projets d'entrepriseLe big data à l'épreuve des projets d'entreprise
Le big data à l'épreuve des projets d'entreprise
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft Azure
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 

Plus de Elasticsearch

An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxElasticsearch
 
From MSP to MSSP using Elastic
From MSP to MSSP using ElasticFrom MSP to MSSP using Elastic
From MSP to MSSP using ElasticElasticsearch
 
Cómo crear excelentes experiencias de búsqueda en sitios web
Cómo crear excelentes experiencias de búsqueda en sitios webCómo crear excelentes experiencias de búsqueda en sitios web
Cómo crear excelentes experiencias de búsqueda en sitios webElasticsearch
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas Elasticsearch
 
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
Tirez pleinement parti d'Elastic grâce à Elastic CloudTirez pleinement parti d'Elastic grâce à Elastic Cloud
Tirez pleinement parti d'Elastic grâce à Elastic CloudElasticsearch
 
Comment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesComment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesElasticsearch
 
Plongez au cœur de la recherche dans tous ses états.
Plongez au cœur de la recherche dans tous ses états.Plongez au cœur de la recherche dans tous ses états.
Plongez au cœur de la recherche dans tous ses états.Elasticsearch
 
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]Elasticsearch
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxElasticsearch
 
Welcome to a new state of find
Welcome to a new state of findWelcome to a new state of find
Welcome to a new state of findElasticsearch
 
Building great website search experiences
Building great website search experiencesBuilding great website search experiences
Building great website search experiencesElasticsearch
 
Keynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchKeynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchElasticsearch
 
Cómo transformar los datos en análisis con los que tomar decisiones
Cómo transformar los datos en análisis con los que tomar decisionesCómo transformar los datos en análisis con los que tomar decisiones
Cómo transformar los datos en análisis con los que tomar decisionesElasticsearch
 
Explore relève les défis Big Data avec Elastic Cloud
Explore relève les défis Big Data avec Elastic Cloud Explore relève les défis Big Data avec Elastic Cloud
Explore relève les défis Big Data avec Elastic Cloud Elasticsearch
 
Comment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesComment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesElasticsearch
 
Transforming data into actionable insights
Transforming data into actionable insightsTransforming data into actionable insights
Transforming data into actionable insightsElasticsearch
 
Opening Keynote: Why Elastic?
Opening Keynote: Why Elastic?Opening Keynote: Why Elastic?
Opening Keynote: Why Elastic?Elasticsearch
 
Empowering agencies using Elastic as a Service inside Government
Empowering agencies using Elastic as a Service inside GovernmentEmpowering agencies using Elastic as a Service inside Government
Empowering agencies using Elastic as a Service inside GovernmentElasticsearch
 
The opportunities and challenges of data for public good
The opportunities and challenges of data for public goodThe opportunities and challenges of data for public good
The opportunities and challenges of data for public goodElasticsearch
 
Enterprise search and unstructured data with CGI and Elastic
Enterprise search and unstructured data with CGI and ElasticEnterprise search and unstructured data with CGI and Elastic
Enterprise search and unstructured data with CGI and ElasticElasticsearch
 

Plus de Elasticsearch (20)

An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolbox
 
From MSP to MSSP using Elastic
From MSP to MSSP using ElasticFrom MSP to MSSP using Elastic
From MSP to MSSP using Elastic
 
Cómo crear excelentes experiencias de búsqueda en sitios web
Cómo crear excelentes experiencias de búsqueda en sitios webCómo crear excelentes experiencias de búsqueda en sitios web
Cómo crear excelentes experiencias de búsqueda en sitios web
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas
 
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
Tirez pleinement parti d'Elastic grâce à Elastic CloudTirez pleinement parti d'Elastic grâce à Elastic Cloud
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
 
Comment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesComment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitables
 
Plongez au cœur de la recherche dans tous ses états.
Plongez au cœur de la recherche dans tous ses états.Plongez au cœur de la recherche dans tous ses états.
Plongez au cœur de la recherche dans tous ses états.
 
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolbox
 
Welcome to a new state of find
Welcome to a new state of findWelcome to a new state of find
Welcome to a new state of find
 
Building great website search experiences
Building great website search experiencesBuilding great website search experiences
Building great website search experiences
 
Keynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchKeynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified search
 
Cómo transformar los datos en análisis con los que tomar decisiones
Cómo transformar los datos en análisis con los que tomar decisionesCómo transformar los datos en análisis con los que tomar decisiones
Cómo transformar los datos en análisis con los que tomar decisiones
 
Explore relève les défis Big Data avec Elastic Cloud
Explore relève les défis Big Data avec Elastic Cloud Explore relève les défis Big Data avec Elastic Cloud
Explore relève les défis Big Data avec Elastic Cloud
 
Comment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesComment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitables
 
Transforming data into actionable insights
Transforming data into actionable insightsTransforming data into actionable insights
Transforming data into actionable insights
 
Opening Keynote: Why Elastic?
Opening Keynote: Why Elastic?Opening Keynote: Why Elastic?
Opening Keynote: Why Elastic?
 
Empowering agencies using Elastic as a Service inside Government
Empowering agencies using Elastic as a Service inside GovernmentEmpowering agencies using Elastic as a Service inside Government
Empowering agencies using Elastic as a Service inside Government
 
The opportunities and challenges of data for public good
The opportunities and challenges of data for public goodThe opportunities and challenges of data for public good
The opportunities and challenges of data for public good
 
Enterprise search and unstructured data with CGI and Elastic
Enterprise search and unstructured data with CGI and ElasticEnterprise search and unstructured data with CGI and Elastic
Enterprise search and unstructured data with CGI and Elastic
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Dernier (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Customer Story: Elastic Stack을 이용한 게임 서비스 통합 로깅 플랫폼

  • 1. 1 Choice of ‘ElasticSearch’ for online e- commerce big-data analysis based on high performance and high availability Bosoon, Kim CTO (Builton Co., Ltd.) February 22, 2018 http://www.builton.co.kr/en
  • 2. 2 BuiltOn • The scale of e-commerce worldwide grows day by day. • E-commerce, data analysis is essential for companies to choose what to do and how to do it. • We analyze various aspects of retailers, sellers and consumers of e- commerce industry. • Many companies in South Korea, including global companies, is using BuiltOn’s data to analyze e-commerce big-data. • We also collaborate with global data analysis partners. http://www.builton.co.kr/en Source: Gray Arial 10pt
  • 3. 3 What is e-commerce analysis? What does BuiltOn analyze?
  • 4. 4 Necessity of e-commerce analysis • Why my product is not selling? • What is the strategies of selling product of the competitors? • What is the thoughts of consumers who bought our products or services? • What is the best selling products? • What is most effective way of ads to boost sales? • Who is selling our products? • How much is sold for our product? • Where the our products are sold? • In addition, still there are many questions in e-commerce. People who work in the e-commerce environment are curious. Source: Gray Arial 10pt
  • 5. 5 The e-commerce big-data analysis process diagram Same flow as typical big-data analytics. 1 2 3 4 5 E-commerce big-data warehouse configuration Data collection, data refining and data quality control Configure aggregate data marts Visualization Derivation of KPIs (Ker performance indicators)
  • 6. 6 Analysis based on digital-shelf. • Collects the search results of categories and keywords in target online retailer. • Analyze the digital shelf share of the manufacturers and brands. • Can see the market penetration rate of my products and competitors. • Can also see the share of advertising by manufacturer, brand and product. • The search results show that consumers are more likely to choose products that are exposed to the top. Brand analysis in TV category digital shelf for target retailer Source: Gray Arial 10pt D2.27%(3) E2.27%(3) C2.27%(3) Etc 7.58% (10) B Electronics 40.91% (54) A Electronics 43.18% (57) F 16.67% (2) D2.50%(3) Etc 5.00% (6) B Electronics 45.00% (54) A Electronics 47.50% (57) G 16.67% (2) H 16.67 (2) E 25.00% (3) C Electronics 25.00% (3) A Electronics B Electronics D C Electronics E F Total (132) Advertisement (12) Normal (120) G H
  • 7. 7 Shoot example: Top5 Shelf Share.
  • 8. 8 The analysis based on price. • Can analyze the price of products by the seller and the online retailer according to the time series. • For the same product, consumers are more likely to purchase the lowest-priced product. • If the prices of goods sold abroad are much lower, consumers are not willing to buy it in local. • The lower of the commodity price, the less profitable the seller is. Minimum Advertised Price(MAP) violations by resellers. Source: Gray Arial 10pt CHANEL SUBLIMAGE LA CR. TS 420,000 430,000 440,000 450,000 460,000 470,000 480,000 490,000 500,000 510,000 520,000 530,000 540,000 550,000 560,000 570,000 580,000 Retailer A Retailer B Retailer C Retailer D Retailer E
  • 9. 9 Shoot Example: Official and Unofficial price trend
  • 10. 10 The analysis based on customer review • Analyzes the customer’s review of the product. • Analyzes customer reaction (positive and negative) of product characteristics through comments. • Identify problems of your products and competitor’s products. • Discover the sales trend of your products count by totaling the number of purchases in the ecommerce websites. Product review trend Source: Gray Arial 10pt Instant rice 210g x 1 Reviews satisfaction rate
  • 11. 11 Shoot Example: Consumer reviews analytics
  • 12. 12 Analysis based on consumer behavior • Provide real-time inflow status of online product page. • Track consumer behavior of each product. ‒ # of purchase button clicks ‒ # of cart button clicks ‒ Sales success rate • Provides tracking report that has consist of analysis platforms, keywords and ads. Source: Gray Arial 10pt 0 10 20 30 40 50 60 70 80 90 100 PC Mobile App 100% Stacked chart for platform share based on time series.
  • 13. 13 Shoot Example: Real-time product page status
  • 14. 14 Retrospective from 2012 to 2016 A bit embarrassed …
  • 15. 15 Starting Architecture RDBMS (with Replication) Nodes (X) Data Collection Engine Nodes (X) Business Server Web Service Data-mart Visualization Nodes (X) Network gateway Nodes (X) Network controllerRetailer Information • Product title • Price and discount ratio • Card promotion • Digital shelfs • Reviews • Seller • Etc… Batch process Nodes (X) Nodes (X) Text search engine Based on RDBMS
  • 16. 16 Reason for starting architecture configuration • Familiar development environment. ‒ C/C++ ‒ LUA script engine. ‒ RDBMS on columns such as MySQL, SQL-SERVER, PostgreSQL… • Execute separate data collection engine instance for each user. • Cloud platforms such as Amazon web service. ‒ Cloud platform cost is very expensive. ‒ BuiltOn manage own hardware infrastructure to provide efficient architecture service for partners. • Self-developed visualization. Source: Gray Arial 10pt
  • 17. 17 Develops almost of the architecture component • Full-text search engine. ‒ Search engine is required to find the products that you want in big-data. • Monitoring system. ‒ CPU, Memory, Disk, Network traffic and etc… • Data replication into storage of customer. ‒ Interpreting and replicating the event log of RDBMS. ‒ Customers want to replicate refined data to their data center. Source: Gray Arial 10pt
  • 18. 18 AS THE COMPANY GROWS, FACING WITH ANOTHER PROBLEMS.
  • 19. 19 As the company grows… • Limit point exposure of RDBMS ‒ System slows down. ‒ Difficult to reflect customer customization. ‒ Added columns that other customers do not need. ‒ Too much time waste adding columns. ‒ Increased indexing time. ‒ Frequent replication synchronization issues. ‒ Full-text search tasks a long time. ‒ RDBMS cluster is not very fast even though increase nodes. • Storage scale-up cost is too expensive. ‒ Initially, HDD ‒ Next, SDD ‒ High-performance NVMe SSD in the end ‒ It’s too expensive There have been many technical issues. Source: Gray Arial 10pt Storage cost & Maintenance cost Storage Performance
  • 20. 20 As the company grows… • Spending too much time for developing visualization. • Difficulties on O/S log analysis. • Long downtime for hardware failures. • Recurrent development for solving issues. There have been many technical issues. Source: Gray Arial 10pt
  • 22. 22 Why & What happened? • Excessive desire for development and testing. • Enormous stored data. • The belief that hardware scale-up will solve everything. • Lack of understanding on the latest analytical trend. Source: Gray Arial 10pt
  • 23. 23 It’s the economy, stupid James Carville
  • 24. 24 What should be changed? • At least the performance has to be much faster than now. ‒ Without expensive NVMe SSD. • Schema free for flexible data management. • Minimize downtime due to hardware equipment replacement. • Storage engine that can support full text search without a separate search engine. • Automatically, archiving old data in low-cost storage. Excessive desire for development and testing is wasting of time and money. Source: Gray Arial 10pt
  • 25. 25 WHAT DO WE HAVE TO CHOOSE?
  • 26. 26 Own evaluation for existing storage engine • RDBMS Cluster ‒ As the number of nodes increased, storage capacity was available, but performance was not satisfactory. • CouchBase NoSQL database ‒ Random access is good, but the sequential access is bad. The system died as the data grow up. Now? Changed maybe? • HDFS ‒ Reliable, high-capacity storage is good. But all the rest must be developed by the developer. Source: Gray Arial 10pt
  • 27. 27 Suddenly, the worst situation happens. • There was a report that has to be aggregated and processed for 3 minutes to the analytic report. • Because many of the input parameters are changed by the user, pre- calculation is not possible. • The customer asked us to get the output as soon as they clicked on it. • It was an unreasonable and excessive demand and could not be processed in our environment. One day… Source: Gray Arial 10pt
  • 28. 28 ElasticSearch • Unstable and unreliable storage engine could not be used. • Meet ElasticSearch while trying to solve these troubles. • We moved all the data from RDBMS to ElasticSearch, so we provided the reports within time customer required. First meet. Source: Gray Arial 10pt
  • 29. 29 RDBMS Based on high performance NVMe SSD 420000 IOPS 1 nodes Response time = x60 faster ElasticSearch Based on Normal SSD 96000 IOPS 2 nodes 3m 3s 180 seconds response time 3 seconds response time
  • 31. 31 New architecture design in 2017 Based on ElasticSearch
  • 32. 32 New Architecture Nodes (X) Job Worker Node.js Nodes (X) Central Scheduler Node.js RDBMS data-mart Visualization based on Business Intelligence Nodes (X) Network gateway Nodes (X) Network controller Retailer Information Product title, price, card promotion Digital shelfs Shopper reviews ETL & ELT Nodes (X) Elasticsearch X-pack Master Nodes (3) Ingest Nodes (X) Data Nodes - Hot (X) Data Nodes - Warm (X) Nodes (X) Server Metricbeat X-pack Instances (X) Refinement Nodes (X) Elasticsearch
  • 33. 33 What have we changed? • Replaced storage engine from RDBMS to ElasticSearch. • Perform a full-text search directly from ElasticSearch. • Changed the system monitoring to Metricbeat. • Use Hot-Warm nodes without backup old data separately. ‒ Old data uses based on low-cost hardware such as HDD. • No longer operate RDBMS data replication. ‒ We trust shard and replication of ElasticSearch. • If not enough capacity, just add a new node. ‒ ElasticSearch is fast and easy to scale-out. We’ve changed everything that can be replaced by ElasticSearch. Source: Gray Arial 10pt Metricbeat
  • 34. 34 Changed architecture comparison Item Old - RDBMS New – ElasticSearch Data type Based on columns Document Schema free support N/A YES Real-time analysis response time Slow High Fast Downtime Long Almost none Storage extension policy Scale-up Scale-out Storage cost Expensive Cheap SSD type Server side high performance NVMe Server side normal SSD CPU Xeon E5-2620 v4 2.10GHz / x2 Xeon E5-2620 v4 2.10GHz Memory 512GB per a node 64GB per a node Data distribution N/A Shard Backup Replication Replication Full-text search In house-development Basic support Archiving Individual backup into HDD Hot-Warm System monitoring In house-development Metricbeat Visualization In house-development Kibana, Tableau or Etc…
  • 35. 35 Before RDBMS Expensive CPU / 6 Nodes based on server-side NVMe SSD / 512GB Memory per a node / Replication-based backup policies / Sometimes slow response time Daily data throughput After ElasticSearch Cheap CPU / 17 Nodes based on Normal server- side SSD / 64GB Memory per a node / Multi-shard based cluster / High fast response time 30GB 500GB
  • 36. 36 Technical Support • Rapid advanced technical support. ‒ Restart some nodes. ‒ The problem is that the primary shard data is not redistributed. ‒ In the worst case, data loss can occur. ‒ We ask for technical support and were able to solve the problem quickly. ‒ We found that problem turned off the index recovery setting. ‒ We still have technical support if have questions. X-PACK Source: Gray Arial 10pt
  • 37. 37 Future work Based on ElasticSearch
  • 38. 38 Future work • Virtualization of ElasticSearch with Docker. • Infographic using Canvas. • Buzz analysis using Nori. • Network monitoring with Packetbeat. • Monitoring e-commerce big-data properties information using Kibana. • Logstash will be applied to ETL and ELT. Can do more with ElasticSearch. Source: Gray Arial 10pt
  • 39. 39 Don’t be greedy!
 
 Just use ElasticSearch!
 
 Thank you.