SlideShare une entreprise Scribd logo
1  sur  46
Clustering data at scale
Dan Filimon, Politehnica University Bucharest
Ted Dunning, MapR Technologies
Saturday, June 1, 13
whoami
• Soon-to-be graduate of Politehnica Univerity of
Bucharest
• Apache Mahout committer
• This work is my senior project
• Contact me at
• dfilimon@apache.org
• dangeorge.filimon@gmail.com
Saturday, June 1, 13
Agenda
• Data
• Clustering
• k-means
• Improvements
• Large scale
• k-means as a map-reduce
• streaming k-means
• MapReduce & Storm
• Results
Full version at http://goo.gl/n3n8S
Saturday, June 1, 13
Data
• Real-valued vectors (or anything that can be
encoded as such)
• Think of rows in a database table
• Can be: documents, web pages, images, videos,
users, DNA
Saturday, June 1, 13
The problem
Group n d-dimensional datapoints into k disjoint
sets to minimize
ckX
c1
0
@
X
xij2Xi
dist(xij, ci)
1
A
Xi is the ith
cluster
ci is the centroid of the ith
cluster
xij is the jth
point from the ith
cluster
dist(x, y) = ||x y||2
Saturday, June 1, 13
k-means
points to cluster
Saturday, June 1, 13
k-means
initial centroids
Saturday, June 1, 13
k-means
first iteration assignment
Saturday, June 1, 13
k-means
adjusted centroids
Saturday, June 1, 13
k-means
second iteration assignment
Saturday, June 1, 13
k-means
adjusted centroids
Saturday, June 1, 13
Details
• Quality
• How to initialize the centroids
• When to stop iterating
• How to deal with outliers
• Speed
• Complexity of cluster assignment
Saturday, June 1, 13
Quality?
• Clustering is in the eye of the beholder
• Total clustering cost and:
• compact
• well-separated
• Dunn Index, Davies-Bouldin Index, etc.
Saturday, June 1, 13
Centroid initialization
• Important for quick
convergence and quality
• Randomly select k points as
centroids
• Clustering fails if two
centroids are in the same
cluster
• k-means++ addresses this
2 seeds here
1 seed here
Saturday, June 1, 13
Outliers
• Real data is messy
• Outliers can affect
k-means centroids
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
● ●
●
●
● ●
●
● ●
●
● ●
●
●
●●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●●
●
● ●
●
●
●
●
●
●
● ●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●●
●
●
●
●
●
−5 0 5
−6−4−20246
x
y
●
●
Effect of outliers
better centroid
Saturday, June 1, 13
Closest cluster: reducing k
• Avoid computing the distance from a point to every
cluster
• Random Projections
• projection search
• locality-sensitive hashing search
Saturday, June 1, 13
Random Projections
1.5-2 -1.5 -1 -0.5 0.5 1
3
-3
-2
-1
1
2
X Axis
YAxis
• Unit length vectors with normally distributed
components
Saturday, June 1, 13
Closest cluster: reducing d
• Principal Component Analysis
• compute SVD
• Random Projections
• multiply data by projection matrix
Saturday, June 1, 13
k-means as a MapReduce
• Can’t split k-means loop directly
• Must express a single k-means step as a MapReduce
Saturday, June 1, 13
Now for something
completely different
• Attempt to build a “sketch” of the data in one pass
• O(k log n) intermediate clusters
• Can fit into memory
• Ball k-means on the sketch for k final clusters
Saturday, June 1, 13
streaming k-means
first point
Saturday, June 1, 13
streaming k-means
becomes centroid
Saturday, June 1, 13
streaming k-means
second point
far away
Saturday, June 1, 13
streaming k-means
becomes centroid
far away
Saturday, June 1, 13
streaming k-means
third point
close
Saturday, June 1, 13
streaming k-means
centroid is updated
Saturday, June 1, 13
streaming k-means
close
but becomes a
new centroid
Saturday, June 1, 13
streaming k-means
Saturday, June 1, 13
streaming k-means
• for each point p, with weight w
• find the closest centroid to p, call it c and let d be the
distance between p and c
• if an event with probability proportional to
d * w / distanceCutoff occurs
• create a new cluster with p as its centroid
• else, merge p into c
• if there are too many clusters, increase distanceCutoff
and cluster recursively
Saturday, June 1, 13
The big picture
MapReduce
• Can cluster all the points with just 1 MapReduce:
• m mappers run streaming k-means
• 1 reducer runs ball k-means to get k clusters
Saturday, June 1, 13
The big picture
Storm
• Storm streaming topology
• streaming k-means bolt
• Release the sketch when notified (e.g. tick tuples)
• Trident
• streaming k-means partition aggregator
• ball k-means aggregator
Saturday, June 1, 13
Results
Saturday, June 1, 13
Quality
• Compared the quality on various small-medium UCI data
sets
• iris, seeds, movement, control, power
• Computed the following quality measures:
• Dunn Index (higher is better)
• Davies-Bouldin Index (lower is better)
• Adjusted Rand Index (higher is better)
• Total cost (lower is better)
Saturday, June 1, 13
iris randplot-2
0.5 1.0 1.5 2.0 2.5 3.0 3.5
0.51.01.52.02.53.03.5
0
51
0
62
0
0
0
0
38
Confusion matrix run 2
Adjusted Rand Index 1
Ball/Streaming k−means centroids
Mahoutk−meanscentroids
Saturday, June 1, 13
iris randplot-4
0.5 1.0 1.5 2.0 2.5 3.0 3.5
0.51.01.52.02.53.03.5
50
1
0
0
0
38
3
0
59
Confusion matrix run 4
Adjusted Rand Index 0.546899
Ball/Streaming k−means centroids
Mahoutk−meanscentroids
Saturday, June 1, 13
seeds compareplot
bskm km
4567
4.3116448 4.22126213333333
Clustering techniques compared all
Clustering type / overall mean cluster distance
Meanclusterdistance
Saturday, June 1, 13
movement compareplot
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
0 20 40 60
1.01.52.02.51.403378
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
1.50878
●
●
km distances
bskm distances
km mean
bskm mean
km vs bskm
Cluster index/Run
Averageclusterdistance
Saturday, June 1, 13
control randplot-3
1 2 3 4 5 6
123456
0
132
0
0
0
0
0
39
0
0
0
0
0
0
46
39
34
0
197
1
0
0
0
0
0
0
18
0
4
59
0
31
0
0
0
0
Confusion matrix run 3
Adjusted Rand Index 0.724136
Ball/Streaming k−means centroids
Mahoutk−meanscentroids
Saturday, June 1, 13
power allplot
●
●
●
●
bskm km
102050100200500
142.101769766667 149.3591672
Clustering techniques compared all
Clustering type / overall mean cluster distance
Meanclusterdistance
Saturday, June 1, 13
power compareplot
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
● ●
●
● ● ●
●
●
0 5 10 15 20 25 30
0200400600800149.3592
● ●
●
●
● ●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
142.1018
●
●
km distances
bskm distances
km mean
bskm mean
km vs bskm
Cluster index/Run
Averageclusterdistance
Saturday, June 1, 13
Overall (avg. 5 runs)
Dataset Clustering Avg. Dunn Avg. DB Avg. Cost Avg.ARI
iris
km 9.161 0.265 124.146
0.905iris
bskm 6.454 0.336 117.859
0.905
seeds
km 7.432 0.453 909.875
0.980seeds
bskm 6.886 0.505 916.511
0.980
movement
km 0.457 1.843 336.456
0.650movement
bskm 0.436 2.003 347.078
0.650
control
km 0.553 1.700 1014313
0.630control
bskm 0.753 1.434 1004917
0.630
power
km 0.107 1.380 73339083
0.605power
bskm 1.953 1.080 54422758
0.605
Saturday, June 1, 13
Speed (Threaded)
Saturday, June 1, 13
Speed (MapReduce)
0
17.5
35
52.5
70
20 clusters 1 iteration
KMeans StreamingKMeans
iteration times comparable
~2.4x faster
Saturday, June 1, 13
Speed (MapReduce)
0
75
150
225
300
1000 clusters 1 iteration
KMeans StreamingKMeans
benefits from
approximate
nearest-neighbor
search
~8x faster
Saturday, June 1, 13
Code
• Now available in Apache Mahout trunk
• Prototype for Storm
http://github.com/dfilimon/streaming-storm
Clustering algorithms
BallKMeans
StreamingKMeans
Fast nearest-neighbor search ProjectionSearch
Quality metrics ClusteringUtils
MapReduce classes
StreamingKMeansMapper
StreamingKMeansReducer
Storm classes StreamingKMeansBolt
Saturday, June 1, 13
Thank you!
Questions?
dfilimon@apache.org
dangeorge.filimon@gmail.com
Saturday, June 1, 13

Contenu connexe

Similaire à Clustering large-scale data Buzzwords 2013

대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화NAVER Engineering
 
A Lightweight Infrastructure for Graph Analytics
A Lightweight Infrastructure for Graph AnalyticsA Lightweight Infrastructure for Graph Analytics
A Lightweight Infrastructure for Graph AnalyticsDonald Nguyen
 
Pregel: A System For Large Scale Graph Processing
Pregel: A System For Large Scale Graph ProcessingPregel: A System For Large Scale Graph Processing
Pregel: A System For Large Scale Graph ProcessingRiyad Parvez
 
[ML]-Unsupervised-learning_Unit2.ppt.pdf
[ML]-Unsupervised-learning_Unit2.ppt.pdf[ML]-Unsupervised-learning_Unit2.ppt.pdf
[ML]-Unsupervised-learning_Unit2.ppt.pdf4NM20IS025BHUSHANNAY
 
Pattern recognition binoy k means clustering
Pattern recognition binoy  k means clusteringPattern recognition binoy  k means clustering
Pattern recognition binoy k means clustering108kaushik
 
DMTM Lecture 13 Representative based clustering
DMTM Lecture 13 Representative based clusteringDMTM Lecture 13 Representative based clustering
DMTM Lecture 13 Representative based clusteringPier Luca Lanzi
 
clustering_hierarchical ckustering notes.pdf
clustering_hierarchical ckustering notes.pdfclustering_hierarchical ckustering notes.pdf
clustering_hierarchical ckustering notes.pdfp_manimozhi
 
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...ABINASHPADHY6
 
Prototyping Dynamic Robots
Prototyping Dynamic RobotsPrototyping Dynamic Robots
Prototyping Dynamic RobotsNick Morozovsky
 
Advanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsAdvanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsNithyananthSengottai
 

Similaire à Clustering large-scale data Buzzwords 2013 (15)

대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
 
A Lightweight Infrastructure for Graph Analytics
A Lightweight Infrastructure for Graph AnalyticsA Lightweight Infrastructure for Graph Analytics
A Lightweight Infrastructure for Graph Analytics
 
Pregel: A System For Large Scale Graph Processing
Pregel: A System For Large Scale Graph ProcessingPregel: A System For Large Scale Graph Processing
Pregel: A System For Large Scale Graph Processing
 
Machine Learning Tools and Particle Swarm Optimization for Content-Based Sear...
Machine Learning Tools and Particle Swarm Optimization for Content-Based Sear...Machine Learning Tools and Particle Swarm Optimization for Content-Based Sear...
Machine Learning Tools and Particle Swarm Optimization for Content-Based Sear...
 
[ML]-Unsupervised-learning_Unit2.ppt.pdf
[ML]-Unsupervised-learning_Unit2.ppt.pdf[ML]-Unsupervised-learning_Unit2.ppt.pdf
[ML]-Unsupervised-learning_Unit2.ppt.pdf
 
Pattern recognition binoy k means clustering
Pattern recognition binoy  k means clusteringPattern recognition binoy  k means clustering
Pattern recognition binoy k means clustering
 
Clustering - ACM 2013 02-25
Clustering - ACM 2013 02-25Clustering - ACM 2013 02-25
Clustering - ACM 2013 02-25
 
DMTM Lecture 13 Representative based clustering
DMTM Lecture 13 Representative based clusteringDMTM Lecture 13 Representative based clustering
DMTM Lecture 13 Representative based clustering
 
clustering_hierarchical ckustering notes.pdf
clustering_hierarchical ckustering notes.pdfclustering_hierarchical ckustering notes.pdf
clustering_hierarchical ckustering notes.pdf
 
Clustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn TutorialClustering: A Scikit Learn Tutorial
Clustering: A Scikit Learn Tutorial
 
0629venmoplus
0629venmoplus0629venmoplus
0629venmoplus
 
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
 
Outlier detection handling
Outlier detection handlingOutlier detection handling
Outlier detection handling
 
Prototyping Dynamic Robots
Prototyping Dynamic RobotsPrototyping Dynamic Robots
Prototyping Dynamic Robots
 
Advanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsAdvanced database and data mining & clustering concepts
Advanced database and data mining & clustering concepts
 

Dernier

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Dernier (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Clustering large-scale data Buzzwords 2013