Kai Wähner (@KaiWaehner) is a Technology Evangelist and Community Director at TIBCO Software - a leading provider of integration and analytics middleware. Kai is an experience guy in broad variety of topics like Big Data, Advanced Analytics & Machine Learning, he loves to write articles and blog about new technologies and make talks. The talk is about 3 different projects where Kai's team built analytic models with technologies R, Apache Spark or H2O.ai which were deployed to real time processing. The use cases include predictive maintenance in manufacturing but also fraud detection in banking and context-specific pricing in insurance. For one of the cases, Kai gonna show detailed steps will be, how it was built and deployed using supervised/unsupervised ML.
Talk was done together with my colleague Ankitaa Bhowmick.
5. Machine Learning
…. allows computers to find hidden insights without being
explicitly programmed where to look.
6. Real World Examples of Machine Learning
Spam Detection
Search Results +
Product Recommendation
Picture Detection
(Friends, Locations, Products)
Machine Learning is already present in daily life…
Now, every enterprise is beginning to leverage it!
The Next Disruption:
Google Beats Go Champion
19. 2. Find Leading
Indicators
3. Backtest
Rules / Models
4. Push
Rules / Models
to Event Server
1. Study
Anomalies
Sensor Analytics – Patterns in Historical Data
23. Scenario: Predictive Scrapping of Parts in an Assembly Line
Goal: Scrap parts as early as possible automatically to reduce costs in a manufacturing process.
Question: When to scrap a part in Station 1 instead of doing re-work or sending it to Station 2?
Station 1 Station 2
Cost Before
9€
7€ 13€
Total Cost
29€
(or more)
Scrap? Scrap?
26. Fast Data Architecture for Predictive Maintenance
Operational Analytics
Operations
Live UI
CSV Batch
JSON Real Time
XML Real Time
Streaming AnalyticsAction
Aggregate
Rules
Analytics
Correlate
Live Datamart
Continuous query
processing
Alerts
Manual action,
escalation
HISTORICAL ANALYSIS Data
Scientists
Flume
HDFS
Spotfire
R / TERR
HDFS
Hadoop (Cloudera)
StreamBase
TIBCO Fast Data Platform
H2O
Oracle RDBMS
Avro Parquet … PMML
Internal Data