SlideShare une entreprise Scribd logo
1  sur  32
Fighting Financial Crime
with Artificial Intelligence
Tim Seears
Area Practice Director, Data Science – Asia Pacific & Korea
2
So far we have delivered
150+ successful projects for
100+ clients worldwide.
500+ employees
(2017)
Vendor-neutral with an
open source focus.
Full spectrum consulting, data
engineering, data science &
support.
Apache
Hadoop
and cloud
ecosystem
integration.
Founded in 2010
industry thought
leader.
Fixed fee offerings
for data science and
engineering.
Who Is Think Big?
1st Big Data provider 100%
focused around open source.
3
Help customers be financially confident
and achieve their ambitions by making
daily banking and important financial
decisions easy.
19,000
employees
(2017)
1,800 corporate and
institutional customers
Covering Personal Banking, Business
Banking, Corporates & Institutions and
Wealth Management.
2.7 million
personal
customers
236,000 small and
medium-sized
business customers
Making banking
easier for over 145
years.
Who is the customer?
Leading Nordic universal bank with strong
local roots and bridges to the rest of the
world
4
PayPal WalmartJohn Deere Lowe’s Wells FargoJP Morgan
This is Happening in the Enterprise today
5
“
’’
Over the next decade, AI won’t replace
managers, but managers who use AI
will replace those who don’t.
HBR July 2017
7
Data Driven Approach to Fight Fraud
Fast evolving fraud sophistication, AI
Ambitions for Fraud Project
Become client’s advanced
analytics blueprint
Data driven approach to real
time scoring of transactions
Reduce false-positives &
Enhance fraud detection rate
ONLY ~40%
of fraud cases are detected
Low Detection Rate
99.5%
of cases are not
fraud related
Many false positives
Challenges for Fraud Detection
Tens of Millions
€ lost each month
High Fraud Loss
© 2017 Teradata
8
Fraud Types – Customer Initiated
Nigeria/
Investment Scam
CEO
Fraud
Customer
Initiated
© 2017 Teradata
Beneficiary Account
Change
Fake
Invoice
Rental Scam/
Goods Not
Received
9
ID Theft
SPEAR
Phishing
Vishing/Support Scam
Malware
Phishing/Smishing
Fraud Types – Fraudster Initiated
Fraudster
Initiated
© 2017 Teradata
10
Modeling
Challenges
© 2017 Teradata
• Class imbalance
(100,000:1 non-fraud vs. fraud)
• Assigning fraud labels from
historic data
• Fraud is ambiguous
• Not all features available in
real-time
• Most machine learning sees
transactions atomically
11
Advanced Platform for Fraud: Data-Driven Approach
How It Works:
• Understanding the domain
• Gathering and preparing the data
• Automatically generating the rules and recognizing fraudulent patterns by
training models on historical data
• Automatically maintaining the engine by retraining the model
Pros:
• Automatic/data-driven/objective
inference of the rules
• Ability to detect patterns in a high
dimensional data input
• Fast detection of new/changing
fraudulent patterns
Cons:
• Might be unintuitive and hardly
interpretable
• Data preparation and feature aggregation
is time consuming
© 2017 Teradata
12
Phase One
© 2017 Teradata
13
Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul
Project kick-off
30th September
Teamwork to Deliver Value in Each Project Iteration
2016 2017
Kick-off: Data
Scientist track
Go-Live plan set
for the project
19th December
Kick-off:
Engineering track
Models successfully in
Shadow Production
4th March
First round trip
test transaction
First production
virtual machine
Production Hardening
for HA and Security
Full productionisation of
the Fraud Engine
Cross-
functional
Collaboration
Cont. Deep
Learning
modeling
© 2017 Teradata
14
>
Banking
Anti-Fraud
Solution
© 2017 Teradata
Banking Anti-Fraud Solution
By leveraging the power of a
thoughtful and strong data
and analytics strategy, we
unleash high impact business
outcomes.
• Multiple models running in production
at the same time
• Mix of traditional and advance deep
learning methods
• AnalyticsOps: Deploying machine
learning models in production
Model Management
Framework
• Organisation and silos of data
• Real-time data integration
• Security and Procedures: following
existing bank procedures
Data Modelling, Pipeline
and Ingestion
• Hard to operationalize insights
• Availability of analytic capabilities/ skills
and data
• Interpreting the results of machine
learning models
Machine Learning and
Artificial Intelligence
15
Advanced Platform for Fraud: Data-Driven Approach
Manual
Evaluation
eBanking
Mobile
Pay
Business
Online
Global Payment
Interface
Fraud?
Create
Verification
Fraud
Payment
Weblog Data
Basic Customer
Data
Historic
Transactions
Customer Product
Data
Aggregated
Customer Data
Fraud?
Advanced Analytics
Platform for Fraud
Central Fraud Engine
Process Payment to
Beneficiary
Yes/Maybe
No
Yes !
Return Payment
to Customer
Update
Fraud Data
© 2017 Teradata
No
16
Model Management
Real Time Scoring
System
Production
Operations
Source
Systems
Data
Managers
Framework to Enhance AnalyticsOps Capabilities
Log
Transactions
Add Historic
Data
Model
Performance
Promote
Models
© 2017 Teradata
Data Scientists
Development & Deployment
Management
Data Science Lab
Data Analysis &
Model Development
Data
Management
Developers &
Release Managers
17
Key Requirement: Model Interpretation
• We have deployed LIME (Locally Interpretable Model Explanation) for
customers
– Improves trust
– Compliance with EU’s General Data Protection Regulation (GDPR)
© 2017 Teradata
17.6% Fraud Probability
Customer Amount
Debit Amount
Avg. # Trans Cred Acc
# Prev to This Dest
# Xfer Accounts
X% score due to:
+ transfer amount
+ destination country
+ last year monthly spend
What features are most important to
this decision?
18
Machine Learning Results
(Live System: 60 transactions/sec.)
Ensemble of boosted decision trees and
logistic regression.
From online validation of the model:
● 25-30% false positive reduction, with
over 35% increase in detection rate
● Opportunity to expand model with
additional features, retrain on recent data
and add additional models to
the ensemble.
● Models can be expanded to additional
channels
Rule Engine on
validation set
© 2017 Teradata
19
Deep Learning
© 2017 Teradata
20
Current models can
only catch ~70% of all
fraud cases
Deep Learning Opportunity
Traditional ML models view
transactions atomically
Often missed fraud
transactions are part of
a series
Capturing correlation
across many features
© 2017 Teradata
21
Three Deep Learning Architectures to Deliver Value
• Designed for spatial
correlated features, but
by transforming
transactions into a
2D image, we can
learn temporal
correlated features.
• Deeper ConvNet allows
learning more complex
& general features.
Goal: Learn kernels from
temporal & static features
to gain insight into the
characteristics of fraud.
• Learn temporal
information and classify
if the sequence of
transactions
contains fraud.
• Shares knowledge
across learning time.
Goal: Learn transaction
patterns within a window.
Two solutions can be
tested: flag fraud or predict
next transaction and define
an error.
• Learn how to generate
normal transactions,
potentially large volumes
of non-fraud data.
• AE provide a low level
representation of the
data.
Goal: Build a model that
learns how to generate
non-fraud data. To detect
fraud, define a
reconstruction error rate for
the fraud cases
Auto-Encoders
LSTM
ConvNet
© 2017 Teradata
22
How Can We Create an Image From Bank Transactions?
t0 X_0, X_1, ... X_n
dt
t1 X_0, X_1, ... X_n
t2 X_0, X_1, ... X_n
ts X_0, X_1, ... X_n
...
Top k Features Correlation
...
X_0
X_41 X_5
X_30
X_29X_31X_10
X_37
X_3
X_1
X_42 X_40
X_32
X_15X_35X_2
X_16
X_31
X_2
X_3 X_15
X_4
X_1X_28X_40
X_31
X_49
X_n
X_26 X_9
X_40
X_35X_28X_2
X_17
X_1
...
X_0
X_41 X_5
X_30
X_29X_31X_10
X_37
X_3
X_1
X_42 X_40
X_32
X_15X_35X_2
X_16
X_31
X_2
X_3 X_15
X_4
X_1X_28X_40
X_31
X_49
X_n
X_26 X_9
X_40
X_35X_28X_2
X_17
X_1
...
...
...
...
...
X_0
X_41 X_5
X_30
X_29X_31X_10
X_37
X_3
X_1
X_42 X_40
X_32
X_15X_35X_2
X_16
X_31
X_2
X_3 X_15
X_4
X_1X_28X_40
X_31
X_49
X_n
X_26 X_9
X_40
X_35X_28X_2
X_17
X_1
N
dt
t0
t1
ts
Input Output
Raw Features
Add correlated features in a
clock-wise manner
© 2017 Teradata
Image size is:
[10 x 3, 50 x 3, 1]
23
Convolutional Layers for Trans2D
Kernels
of 3x3
...X_0 X_1 X_2 X_n
...X_0 X_1 X_2 X_n...
...
...
...
...X_0 X_1 X_2 X_n
Features
Time
Strides of 3
Strides
of 1
First Convolutional Layer Architecture
© 2017 Teradata
24
2D Transaction Image Example
© 2017 Teradata
Non-fraud Transaction Image
Non-fraud
Fraud Transaction Image
X-axis: features, Y-axis: time
Fraud
Non-fraud
25
Network Architecture for CNNs
Fraud
Normal
50
30
25
15
13
8CNN
Fraud
Normal50
30
25
15
25
15
25
15
25
15
© 2017 Teradata
26
Inside the ResNet model
64 Filters
Activations After
the CNN
Residual Blocks
FraudNon-fraud
© 2017 Teradata
27
Deep Learning
First Results
on the fraud verification dataset
Comparison of the three deep learning
models andthe traditional machine
learning ensemble model.
© 2017 Teradata
• Ensemble model (AUC 0.89)
• ConvNets (AUC 0.95)
• LSTM (AUC 0.90)
• ResNet (AUC 0.94)
28
Lessons Learned: Take-Aways From client project
Deep
learning
adoption
from pictures
to financial
transactions
Enhancement of
data quality &
cluster
capabilities with
data ingestion
Building
Analytics
Ops
capabilities
to support
business
units
Leveraging
experience
from Fraud
advanced
analytics to
deliver extra
use cases
© 2017 Teradata
29
Success: From PowerPoint to production in 8 sprints
Team effort: Thorough collaboration across IMD, GFU and Think Big
Synergy: Successfully spearheaded innovation in all involved systems
Inspiration: Bank advanced analytics blueprint sets a generic scene for combatting new
challenges in advanced analytics
Agile influence: Using an agile approach we were able to quickly deliver within the
challenging timeframe.
© 2017 Teradata
Big Data Team – Lessons Learned
3030 © 2017 Teradata
31
Appendix
© 2017 Teradata
32
Advanced Analytics Fraud Platform
• Develop a scalable and expandable platform which follows the Bank’s blueprint of
digitalization
• 100 % data-driven approach to find patterns in the data and complement the
existing fraud engine
• Use Hadoop to handle the large data volumes for training models on transaction
data
• Implement a real-time solution that can score live transactions such as
e-banking, credit card and mobile payments.
• Reduce amount of false-positives by at least 20-40 %
© 2017 Teradata
33
Ambitions for the Advance Analytics project
• The goal is to enhance fraud detection and
reduce false positives across products.
• Fraud is only the first of many possible
advanced analytics use cases.
• The project leads to the creation of the Bank’s
advanced analytics blueprint.
• Bank’s ambition is to become one of the bank
leading in advanced analytics capabilities.
Challenges in Fraud Detection at client
• Low detection rate: Existing human-written
rule engine catches ~40% of fraud cases.
• Many false positives: 99.5% of all cases
investigated are not fraud related.
• High fraud loss: Tens of millions of € total
fraud per month.
• Mobile payments are quickly growing in
number.
• Fraud evolving rapidly, with increased
sophistication: Bank must modernize its anti-
fraud arsenal.
© 2017 Teradata

Contenu connexe

Tendances

Digital Transformation in Retail Banking
Digital Transformation in Retail BankingDigital Transformation in Retail Banking
Digital Transformation in Retail BankingFerran Garcia Pagans
 
Machine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementMachine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementQuantUniversity
 
Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceRon Bodkin
 
Success Factors for Digital Transformation in Banking
Success Factors for Digital Transformation in BankingSuccess Factors for Digital Transformation in Banking
Success Factors for Digital Transformation in BankingTata Consultancy Services
 
Fraud Detection presentation
Fraud Detection presentationFraud Detection presentation
Fraud Detection presentationHernan Huwyler
 
BBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial ServicesBBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial ServicesOzgur Karakaya
 
Fintech Overview and Growth Drivers
Fintech Overview and Growth DriversFintech Overview and Growth Drivers
Fintech Overview and Growth DriversRatnakar Pandey
 
FinTech for Financial Inclusion
FinTech for Financial InclusionFinTech for Financial Inclusion
FinTech for Financial InclusionPaperjam_redaction
 
Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services NVIDIA
 
An Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking ArchitectureAn Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking ArchitectureWSO2
 
AI and the Financial Service Segment
AI and the Financial Service SegmentAI and the Financial Service Segment
AI and the Financial Service SegmentGraeme Wood
 
The Path to Open Banking
The Path to Open BankingThe Path to Open Banking
The Path to Open BankingMuleSoft
 
Idiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics
 
BaaS-platforms and open APIs in fintech l bank-as-a-service.com
BaaS-platforms and open APIs in fintech l bank-as-a-service.comBaaS-platforms and open APIs in fintech l bank-as-a-service.com
BaaS-platforms and open APIs in fintech l bank-as-a-service.comVladislav Solodkiy
 
Business Intelligence - A Management Perspective
Business Intelligence - A Management PerspectiveBusiness Intelligence - A Management Perspective
Business Intelligence - A Management Perspectivevinaya.hs
 
Intelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial BankingIntelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial BankingDmitry Petukhov
 

Tendances (20)

Digital Transformation in Retail Banking
Digital Transformation in Retail BankingDigital Transformation in Retail Banking
Digital Transformation in Retail Banking
 
Machine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementMachine Learning and AI in Risk Management
Machine Learning and AI in Risk Management
 
Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligence
 
Success Factors for Digital Transformation in Banking
Success Factors for Digital Transformation in BankingSuccess Factors for Digital Transformation in Banking
Success Factors for Digital Transformation in Banking
 
Trends in AML Compliance
Trends in AML ComplianceTrends in AML Compliance
Trends in AML Compliance
 
Fraud Detection presentation
Fraud Detection presentationFraud Detection presentation
Fraud Detection presentation
 
BBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial ServicesBBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial Services
 
Fintech Overview and Growth Drivers
Fintech Overview and Growth DriversFintech Overview and Growth Drivers
Fintech Overview and Growth Drivers
 
FinTech for Financial Inclusion
FinTech for Financial InclusionFinTech for Financial Inclusion
FinTech for Financial Inclusion
 
Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services
 
An Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking ArchitectureAn Entry Point to Impactful Open Banking Architecture
An Entry Point to Impactful Open Banking Architecture
 
Big Data Application Architectures - Fraud Detection
Big Data Application Architectures - Fraud DetectionBig Data Application Architectures - Fraud Detection
Big Data Application Architectures - Fraud Detection
 
AI and the Financial Service Segment
AI and the Financial Service SegmentAI and the Financial Service Segment
AI and the Financial Service Segment
 
The Path to Open Banking
The Path to Open BankingThe Path to Open Banking
The Path to Open Banking
 
Idiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big DataIdiro Analytics - Analytics & Big Data
Idiro Analytics - Analytics & Big Data
 
BaaS-platforms and open APIs in fintech l bank-as-a-service.com
BaaS-platforms and open APIs in fintech l bank-as-a-service.comBaaS-platforms and open APIs in fintech l bank-as-a-service.com
BaaS-platforms and open APIs in fintech l bank-as-a-service.com
 
Apply (Big) Data Analytics & Predictive Analytics to Business Application
Apply (Big) Data Analytics & Predictive Analytics to Business ApplicationApply (Big) Data Analytics & Predictive Analytics to Business Application
Apply (Big) Data Analytics & Predictive Analytics to Business Application
 
Business Intelligence - A Management Perspective
Business Intelligence - A Management PerspectiveBusiness Intelligence - A Management Perspective
Business Intelligence - A Management Perspective
 
Artificial intelligence in financial sector
Artificial intelligence in financial sectorArtificial intelligence in financial sector
Artificial intelligence in financial sector
 
Intelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial BankingIntelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial Banking
 

Similaire à Fighting Financial Crime with Artificial Intelligence

CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...Santiago Cabrera-Naranjo
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"MDS ap
 
Building for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scaleBuilding for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scaleMichael Nealey
 
Data & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersData & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersNick Smith
 
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...Santiago Cabrera-Naranjo
 
Graph+AI for Fin. Services
Graph+AI for Fin. ServicesGraph+AI for Fin. Services
Graph+AI for Fin. ServicesTigerGraph
 
Emerging opportunities in the age of data
Emerging opportunities in the age of dataEmerging opportunities in the age of data
Emerging opportunities in the age of dataEjaz Siddiqui
 
20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligence20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligenceSantiago Cabrera-Naranjo
 
Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database DataStax
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
The Journey to Big Data Analytics
The Journey to Big Data AnalyticsThe Journey to Big Data Analytics
The Journey to Big Data AnalyticsDr.Stefan Radtke
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data SnapLogic
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance accenture
 
Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016David Patton
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various typesloginworks software
 

Similaire à Fighting Financial Crime with Artificial Intelligence (20)

Opportunities derived by AI
Opportunities derived by AIOpportunities derived by AI
Opportunities derived by AI
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Building for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scaleBuilding for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scale
 
Data & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersData & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customers
 
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
 
Graph+AI for Fin. Services
Graph+AI for Fin. ServicesGraph+AI for Fin. Services
Graph+AI for Fin. Services
 
Emerging opportunities in the age of data
Emerging opportunities in the age of dataEmerging opportunities in the age of data
Emerging opportunities in the age of data
 
20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligence20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligence
 
Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
The Journey to Big Data Analytics
The Journey to Big Data AnalyticsThe Journey to Big Data Analytics
The Journey to Big Data Analytics
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance
 
Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various types
 

Plus de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Plus de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Dernier

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 

Dernier (20)

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Fighting Financial Crime with Artificial Intelligence

  • 1. Fighting Financial Crime with Artificial Intelligence Tim Seears Area Practice Director, Data Science – Asia Pacific & Korea
  • 2. 2 So far we have delivered 150+ successful projects for 100+ clients worldwide. 500+ employees (2017) Vendor-neutral with an open source focus. Full spectrum consulting, data engineering, data science & support. Apache Hadoop and cloud ecosystem integration. Founded in 2010 industry thought leader. Fixed fee offerings for data science and engineering. Who Is Think Big? 1st Big Data provider 100% focused around open source.
  • 3. 3 Help customers be financially confident and achieve their ambitions by making daily banking and important financial decisions easy. 19,000 employees (2017) 1,800 corporate and institutional customers Covering Personal Banking, Business Banking, Corporates & Institutions and Wealth Management. 2.7 million personal customers 236,000 small and medium-sized business customers Making banking easier for over 145 years. Who is the customer? Leading Nordic universal bank with strong local roots and bridges to the rest of the world
  • 4. 4 PayPal WalmartJohn Deere Lowe’s Wells FargoJP Morgan This is Happening in the Enterprise today
  • 5. 5 “ ’’ Over the next decade, AI won’t replace managers, but managers who use AI will replace those who don’t. HBR July 2017
  • 6. 7 Data Driven Approach to Fight Fraud Fast evolving fraud sophistication, AI Ambitions for Fraud Project Become client’s advanced analytics blueprint Data driven approach to real time scoring of transactions Reduce false-positives & Enhance fraud detection rate ONLY ~40% of fraud cases are detected Low Detection Rate 99.5% of cases are not fraud related Many false positives Challenges for Fraud Detection Tens of Millions € lost each month High Fraud Loss © 2017 Teradata
  • 7. 8 Fraud Types – Customer Initiated Nigeria/ Investment Scam CEO Fraud Customer Initiated © 2017 Teradata Beneficiary Account Change Fake Invoice Rental Scam/ Goods Not Received
  • 8. 9 ID Theft SPEAR Phishing Vishing/Support Scam Malware Phishing/Smishing Fraud Types – Fraudster Initiated Fraudster Initiated © 2017 Teradata
  • 9. 10 Modeling Challenges © 2017 Teradata • Class imbalance (100,000:1 non-fraud vs. fraud) • Assigning fraud labels from historic data • Fraud is ambiguous • Not all features available in real-time • Most machine learning sees transactions atomically
  • 10. 11 Advanced Platform for Fraud: Data-Driven Approach How It Works: • Understanding the domain • Gathering and preparing the data • Automatically generating the rules and recognizing fraudulent patterns by training models on historical data • Automatically maintaining the engine by retraining the model Pros: • Automatic/data-driven/objective inference of the rules • Ability to detect patterns in a high dimensional data input • Fast detection of new/changing fraudulent patterns Cons: • Might be unintuitive and hardly interpretable • Data preparation and feature aggregation is time consuming © 2017 Teradata
  • 12. 13 Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Project kick-off 30th September Teamwork to Deliver Value in Each Project Iteration 2016 2017 Kick-off: Data Scientist track Go-Live plan set for the project 19th December Kick-off: Engineering track Models successfully in Shadow Production 4th March First round trip test transaction First production virtual machine Production Hardening for HA and Security Full productionisation of the Fraud Engine Cross- functional Collaboration Cont. Deep Learning modeling © 2017 Teradata
  • 13. 14 > Banking Anti-Fraud Solution © 2017 Teradata Banking Anti-Fraud Solution By leveraging the power of a thoughtful and strong data and analytics strategy, we unleash high impact business outcomes. • Multiple models running in production at the same time • Mix of traditional and advance deep learning methods • AnalyticsOps: Deploying machine learning models in production Model Management Framework • Organisation and silos of data • Real-time data integration • Security and Procedures: following existing bank procedures Data Modelling, Pipeline and Ingestion • Hard to operationalize insights • Availability of analytic capabilities/ skills and data • Interpreting the results of machine learning models Machine Learning and Artificial Intelligence
  • 14. 15 Advanced Platform for Fraud: Data-Driven Approach Manual Evaluation eBanking Mobile Pay Business Online Global Payment Interface Fraud? Create Verification Fraud Payment Weblog Data Basic Customer Data Historic Transactions Customer Product Data Aggregated Customer Data Fraud? Advanced Analytics Platform for Fraud Central Fraud Engine Process Payment to Beneficiary Yes/Maybe No Yes ! Return Payment to Customer Update Fraud Data © 2017 Teradata No
  • 15. 16 Model Management Real Time Scoring System Production Operations Source Systems Data Managers Framework to Enhance AnalyticsOps Capabilities Log Transactions Add Historic Data Model Performance Promote Models © 2017 Teradata Data Scientists Development & Deployment Management Data Science Lab Data Analysis & Model Development Data Management Developers & Release Managers
  • 16. 17 Key Requirement: Model Interpretation • We have deployed LIME (Locally Interpretable Model Explanation) for customers – Improves trust – Compliance with EU’s General Data Protection Regulation (GDPR) © 2017 Teradata 17.6% Fraud Probability Customer Amount Debit Amount Avg. # Trans Cred Acc # Prev to This Dest # Xfer Accounts X% score due to: + transfer amount + destination country + last year monthly spend What features are most important to this decision?
  • 17. 18 Machine Learning Results (Live System: 60 transactions/sec.) Ensemble of boosted decision trees and logistic regression. From online validation of the model: ● 25-30% false positive reduction, with over 35% increase in detection rate ● Opportunity to expand model with additional features, retrain on recent data and add additional models to the ensemble. ● Models can be expanded to additional channels Rule Engine on validation set © 2017 Teradata
  • 19. 20 Current models can only catch ~70% of all fraud cases Deep Learning Opportunity Traditional ML models view transactions atomically Often missed fraud transactions are part of a series Capturing correlation across many features © 2017 Teradata
  • 20. 21 Three Deep Learning Architectures to Deliver Value • Designed for spatial correlated features, but by transforming transactions into a 2D image, we can learn temporal correlated features. • Deeper ConvNet allows learning more complex & general features. Goal: Learn kernels from temporal & static features to gain insight into the characteristics of fraud. • Learn temporal information and classify if the sequence of transactions contains fraud. • Shares knowledge across learning time. Goal: Learn transaction patterns within a window. Two solutions can be tested: flag fraud or predict next transaction and define an error. • Learn how to generate normal transactions, potentially large volumes of non-fraud data. • AE provide a low level representation of the data. Goal: Build a model that learns how to generate non-fraud data. To detect fraud, define a reconstruction error rate for the fraud cases Auto-Encoders LSTM ConvNet © 2017 Teradata
  • 21. 22 How Can We Create an Image From Bank Transactions? t0 X_0, X_1, ... X_n dt t1 X_0, X_1, ... X_n t2 X_0, X_1, ... X_n ts X_0, X_1, ... X_n ... Top k Features Correlation ... X_0 X_41 X_5 X_30 X_29X_31X_10 X_37 X_3 X_1 X_42 X_40 X_32 X_15X_35X_2 X_16 X_31 X_2 X_3 X_15 X_4 X_1X_28X_40 X_31 X_49 X_n X_26 X_9 X_40 X_35X_28X_2 X_17 X_1 ... X_0 X_41 X_5 X_30 X_29X_31X_10 X_37 X_3 X_1 X_42 X_40 X_32 X_15X_35X_2 X_16 X_31 X_2 X_3 X_15 X_4 X_1X_28X_40 X_31 X_49 X_n X_26 X_9 X_40 X_35X_28X_2 X_17 X_1 ... ... ... ... ... X_0 X_41 X_5 X_30 X_29X_31X_10 X_37 X_3 X_1 X_42 X_40 X_32 X_15X_35X_2 X_16 X_31 X_2 X_3 X_15 X_4 X_1X_28X_40 X_31 X_49 X_n X_26 X_9 X_40 X_35X_28X_2 X_17 X_1 N dt t0 t1 ts Input Output Raw Features Add correlated features in a clock-wise manner © 2017 Teradata Image size is: [10 x 3, 50 x 3, 1]
  • 22. 23 Convolutional Layers for Trans2D Kernels of 3x3 ...X_0 X_1 X_2 X_n ...X_0 X_1 X_2 X_n... ... ... ... ...X_0 X_1 X_2 X_n Features Time Strides of 3 Strides of 1 First Convolutional Layer Architecture © 2017 Teradata
  • 23. 24 2D Transaction Image Example © 2017 Teradata Non-fraud Transaction Image Non-fraud Fraud Transaction Image X-axis: features, Y-axis: time Fraud Non-fraud
  • 24. 25 Network Architecture for CNNs Fraud Normal 50 30 25 15 13 8CNN Fraud Normal50 30 25 15 25 15 25 15 25 15 © 2017 Teradata
  • 25. 26 Inside the ResNet model 64 Filters Activations After the CNN Residual Blocks FraudNon-fraud © 2017 Teradata
  • 26. 27 Deep Learning First Results on the fraud verification dataset Comparison of the three deep learning models andthe traditional machine learning ensemble model. © 2017 Teradata • Ensemble model (AUC 0.89) • ConvNets (AUC 0.95) • LSTM (AUC 0.90) • ResNet (AUC 0.94)
  • 27. 28 Lessons Learned: Take-Aways From client project Deep learning adoption from pictures to financial transactions Enhancement of data quality & cluster capabilities with data ingestion Building Analytics Ops capabilities to support business units Leveraging experience from Fraud advanced analytics to deliver extra use cases © 2017 Teradata
  • 28. 29 Success: From PowerPoint to production in 8 sprints Team effort: Thorough collaboration across IMD, GFU and Think Big Synergy: Successfully spearheaded innovation in all involved systems Inspiration: Bank advanced analytics blueprint sets a generic scene for combatting new challenges in advanced analytics Agile influence: Using an agile approach we were able to quickly deliver within the challenging timeframe. © 2017 Teradata Big Data Team – Lessons Learned
  • 29. 3030 © 2017 Teradata
  • 31. 32 Advanced Analytics Fraud Platform • Develop a scalable and expandable platform which follows the Bank’s blueprint of digitalization • 100 % data-driven approach to find patterns in the data and complement the existing fraud engine • Use Hadoop to handle the large data volumes for training models on transaction data • Implement a real-time solution that can score live transactions such as e-banking, credit card and mobile payments. • Reduce amount of false-positives by at least 20-40 % © 2017 Teradata
  • 32. 33 Ambitions for the Advance Analytics project • The goal is to enhance fraud detection and reduce false positives across products. • Fraud is only the first of many possible advanced analytics use cases. • The project leads to the creation of the Bank’s advanced analytics blueprint. • Bank’s ambition is to become one of the bank leading in advanced analytics capabilities. Challenges in Fraud Detection at client • Low detection rate: Existing human-written rule engine catches ~40% of fraud cases. • Many false positives: 99.5% of all cases investigated are not fraud related. • High fraud loss: Tens of millions of € total fraud per month. • Mobile payments are quickly growing in number. • Fraud evolving rapidly, with increased sophistication: Bank must modernize its anti- fraud arsenal. © 2017 Teradata