3. What we’re up against
3
50+
Schemes
(and counting)
99.9999%‘Good’ messages
6+Months
per case
Needle in a haystack
Hybrid analytics
No training data
Semi-supervised learning
Adversarial learning
Online feedback
9. Sample natural language annotators
Understand vocabulary
– Jargon
– Code words
– Multi-lingual
Understand grammar
– Who are we talking about?
– Past, present or future?
– Compound sentences
Understand context
– Email: Re:, Fwd:, attachments
– SMS & IM have their own grammar
11. User analysis iteration
Email NLP
Features
User graph
Transactions
time series
Graph Features
Time Series
Features
NLP Features
Agent Feedback
Train/TestClassifier
12. Really
• Makes the world a better place • Needle in a very large haystack
– Actually needs a petabyte-scale platform
• Multi-modal: no single trick works
– Hybrid analytics
• No labeled data
– Semi-supervised learning
– Cold start problem
• Sparse & high-dimensional
– Graph based features & change over time
• Adversarial
– Feedback & online learning
Technically
Summary: why hunting criminals is cool
12
12
13. THANK YOU!
Get the notebooks: github.com/Atigeo/Atigeo/hunting_criminals_demo
Try it yourself: “xPatterns Connect” on AWS Marketplace
Ask us about it: @davidtalby , @melcutz