SlideShare une entreprise Scribd logo
1  sur  25
Action Items for the
Financial Industry:
Addressing fraud and privacy
issues in Hadoop
May 1, 2014
Our Speakers
Jeremy Stieglitz
VP Product Management
Dataguise
Dale Kim
Director, Industry Solutions
MapR Technologies
5 Action Items for the Financial Services Industry – April 2014 2
Executive Summary
 Big Data has become priority #1 for
finance and large enterprise in 2014:
– Faster time to insights
– Increasingly touching $$$
– Real-time, Automation and On Demand
 Key challenges on Big Data
– Leveraging sensitive assets
– Automating protection techniques
– Maintaining high performance
– Ensuring scalability
5 Action Items for the Financial Services Industry – April 2014 3
Agenda
 Market overview: Hadoop in Financial Services
 Solution priorities
 5 action items
– Strategize – develop concrete goals
– Automate, automate, automate
– Foresee high rates of innovation
– Expand your community
– Explore the technology landscape
5 Action Items for the Financial Services Industry – April 2014 4
5
Market
Overview
Hadoop in Financial Services
5 Action Items for the Financial Services Industry – April 2014 6
Hadoop addresses:
• Scale
• Infrastructure costs
• Anomaly detection
• Data archiving for
compliance
• Aggregated risk
• Data protection
• Etc.
Risk
Market Uncertainty
Regulations
Costs
Fraud
Security Attacks
Big Data
Business Challenge: Data Growth
 100% growth and 80% unstructured data by 2015
…finding and classifying sensitive data will get
harder
7
Exabytes
5 Action Items for the Financial Services Industry – April 2014
Compliance Universe
5 Action Items for the Financial Services Industry – April 2014
• Sarbanes-Oxley
• Frank-Dodd
• PCI-DSS
• HIPAA
• State Data Breach Laws (CA SB 1386)
• Data Privacy (International)
• EU Data Protection Directive
• Singapore Personal Data Protection Act
• Canadian Personal Information Protection
and Electronic Documents Act (PIPEDA)
• Germany’s Federal Data Protection Act
(BDSG)
• Great Britain Data Protection Act
• Data Privacy (USA)
• Ohio 1347.15
• California SB 1386
• Massachusetts Data Privacy Law
• Financial
• Basel III
• Gramm-Leach-Bliley Act (GLBA)
• J-SOX
• Technology Risk Management
Guidelines
• Health/Pharmaceutical
• NAIC Model Audit Rule (MAR)
• 21 CFR Part 11 (FDA)
• Energy
• North American Electric Reliability
Corp (NERC)
• Federal Energy Regulatory
Commission Regulations (FERC)
• Service Providers
• Statement on Auditing Standards
(SAS 70)
• Education
• Family Educational Rights and Privacy
Act (FERPA)
• Federal Information Security Management
Act (FISMA)
63 countries, 1200+ laws
5 Action Items for the Financial Services Industry – April 2014 9
9
Solution
Priorities
Solution Priorities
Performance
Compute Memory
Network I/ODisk
Software
If you skimp on
one, you impact
performance
5 Action Items for the Financial Services Industry – April 2014
Solution Priorities
Scale
Plan for growth,
beyond your simple
projections:
• Longer time
windows of data
• New data sources
• New use cases
5 Action Items for the Financial Services Industry – April 2014
Solution Priorities
Reliability Plan ahead with all
the right people --
devops, sys admins,
application
developers, etc.
5 Action Items for the Financial Services Industry – April 2014
Hadoop Security Framework
Perimeter
Guarding access to the
cluster itself
Technical Concepts:
Authentication
Network isolation
Data
Protecting data in the
cluster from
unauthorized visibility
Technical Concepts:
Encryption, Tokenization,
Data masking
 The 4 approaches to address security within Hadoop (Perimeter,
Data, Access, Visibility)
 MapR provides true multi-tenant Hadoop, along with authentication/
authorization controls
 Dataguise discovers & protects at the data layer and provides visibility
for audit reporting and data lineage
Perimeter
Guarding access to the
cluster itself
Technical Concepts:
Authentication
Network isolation
Access
Defining what users
and applications can do
with data
Technical Concepts:
Permissions
Authorization
Visibility
Reporting on where
data came from and
how it’s being used
Technical Concepts:
Auditing
Lineage
5 Action Items for the Financial Services Industry – April 2014
5 Actions You Should Take
5 Action Items for the Financial Services Industry – April 2014 14
#1: Strategize – Develop Concrete Goals
 Are you prepared for production?
– Expect stringent SLAs
– Plan for performance, scale, reliability,
and data security
 Additional use cases?
– Do not limit your possibilities
– Hadoop deployments typically handle
multiple business problems
 The ‘Aha’ vs. ‘Gotcha’ moment
5 Action Items for the Financial Services Industry – April 2014 15
#2: Automate, Automate, Automate
– Keep up with data growth
– Simplify critical tasks
– Reduce the risk of error with manual effort
Your Data will grow 6000% in six years. Your headcount will grow 1.5x
5 Action Items for the Financial Services Industry – April 2014
Payment Risk Management
at Major Credit Card Brand
 Multiple business apps: fraud
detection, risk analytics, cross sell
 High degree of automation
 “Silver”, “Gold” and “Platinum”
Hadoop domains with increasing
lockdown of customer data
17
“Our analytics business
draws on the
purchasing data of its
90 million credit card
holders across 127
countries.”
5 Action Items for the Financial Services Industry – April 2014
Hadoop
Payment Risk Management
at Major Credit Card Brand
Customer uses SFTP and
NFS to load data into
Hadoop. All data loaded
in the clear.
Dataguise masking runs as
MapReduce JAR (automatically, with
no programming required).
HDFS “Gold” Cluster
Any incremental updates to HDFS are
automatically protected. Credit card
firm uses access control to determine
access to private data
Dataguise masking
guarantees consistency
between sensitive
elements, ensuring credit
card group can run same
statistical distribution and
analytics without
exposure risk
u v
w
Omniture FilesCredit Card
Transactions (txt)
NFS copy
SFTP
HDFS Cluster “Silver”
x
Source Data
5 Action Items for the Financial Services Industry – April 2014
#3: Foresee High Rates of Innovation
 Hadoop benefits from many new innovations:
{Falcon, Kite, Storm, Mesos, Drill, Impala, Spark,
Shark, etc.}
 Design non-blocking technical choices
(especially in areas of high velocity).
– Many files formats you can use (txt, Snappy, Avro,
SequenceFile, RC, ORC, etc.)
– Make sure your security assumptions aren’t tied to
specific formats
5 Action Items for the Financial Services Industry – April 2014 19
#4: Expand Your Community
 Security is a “common good,” share ideas with:
– Different divisions at your firm
– Professionals in your network
– Competitors
 Who else are Hadoop/security experts?
– E-commerce
– Telecommunications
– Web 2.0
– Government
5 Action Items for the Financial Services Industry – April 2014
 Security analytics and fraud
detection
 Build statistical models to detect
fraud, and mine data to evaluate
suspicious activities
 Huge volumes of collected data to
identify fraud patterns required
massive scalability
Fraud Detection and Security Analytics
at Zions Bank
“We initially got into
centralizing all of our data
from an information
security perspective. We
then saw that we could
use this same environment
to help with fraud
detection.”
5 Action Items for the Financial Services Industry – April 2014
#5: Explore the Technology Landscape
• Deciding on the right technology is hard
• Undoing a bad decision is harder
• Talk to vendors, analysts, community
• Attend meet-ups, conferences
• Validate advice you get with proof points
5 Action Items for the Financial Services Industry – April 2014
MapR Distribution for Hadoop
BIG
DATA
BEST PRODUCT BUSINESS
IMPACT
Hadoop
Top Ranked
Production
Success
5 Action Items for the Financial Services Industry – April 2014
Dataguise: Market Leader in
Big Data Protective Intelligence (BDPI)
Only solution with Hadoop data
discovery
Best in class– data protection with
simplicity, scalability, and
automation
Business friendly to operators and
business analysts
24
5 Action Items for the Financial Services Industry – April 2014
Thank You
Jeremy Stieglitz
VP Product Management
jeremy@dataguise.com
Dale Kim
Director, Industry Solutions
dalekim@mapr.com
5 Action Items for the Financial Services Industry – April 2014 25

Contenu connexe

Tendances

Real-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance IndustryReal-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance Industry
DataWorks Summit
 
Data Governance Overview - Doreen Christian
Data Governance Overview - Doreen ChristianData Governance Overview - Doreen Christian
Data Governance Overview - Doreen Christian
Doreen Christian
 
Big data and social media, BAE Systems Detica
Big data and social media, BAE Systems DeticaBig data and social media, BAE Systems Detica
Big data and social media, BAE Systems Detica
Internet World
 

Tendances (20)

Mapping Business Processes to Compliance Procedures
Mapping Business Processes to Compliance ProceduresMapping Business Processes to Compliance Procedures
Mapping Business Processes to Compliance Procedures
 
Managing Multiple Compliance Priorities - GDPR, CCPA, HIPAA, APEC, ISO 27001,...
Managing Multiple Compliance Priorities - GDPR, CCPA, HIPAA, APEC, ISO 27001,...Managing Multiple Compliance Priorities - GDPR, CCPA, HIPAA, APEC, ISO 27001,...
Managing Multiple Compliance Priorities - GDPR, CCPA, HIPAA, APEC, ISO 27001,...
 
Threat Ready Data: Protect Data from the Inside and the Outside
Threat Ready Data: Protect Data from the Inside and the OutsideThreat Ready Data: Protect Data from the Inside and the Outside
Threat Ready Data: Protect Data from the Inside and the Outside
 
Succeeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and TechnologySucceeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and Technology
 
NoSQL? How about "NoDBMS"?
NoSQL? How about "NoDBMS"?NoSQL? How about "NoDBMS"?
NoSQL? How about "NoDBMS"?
 
Unlocking Greater Insights with Integrated Data Quality for Collibra
Unlocking Greater Insights with Integrated Data Quality for CollibraUnlocking Greater Insights with Integrated Data Quality for Collibra
Unlocking Greater Insights with Integrated Data Quality for Collibra
 
Real-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance IndustryReal-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance Industry
 
RFT for Business Intelligence and Data Strategy
RFT for Business Intelligence and Data StrategyRFT for Business Intelligence and Data Strategy
RFT for Business Intelligence and Data Strategy
 
Are Your Data Ready for GDPR? (with MAPR and Talend)
Are Your Data Ready for GDPR? (with MAPR and Talend)Are Your Data Ready for GDPR? (with MAPR and Talend)
Are Your Data Ready for GDPR? (with MAPR and Talend)
 
Delivering Analytics at Scale with a Governed Data Lake
Delivering Analytics at Scale with a Governed Data LakeDelivering Analytics at Scale with a Governed Data Lake
Delivering Analytics at Scale with a Governed Data Lake
 
Big datacamp june14_alex_liu
Big datacamp june14_alex_liuBig datacamp june14_alex_liu
Big datacamp june14_alex_liu
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Your Worst GDPR Nightmare - Unstructured Data
Your Worst GDPR Nightmare - Unstructured DataYour Worst GDPR Nightmare - Unstructured Data
Your Worst GDPR Nightmare - Unstructured Data
 
Practical steps to GDPR compliance
Practical steps to GDPR compliance Practical steps to GDPR compliance
Practical steps to GDPR compliance
 
Data Governance Overview - Doreen Christian
Data Governance Overview - Doreen ChristianData Governance Overview - Doreen Christian
Data Governance Overview - Doreen Christian
 
Big data and social media, BAE Systems Detica
Big data and social media, BAE Systems DeticaBig data and social media, BAE Systems Detica
Big data and social media, BAE Systems Detica
 
The Importance of DataOps in a Multi-Cloud World
The Importance of DataOps in a Multi-Cloud WorldThe Importance of DataOps in a Multi-Cloud World
The Importance of DataOps in a Multi-Cloud World
 
Unlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location IntelligenceUnlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location Intelligence
 
01 big dataoverview
01 big dataoverview01 big dataoverview
01 big dataoverview
 
In the Absence of Fact - Stephen Harris
In the Absence of Fact - Stephen HarrisIn the Absence of Fact - Stephen Harris
In the Absence of Fact - Stephen Harris
 

En vedette

Riak intro to..
Riak intro to..Riak intro to..
Riak intro to..
Adron Hall
 

En vedette (14)

Security Analytics and Big Data: What You Need to Know
Security Analytics and Big Data: What You Need to KnowSecurity Analytics and Big Data: What You Need to Know
Security Analytics and Big Data: What You Need to Know
 
Riak intro to..
Riak intro to..Riak intro to..
Riak intro to..
 
NETWORK SECURITY MONITORING WITH BIG DATA ANALYTICS - Nguyễn Minh Đức
NETWORK SECURITY  MONITORING WITH BIG  DATA ANALYTICS - Nguyễn Minh ĐứcNETWORK SECURITY  MONITORING WITH BIG  DATA ANALYTICS - Nguyễn Minh Đức
NETWORK SECURITY MONITORING WITH BIG DATA ANALYTICS - Nguyễn Minh Đức
 
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceHandling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in Finance
 
Network Security‬ and Big ‪‎Data Analytics‬
Network Security‬ and Big ‪‎Data Analytics‬Network Security‬ and Big ‪‎Data Analytics‬
Network Security‬ and Big ‪‎Data Analytics‬
 
Information Security It's All About Compliance
Information Security   It's All About ComplianceInformation Security   It's All About Compliance
Information Security It's All About Compliance
 
Anomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) EnglishAnomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) English
 
Big Data and Security - Where are we now? (2015)
Big Data and Security - Where are we now? (2015)Big Data and Security - Where are we now? (2015)
Big Data and Security - Where are we now? (2015)
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
 
Big data security
Big data securityBig data security
Big data security
 
Big Data Visualization
Big Data VisualizationBig Data Visualization
Big Data Visualization
 
Basics of Motor Insurance ppt.
Basics of Motor Insurance ppt.Basics of Motor Insurance ppt.
Basics of Motor Insurance ppt.
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 

Similaire à Dataguise & MapR: Action Items for the Financial Industry

Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidence
IBM Software India
 
Presentation1 (1).pptx
Presentation1 (1).pptxPresentation1 (1).pptx
Presentation1 (1).pptx
Dat Trinh
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
DATAVERSITY
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Matt Stubbs
 

Similaire à Dataguise & MapR: Action Items for the Financial Industry (20)

data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
 
Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidence
 
Presentation1 (1).pptx
Presentation1 (1).pptxPresentation1 (1).pptx
Presentation1 (1).pptx
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makers
 
Operationalizing Data Analytics
Operationalizing Data AnalyticsOperationalizing Data Analytics
Operationalizing Data Analytics
 
Data-Ed: Data Warehousing Strategies
Data-Ed: Data Warehousing StrategiesData-Ed: Data Warehousing Strategies
Data-Ed: Data Warehousing Strategies
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
 
Understanding New Technology and Security Risks as you respond to COVID-19
Understanding New Technology and Security Risks as you respond to COVID-19Understanding New Technology and Security Risks as you respond to COVID-19
Understanding New Technology and Security Risks as you respond to COVID-19
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
 
Big data
Big dataBig data
Big data
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Big data beyond the hype may 2014
Big data beyond the hype may 2014Big data beyond the hype may 2014
Big data beyond the hype may 2014
 
Big data - The next best thing
Big data - The next best thingBig data - The next best thing
Big data - The next best thing
 
Big Data & Analytics Day
Big Data & Analytics Day Big Data & Analytics Day
Big Data & Analytics Day
 
Modern Data Management
Modern Data ManagementModern Data Management
Modern Data Management
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPM
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 
Using a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance businessUsing a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance business
 
06 summary
06 summary06 summary
06 summary
 

Plus de MapR Technologies

Plus de MapR Technologies (20)

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIs
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQL
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Dataguise & MapR: Action Items for the Financial Industry

  • 1. Action Items for the Financial Industry: Addressing fraud and privacy issues in Hadoop May 1, 2014
  • 2. Our Speakers Jeremy Stieglitz VP Product Management Dataguise Dale Kim Director, Industry Solutions MapR Technologies 5 Action Items for the Financial Services Industry – April 2014 2
  • 3. Executive Summary  Big Data has become priority #1 for finance and large enterprise in 2014: – Faster time to insights – Increasingly touching $$$ – Real-time, Automation and On Demand  Key challenges on Big Data – Leveraging sensitive assets – Automating protection techniques – Maintaining high performance – Ensuring scalability 5 Action Items for the Financial Services Industry – April 2014 3
  • 4. Agenda  Market overview: Hadoop in Financial Services  Solution priorities  5 action items – Strategize – develop concrete goals – Automate, automate, automate – Foresee high rates of innovation – Expand your community – Explore the technology landscape 5 Action Items for the Financial Services Industry – April 2014 4
  • 6. Hadoop in Financial Services 5 Action Items for the Financial Services Industry – April 2014 6 Hadoop addresses: • Scale • Infrastructure costs • Anomaly detection • Data archiving for compliance • Aggregated risk • Data protection • Etc. Risk Market Uncertainty Regulations Costs Fraud Security Attacks Big Data
  • 7. Business Challenge: Data Growth  100% growth and 80% unstructured data by 2015 …finding and classifying sensitive data will get harder 7 Exabytes 5 Action Items for the Financial Services Industry – April 2014
  • 8. Compliance Universe 5 Action Items for the Financial Services Industry – April 2014 • Sarbanes-Oxley • Frank-Dodd • PCI-DSS • HIPAA • State Data Breach Laws (CA SB 1386) • Data Privacy (International) • EU Data Protection Directive • Singapore Personal Data Protection Act • Canadian Personal Information Protection and Electronic Documents Act (PIPEDA) • Germany’s Federal Data Protection Act (BDSG) • Great Britain Data Protection Act • Data Privacy (USA) • Ohio 1347.15 • California SB 1386 • Massachusetts Data Privacy Law • Financial • Basel III • Gramm-Leach-Bliley Act (GLBA) • J-SOX • Technology Risk Management Guidelines • Health/Pharmaceutical • NAIC Model Audit Rule (MAR) • 21 CFR Part 11 (FDA) • Energy • North American Electric Reliability Corp (NERC) • Federal Energy Regulatory Commission Regulations (FERC) • Service Providers • Statement on Auditing Standards (SAS 70) • Education • Family Educational Rights and Privacy Act (FERPA) • Federal Information Security Management Act (FISMA) 63 countries, 1200+ laws
  • 9. 5 Action Items for the Financial Services Industry – April 2014 9 9 Solution Priorities
  • 10. Solution Priorities Performance Compute Memory Network I/ODisk Software If you skimp on one, you impact performance 5 Action Items for the Financial Services Industry – April 2014
  • 11. Solution Priorities Scale Plan for growth, beyond your simple projections: • Longer time windows of data • New data sources • New use cases 5 Action Items for the Financial Services Industry – April 2014
  • 12. Solution Priorities Reliability Plan ahead with all the right people -- devops, sys admins, application developers, etc. 5 Action Items for the Financial Services Industry – April 2014
  • 13. Hadoop Security Framework Perimeter Guarding access to the cluster itself Technical Concepts: Authentication Network isolation Data Protecting data in the cluster from unauthorized visibility Technical Concepts: Encryption, Tokenization, Data masking  The 4 approaches to address security within Hadoop (Perimeter, Data, Access, Visibility)  MapR provides true multi-tenant Hadoop, along with authentication/ authorization controls  Dataguise discovers & protects at the data layer and provides visibility for audit reporting and data lineage Perimeter Guarding access to the cluster itself Technical Concepts: Authentication Network isolation Access Defining what users and applications can do with data Technical Concepts: Permissions Authorization Visibility Reporting on where data came from and how it’s being used Technical Concepts: Auditing Lineage 5 Action Items for the Financial Services Industry – April 2014
  • 14. 5 Actions You Should Take 5 Action Items for the Financial Services Industry – April 2014 14
  • 15. #1: Strategize – Develop Concrete Goals  Are you prepared for production? – Expect stringent SLAs – Plan for performance, scale, reliability, and data security  Additional use cases? – Do not limit your possibilities – Hadoop deployments typically handle multiple business problems  The ‘Aha’ vs. ‘Gotcha’ moment 5 Action Items for the Financial Services Industry – April 2014 15
  • 16. #2: Automate, Automate, Automate – Keep up with data growth – Simplify critical tasks – Reduce the risk of error with manual effort Your Data will grow 6000% in six years. Your headcount will grow 1.5x 5 Action Items for the Financial Services Industry – April 2014
  • 17. Payment Risk Management at Major Credit Card Brand  Multiple business apps: fraud detection, risk analytics, cross sell  High degree of automation  “Silver”, “Gold” and “Platinum” Hadoop domains with increasing lockdown of customer data 17 “Our analytics business draws on the purchasing data of its 90 million credit card holders across 127 countries.” 5 Action Items for the Financial Services Industry – April 2014
  • 18. Hadoop Payment Risk Management at Major Credit Card Brand Customer uses SFTP and NFS to load data into Hadoop. All data loaded in the clear. Dataguise masking runs as MapReduce JAR (automatically, with no programming required). HDFS “Gold” Cluster Any incremental updates to HDFS are automatically protected. Credit card firm uses access control to determine access to private data Dataguise masking guarantees consistency between sensitive elements, ensuring credit card group can run same statistical distribution and analytics without exposure risk u v w Omniture FilesCredit Card Transactions (txt) NFS copy SFTP HDFS Cluster “Silver” x Source Data 5 Action Items for the Financial Services Industry – April 2014
  • 19. #3: Foresee High Rates of Innovation  Hadoop benefits from many new innovations: {Falcon, Kite, Storm, Mesos, Drill, Impala, Spark, Shark, etc.}  Design non-blocking technical choices (especially in areas of high velocity). – Many files formats you can use (txt, Snappy, Avro, SequenceFile, RC, ORC, etc.) – Make sure your security assumptions aren’t tied to specific formats 5 Action Items for the Financial Services Industry – April 2014 19
  • 20. #4: Expand Your Community  Security is a “common good,” share ideas with: – Different divisions at your firm – Professionals in your network – Competitors  Who else are Hadoop/security experts? – E-commerce – Telecommunications – Web 2.0 – Government 5 Action Items for the Financial Services Industry – April 2014
  • 21.  Security analytics and fraud detection  Build statistical models to detect fraud, and mine data to evaluate suspicious activities  Huge volumes of collected data to identify fraud patterns required massive scalability Fraud Detection and Security Analytics at Zions Bank “We initially got into centralizing all of our data from an information security perspective. We then saw that we could use this same environment to help with fraud detection.” 5 Action Items for the Financial Services Industry – April 2014
  • 22. #5: Explore the Technology Landscape • Deciding on the right technology is hard • Undoing a bad decision is harder • Talk to vendors, analysts, community • Attend meet-ups, conferences • Validate advice you get with proof points 5 Action Items for the Financial Services Industry – April 2014
  • 23. MapR Distribution for Hadoop BIG DATA BEST PRODUCT BUSINESS IMPACT Hadoop Top Ranked Production Success 5 Action Items for the Financial Services Industry – April 2014
  • 24. Dataguise: Market Leader in Big Data Protective Intelligence (BDPI) Only solution with Hadoop data discovery Best in class– data protection with simplicity, scalability, and automation Business friendly to operators and business analysts 24 5 Action Items for the Financial Services Industry – April 2014
  • 25. Thank You Jeremy Stieglitz VP Product Management jeremy@dataguise.com Dale Kim Director, Industry Solutions dalekim@mapr.com 5 Action Items for the Financial Services Industry – April 2014 25