4. Nitin Bandugula
Product Marketing Manager
MapR Technologies
Kevin Petrie
Senior Director
Attunity
George Corugedo
Chief Technology Officer & Co-Founder
RedPoint Global Inc.
27. 27 RedPoint Global Inc. 2015 Confidential
Overview of RedPoint Global
Launched 2006
Founded and staffed by industry veterans
Headquarters: Wellesley, Massachusetts
Offices in US, UK, Australia, Philippines
Global customer base
Serves most major industries MAGIC QUADRANT
Data Quality
MAGIC QUADRANT
Multichannel Campaign
Management
MAGIC QUADRANT
Integrated Marketing
Management
28. 28 RedPoint Global Inc. 2015 Confidential
Extensive experience with a diverse customer base
29. 29 RedPoint Global Inc. 2015 Confidential
Cloudera Stack
30. 30 RedPoint Global Inc. 2015 Confidential
Andrew Brust, GigaOm Research
31. 31 RedPoint Global Inc. 2015 Confidential
There is lots of Hype Out There
32. 32 RedPoint Global Inc. 2015 Confidential
Don’t believe the Marketing Hype
33. 33 RedPoint Global Inc. 2015 Confidential
Data Hub for MDM
Data Hub
1
n
YARN
Production RDBMS
Databases
DataIngestion
Specialized Analytic
Databases & Caches
Any analytics
Any reporting
Predictive Analytics
Clustering
Profiling
Analytics
Marketing Automation
Real Time Personalization
Omni-Channel Optimization
Digital and Traditional Channels
Interaction Systems
DataQualityProcessing
Persistent Entity Resolution, Linkage and Keying
34. 34 RedPoint Global Inc. 2015 Confidential
How About MDM on a Data Lake?
• Severe shortage of Map Reduce skilled
resources
• Inconsistent skills lead to inconsistent
results of code based solutions
• Nascent technologies require multiple
point solutions
• Technologies are not enterprise grade
• Some functionality may not be possible
within these frameworks
Challenges to Data Lake Approach
• Data is ingested in its raw state
regardless of format, structure or lack of
structure
• Raw data can be used and reused for
differing purposes across the enterprise
• Beyond inexpensive storage, Hadoop is
an extremely power and scalable and
segmentable computational platform
• Master Data can be fed across the
enterprise and deep analytics on clean
data is immediately enabled
Benefits of a Hadoop Data Lake
35. 35 RedPoint Global Inc. 2015 Confidential
Key Functions for Master Data Management
Master Key Management
ETL & ELT Data Quality
Web Services Integration
Integration & Matching
Process Automation
& Operations
• Profiling, reads/writes,
transformations
• Single project for all jobs
• Cleanse data
• Parsing, correction
• Geo-spatial analysis
• Grouping
• Fuzzy match
• Create keys
• Track changes
• Maintain matches
over time
• Consume and publish
• HTTP/HTTPS protocols
• XML/JSON/SOAP formats
• Job scheduling, monitoring,
notifications
• Central point of control
• Meta Data Management
36. 36 RedPoint Global Inc. 2015 Confidential
Overview - What is Hadoop/Hadoop 2.0
Hadoop 1.0
• All operations based on Map Reduce
• Intrinsic inconsistency of code based
solutions
• Highly skilled and expensive resources
needed
• 3rd party applications constrained by the
need to generate code
Hadoop 2.0
• Introduction of the YARN:
“a general-purpose, distributed, application
management framework that supersedes the classic
Apache Hadoop MapReduce framework for
processing data in Hadoop clusters.”
• Mature applications can now operate
directly on Hadoop
• Reduce skill requirements and increased
consistency
37. 37 RedPoint Global Inc. 2015 Confidential
RedPoint Data Management on Hadoop
Partitioning
AM / Tasks
Execution
AM / Tasks
Data I/O
Key / Split
Analysis
Parallel Section
YARN
MapReduce
41. 41 RedPoint Global Inc. 2015 Confidential
Data Lake Architecture for MDM
42. 42 RedPoint Global Inc. 2015 Confidential
Recommendations for Data Quality
• There is a gap between current use and the
mainstream
• Don’t believe the hype; there’s plenty of it
• Data Quality creates trust in information which
enables confident and nimble decision making.
• Look for broad enterprise apps that have
solved the parallel scalability problem
• Consider a Data Hub approach for Data Quality
for maximum flexibility and scalable
performance
43. 43 RedPoint Global Inc. 2015 Confidential
George Corugedo
Chief Technology Officer
George.corugedo@redpoint.net
781.725.0252
Download our white paper
From Yawn to Yarn: Why You Should be
Excited about Hadoop
Redpoint.net/dbtawebinar
45. Nitin Bandugula
Product Marketing Manager
MapR Technologies
Kevin Petrie
Senior Director
Attunity
George Corugedo
Chief Technology Officer & Co-Founder
RedPoint Global Inc.
46. Please use the same URL you used to view today’s live event
for the archive event, plus we will be sending you a follow-up
email with that URL once the archive is posted!
47. Thank you for participating in
today’s roundtable web event
Just by attending this event the winner of the
$100 AmEx Gift Card is…….