SlideShare une entreprise Scribd logo
1  sur  24
1 
© Talend 2014 
Talend: Solutions 
Overview
2 
About the Presenter 
Rajan Kanitkar 
• Senior Solutions Engineer 
• Rajan Kanitkar is a Pre-Sales Consultant with Talend. He 
has been active in the broader Data Integration space for 
the past 15 years and has experience with several leading 
edge software companies in these areas. His areas of 
specialties at Talend include Data Integration (DI), Big 
Data (BD), Data Quality (DQ) , and Master Data 
Management (MDM). 
• Contact: rkanitkar@talend.com 
© Talend 2014
3 
Talend Big Data Platform 
Hadoop, MapReduce, NoSQL capabilities … 
© Talend 2014
4 
The Big Data Ecosystem 
• Hadoop: the core project 
• HDFS: the Hadoop Distributed File System 
• MapReduce: the software framework for distributed 
processing of large data sets 
• Hive: a data warehouse infrastructure that provides 
data summarization and a querying language 
• Pig: a high-level data-flow language and execution 
framework for parallel computation 
• HBase: this is the Hadoop database. Use it when 
you need random, realtime read/write access to 
your Big Data 
• And many many more: Sqoop, HCatalog, 
Zookeeper, Oozie, Cassandra, MongoDB, Flume, 
Impala, Stinger, Neo4J, etc. 
© Talend 2014
5 
Talend’s Solution 
© Talend 2014
6 
Key differentiator of Our Next Gen Architecture… 
© Talend 2014 
JAVA 
ETL 
Day-to-day 
integration 
Run everywhere 
SQL 
ELT 
DW 
appliance 
Teradata, Netezza… 
MapReduce 
+ PIG + HiveQL 
+ Sqoop + … 
Hadoop 
Highly 
Scalable 
Hadoop Grid 
CAMEL 
CAMEL 
Message 
transform-ation 
High Frequency 
 No black-box engine 
 Enables light-weight distributed, 
customizable and parallelizable 
run time 
 Standards-Based 
Code Generator
7 
© Talend 2014 
Trying to get from this…
8 
Talend Big Data – “pure Hadoop” 
© Talend 2014 
Visual design in Map Reduce and optimize before 
deploying on Hadoop 
to this…
9 
Native Map/Reduce Jobs 
• Create classic ETL patterns using native Map/Reduce 
- Only data management solution on the market to generate native 
Map/Reduce code 
© Talend 2014 
• Reduce the need for big 
data coding skills 
• Zero pre-installation on 
the Hadoop cluster 
• Hadoop is the “engine” 
for data processing
10 
MapReduce 2.0, YARN, Storm, Spark 
• Yarn: Ensures predictable performance & QoS for all apps 
• Enables apps to run “IN” Hadoop rather than “ON” 
• In Labs: Streaming with Apache Storm 
• In Labs: mini-Batch and In-Memory with Apache Spark 
© Talend 2014 
Applications Run Natively IN Hadoop 
YARN (Cluster Resource Management) 
HDFS2 (Redundant, Reliable Storage) 
BATCH 
(MapReduce) 
INTERACTIVE 
(Tez) 
STREAMING 
(Storm, Spark) 
GRAPH 
(Giraph) 
NoSQL 
(MongoDB) 
EVENTS 
(Falcon) 
ONLINE 
(HBase) 
OTHER 
(Search) 
Source: Hortonworks
11 
© Talend 2014 
iPaaS MDM 
HA Govern 
Security Meta 
Storm Kafka 
CXF Camel 
STANDARD-IZE 
MACHINE 
YARN (Cluster Resource Management) 
HDFS2 (Redundant, Reliable Storage) 
800+ 
HIVE 
BATCH 
(MapReduce) 
INTERACTIVE 
(Tez) 
STREAMING 
(Storm, Spark) 
GRAPH 
(Giraph) 
NoSQL 
(MongoDB) 
Events 
(Falcon) 
ONLINE 
(HBase) 
OTHER 
(Search) 
Talend: Ingest – Transform – Deliver 
TRANSFORM (Data Refinement) 
MAP PROFILE PARSE CLEANSE CDC 
LEARNING 
MATCH 
INGEST 
(Ingestion) 
SQOOP 
FLUME 
HDFS API 
HBase API 
DELIVER 
(as an API) 
Karaf ActiveMQ
12 
© Talend 2014 
Talend Big Data Sandbox & 
Talend Big Data Jumpstart 
Delivering instant value from all your data
13 
BIG DATA CHALLENGES 
The Big Data Customer Discussion 
© Talend 2014
14 
Top Big Data Challenges 
© Talend 2014 
Talend Directly 
Addresses these 
Challenges 
Source: Gartner - Survey Analysis: Big Data Adoption in 2013 Shows Substance 
Behind the Hype - 12 September 2013 - G00255160
15 
Talend’s Solution 
© Talend 2014
16 
TALEND BIG DATA SANDBOX 
30 day customer trial 
© Talend 2014
17 
Cookbook Step-by-Step Directions 
• Completely Self-contained Demo Sandbox 
• Key Scenarios: 
- Twitter Analysis 
- Clickstream Analysis 
- Web Log analysis 
- ETL Offload 
• Scenario Summaries 
- Social Media insights 
- Channel optimization 
- Customer insights 
- Data Warehouse Cost Reduction 
© Talend 2014
18 
Ready for Launch 
• Announcements 
- Public announcement Tuesday 15th 
- Newsletter was sent 9th July 
• Customer Nurture campaign 
- Scenario reminders, videos & Links 
- Reminder to Talend AE 
• Two Routes for 5.5 
- Sandbox Download publicly available – 15th July 
- Jumpstart and AE ‘access’ – 15th July 
• Links for the 15th (Sandbox download) 
- Public: http://www.talend.com/talend-big-data-sandbox 
- Account Exec: send download link for customer to fill in: 
© Talend 2014 
• https://info.talend.com/prodevaltpbdsandbox
19 
TALEND BIG DATA JUMPSTART 
A ‘guided tour’ of the Sandbox 
© Talend 2014
20 
Why the ‘Jumpstart’? 
Practical 
Guided Tour 
• Lead by Talend Solutions Engineer 
• Learn about the Talend Studio 
• See how to execute Hadoop processes 
- Map/Reduce with YARN 
- Pig 
- HDFS 
• See NoSQL Examples 
- Hive 
- HBase 
- MongoDB 
- Cassandra 
© Talend 2014
21 
Key benefits 
• NO Configuration/Development 
• INSTANT results now, for the Future 
• Valuable prototypes for FREE 
• Working on the top THREE Hadoop Distributions 
© Talend 2014
22 
3 Simple Messages 
• Sandbox is Customer led, Jumpstart is Sales led 
• Jumpstart is the best way to ‘get Talend’ 
- Google: Talend Jumpstart 
• Work to get the best conversation & involve pre-sales 
© Talend 2014
23 
© Talend 2014 
Sandbox 
- Talend Jumpstart Sandbox - virtual image installed with: 
• Apache Hadoop distribution provided Hortonworks, Cloudera & MapR 
• Pre-configured Talend Platform for Big Data 5.5* 
• Four scenarios for you to try: 
– Clickstream data 
– Twitter sentiment 
– Apache weblogs 
– ETL Offload 
• Demonstrations of several NoSQL databases 
*Includes Talend Studio (graphical IDE), team working, 
management, data quality and advanced big data features. 
www.talend.com/products/platform-for-big-data
24 
SHOW ME 
Talend Demo 
© Talend 2014

Contenu connexe

Tendances

Using Hadoop to Offload Data Warehouse Processing and More - Brad Anserson
Using Hadoop to Offload Data Warehouse Processing and More - Brad AnsersonUsing Hadoop to Offload Data Warehouse Processing and More - Brad Anserson
Using Hadoop to Offload Data Warehouse Processing and More - Brad Anserson
MapR Technologies
 
Evolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data ApplicationsEvolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data Applications
DataWorks Summit
 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Eric Sun
 
Priyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQLPriyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQL
The Hive
 

Tendances (20)

Filling the Data Lake
Filling the Data LakeFilling the Data Lake
Filling the Data Lake
 
Hadoop and the Data Warehouse: When to Use Which
Hadoop and the Data Warehouse: When to Use Which Hadoop and the Data Warehouse: When to Use Which
Hadoop and the Data Warehouse: When to Use Which
 
Talend Data Preparation Overview
Talend Data Preparation OverviewTalend Data Preparation Overview
Talend Data Preparation Overview
 
Using Hadoop to Offload Data Warehouse Processing and More - Brad Anserson
Using Hadoop to Offload Data Warehouse Processing and More - Brad AnsersonUsing Hadoop to Offload Data Warehouse Processing and More - Brad Anserson
Using Hadoop to Offload Data Warehouse Processing and More - Brad Anserson
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Data-In-Motion Unleashed
Data-In-Motion UnleashedData-In-Motion Unleashed
Data-In-Motion Unleashed
 
Evolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data ApplicationsEvolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data Applications
 
Hadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop SummitHadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop Summit
 
Innovation in the Data Warehouse - StampedeCon 2016
Innovation in the Data Warehouse - StampedeCon 2016Innovation in the Data Warehouse - StampedeCon 2016
Innovation in the Data Warehouse - StampedeCon 2016
 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
 
Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017
Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017
Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
 
SQL on Hadoop for the Oracle Professional
SQL on Hadoop for the Oracle ProfessionalSQL on Hadoop for the Oracle Professional
SQL on Hadoop for the Oracle Professional
 
Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2
 
Integrated Data Warehouse with Hadoop and Oracle Database
Integrated Data Warehouse with Hadoop and Oracle DatabaseIntegrated Data Warehouse with Hadoop and Oracle Database
Integrated Data Warehouse with Hadoop and Oracle Database
 
Harnessing the Power of Apache Hadoop
Harnessing the Power of Apache Hadoop Harnessing the Power of Apache Hadoop
Harnessing the Power of Apache Hadoop
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
 
Priyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQLPriyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQL
 

Similaire à Talend Big Data Capabilities - 2014

Visual Mapping of Clickstream Data
Visual Mapping of Clickstream DataVisual Mapping of Clickstream Data
Visual Mapping of Clickstream Data
DataWorks Summit
 
Hadoop is not an Island in the Enterprise
Hadoop is not an Island in the EnterpriseHadoop is not an Island in the Enterprise
Hadoop is not an Island in the Enterprise
DataWorks Summit
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop AdoptionYARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
DataWorks Summit
 
Delivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDelivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated Architecture
DataWorks Summit
 

Similaire à Talend Big Data Capabilities - 2014 (20)

Visual Mapping of Clickstream Data
Visual Mapping of Clickstream DataVisual Mapping of Clickstream Data
Visual Mapping of Clickstream Data
 
Talend for big_data_intorduction
Talend for big_data_intorductionTalend for big_data_intorduction
Talend for big_data_intorduction
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
 
Munich HUG 21.11.2013
Munich HUG 21.11.2013Munich HUG 21.11.2013
Munich HUG 21.11.2013
 
Hadoop is not an Island in the Enterprise
Hadoop is not an Island in the EnterpriseHadoop is not an Island in the Enterprise
Hadoop is not an Island in the Enterprise
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
 
Big Data Infrastructure
Big Data InfrastructureBig Data Infrastructure
Big Data Infrastructure
 
Robin_Hadoop
Robin_HadoopRobin_Hadoop
Robin_Hadoop
 
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data InsightSyncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data InsightSyncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
 
Dell solutions for SAP, SAP HANA
Dell solutions for SAP, SAP HANADell solutions for SAP, SAP HANA
Dell solutions for SAP, SAP HANA
 
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop AdoptionYARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
 
Delivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDelivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated Architecture
 
Data Quality in the Data Hub with RedPointGlobal
Data Quality in the Data Hub with RedPointGlobalData Quality in the Data Hub with RedPointGlobal
Data Quality in the Data Hub with RedPointGlobal
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 

Dernier

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 

Dernier (20)

The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 

Talend Big Data Capabilities - 2014

  • 1. 1 © Talend 2014 Talend: Solutions Overview
  • 2. 2 About the Presenter Rajan Kanitkar • Senior Solutions Engineer • Rajan Kanitkar is a Pre-Sales Consultant with Talend. He has been active in the broader Data Integration space for the past 15 years and has experience with several leading edge software companies in these areas. His areas of specialties at Talend include Data Integration (DI), Big Data (BD), Data Quality (DQ) , and Master Data Management (MDM). • Contact: rkanitkar@talend.com © Talend 2014
  • 3. 3 Talend Big Data Platform Hadoop, MapReduce, NoSQL capabilities … © Talend 2014
  • 4. 4 The Big Data Ecosystem • Hadoop: the core project • HDFS: the Hadoop Distributed File System • MapReduce: the software framework for distributed processing of large data sets • Hive: a data warehouse infrastructure that provides data summarization and a querying language • Pig: a high-level data-flow language and execution framework for parallel computation • HBase: this is the Hadoop database. Use it when you need random, realtime read/write access to your Big Data • And many many more: Sqoop, HCatalog, Zookeeper, Oozie, Cassandra, MongoDB, Flume, Impala, Stinger, Neo4J, etc. © Talend 2014
  • 5. 5 Talend’s Solution © Talend 2014
  • 6. 6 Key differentiator of Our Next Gen Architecture… © Talend 2014 JAVA ETL Day-to-day integration Run everywhere SQL ELT DW appliance Teradata, Netezza… MapReduce + PIG + HiveQL + Sqoop + … Hadoop Highly Scalable Hadoop Grid CAMEL CAMEL Message transform-ation High Frequency  No black-box engine  Enables light-weight distributed, customizable and parallelizable run time  Standards-Based Code Generator
  • 7. 7 © Talend 2014 Trying to get from this…
  • 8. 8 Talend Big Data – “pure Hadoop” © Talend 2014 Visual design in Map Reduce and optimize before deploying on Hadoop to this…
  • 9. 9 Native Map/Reduce Jobs • Create classic ETL patterns using native Map/Reduce - Only data management solution on the market to generate native Map/Reduce code © Talend 2014 • Reduce the need for big data coding skills • Zero pre-installation on the Hadoop cluster • Hadoop is the “engine” for data processing
  • 10. 10 MapReduce 2.0, YARN, Storm, Spark • Yarn: Ensures predictable performance & QoS for all apps • Enables apps to run “IN” Hadoop rather than “ON” • In Labs: Streaming with Apache Storm • In Labs: mini-Batch and In-Memory with Apache Spark © Talend 2014 Applications Run Natively IN Hadoop YARN (Cluster Resource Management) HDFS2 (Redundant, Reliable Storage) BATCH (MapReduce) INTERACTIVE (Tez) STREAMING (Storm, Spark) GRAPH (Giraph) NoSQL (MongoDB) EVENTS (Falcon) ONLINE (HBase) OTHER (Search) Source: Hortonworks
  • 11. 11 © Talend 2014 iPaaS MDM HA Govern Security Meta Storm Kafka CXF Camel STANDARD-IZE MACHINE YARN (Cluster Resource Management) HDFS2 (Redundant, Reliable Storage) 800+ HIVE BATCH (MapReduce) INTERACTIVE (Tez) STREAMING (Storm, Spark) GRAPH (Giraph) NoSQL (MongoDB) Events (Falcon) ONLINE (HBase) OTHER (Search) Talend: Ingest – Transform – Deliver TRANSFORM (Data Refinement) MAP PROFILE PARSE CLEANSE CDC LEARNING MATCH INGEST (Ingestion) SQOOP FLUME HDFS API HBase API DELIVER (as an API) Karaf ActiveMQ
  • 12. 12 © Talend 2014 Talend Big Data Sandbox & Talend Big Data Jumpstart Delivering instant value from all your data
  • 13. 13 BIG DATA CHALLENGES The Big Data Customer Discussion © Talend 2014
  • 14. 14 Top Big Data Challenges © Talend 2014 Talend Directly Addresses these Challenges Source: Gartner - Survey Analysis: Big Data Adoption in 2013 Shows Substance Behind the Hype - 12 September 2013 - G00255160
  • 15. 15 Talend’s Solution © Talend 2014
  • 16. 16 TALEND BIG DATA SANDBOX 30 day customer trial © Talend 2014
  • 17. 17 Cookbook Step-by-Step Directions • Completely Self-contained Demo Sandbox • Key Scenarios: - Twitter Analysis - Clickstream Analysis - Web Log analysis - ETL Offload • Scenario Summaries - Social Media insights - Channel optimization - Customer insights - Data Warehouse Cost Reduction © Talend 2014
  • 18. 18 Ready for Launch • Announcements - Public announcement Tuesday 15th - Newsletter was sent 9th July • Customer Nurture campaign - Scenario reminders, videos & Links - Reminder to Talend AE • Two Routes for 5.5 - Sandbox Download publicly available – 15th July - Jumpstart and AE ‘access’ – 15th July • Links for the 15th (Sandbox download) - Public: http://www.talend.com/talend-big-data-sandbox - Account Exec: send download link for customer to fill in: © Talend 2014 • https://info.talend.com/prodevaltpbdsandbox
  • 19. 19 TALEND BIG DATA JUMPSTART A ‘guided tour’ of the Sandbox © Talend 2014
  • 20. 20 Why the ‘Jumpstart’? Practical Guided Tour • Lead by Talend Solutions Engineer • Learn about the Talend Studio • See how to execute Hadoop processes - Map/Reduce with YARN - Pig - HDFS • See NoSQL Examples - Hive - HBase - MongoDB - Cassandra © Talend 2014
  • 21. 21 Key benefits • NO Configuration/Development • INSTANT results now, for the Future • Valuable prototypes for FREE • Working on the top THREE Hadoop Distributions © Talend 2014
  • 22. 22 3 Simple Messages • Sandbox is Customer led, Jumpstart is Sales led • Jumpstart is the best way to ‘get Talend’ - Google: Talend Jumpstart • Work to get the best conversation & involve pre-sales © Talend 2014
  • 23. 23 © Talend 2014 Sandbox - Talend Jumpstart Sandbox - virtual image installed with: • Apache Hadoop distribution provided Hortonworks, Cloudera & MapR • Pre-configured Talend Platform for Big Data 5.5* • Four scenarios for you to try: – Clickstream data – Twitter sentiment – Apache weblogs – ETL Offload • Demonstrations of several NoSQL databases *Includes Talend Studio (graphical IDE), team working, management, data quality and advanced big data features. www.talend.com/products/platform-for-big-data
  • 24. 24 SHOW ME Talend Demo © Talend 2014