Big Data launch keynote Singapore Patrick Buddenbaum
1. Open Platform for Next-Gen Analytics
Patrick Buddenbaum
Director, Enterprise Segment
Datacenter and Connected System Group
2. Legal Information
Todayās presentations contain forward-looking statements. All statements made that are not historical facts are subject to a number of
risks and uncertainties, and actual results may differ materially. Please refer to our most recent Earnings Release and our most recent
Form 10-Q or 10-K filing for more information on the risk factors that could cause actual results to differ.
If we use any non-GAAP financial measures during the presentations, you will find on our website, intc.com, the required reconciliation
to the most directly comparable GAAP financial measure.
INFORMATION IN THIS DOCUMENT IS PROVIDED āAS ISā. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR
OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY
WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION
INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR
INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.
Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate
performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may
affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components
they are considering purchasing. For more information on performance tests and on the performance of Intel products, reference
www.intel.com/software/products.
Software and workloads used in performance tests may have been optimized for performance only on Intel
microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components,
software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other
information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that
product when combined with other products.
Intel product plans in this presentation do not constitute Intel plan of record product roadmaps. Please contact your Intel representative
to obtain Intel's current plan of record product roadmaps.
3. Making Sense of One Petabyte
50x 13y 11s
To read To view To generate
in Library of Congress as HD Video in 2012
Sources: IDC 2012, The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East
http://blogs.loc.gov/digitalpreservation/2011/07/transferring-libraries-of-congress-of-data/
4. Analysis of Data can Transform Society
Enhance understanding, drive
innovation, and accelerate medical cures
Create new business models and
transform organizational processes
Improve public safety and increase
energy efficiency with smart grids
5. Virtuous Cycle of Data-Driven User Experience
Richer
user experiences
Richer data to
analyze
CLIENTS
Richer data
CLOUD from devices
INTELLIGENT
SYSTEMS
6. Democratize Data Analysis from Edge to Cloud
Unlock value in silicon
Support open platforms
Intelligent Systems
Framework
7. Intel at the Intersection of Big Data Forces
HPC Cloud Open Source
IntelĀ®
TrueScale
Infiniband
Enabling exascale computing Helping enterprises build Contributing code and
on massive data sets open interoperable clouds fostering ecosystem
* Other names and brands may be claimed as the property of others.
8. History of Intel and Apache Hadoop*
Product
Optimization
Tuning
Benchmarking
Release 2.0
Research Telco Smart City
(2012)
Release 1.0
HiBench Healthcare Retail (2011)
Web
Open Cirrus*
2009 2013
* Other names and brands may be claimed as the property of others.
9. Announcing Availability of
IntelĀ® Distribution for Apache Hadoop* software
Hardware-enhanced performance & security
Enables partner innovation in analytics
Strengthens Apache Hadoop* ecosystem
* Other names and brands may be claimed as the property of others.
10. IntelĀ® Distribution for Apache Hadoop* software
ā¢ Up to 20x faster decryption with AES-NI*
ā¢ Granular access controls for Hbase
ā¢ Optimized with SSD and Cache Acceleration
ā¢ Up to 8.5X faster queries in Hive
ā¢ Hardware-enhanced compression with AVX & SSE4.2
ā¢ Automated tuning with IntelĀ® Active Tuner
*Based on internal testing
11. Intel Distribution for Apache Hadoop* software
IntelĀ® Manager for Apache Hadoop software
Deployment, Configuration, Monitoring, Alerts, and Security
Data Exchange
Oozie Pig Mahout R connectors Hive
Sqoop
Workflow Scripting Machine Learning Statistics SQL Query
Columnar Store
HBase
Coordination
Zookeeper
YARN (MRv2)
Distributed Processing Framework
Log Collector
Flume
HDFS
Hadoop Distributed File System
Intel unique
Intel enhancements contributed back to open source
Open source components included without change * Other names and brands may be claimed as the property of others.
12. Sold with World-Class Intel Support
Annual Subscription with Technical Support
Support Coverage Options: 24x7 or 8x5
Via Solution Vendors and Service Providers
13. Continued Innovation
Pipeline of innovation from Intel Labs
ā¢ Machine Learning, Graph Lab & Graph Builder
ā¢ Data-Intensive Algorithms & Computer Architecture
Roadmap of open source from Intel Software
ā¢ Project Rhino: Hardening Apache Hadoop
ā¢ Project Panthera: Standard SQL on Apache Hadoop
* Other names and brands may be claimed as the property of others.
14. Backed by Broad Portfolio of Datacenter Products
Software
Cache
Acceleration
Software
Server Storage & Memory Network
15. Antoine Hue
Regional Sales Manager
APJC Data Center
* Other names and brands may be claimed as the property of others.
16. >4 Hours to 7 Minutes
Intel Platform Benefits for Sorting 1TB Data
>4 Hours IntelĀ®
XeonĀ®
E5-2690
processor
~50% IntelĀ® SSD
improved 520 IntelĀ® Deploy Intel
Series 10GbE Distribution
Adapters for Apache
~80% Hadoop*
IntelĀ® Xeon 5690 improved ~50% ~40%
improved improved
7200 HDD
1GbE Adapters
~7 mins
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any
change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.
Source: Intel Internal testing
For more information go to : intel.com/performance
`
17. Proven in the Enterprise
Using the IntelĀ® Distribution to gain tremendous results
IT
* Other names and brands may be claimed as the property of others.
22. The Promise of Big Data Requires Industrialized Services
23. Big Data Customers Need
ā¢ Trusted, mission critical, high-powered
computing solutions
ā¢ Robust security options
ā¢ Enterprise-grade global storage capabilities
BIG
ā¢ Highly available compute power
ā¢ Cloud-based economic model
DATA
ā¢ Expert consulting services to aide in
transformation of data assets
26. Summary
ā¢ Intel announced IntelĀ® Distribution for Apache Hadoop* software
ā¢ Delivers performance, security and ease of deployment
ā¢ Backed by broad portfolio of Intel data center products
ā¢ Contributes to open source and supports Apache Hadoop
ā¢ Enabling ecosystem of partners to innovate on analytics solutions