SlideShare une entreprise Scribd logo
1  sur  38
ON
ON
Neo4j on IBM POWER8
Philip Rathle
VP of Products
Neo Technology
Keshav Ranganathan
Senior Offering Manager, Data & Analytics Solutions
IBM POWER Systems
ONNeo4j on IBM POWER Systems
Key Takeaways:
• Why Graphs & Why Now?
• Unique Characteristics of Graph Data &
Architecture Implications
• IBM Power Systems Overview
• Why deploy Neo4j on IBM Power Systems
• Q&A
2
ON
ON
Neo4j on IBM Power Systems
Solves Massive-Scale,
Previously Unsolvable Problems
A paradigm shift accelerating time to
insight and real-time decision making…
Bringing big data insights into action
ON
A Walk Back in Time…
4
ON
Data Management in 1990
Paper Forms
Tiny RAM Spinning Platters
(Low Capacity / Sequential IO)5
ONTraditional DBMS Technology
6
ON
Data Management in 2016
Dynamic Real-World Systems
Abundant RAM
Flash & CAPI
(High-Capacity Storage &
Ultra-Fast Random I/O)
7
ON
8
ON
9
ON
Sep 2015May 2015Jan 2015Sep 2014May 2014Jan 2014Sep 2013May 2013
100
PopularityChanges
500
600
700
200
300
400
Jan 2013
© DB-Engines.com 2015
Popular Movement
• Wide column stores
• RDF stores
• Document stores
• Search engines
• Native XML DBMS
• Key-value stores
• Object oriented DBMS
• Multivalue DBMS
• Times Series DBMS
Relational database
Graph Database
10
ON
“Graph analysis is possibly the single most effective competitive
differentiator for organizations pursuing data-driven operations
and decisions after the design of data capture.”
By the end of 2018, 70% of leading organizations will have one or
more pilot or proof-of-concept efforts underway utilizing graph
databases.
Analyst Perspective
“Forrester estimates that over 25% of enterprises will be using
graph databases by 2017”
IT Market Clock for Database Management Systems, 2014
https://www.gartner.com/doc/2852717/it-market-clock-database-management
TechRadar™: Enterprise DBMS, Q1 2014
http://www.forrester.com/TechRadar+Enterprise+DBMS+Q1+2014/fulltext/-/E-RES106801
Making Big Data Normal with Graph Analysis for the Masses, 2015
http://www.gartner.com/document/3100219
11
ON
100 Best in Show 2015
Magic Quadrant for
Operational DBMS 2015
Neo4j: World’s Leading Graph Database
Technology of the Year
2015, 2014
100 Companies that
Matter the Most in Data
2015
Neo4j named most popular
Graph Database, 2015
Neo4j declared
“Champion”, 2015 & 2016
“Most Popular and Widely
Deployed Database”
Winner of NoSQL: Graph
Database Technologies
DB-Engines Rankings
Source: http://db-engines.com/en/ranking/graph+dbms
12
ONDatabase Technology Architectures
A Portfolio View
Graph DB
Connected DataDiscrete Data
Relational DBMSOther NoSQL
13
ON
Queries can take non-sequential,
arbitrary paths through data
Real-time queries need speed and
consistent response times
Queries must run reliably
with consistent results
Q
A single query can
touch a lot of data
15
Relationship Queries Strain
Traditional Architectures
ON
UNIFIED, IN-MEMORY MAP
Lightning-fast
queries due to
replicated in-memory
architecture +
index-free adjacency
MACHINE 1 MACHINE 2 MACHINE 3
Slow queries
due to
index lookups +
network hops
Neo4j on IBM POWER8
Using Other NoSQL to Join Data
Q R
Q R
16
Data Relationship Queries
ONTraversal Speeds
• Realistic retail dataset from Amazon
• Social recommendation (Java procedure) equivalent to:
MATCH (you)-[:BOUGHT]->(something)<-[:BOUGHT]-(other)-[:BOUGHT]->(reco)
WHERE id(you)={id}
RETURN reco
Threads Hops/second
1 3-4M
10 17-29M
20 34-50M
30 36-60M
17
ONWrite Scale
• Import highly connected
Friendster dataset
• 1.8 billion relationships
takes around 20 minutes
• That is 1M
writes/second!
18
ON
Good News for Real-Time, In-Memory Graph Queries:
Big RAM is Eating Big Data
19
ONValue from Data Relationships: Top Use Cases
Internal Applications
Fraud Detection
Master Data Management
Network and
IT Operations
Customer-Facing Applications
Real-Time Recommendations
Graph-Based Search
Identity and
Access Management
20
ONSolving Massive-Scale Challenges: Recommendations
21
People, Places, Things +
Interests +
Transactions +
Activity
Each requires a new & higher
level of scaling
ONSolving Massive-Scale Challenges: Fraud Detection
Estimated cost in 2014 $16.31B 1
Fraud and the costs to prevent fraud
are up 94% year over year 2
62% of companies subject to
payment fraud 3
Nearly 1 out of 4 declined
transactions are false positives 4
22 1 The Nilson Report, 2015; 2, 4 2015 LexisNexis True Cost of Fraud Survey; 3 2015 AFP Payments Fraud and Control Survey
ON
Pros
Simple
Stops rookies
Traditional Fraud Detection: Discrete Data Analysis
Revolving
Debt
INVESTIGATE
INVESTIGATE
Number of accounts
Cons
False positives
False negatives
23
ONWhat’s Needed: Connected Analysis
Revolving
Debt
Number of accounts
ADVANTAGES
Detect fraud rings
Fewer false negatives
24
Stop fraud attempts in flight:
Carry out graph “walks” in real-time for transactions &
key events, revealing suspicious “loops”
Speed manual fraud analysis:
Empower fraud analysts with graph-based analysis, to
more quickly and accurately identify complex fraud
ON
Gartner’s Layered Fraud Prevention Approach (4)
(4) http://www.gartner.com/newsroom/id/1695014
Traditional Fraud Prevention: Batch Analysis
Analysis
of users
and their
endpoints
Analysis of
navigation
behavior and
suspect
patterns
Analysis of
anomaly
behavior by
channel
Analysis of
anomaly
behavior
correlated
across channels
Analysis of
relationships
to detect
organized crime
and collusion
Layer 1
Endpoint-
Centric
Navigation-
Centric
Account-
Centric
Cross-
Channel
Entity
Linking
Layer 2 Layer 3 Layer 4 Layer 5
DISCRETE DATA ANALYSIS CONNECTED ANALYSIS
25
ON
ON
IBM Power Systems
Portfolio Overview and
Value for Neo4j
ON
4X
Threads per core*
4X
Mem. Bandwidth*
4X
More cache* @
Lower Latency
SMT=Simultaneous Multi-Threading
OLTP = On-Line Transaction Processing
These design decisions result in best performance for data centric workloads like:
Database, NoSQL, Big Data Analytics, OLTP
POWER8: Designed for data to deliver breakthrough
performance
POWER8
SMT8
x86
Hyperthread
Parallel Processing
POWER8
pipe
Data flow
x86 pipe POWER8
x86 POWER8 +
OpenPOWER
x86
27
ON
250 Worldwide members
of
30 Hardware and
technology providers
100+ Collaborative
2,500+
Linux ISVs developing
on POWER
100,000+
Open source packages
innovations under way
The POWER of an open ecosystem
28
ON
This is what a revolution looks like
29
ON
Power Systems Portfolio – Enterprise and Scale-
out offerings
Offering OS capability Positioning in the Linux portfolio
Scale-up
E880
E870
E850
Equally run AIX, IBM i and
Linux with IFLs
Enterprise systems
Leadership Performance and Reliability
Utilization Guarantee (PowerVM – 70%/80%)
Flexible, dynamic Capacity on Demand & Enterprise Pools
Scale-Out
S824
S822
S814
Equally Run AIX, IBM i
and Linux
Scale out Systems
Utilization Guarantee (PowerVM – 65)
High performance, availability and resiliency
L line
S824L
S822L
S812L
Linux Only
Scale out Linux Systems
Price/Performance Leadership vs. X86
PowerVM, KVM
LC line
S812LC
S822LC BD
S822LC HPC
S821LC
Linux Only
Cluster-optimized Linux Systems
Lowest cost Power System
KVM
New
30
ON
- Design and cost
optimized for
deployments of
multiples (cloud and
cluster)
- Broad number of
optimal solutions
- Co-Designed with the
OpenPOWER Ecosystem Supported by Canonical
IBM Support
Community / 3rd
Party Support
running
The LC Line
The L Line
PurePower
Enterprise
& IFLs
- Enterprise level RAS
for single system
deployments
- Solutions for Big
Data & Analytics
- Converged
infrastructure
offering
- Rapid time to value
and simplicity of
management
- Enterprise level
robustness and IFL
capability
- Solution editions for
in memory
databases
- (HANA, DB2 BLU)
- Hosted cloud and
hybrid cloud
solutions
- Rapid deployments
and POCs
The IBM Power Systems Linux Portfolio
Pipeline of innovation
• Broad Linux portfolio deliver
all your Linux deployment
needs
• Expanding LC portfolio with
two servers for data centric
applications and 2nd
generation HPC server
POWER8 is designed for the Big Data era and delivers
price-performance leadership to the Linux Market!
31
ONPOWER8 CAPI
Coherent Accelerator Processor Interface
Custom
Hardware
Application
POWER8
CAPP
Coherence Bus
PSL
FPGA or ASIC
Customizable Hardware
Application Accelerator
• Specific system SW, middleware, or user application
• Written to durable interface provided by PSL
POWER8
PCIe Gen3
Transport for encapsulated messages
Processor Service Layer (PSL)
• Present robust, durable interfaces to applications
• Offload complexity / content from CAPP
Virtual Addressing
• Accelerator can work with same memory addresses that
the processors use
• Pointers de-referenced same as the host application
• Removes OS & device driver overhead
Hardware Managed Cache Coherence
• Enables the accelerator to participate in “Locks” as a
normal thread Lowers Latency over IO communication
model
32
http://opencapi.org/
ONWhy CAPI is Better than Traditional PCIe
CAP
P
PCI
e
Power Processor
FPGA
AFU
IBM Supplied POWER
Service Layer
Typical I/O Model Flow
Flow with a Coherent Model
Shared Mem.
Notify Accelerator
Acceleration
Shared Memory
Completion
DD Call
Copy or Pin
Source Data
MMIO Notify
Accelerator
Acceleration
Poll / Int
Completion
Copy or Unpin
Result Data
Ret. From DD
Completion
Advantages of Coherent Attachment Over I/O Attachment
• Virtual Addressing & Data Caching
– Shared Memory
– Lower latency for highly referenced data
• Easier, More Natural Programming Model
– Traditional thread level programming
– Long latency of I/O typically requires
restructuring of application
• Enables Applications Not Possible on I/O
– Pointer chasing, etc…
33
Total ~13µs for data prep
Total 0.36µs
ON
IBM Data Engine for NoSQL is an integrated platform for large and fast growing NoSQL data stores. It
builds on the CAPI capability of POWER8 systems and provides super-fast access to large flash
storage capacity. It delivers high speed access to both RAM and flash storage which can result in
significantly lower cost, and higher workload density for NoSQL deployments than a standard RAM-
based system. The solution offers superior performance and price-performance to scale out x86
server deployments that are either limited in available memory per server or have flash memory with
limited data access latency.
IBM Data Engine for NoSQL
Cost Savings for In-Memory NoSQL Data Stores
Up to 57TB of extended memory with one POWER8 server + CAPI attach FLASH
Power S822L /
S812L
Flash System 900
Power S822L / S812L / S822 LC
NEW
External Flash Configuration Integrated Flash Configuration
Up to 8TB of super-fast storage tier on one POWER8 server
34
ONCAPI Unlocks the Next Level of Performance for Flash
Identical hardware with 3 different
paths to data
FlashSystem
Conventional
I/O (FC) CAPI - E
0
50,000
100,000
150,000
200,000
250,000
300,000
350,000
400,000
450,000
Conventional CAPI - I CAPI - E
IOPS per Hardware Thread
0
20
40
60
80
100
120
140
160
180
200
Conventional CAPI - I CAPI - E
Latency (microseconds)
IBM POWER S822L
>3x better IOPS
per HW thread
Lower latency
35
CAPI – I : Integrated CAPI Flash Card
CAPI – E: CAPI attached External Flash
CAPI - I
ON
POWER8 with CAPI enabled acceleration running Neo4j delivers
1.82X the performance versus Intel Broadwell servers with NVMe
711
390
0
100
200
300
400
500
600
700
800
POWER8 x86
Representativemixedworkload
Throughput
IBM Power S822LC (20c/160t) x86 Broadwell Server
(24c/48t)
82%
More
Throughput
• Accelerate Graph Databases with CAPI on POWER8
• Real-World mixed graph transaction workload running Neo4j
on IBM Power S822LC server delivers 1.82X the throughput
versus Intel Xeon E5-2650 v4 server
– POWER8 (20 cores / 128 GB): 711 Ops/sec
– Intel Xeon E5 2650 v4 processor (24 cores / 128
GB): 390 Ops/sec
• Based on IBM internal testing of single system and OS image running mixed graph transaction s based on 200 GB data model internal IBM and Neo4j workload. Conducted under laboratory condition, individual result can vary
based on workload size, use of storage subsystems & other conditions. Data as of October 19, 2016
• IBM Power System S822LC; 20 cores (2 x 10c chips) / 160 threads, POWER8; 128 GB memory (16 x 8GB), 1.6 TB CAPI NVMe adapter , Neo4j 3.0.4, Ubuntu 16.04. Competitive stack: HP Proliant DL380 Gen9; 24 cores (2 x 12c chips) /
48 threads; Intel E5-2650 v4; 128 GB memory,(16 x 8GB), 1.6 TB NVMe adapter, Neo4j 3.0.4, Ubuntu 15.10.
36
ON
POWER8 with CAPI enabled acceleration running Neo4j delivers
1.61X the price-performance versus Intel Xeon E5-2650 v4 with NVMe
IBM Power
S822LC
(20-core, 128GB)
HP
DL380 Gen9
(24-core, 128GB)
Server price*
-3-year warranty
$19,123 $16,911
Mixed graph
transaction Workload
(total operations per second)
711 390
1.61X
Price-Performance
1.82X
Performance
per Server
• Based on IBM internal testing of single system and OS image running mixed graph transaction s based on 200 GB data model internal IBM and Neo4j workload. Conducted under laboratory condition, individual result can vary
based on workload size, use of storage subsystems & other conditions. Data as of October 19, 2016
• IBM Power System S822LC; 20 cores (2 x 10c chips) / 160 threads, POWER8; 128 GB memory (16 x 8GB), 1.6 TB CAPI NVMe adapter , Neo4j 3.0.4, Ubuntu 16.04. Competitive stack: HP Proliant DL380 Gen9; 24 cores (2 x 12c chips) /
48 threads; Intel E5-2650 v4; 128 GB memory,(16 x 8GB), 1.6 TB NVMe adapter, Neo4j 3.0.4, Ubuntu 15.10.
* Pricing is based bundled pricing for S822LC with Integrated CAPI Flash card (IBM ordering system) and HP Web price https://h22174.www2.hp.com/SimplifiedConfig/Index
37
ON
ON
Neo4j on IBM Power Systems
Solves Massive-Scale,
Previously Unsolvable Problems
A paradigm shift accelerating time to
insight and real-time decision making…
Bringing big data insights into action
ONWhere Do I Go Next?
If you think that you have a graph problem
Let’s qualify your use case
• neo4j.com/contact-us
• Info@neotechnology.com
• Your local IBM representative
• OpenDB@us.ibm.com
39
Learn more…
• About graphs & Neo4j @ http://neo4j.com
• Use cases / Case studies / Webinars / Training / Boot
camp for your organization or team
• About IBM Power Systems @ http://www-
03.ibm.com/systems/power
• About IBM Data Engine for NoSQL @ http://www-
03.ibm.com/systems/power/solutions/bigdata-
analytics/data-engine-nosql/

Contenu connexe

Tendances

Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and ImplyAchieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
confluent
 
a Real-time Processing System based on Spark streaming int he field of Teleco...
a Real-time Processing System based on Spark streaming int he field of Teleco...a Real-time Processing System based on Spark streaming int he field of Teleco...
a Real-time Processing System based on Spark streaming int he field of Teleco...
DataWorks Summit
 

Tendances (20)

Using a Fast Operational Database to Build Real-time Streaming Aggregations
Using a Fast Operational Database to Build Real-time Streaming AggregationsUsing a Fast Operational Database to Build Real-time Streaming Aggregations
Using a Fast Operational Database to Build Real-time Streaming Aggregations
 
Instrumenting your Instruments
Instrumenting your Instruments Instrumenting your Instruments
Instrumenting your Instruments
 
Big Data Solutions in Azure - David Giard
Big Data Solutions in Azure - David GiardBig Data Solutions in Azure - David Giard
Big Data Solutions in Azure - David Giard
 
Realizing the promise of portable data processing with Apache Beam
Realizing the promise of portable data processing with Apache BeamRealizing the promise of portable data processing with Apache Beam
Realizing the promise of portable data processing with Apache Beam
 
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache KafkaTransform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
 
Fast Data – the New Big Data
Fast Data – the New Big DataFast Data – the New Big Data
Fast Data – the New Big Data
 
Real time data ingestion and Hybrid Cloud
Real time data ingestion and Hybrid CloudReal time data ingestion and Hybrid Cloud
Real time data ingestion and Hybrid Cloud
 
Apache Spark vs Apache Spark: An On-Prem Comparison of Databricks and Open-So...
Apache Spark vs Apache Spark: An On-Prem Comparison of Databricks and Open-So...Apache Spark vs Apache Spark: An On-Prem Comparison of Databricks and Open-So...
Apache Spark vs Apache Spark: An On-Prem Comparison of Databricks and Open-So...
 
Databus - LinkedIn's Change Data Capture Pipeline
Databus - LinkedIn's Change Data Capture PipelineDatabus - LinkedIn's Change Data Capture Pipeline
Databus - LinkedIn's Change Data Capture Pipeline
 
Introduction to Apache NiFi dws19 DWS - DC 2019
Introduction to Apache NiFi   dws19 DWS - DC 2019Introduction to Apache NiFi   dws19 DWS - DC 2019
Introduction to Apache NiFi dws19 DWS - DC 2019
 
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams APIuser Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
 
Evolving from Messaging to Event Streaming
Evolving from Messaging to Event StreamingEvolving from Messaging to Event Streaming
Evolving from Messaging to Event Streaming
 
Druid Overview by Rachel Pedreschi
Druid Overview by Rachel PedreschiDruid Overview by Rachel Pedreschi
Druid Overview by Rachel Pedreschi
 
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and ImplyAchieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
 
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data TechBig Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
Big Data Kappa | Mark Senerth, The Walt Disney Company - DMED, Data Tech
 
Self-Service Data Ingestion Using NiFi, StreamSets & Kafka
Self-Service Data Ingestion Using NiFi, StreamSets & KafkaSelf-Service Data Ingestion Using NiFi, StreamSets & Kafka
Self-Service Data Ingestion Using NiFi, StreamSets & Kafka
 
Mainframe Modernization with Precisely and Microsoft Azure
Mainframe Modernization with Precisely and Microsoft AzureMainframe Modernization with Precisely and Microsoft Azure
Mainframe Modernization with Precisely and Microsoft Azure
 
Apache Druid 101
Apache Druid 101Apache Druid 101
Apache Druid 101
 
a Real-time Processing System based on Spark streaming int he field of Teleco...
a Real-time Processing System based on Spark streaming int he field of Teleco...a Real-time Processing System based on Spark streaming int he field of Teleco...
a Real-time Processing System based on Spark streaming int he field of Teleco...
 
Spark meetup - Zoomdata Streaming
Spark meetup  - Zoomdata StreamingSpark meetup  - Zoomdata Streaming
Spark meetup - Zoomdata Streaming
 

En vedette

Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...
Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...
Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...
TUMRA | Big Data Science - Gain a competitive advantage through Big Data & Data Science
 
OPEN_POWER8_SESSION_20150316
OPEN_POWER8_SESSION_20150316OPEN_POWER8_SESSION_20150316
OPEN_POWER8_SESSION_20150316
기한 김
 

En vedette (20)

Introducing Neo4j 3.1: New Security and Clustering Architecture
Introducing Neo4j 3.1: New Security and Clustering Architecture Introducing Neo4j 3.1: New Security and Clustering Architecture
Introducing Neo4j 3.1: New Security and Clustering Architecture
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4j
 
Why Traditional Web Security Technologies no Longer Suffice to Keep You Safe
Why Traditional Web Security Technologies no Longer Suffice to Keep You SafeWhy Traditional Web Security Technologies no Longer Suffice to Keep You Safe
Why Traditional Web Security Technologies no Longer Suffice to Keep You Safe
 
Mitsubishi 20161109 v5
Mitsubishi 20161109 v5Mitsubishi 20161109 v5
Mitsubishi 20161109 v5
 
Cognitive startups 20160708 v7
Cognitive startups 20160708 v7Cognitive startups 20160708 v7
Cognitive startups 20160708 v7
 
Neo4j Import Webinar
Neo4j Import WebinarNeo4j Import Webinar
Neo4j Import Webinar
 
MongoDB Evenings DC: MongoDB - The New Default Database for Giant Ideas
MongoDB Evenings DC: MongoDB - The New Default Database for Giant IdeasMongoDB Evenings DC: MongoDB - The New Default Database for Giant Ideas
MongoDB Evenings DC: MongoDB - The New Default Database for Giant Ideas
 
Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...
Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...
Using Graph theory to understand Intent & Concepts - Neo4j User Group (Januar...
 
Webinar: High Performance MongoDB Applications with IBM POWER8
Webinar: High Performance MongoDB Applications with IBM POWER8Webinar: High Performance MongoDB Applications with IBM POWER8
Webinar: High Performance MongoDB Applications with IBM POWER8
 
How the IBM Platform LSF Architecture Accelerates Technical Computing
How the IBM Platform LSF Architecture Accelerates Technical ComputingHow the IBM Platform LSF Architecture Accelerates Technical Computing
How the IBM Platform LSF Architecture Accelerates Technical Computing
 
IBM Power Systems Open Innovation
IBM Power Systems Open InnovationIBM Power Systems Open Innovation
IBM Power Systems Open Innovation
 
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
New Generation of IBM Power Systems Delivering value with Red Hat Enterprise ...
 
Using neo4j for enterprise metadata requirements
Using neo4j for enterprise metadata requirementsUsing neo4j for enterprise metadata requirements
Using neo4j for enterprise metadata requirements
 
IBM Power & SUSE at SAPPHIRE 2016
IBM Power & SUSE at SAPPHIRE 2016IBM Power & SUSE at SAPPHIRE 2016
IBM Power & SUSE at SAPPHIRE 2016
 
Neo4j Introduction - Game of Thrones
Neo4j Introduction  - Game of ThronesNeo4j Introduction  - Game of Thrones
Neo4j Introduction - Game of Thrones
 
IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17IBM Power Systems Update 1Q17
IBM Power Systems Update 1Q17
 
IBM Flash System® Family webinar 9 de noviembre de 2016
IBM Flash System® Family   webinar 9 de noviembre de 2016 IBM Flash System® Family   webinar 9 de noviembre de 2016
IBM Flash System® Family webinar 9 de noviembre de 2016
 
ETL into Neo4j
ETL into Neo4jETL into Neo4j
ETL into Neo4j
 
OPEN_POWER8_SESSION_20150316
OPEN_POWER8_SESSION_20150316OPEN_POWER8_SESSION_20150316
OPEN_POWER8_SESSION_20150316
 
Introducing Neo4j
Introducing Neo4jIntroducing Neo4j
Introducing Neo4j
 

Similaire à Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4j

Ibm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIbm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bk
IBM Switzerland
 
Informix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performanceInformix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performance
Keshav Murthy
 

Similaire à Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4j (20)

Ibm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIbm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bk
 
IBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWERIBM Data Centric Systems & OpenPOWER
IBM Data Centric Systems & OpenPOWER
 
OpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC SystemsOpenPOWER Acceleration of HPCC Systems
OpenPOWER Acceleration of HPCC Systems
 
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
 
Demystify OpenPOWER
Demystify OpenPOWERDemystify OpenPOWER
Demystify OpenPOWER
 
Scaling Redis Cluster Deployments for Genome Analysis (featuring LSU) - Terry...
Scaling Redis Cluster Deployments for Genome Analysis (featuring LSU) - Terry...Scaling Redis Cluster Deployments for Genome Analysis (featuring LSU) - Terry...
Scaling Redis Cluster Deployments for Genome Analysis (featuring LSU) - Terry...
 
RedisConf17 - Redis Enterprise on IBM Power Systems
RedisConf17 - Redis Enterprise on IBM Power SystemsRedisConf17 - Redis Enterprise on IBM Power Systems
RedisConf17 - Redis Enterprise on IBM Power Systems
 
Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!
 
Informix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performanceInformix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performance
 
IBM Power Systems: Designed for Data
IBM Power Systems: Designed for DataIBM Power Systems: Designed for Data
IBM Power Systems: Designed for Data
 
Intelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff PollockIntelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff Pollock
 
OpenPOWER Foundation Overview
OpenPOWER Foundation OverviewOpenPOWER Foundation Overview
OpenPOWER Foundation Overview
 
What to Expect for Big Data and Apache Spark in 2017
What to Expect for Big Data and Apache Spark in 2017 What to Expect for Big Data and Apache Spark in 2017
What to Expect for Big Data and Apache Spark in 2017
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai
 
XDF 2019 Xilinx Accelerated Database and Data Analytics Ecosystem
XDF 2019 Xilinx Accelerated Database and Data Analytics EcosystemXDF 2019 Xilinx Accelerated Database and Data Analytics Ecosystem
XDF 2019 Xilinx Accelerated Database and Data Analytics Ecosystem
 
OpenCAPI next generation accelerator
OpenCAPI next generation accelerator OpenCAPI next generation accelerator
OpenCAPI next generation accelerator
 
How Kafka and Modern Databases Benefit Apps and Analytics
How Kafka and Modern Databases Benefit Apps and AnalyticsHow Kafka and Modern Databases Benefit Apps and Analytics
How Kafka and Modern Databases Benefit Apps and Analytics
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & TableauBig Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
 
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
 

Plus de Neo4j

Plus de Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4j

  • 1. ON ON Neo4j on IBM POWER8 Philip Rathle VP of Products Neo Technology Keshav Ranganathan Senior Offering Manager, Data & Analytics Solutions IBM POWER Systems
  • 2. ONNeo4j on IBM POWER Systems Key Takeaways: • Why Graphs & Why Now? • Unique Characteristics of Graph Data & Architecture Implications • IBM Power Systems Overview • Why deploy Neo4j on IBM Power Systems • Q&A 2
  • 3. ON ON Neo4j on IBM Power Systems Solves Massive-Scale, Previously Unsolvable Problems A paradigm shift accelerating time to insight and real-time decision making… Bringing big data insights into action
  • 4. ON A Walk Back in Time… 4
  • 5. ON Data Management in 1990 Paper Forms Tiny RAM Spinning Platters (Low Capacity / Sequential IO)5
  • 7. ON Data Management in 2016 Dynamic Real-World Systems Abundant RAM Flash & CAPI (High-Capacity Storage & Ultra-Fast Random I/O) 7
  • 10. ON Sep 2015May 2015Jan 2015Sep 2014May 2014Jan 2014Sep 2013May 2013 100 PopularityChanges 500 600 700 200 300 400 Jan 2013 © DB-Engines.com 2015 Popular Movement • Wide column stores • RDF stores • Document stores • Search engines • Native XML DBMS • Key-value stores • Object oriented DBMS • Multivalue DBMS • Times Series DBMS Relational database Graph Database 10
  • 11. ON “Graph analysis is possibly the single most effective competitive differentiator for organizations pursuing data-driven operations and decisions after the design of data capture.” By the end of 2018, 70% of leading organizations will have one or more pilot or proof-of-concept efforts underway utilizing graph databases. Analyst Perspective “Forrester estimates that over 25% of enterprises will be using graph databases by 2017” IT Market Clock for Database Management Systems, 2014 https://www.gartner.com/doc/2852717/it-market-clock-database-management TechRadar™: Enterprise DBMS, Q1 2014 http://www.forrester.com/TechRadar+Enterprise+DBMS+Q1+2014/fulltext/-/E-RES106801 Making Big Data Normal with Graph Analysis for the Masses, 2015 http://www.gartner.com/document/3100219 11
  • 12. ON 100 Best in Show 2015 Magic Quadrant for Operational DBMS 2015 Neo4j: World’s Leading Graph Database Technology of the Year 2015, 2014 100 Companies that Matter the Most in Data 2015 Neo4j named most popular Graph Database, 2015 Neo4j declared “Champion”, 2015 & 2016 “Most Popular and Widely Deployed Database” Winner of NoSQL: Graph Database Technologies DB-Engines Rankings Source: http://db-engines.com/en/ranking/graph+dbms 12
  • 13. ONDatabase Technology Architectures A Portfolio View Graph DB Connected DataDiscrete Data Relational DBMSOther NoSQL 13
  • 14. ON Queries can take non-sequential, arbitrary paths through data Real-time queries need speed and consistent response times Queries must run reliably with consistent results Q A single query can touch a lot of data 15 Relationship Queries Strain Traditional Architectures
  • 15. ON UNIFIED, IN-MEMORY MAP Lightning-fast queries due to replicated in-memory architecture + index-free adjacency MACHINE 1 MACHINE 2 MACHINE 3 Slow queries due to index lookups + network hops Neo4j on IBM POWER8 Using Other NoSQL to Join Data Q R Q R 16 Data Relationship Queries
  • 16. ONTraversal Speeds • Realistic retail dataset from Amazon • Social recommendation (Java procedure) equivalent to: MATCH (you)-[:BOUGHT]->(something)<-[:BOUGHT]-(other)-[:BOUGHT]->(reco) WHERE id(you)={id} RETURN reco Threads Hops/second 1 3-4M 10 17-29M 20 34-50M 30 36-60M 17
  • 17. ONWrite Scale • Import highly connected Friendster dataset • 1.8 billion relationships takes around 20 minutes • That is 1M writes/second! 18
  • 18. ON Good News for Real-Time, In-Memory Graph Queries: Big RAM is Eating Big Data 19
  • 19. ONValue from Data Relationships: Top Use Cases Internal Applications Fraud Detection Master Data Management Network and IT Operations Customer-Facing Applications Real-Time Recommendations Graph-Based Search Identity and Access Management 20
  • 20. ONSolving Massive-Scale Challenges: Recommendations 21 People, Places, Things + Interests + Transactions + Activity Each requires a new & higher level of scaling
  • 21. ONSolving Massive-Scale Challenges: Fraud Detection Estimated cost in 2014 $16.31B 1 Fraud and the costs to prevent fraud are up 94% year over year 2 62% of companies subject to payment fraud 3 Nearly 1 out of 4 declined transactions are false positives 4 22 1 The Nilson Report, 2015; 2, 4 2015 LexisNexis True Cost of Fraud Survey; 3 2015 AFP Payments Fraud and Control Survey
  • 22. ON Pros Simple Stops rookies Traditional Fraud Detection: Discrete Data Analysis Revolving Debt INVESTIGATE INVESTIGATE Number of accounts Cons False positives False negatives 23
  • 23. ONWhat’s Needed: Connected Analysis Revolving Debt Number of accounts ADVANTAGES Detect fraud rings Fewer false negatives 24 Stop fraud attempts in flight: Carry out graph “walks” in real-time for transactions & key events, revealing suspicious “loops” Speed manual fraud analysis: Empower fraud analysts with graph-based analysis, to more quickly and accurately identify complex fraud
  • 24. ON Gartner’s Layered Fraud Prevention Approach (4) (4) http://www.gartner.com/newsroom/id/1695014 Traditional Fraud Prevention: Batch Analysis Analysis of users and their endpoints Analysis of navigation behavior and suspect patterns Analysis of anomaly behavior by channel Analysis of anomaly behavior correlated across channels Analysis of relationships to detect organized crime and collusion Layer 1 Endpoint- Centric Navigation- Centric Account- Centric Cross- Channel Entity Linking Layer 2 Layer 3 Layer 4 Layer 5 DISCRETE DATA ANALYSIS CONNECTED ANALYSIS 25
  • 25. ON ON IBM Power Systems Portfolio Overview and Value for Neo4j
  • 26. ON 4X Threads per core* 4X Mem. Bandwidth* 4X More cache* @ Lower Latency SMT=Simultaneous Multi-Threading OLTP = On-Line Transaction Processing These design decisions result in best performance for data centric workloads like: Database, NoSQL, Big Data Analytics, OLTP POWER8: Designed for data to deliver breakthrough performance POWER8 SMT8 x86 Hyperthread Parallel Processing POWER8 pipe Data flow x86 pipe POWER8 x86 POWER8 + OpenPOWER x86 27
  • 27. ON 250 Worldwide members of 30 Hardware and technology providers 100+ Collaborative 2,500+ Linux ISVs developing on POWER 100,000+ Open source packages innovations under way The POWER of an open ecosystem 28
  • 28. ON This is what a revolution looks like 29
  • 29. ON Power Systems Portfolio – Enterprise and Scale- out offerings Offering OS capability Positioning in the Linux portfolio Scale-up E880 E870 E850 Equally run AIX, IBM i and Linux with IFLs Enterprise systems Leadership Performance and Reliability Utilization Guarantee (PowerVM – 70%/80%) Flexible, dynamic Capacity on Demand & Enterprise Pools Scale-Out S824 S822 S814 Equally Run AIX, IBM i and Linux Scale out Systems Utilization Guarantee (PowerVM – 65) High performance, availability and resiliency L line S824L S822L S812L Linux Only Scale out Linux Systems Price/Performance Leadership vs. X86 PowerVM, KVM LC line S812LC S822LC BD S822LC HPC S821LC Linux Only Cluster-optimized Linux Systems Lowest cost Power System KVM New 30
  • 30. ON - Design and cost optimized for deployments of multiples (cloud and cluster) - Broad number of optimal solutions - Co-Designed with the OpenPOWER Ecosystem Supported by Canonical IBM Support Community / 3rd Party Support running The LC Line The L Line PurePower Enterprise & IFLs - Enterprise level RAS for single system deployments - Solutions for Big Data & Analytics - Converged infrastructure offering - Rapid time to value and simplicity of management - Enterprise level robustness and IFL capability - Solution editions for in memory databases - (HANA, DB2 BLU) - Hosted cloud and hybrid cloud solutions - Rapid deployments and POCs The IBM Power Systems Linux Portfolio Pipeline of innovation • Broad Linux portfolio deliver all your Linux deployment needs • Expanding LC portfolio with two servers for data centric applications and 2nd generation HPC server POWER8 is designed for the Big Data era and delivers price-performance leadership to the Linux Market! 31
  • 31. ONPOWER8 CAPI Coherent Accelerator Processor Interface Custom Hardware Application POWER8 CAPP Coherence Bus PSL FPGA or ASIC Customizable Hardware Application Accelerator • Specific system SW, middleware, or user application • Written to durable interface provided by PSL POWER8 PCIe Gen3 Transport for encapsulated messages Processor Service Layer (PSL) • Present robust, durable interfaces to applications • Offload complexity / content from CAPP Virtual Addressing • Accelerator can work with same memory addresses that the processors use • Pointers de-referenced same as the host application • Removes OS & device driver overhead Hardware Managed Cache Coherence • Enables the accelerator to participate in “Locks” as a normal thread Lowers Latency over IO communication model 32 http://opencapi.org/
  • 32. ONWhy CAPI is Better than Traditional PCIe CAP P PCI e Power Processor FPGA AFU IBM Supplied POWER Service Layer Typical I/O Model Flow Flow with a Coherent Model Shared Mem. Notify Accelerator Acceleration Shared Memory Completion DD Call Copy or Pin Source Data MMIO Notify Accelerator Acceleration Poll / Int Completion Copy or Unpin Result Data Ret. From DD Completion Advantages of Coherent Attachment Over I/O Attachment • Virtual Addressing & Data Caching – Shared Memory – Lower latency for highly referenced data • Easier, More Natural Programming Model – Traditional thread level programming – Long latency of I/O typically requires restructuring of application • Enables Applications Not Possible on I/O – Pointer chasing, etc… 33 Total ~13µs for data prep Total 0.36µs
  • 33. ON IBM Data Engine for NoSQL is an integrated platform for large and fast growing NoSQL data stores. It builds on the CAPI capability of POWER8 systems and provides super-fast access to large flash storage capacity. It delivers high speed access to both RAM and flash storage which can result in significantly lower cost, and higher workload density for NoSQL deployments than a standard RAM- based system. The solution offers superior performance and price-performance to scale out x86 server deployments that are either limited in available memory per server or have flash memory with limited data access latency. IBM Data Engine for NoSQL Cost Savings for In-Memory NoSQL Data Stores Up to 57TB of extended memory with one POWER8 server + CAPI attach FLASH Power S822L / S812L Flash System 900 Power S822L / S812L / S822 LC NEW External Flash Configuration Integrated Flash Configuration Up to 8TB of super-fast storage tier on one POWER8 server 34
  • 34. ONCAPI Unlocks the Next Level of Performance for Flash Identical hardware with 3 different paths to data FlashSystem Conventional I/O (FC) CAPI - E 0 50,000 100,000 150,000 200,000 250,000 300,000 350,000 400,000 450,000 Conventional CAPI - I CAPI - E IOPS per Hardware Thread 0 20 40 60 80 100 120 140 160 180 200 Conventional CAPI - I CAPI - E Latency (microseconds) IBM POWER S822L >3x better IOPS per HW thread Lower latency 35 CAPI – I : Integrated CAPI Flash Card CAPI – E: CAPI attached External Flash CAPI - I
  • 35. ON POWER8 with CAPI enabled acceleration running Neo4j delivers 1.82X the performance versus Intel Broadwell servers with NVMe 711 390 0 100 200 300 400 500 600 700 800 POWER8 x86 Representativemixedworkload Throughput IBM Power S822LC (20c/160t) x86 Broadwell Server (24c/48t) 82% More Throughput • Accelerate Graph Databases with CAPI on POWER8 • Real-World mixed graph transaction workload running Neo4j on IBM Power S822LC server delivers 1.82X the throughput versus Intel Xeon E5-2650 v4 server – POWER8 (20 cores / 128 GB): 711 Ops/sec – Intel Xeon E5 2650 v4 processor (24 cores / 128 GB): 390 Ops/sec • Based on IBM internal testing of single system and OS image running mixed graph transaction s based on 200 GB data model internal IBM and Neo4j workload. Conducted under laboratory condition, individual result can vary based on workload size, use of storage subsystems & other conditions. Data as of October 19, 2016 • IBM Power System S822LC; 20 cores (2 x 10c chips) / 160 threads, POWER8; 128 GB memory (16 x 8GB), 1.6 TB CAPI NVMe adapter , Neo4j 3.0.4, Ubuntu 16.04. Competitive stack: HP Proliant DL380 Gen9; 24 cores (2 x 12c chips) / 48 threads; Intel E5-2650 v4; 128 GB memory,(16 x 8GB), 1.6 TB NVMe adapter, Neo4j 3.0.4, Ubuntu 15.10. 36
  • 36. ON POWER8 with CAPI enabled acceleration running Neo4j delivers 1.61X the price-performance versus Intel Xeon E5-2650 v4 with NVMe IBM Power S822LC (20-core, 128GB) HP DL380 Gen9 (24-core, 128GB) Server price* -3-year warranty $19,123 $16,911 Mixed graph transaction Workload (total operations per second) 711 390 1.61X Price-Performance 1.82X Performance per Server • Based on IBM internal testing of single system and OS image running mixed graph transaction s based on 200 GB data model internal IBM and Neo4j workload. Conducted under laboratory condition, individual result can vary based on workload size, use of storage subsystems & other conditions. Data as of October 19, 2016 • IBM Power System S822LC; 20 cores (2 x 10c chips) / 160 threads, POWER8; 128 GB memory (16 x 8GB), 1.6 TB CAPI NVMe adapter , Neo4j 3.0.4, Ubuntu 16.04. Competitive stack: HP Proliant DL380 Gen9; 24 cores (2 x 12c chips) / 48 threads; Intel E5-2650 v4; 128 GB memory,(16 x 8GB), 1.6 TB NVMe adapter, Neo4j 3.0.4, Ubuntu 15.10. * Pricing is based bundled pricing for S822LC with Integrated CAPI Flash card (IBM ordering system) and HP Web price https://h22174.www2.hp.com/SimplifiedConfig/Index 37
  • 37. ON ON Neo4j on IBM Power Systems Solves Massive-Scale, Previously Unsolvable Problems A paradigm shift accelerating time to insight and real-time decision making… Bringing big data insights into action
  • 38. ONWhere Do I Go Next? If you think that you have a graph problem Let’s qualify your use case • neo4j.com/contact-us • Info@neotechnology.com • Your local IBM representative • OpenDB@us.ibm.com 39 Learn more… • About graphs & Neo4j @ http://neo4j.com • Use cases / Case studies / Webinars / Training / Boot camp for your organization or team • About IBM Power Systems @ http://www- 03.ibm.com/systems/power • About IBM Data Engine for NoSQL @ http://www- 03.ibm.com/systems/power/solutions/bigdata- analytics/data-engine-nosql/

Notes de l'éditeur

  1. Agenda: Context & Value Proposition Graph Database Uses, Architectures & Considerations IBM POWER Overview Neo4j and IBM POWER Systems Differentiation Q&A
  2. But also your problem… whatever size it may be. R&D is ongoing in the lab. Current expectations are improvements of 2-4x over Intel with OpenPower, and CAPI Flash.
  3. And deriving value from data-relationships is exactly what some of the most successful companies in the world have done. Google created perhaps the most valuable advertising system of all time on top of their search-enginge, which is based on relationships between webpages. Linkedin created perhaps the most valuable HR-tool ever based on relationships amongst professional And this is also what pay-pal did, creating a peer-to-peer transaction service, based on relationships.
  4. Coherent Accelerator Processor Interface  IBM Data Engine for NoSQL
  5. If you zoom into the previous picture, this is what you see… an individual, at the center of whose nervous system is a brain, that is an incredible network The brain in fact is a network comprised of around 86 billion neurons and over 100 trillion synapses.   Our intelligence lies in the connections: the connectome. And when we process information, currents move along these channels.
  6. Fixa till den där effekten
  7. Continuing on – We have receive very solid validation from these industry watchers that the market we are pursuing represents a huge opportunity and being anointed as the leader in this market that is likely to grow at this rate is very exciting.
  8. Neo4j Perspective = Graph First + Community First We have received recognition for a broad cross section of the industry. On one hand our technology continues to be seen as innovative and impactful in the database management and particularly the NoSQL space. While on the other hand, more and more analyst firms are recognizing us as the most impactful player in the segments they are focused on which is the enterprise segment.
  9. Macro/Portfolio View
  10. Key virtuous cycle that occurs in data, and it’s a common architecture pattern One of the things I’d like you to take away today is a perspective on the use of big data, and relationships in data Similar to the brain, making decisions We’re so often used to making calculations in batch Neo4j on POWER8 enables disruptive innovation by bringing the power of graph analysis into your customer and end-user facing business applications
  11. You might ask: why do you need a special database?
  12. And in SQL, it’s a 20-table join 1M+ pointer-chasing operations per second per hardware thread
  13. Neo4j best in class on traversal speed and scaling reads.
  14. Super important macro-level trend: Jure Leskovec from Stanford: analysis from GRADES 2016 Over the past 10 years there is indication that in data analytics size of RAM has been growing 50% every year while size of the data only by 20%. “Maybe your data increases faster. Maybe you think data is bigger and increasing faster. But facts should trump opinions” – Szilard Pafka Let’s look at the memory. 100G commodity, soon to be high-end laptop territory. 10T off the shelf (Jure’s group bought a 12TB machine) 1P not yet available in a commodity single machine, but 50% per year growth mean’s it’s not far away (~7 years)
  15. Real-time Recommendations: Power real-time recommendations that incorporate current session and historical data, even from multiple data sources.
  16. Fraud Detection: Detect and stop fraud as it happens with real-time analysis of data relationships.
  17. Just like Recommendations: People, Places, Things + Interests + Transactions + Activity
  18. POWER8 compared to Haswell EX Sources: Haswell EX: http://ark.intel.com/products/84685/Intel-Xeon-Processor-E7-8890-v3-45M-Cache-2_50-GHz POWER8: http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?subtype=BR&infotype=PM&appname=STGE_PO_PO_USEN&htmlfid=POB03046USEN Key Message: Our leading IO and Memory capabilities power the need to fast data access and movement across a wide range of BDA applications. POWER8 is the first microprocessor designed for Big Data and Analytics. In IBM we have the advantage of having a hardware and software organization that can work together to optimize the entire solution stack. When systems are designed for big data, there are a couple of key attributes that are important to create a balanced system design. First having the processing capability, second having the memory space, the workspace, and the third is having the bandwidth, the ability to move the information in and out of the system at the rapid speeds required. We’re delivering 4 times more threads per core vs. commodity infrastructure. We can easily support a growing number of users who need reports, or to perform ad hoc analytics. This is because the processor can run more concurrent queries in parallel faster, across multiple cores with more threads per core. We’re delivering 3 times more memory bandwidth. Increased memory bandwidth to access up to 1 TB of memory for data operations and enlarged cache in every processor. This delivers the levels of performance your teams need to make decisions in real time. We’re delivering faster IO to ingest, move and access large volumes of data so that analytics results are available faster. Power Systems provide the capabilities needed to handle the varying analytics initiatives your business requires. Broad range of data and analytics – from operational to computational to business analytics, as well as cognitive solutions leveraging IBM Watson technology, Power Systems are optimized for performance and can scale to support demanding and growing workloads. These solutions help you capitalize on the currency of data by finding business insights faster and more efficiently.
  19. 28
  20. 31
  21. 36
  22. Scale up and/or out depending on your application Performance and Scale as YOU Need
  23. Customer CTA Slide