SlideShare une entreprise Scribd logo
1  sur  45
Télécharger pour lire hors ligne
Neo Technology, Inc Confidential
La BI,
l'informatique décisionnelle
et les graphes
Philip Rathle, Sr Dir Products
philip@neotechnology.com
http://twitter.com/prathle
Neo Technology, Inc Confidential
Les Graphes
et la Pensée
Neo Technology, Inc Confidential
The Brain
Structure: Neurons Connected by Synapses.
Processing: Signals Relayed Between Neurons through Synapses
Neo Technology, Inc Confidential
Human Thinking
Structured Unstructured (Creative)
Both forms involve processing connections
Neo Technology, Inc Confidential
Les Applications Graphes
dans le Commerce
Neo Technology, Inc Confidential
Early Adopters of Graph Tech
Neo Technology, Inc Confidential
Evolution of Web Search
Survival of the Fittest
Pre-1999
WWW Indexing
Discrete Data
1999 - 2012
Google Invents
PageRank
Connected Data
(Simple)
2012-?
Google Knowledge Graph,
Facebook Graph Search
Connected Data
(Rich)
Neo Technology, Inc Confidential
Evolution of Online Recruiting
2010-11
Resume Searching &
Scoring
Aggregated Data
Survival of the Fittest
2011-12
Social Job Search
Connected Data
Neo Technology, Inc Confidential
Consumer Web Giants Depends on Five Graphs
Gartner’s “5 Graphs”
Social Graph
Ref: http://www.gartner.com/id=2081316
Interest Graph
Payment Graph
Intent Graph
Mobile Graph
Neo Technology, Inc Confidential
Graph Buzz!
Neo Technology, Inc Confidential
Core Industries
& Use Cases:
Web / ISV
Finance &
Insurance
Communi-
cations
Logistics
Life
Sciences
Media &
Publishing
Education,
Not-for-
Profit
Government,
Aerospace,
Gaming, Other
Network
Management
MDM
Social
Geo
Authorization &
Access Control
Content
Management
Recommend-
ations
Fraud
Detection,
Other
Accenture
Select Commercial Customers (Community Users Not Included)
Neo4j Adoption Snapshot*
Neo Technology, Inc Confidential
Ecosystème de la
Technologie Graph
Neo Technology, Inc Confidential
Data Storage & Processing
• Graph Databases
• Graph Compute Engines
Programming:
• Graph-Centric APIs & Languages
• Graph Algorithms
Tools:
• Visualization Tools & Libraries
• Other
Key Graph Analytic Technologies
Neo Technology, Inc Confidential
Typical Graph BI Environment
Application
Other
Databases
ETL
Neo4j
Cluster
Data Storage &
Business Rules Execution
Reporting
Graph-
Dashboards
&
Ad-hoc
Analysis
Graph
Visualization
End User Ad-hoc visual navigation &
discovery
Bulk Analytic
Infrastructure
(e.g. Graph Compute
Engine)
ETL
Graph Mining &
Aggregation
Data Scientist
Ad-Hoc
Analysis
What is a
Graph Database
A graph database is an online (“real-time”)
database management system with CRUD
methods that expose a graph data model
• Two important properties:
• Native graph processing, including
index-free adjacency1 to facilitate traversals
• Native graph storage engine, i.e.
written from the ground up to be
optimized for managing graph data
1] See Rodriguez, M.A., Neubauer, P., ,“The Graph Traversal Pattern,” 2010 (http://arxiv.org/abs/1004.1001)
Overview of Popular
Graph Data Models
• Property Graph
• Description: A “directed, labeled, attributed, multi-
graph”1 which exposes three building blocks: nodes, typed
relationships and key-value properties on both nodes and
relationships
• Vendors: Neo4j, OrientDB, InfiniteGraph, Dex
• RDF Triples
• Description: URI-centered subject-predicate-object
triples as pioneered by the semantic web movement2
• Vendors: AllegroGraph, Sesame
• HyperGraph
• Description: A generalized graph where a relationship
can connect an arbitrary amount of nodes (compared to
the more common binary graph models)3
• Vendors: HyperGraphDB,TrinityDB
1] Rodriguez, M.A., Neubauer, P., “Constructions from Dots and Lines,” 2010, http://arxiv.org/abs/1006.2361
2] W3C,“The Resource Description Framework (RDF),” 2004, http://www.w3.org/RDF/
3] Wikipedia, http://en.wikipedia.org/wiki/Hypergraph
Graph Compute Engine
Processing platforms that enable graph global
computational algorithms to be run against
large data sets
Graph Mining
Engine
(Working Storage)
In-Memory Processing
System(s)
of Record
Graph Compute
Engine
Data extraction,
transformation,
and load
Neo Technology, Inc Confidential
Graph Global Queries
What is the max/min/avg. number of connections per node?
(aka “Degree Distribution”)
Neo Technology, Inc Confidential
Quoi faire avec un Graph Database?
Example: Facebook Graph Search
Neo Technology, Inc Confidential
For the Facebook Graph Question:
What sushi restaurants in NYC do my friends like?
Neo Technology, Inc Confidential
What the Graph Looks Like:
What sushi restaurants in NYC do my friends like?
Neo Technology, Inc Confidential
What the Cypher Query Looks Like:
What sushi restaurants in NYC do my friends like?
START me=node:person(name = 'Philip'),
location=node:location(location='New York'),
cuisine=node:cuisine(cuisine='Sushi')
MATCH (me)-[:IS_FRIEND_OF]->(friend)-[:LIKES]->(restaurant)
-[:LOCATED_IN]->(location),(restaurant)-[:SERVES]->(cuisine)
RETURN restaurant
Neo Technology, Inc Confidential
What the Search Looks Like:
What sushi restaurants in NYC do my friends like?
Neo Technology, Inc Confidential
What Other Graph Searches Look Like
What drugs will bind to protein X and not interact with drugY?
Neo Technology, Inc Confidential
Graph Dashboards
Social Network Analysis
Fraud Detection & Money Laundering
Service Assurance
& Network Failure Analysis
Neo Technology, Inc Confidential
Industry Example:
5 Graphs of
Communications
Neo Technology, Inc Confidential
#1:The Network Graph
Graphs in Communications
Cell Signal Analysis
Router
Service
DEPENDS_O
N
Switch Switch
Router
Fiber Link
Fiber Link
Fiber Link
Oceanfloor
Cable
DEPENDS_ON
DEPENDS_ON
DEPEN
DS_O
N
DEPENDS_ON
DEPENDS_ON
DEPENDS_ON
DEPENDS_ON
DEPENDS_ON
DEPENDS_ON
LINKED
LINKED
LIN
KED
DEPENDS_ON
“What if” Downtime Analysis
(Service-to-Infrastructure Mapping)
Network Inventory &
Cost Accounting
Neo Technology, Inc Confidential
#2:The Social Graph
Graphs in Communications
Mobile apps,
Collaboration,
Social Recommendations,
and more...
Neo Technology, Inc Confidential
#3:The Call Graph
Graphs in Communications
Plan & Feature Recommendations,Assess Churn Risk
Neo Technology, Inc Confidential
#4: Master Data Graph
Graphs in Communications
Organizational Hierarchy
Management
Resource Authorization
Ref: http://www.slideshare.net/verheughe/how-nosql-paid-off-for-telenor
Neo Technology, Inc Confidential
#5:The Help Desk Graph
Graphs in Communications
Online Recommendations
for Case Avoidance
Neo Technology, Inc Confidential
Entitlements & Identity
Management
Network Asset
Management
Network Cell Analysis
Geo Routing
(Public Transport)
BioInformatics
Emergent Graph in Other Industries
(Actual Neo4j Graphs)
Insurance Risk Analysis
Neo Technology, Inc Confidential
Web Browsing Portfolio Analytics
Mobile Social ApplicationGene Sequencing
Emergent Graph in Other Industries
(Actual Neo4j Graphs)
Neo Technology, Inc Confidential
Cas d’études
selectionés
Neo Technology, Inc Confidential
Background
• World’s largest provider of IT infrastructure, software
& services
• HP’s Unified Correlation Analyzer (UCA) application is a
key application inside HP’s OSS Assurance portfolio
• Carrier-class resource & service management, problem
determination, root cause & service impact analysis
• Helps communications operators manage large,
complex and fast changing networks
Business problem
• Use network topology information to identify root
problems causes on the network
• Simplify alarm handling by human operators
• Automate handling of certain types of alarms Help
operators respond rapidly to network issues
• Filter/group/eliminate redundant Network
Management System alarms by event correlation
Solution & Benefits
• Accelerated product development time
• Extremely fast querying of network topology
• Graph representation a perfect domain fit
• 24x7 carrier-grade reliability with Neo4j HA clustering
• Met objective in under 6 months
Industry: Web/ISV, Communications
Use case: Network Management
Global (U.S., France)
Neo Technology, Inc Confidential
Background
•One of the world’s largest logistics carriers
•Projected to outgrow capacity of old system
•New parcel routing system
•Single source of truth for entire network
•B2C & B2B parcel tracking
•Real-time routing: up to 5M parcels per day
Business problem
•24x7 availability, year round
•Peak loads of 2500+ parcels per second
•Complex and diverse software stack
•Need predictable performance & linear
scalability
•Daily changes to logistics network: route from
any point, to any point
Solution & Benefits
•Neo4j provides the ideal domain fit:
•a logistics network is a graph
•Extreme availability & performance with Neo4j
clustering
•Hugely simplified queries, vs. relational for
complex routing
•Flexible data model can reflect real-world data
variance much better than relational
•“Whiteboard friendly” model easy to understand
Industry: Logistics
Use case: Parcel Routing
Neo Technology, Inc Confidential
Industry: Online Job Search
Use case: Social / Recommendations
• Online jobs and career community, providing
anonymized inside information to job seekers
Business problem
• Wanted to leverage known fact that most jobs are
found through personal & professional connections
• Needed to rely on an existing source of social
network data. Facebook was the ideal choice.
• End users needed to get instant gratification
• Aiming to have the best job search service, in a very
competitive market
Solution & Benefits
• First-to-market with a product that let users find jobs
through their network of Facebook friends
• Job recommendations served real-time from Neo4j
• Individual Facebook graphs imported real-time into Neo4j
• Glassdoor now stores > 50% of the entire Facebook
social graph
• Neo4j cluster has grown seamlessly, with new instances
being brought online as graph size and load have increased
Person
Company
KNOW
S
Person
Person
KNOWS
Company
KNOWS
WORKS_AT
WORKS_AT
Neo Technology Confidential
Background
Sausalito, CA
Neo Technology, Inc Confidential
Industry: Communications
Use case: Recommendations
•Cisco.com serves customer and business
customers with Support Services
•Needed real-time recommendations, to
encourage use of online knowledge base
•Cisco had been successfully using Neo4j for its
internal master data management solution.
•Identified a strong fit for online
recommendations
Solution & Benefits
•Cases, solutions, articles, etc. continuously scraped
for cross-reference links, and represented in Neo4j
•Real-time reading recommendations via Neo4j
•Neo4j Enterprise with HA cluster
•The result: customers obtain help faster, with
decreased reliance on customer support
Neo Technology Confidential
Background
Business problem
•Call center volumes needed to be lowered by
improving the efficacy of online self service
•Leverage large amounts of knowledge stored in
service cases, solutions, articles, forums, etc.
•Problem resolution times, as well as support
costs, needed to be lowered
Support
Case
Support
Case
Knowledge
Base
Article
Solution
Knowledge
Base
Article
Knowledge
Base
Article
Message
San Jose, CA
Cisco.com
Neo Technology, Inc Confidential
Interactive Television Programming
Industry: Communications
Use case: Social gaming
Background
• Europe’s largest communications company
• Provider of mobile & land telephone lines to
consumers and businesses, as well as internet
services, television, and other services
Solution & Benefits
• Interactive, social offering gives fans a way to
experience the game more closely
• Increased customer stickiness for Deutsche Telekom
• A completely new channel for reaching customers
with information, promotions, and ads
• Clear competitive advantage
Frankfurt, Germany
Business problem
• The Fanorakel application allows fans to have an
interactive experience while watching sports
• Fans can vote for referee decisions and interact with
other fans watching the game
• Highly connected dataset with real-time updates
• Queries need to be served real-time on rapidly
changing data
• One technical challenge is to handle the very high
spikes of activity during popular games
Neo Technology, Inc Confidential
Reasons for Choosing a Graph
Database
1. Order-of-magnitude improvements in query
performance for complex, connected data
2. Drastically accelerated application
development cycles
3. Maintainability and extensibility of the
data model
4. Maturity and reliability of the product
Neo Technology, Inc Confidential
Questions ?
Neo Technology, Inc Confidential
Merci !
Pour	
  aller	
  plus	
  loin	
  :
Cédric	
  Fauvet	
  –	
  Votre	
  contact	
  en	
  France
E-­‐mail	
  :	
  Cedric.Fauvet@neotechnology.com
Twi+er	
  :	
  @Neo4jFr
Communauté	
  Francophone	
  :	
  meetup.com/graphdb-­‐france

Contenu connexe

Tendances

Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
Neo4j
 

Tendances (20)

GraphTalks - Einführung
GraphTalks - EinführungGraphTalks - Einführung
GraphTalks - Einführung
 
Neo4j the Anti Crime Database
Neo4j the Anti Crime DatabaseNeo4j the Anti Crime Database
Neo4j the Anti Crime Database
 
Making connections matter: 2 use cases on graphs & analytics solutions
Making connections matter: 2 use cases on graphs & analytics solutionsMaking connections matter: 2 use cases on graphs & analytics solutions
Making connections matter: 2 use cases on graphs & analytics solutions
 
Spring Data Neo4j Intro SpringOne 2011
Spring Data Neo4j Intro SpringOne 2011Spring Data Neo4j Intro SpringOne 2011
Spring Data Neo4j Intro SpringOne 2011
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
 
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge ScientistEthics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
 
Detecting eCommerce Fraud with Neo4j and Linkurious
Detecting eCommerce Fraud with Neo4j and LinkuriousDetecting eCommerce Fraud with Neo4j and Linkurious
Detecting eCommerce Fraud with Neo4j and Linkurious
 
Intro to Neo4j Webinar
Intro to Neo4j WebinarIntro to Neo4j Webinar
Intro to Neo4j Webinar
 
Graph technology meetup slides
Graph technology meetup slidesGraph technology meetup slides
Graph technology meetup slides
 
How do You Graph
How do You GraphHow do You Graph
How do You Graph
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Intro to Neo4j
Intro to Neo4jIntro to Neo4j
Intro to Neo4j
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?
 
Connecting the Dots—How a Graph Database Enables Discovery
Connecting the Dots—How a Graph Database Enables DiscoveryConnecting the Dots—How a Graph Database Enables Discovery
Connecting the Dots—How a Graph Database Enables Discovery
 
GDPR: Leverage the Power of Graphs
GDPR: Leverage the Power of GraphsGDPR: Leverage the Power of Graphs
GDPR: Leverage the Power of Graphs
 
Graphdatenbank Neo4j: Konzept, Positionierung, Status Region DACH - Bruno Un...
 Graphdatenbank Neo4j: Konzept, Positionierung, Status Region DACH - Bruno Un... Graphdatenbank Neo4j: Konzept, Positionierung, Status Region DACH - Bruno Un...
Graphdatenbank Neo4j: Konzept, Positionierung, Status Region DACH - Bruno Un...
 
moOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOn
moOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOnmoOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOn
moOn: A Multidimensional Graph Approach to Human Resources Analytics, aizoOn
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
 

En vedette

En vedette (20)

20150624 Belgian GraphDB meetup at Ordina
20150624 Belgian GraphDB meetup at Ordina20150624 Belgian GraphDB meetup at Ordina
20150624 Belgian GraphDB meetup at Ordina
 
NoSQL Now! Introduction to Graph Databases
NoSQL Now! Introduction to Graph DatabasesNoSQL Now! Introduction to Graph Databases
NoSQL Now! Introduction to Graph Databases
 
A case-for-graph-db
A case-for-graph-dbA case-for-graph-db
A case-for-graph-db
 
Graph Databases for SQL Server Professionals - SQLSaturday #350 Winnipeg
Graph Databases for SQL Server Professionals - SQLSaturday #350 WinnipegGraph Databases for SQL Server Professionals - SQLSaturday #350 Winnipeg
Graph Databases for SQL Server Professionals - SQLSaturday #350 Winnipeg
 
Introduction à Neo4j - La base de données de graphes - 2016
Introduction à Neo4j - La base de données de graphes - 2016Introduction à Neo4j - La base de données de graphes - 2016
Introduction à Neo4j - La base de données de graphes - 2016
 
Neo4J : Introduction to Graph Database
Neo4J : Introduction to Graph DatabaseNeo4J : Introduction to Graph Database
Neo4J : Introduction to Graph Database
 
Intro to Neo4j - Nicole White
Intro to Neo4j - Nicole WhiteIntro to Neo4j - Nicole White
Intro to Neo4j - Nicole White
 
Neo4j -- or why graph dbs kick ass
Neo4j -- or why graph dbs kick assNeo4j -- or why graph dbs kick ass
Neo4j -- or why graph dbs kick ass
 
Intro to Cypher
Intro to CypherIntro to Cypher
Intro to Cypher
 
Neo4j
Neo4jNeo4j
Neo4j
 
Intro to Neo4j or why insurances should love graphs
Intro to Neo4j or why insurances should love graphsIntro to Neo4j or why insurances should love graphs
Intro to Neo4j or why insurances should love graphs
 
Neo4j - Cas d'usages pour votre métier
Neo4j - Cas d'usages pour votre métierNeo4j - Cas d'usages pour votre métier
Neo4j - Cas d'usages pour votre métier
 
Immersion Musicale avec Neo4j
Immersion Musicale avec Neo4jImmersion Musicale avec Neo4j
Immersion Musicale avec Neo4j
 
OrientDB vs Neo4j - and an introduction to NoSQL databases
OrientDB vs Neo4j - and an introduction to NoSQL databasesOrientDB vs Neo4j - and an introduction to NoSQL databases
OrientDB vs Neo4j - and an introduction to NoSQL databases
 
RDBMS to Graphs
RDBMS to GraphsRDBMS to Graphs
RDBMS to Graphs
 
Lobbying & réseaux d'influences .ppt
Lobbying & réseaux d'influences .pptLobbying & réseaux d'influences .ppt
Lobbying & réseaux d'influences .ppt
 
The Graph Traversal Programming Pattern
The Graph Traversal Programming PatternThe Graph Traversal Programming Pattern
The Graph Traversal Programming Pattern
 
Recommandations avec Neo4j et le GraphAware Recommendation Engine
Recommandations avec Neo4j et le GraphAware Recommendation EngineRecommandations avec Neo4j et le GraphAware Recommendation Engine
Recommandations avec Neo4j et le GraphAware Recommendation Engine
 
Intro to Neo4j presentation
Intro to Neo4j presentationIntro to Neo4j presentation
Intro to Neo4j presentation
 
Introduction to Graph databases and Neo4j (by Stefan Armbruster)
Introduction to Graph databases and Neo4j (by Stefan Armbruster)Introduction to Graph databases and Neo4j (by Stefan Armbruster)
Introduction to Graph databases and Neo4j (by Stefan Armbruster)
 

Similaire à La bi, l'informatique décisionnelle et les graphes

Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
Dublinked .
 
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j
 

Similaire à La bi, l'informatique décisionnelle et les graphes (20)

Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperative
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
 
K anonymity for crowdsourcing database
K anonymity for crowdsourcing databaseK anonymity for crowdsourcing database
K anonymity for crowdsourcing database
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time Analytics
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - Webinar
 
Government GraphSummit: Optimizing the Supply Chain
Government GraphSummit: Optimizing the Supply ChainGovernment GraphSummit: Optimizing the Supply Chain
Government GraphSummit: Optimizing the Supply Chain
 
Neo4j GraphTalks Zürich - Einführung
Neo4j GraphTalks Zürich - EinführungNeo4j GraphTalks Zürich - Einführung
Neo4j GraphTalks Zürich - Einführung
 
Introducing Neo4j
Introducing Neo4jIntroducing Neo4j
Introducing Neo4j
 
Keynote: Graphs in Government_Lance Walter, CMO
Keynote:  Graphs in Government_Lance Walter, CMOKeynote:  Graphs in Government_Lance Walter, CMO
Keynote: Graphs in Government_Lance Walter, CMO
 
Neo4j in Depth
Neo4j in DepthNeo4j in Depth
Neo4j in Depth
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
 
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in Graphdatenbanken
 
Introduction: Relational to Graphs
Introduction: Relational to GraphsIntroduction: Relational to Graphs
Introduction: Relational to Graphs
 
Relationships Matter: Using Connected Data for Better Machine Learning
Relationships Matter: Using Connected Data for Better Machine LearningRelationships Matter: Using Connected Data for Better Machine Learning
Relationships Matter: Using Connected Data for Better Machine Learning
 
Neo4j GraphTalk Oslo - Introduction to Graphs
Neo4j GraphTalk Oslo - Introduction to GraphsNeo4j GraphTalk Oslo - Introduction to Graphs
Neo4j GraphTalk Oslo - Introduction to Graphs
 
Semantics and Machine Learning
Semantics and Machine LearningSemantics and Machine Learning
Semantics and Machine Learning
 
How Graph Technology is Changing AI
How Graph Technology is Changing AIHow Graph Technology is Changing AI
How Graph Technology is Changing AI
 
The Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdfThe Data Platform for Today's Intelligent Applications.pdf
The Data Platform for Today's Intelligent Applications.pdf
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

La bi, l'informatique décisionnelle et les graphes

  • 1. Neo Technology, Inc Confidential La BI, l'informatique décisionnelle et les graphes Philip Rathle, Sr Dir Products philip@neotechnology.com http://twitter.com/prathle
  • 2. Neo Technology, Inc Confidential Les Graphes et la Pensée
  • 3. Neo Technology, Inc Confidential The Brain Structure: Neurons Connected by Synapses. Processing: Signals Relayed Between Neurons through Synapses
  • 4. Neo Technology, Inc Confidential Human Thinking Structured Unstructured (Creative) Both forms involve processing connections
  • 5. Neo Technology, Inc Confidential Les Applications Graphes dans le Commerce
  • 6. Neo Technology, Inc Confidential Early Adopters of Graph Tech
  • 7. Neo Technology, Inc Confidential Evolution of Web Search Survival of the Fittest Pre-1999 WWW Indexing Discrete Data 1999 - 2012 Google Invents PageRank Connected Data (Simple) 2012-? Google Knowledge Graph, Facebook Graph Search Connected Data (Rich)
  • 8. Neo Technology, Inc Confidential Evolution of Online Recruiting 2010-11 Resume Searching & Scoring Aggregated Data Survival of the Fittest 2011-12 Social Job Search Connected Data
  • 9. Neo Technology, Inc Confidential Consumer Web Giants Depends on Five Graphs Gartner’s “5 Graphs” Social Graph Ref: http://www.gartner.com/id=2081316 Interest Graph Payment Graph Intent Graph Mobile Graph
  • 10. Neo Technology, Inc Confidential Graph Buzz!
  • 11. Neo Technology, Inc Confidential Core Industries & Use Cases: Web / ISV Finance & Insurance Communi- cations Logistics Life Sciences Media & Publishing Education, Not-for- Profit Government, Aerospace, Gaming, Other Network Management MDM Social Geo Authorization & Access Control Content Management Recommend- ations Fraud Detection, Other Accenture Select Commercial Customers (Community Users Not Included) Neo4j Adoption Snapshot*
  • 12. Neo Technology, Inc Confidential Ecosystème de la Technologie Graph
  • 13. Neo Technology, Inc Confidential Data Storage & Processing • Graph Databases • Graph Compute Engines Programming: • Graph-Centric APIs & Languages • Graph Algorithms Tools: • Visualization Tools & Libraries • Other Key Graph Analytic Technologies
  • 14. Neo Technology, Inc Confidential Typical Graph BI Environment Application Other Databases ETL Neo4j Cluster Data Storage & Business Rules Execution Reporting Graph- Dashboards & Ad-hoc Analysis Graph Visualization End User Ad-hoc visual navigation & discovery Bulk Analytic Infrastructure (e.g. Graph Compute Engine) ETL Graph Mining & Aggregation Data Scientist Ad-Hoc Analysis
  • 15. What is a Graph Database A graph database is an online (“real-time”) database management system with CRUD methods that expose a graph data model • Two important properties: • Native graph processing, including index-free adjacency1 to facilitate traversals • Native graph storage engine, i.e. written from the ground up to be optimized for managing graph data 1] See Rodriguez, M.A., Neubauer, P., ,“The Graph Traversal Pattern,” 2010 (http://arxiv.org/abs/1004.1001)
  • 16. Overview of Popular Graph Data Models • Property Graph • Description: A “directed, labeled, attributed, multi- graph”1 which exposes three building blocks: nodes, typed relationships and key-value properties on both nodes and relationships • Vendors: Neo4j, OrientDB, InfiniteGraph, Dex • RDF Triples • Description: URI-centered subject-predicate-object triples as pioneered by the semantic web movement2 • Vendors: AllegroGraph, Sesame • HyperGraph • Description: A generalized graph where a relationship can connect an arbitrary amount of nodes (compared to the more common binary graph models)3 • Vendors: HyperGraphDB,TrinityDB 1] Rodriguez, M.A., Neubauer, P., “Constructions from Dots and Lines,” 2010, http://arxiv.org/abs/1006.2361 2] W3C,“The Resource Description Framework (RDF),” 2004, http://www.w3.org/RDF/ 3] Wikipedia, http://en.wikipedia.org/wiki/Hypergraph
  • 17. Graph Compute Engine Processing platforms that enable graph global computational algorithms to be run against large data sets Graph Mining Engine (Working Storage) In-Memory Processing System(s) of Record Graph Compute Engine Data extraction, transformation, and load
  • 18. Neo Technology, Inc Confidential Graph Global Queries What is the max/min/avg. number of connections per node? (aka “Degree Distribution”)
  • 19. Neo Technology, Inc Confidential Quoi faire avec un Graph Database? Example: Facebook Graph Search
  • 20. Neo Technology, Inc Confidential For the Facebook Graph Question: What sushi restaurants in NYC do my friends like?
  • 21. Neo Technology, Inc Confidential What the Graph Looks Like: What sushi restaurants in NYC do my friends like?
  • 22. Neo Technology, Inc Confidential What the Cypher Query Looks Like: What sushi restaurants in NYC do my friends like? START me=node:person(name = 'Philip'), location=node:location(location='New York'), cuisine=node:cuisine(cuisine='Sushi') MATCH (me)-[:IS_FRIEND_OF]->(friend)-[:LIKES]->(restaurant) -[:LOCATED_IN]->(location),(restaurant)-[:SERVES]->(cuisine) RETURN restaurant
  • 23. Neo Technology, Inc Confidential What the Search Looks Like: What sushi restaurants in NYC do my friends like?
  • 24. Neo Technology, Inc Confidential What Other Graph Searches Look Like What drugs will bind to protein X and not interact with drugY?
  • 25. Neo Technology, Inc Confidential Graph Dashboards
  • 27. Fraud Detection & Money Laundering
  • 28. Service Assurance & Network Failure Analysis
  • 29. Neo Technology, Inc Confidential Industry Example: 5 Graphs of Communications
  • 30. Neo Technology, Inc Confidential #1:The Network Graph Graphs in Communications Cell Signal Analysis Router Service DEPENDS_O N Switch Switch Router Fiber Link Fiber Link Fiber Link Oceanfloor Cable DEPENDS_ON DEPENDS_ON DEPEN DS_O N DEPENDS_ON DEPENDS_ON DEPENDS_ON DEPENDS_ON DEPENDS_ON DEPENDS_ON LINKED LINKED LIN KED DEPENDS_ON “What if” Downtime Analysis (Service-to-Infrastructure Mapping) Network Inventory & Cost Accounting
  • 31. Neo Technology, Inc Confidential #2:The Social Graph Graphs in Communications Mobile apps, Collaboration, Social Recommendations, and more...
  • 32. Neo Technology, Inc Confidential #3:The Call Graph Graphs in Communications Plan & Feature Recommendations,Assess Churn Risk
  • 33. Neo Technology, Inc Confidential #4: Master Data Graph Graphs in Communications Organizational Hierarchy Management Resource Authorization Ref: http://www.slideshare.net/verheughe/how-nosql-paid-off-for-telenor
  • 34. Neo Technology, Inc Confidential #5:The Help Desk Graph Graphs in Communications Online Recommendations for Case Avoidance
  • 35. Neo Technology, Inc Confidential Entitlements & Identity Management Network Asset Management Network Cell Analysis Geo Routing (Public Transport) BioInformatics Emergent Graph in Other Industries (Actual Neo4j Graphs) Insurance Risk Analysis
  • 36. Neo Technology, Inc Confidential Web Browsing Portfolio Analytics Mobile Social ApplicationGene Sequencing Emergent Graph in Other Industries (Actual Neo4j Graphs)
  • 37. Neo Technology, Inc Confidential Cas d’études selectionés
  • 38. Neo Technology, Inc Confidential Background • World’s largest provider of IT infrastructure, software & services • HP’s Unified Correlation Analyzer (UCA) application is a key application inside HP’s OSS Assurance portfolio • Carrier-class resource & service management, problem determination, root cause & service impact analysis • Helps communications operators manage large, complex and fast changing networks Business problem • Use network topology information to identify root problems causes on the network • Simplify alarm handling by human operators • Automate handling of certain types of alarms Help operators respond rapidly to network issues • Filter/group/eliminate redundant Network Management System alarms by event correlation Solution & Benefits • Accelerated product development time • Extremely fast querying of network topology • Graph representation a perfect domain fit • 24x7 carrier-grade reliability with Neo4j HA clustering • Met objective in under 6 months Industry: Web/ISV, Communications Use case: Network Management Global (U.S., France)
  • 39. Neo Technology, Inc Confidential Background •One of the world’s largest logistics carriers •Projected to outgrow capacity of old system •New parcel routing system •Single source of truth for entire network •B2C & B2B parcel tracking •Real-time routing: up to 5M parcels per day Business problem •24x7 availability, year round •Peak loads of 2500+ parcels per second •Complex and diverse software stack •Need predictable performance & linear scalability •Daily changes to logistics network: route from any point, to any point Solution & Benefits •Neo4j provides the ideal domain fit: •a logistics network is a graph •Extreme availability & performance with Neo4j clustering •Hugely simplified queries, vs. relational for complex routing •Flexible data model can reflect real-world data variance much better than relational •“Whiteboard friendly” model easy to understand Industry: Logistics Use case: Parcel Routing
  • 40. Neo Technology, Inc Confidential Industry: Online Job Search Use case: Social / Recommendations • Online jobs and career community, providing anonymized inside information to job seekers Business problem • Wanted to leverage known fact that most jobs are found through personal & professional connections • Needed to rely on an existing source of social network data. Facebook was the ideal choice. • End users needed to get instant gratification • Aiming to have the best job search service, in a very competitive market Solution & Benefits • First-to-market with a product that let users find jobs through their network of Facebook friends • Job recommendations served real-time from Neo4j • Individual Facebook graphs imported real-time into Neo4j • Glassdoor now stores > 50% of the entire Facebook social graph • Neo4j cluster has grown seamlessly, with new instances being brought online as graph size and load have increased Person Company KNOW S Person Person KNOWS Company KNOWS WORKS_AT WORKS_AT Neo Technology Confidential Background Sausalito, CA
  • 41. Neo Technology, Inc Confidential Industry: Communications Use case: Recommendations •Cisco.com serves customer and business customers with Support Services •Needed real-time recommendations, to encourage use of online knowledge base •Cisco had been successfully using Neo4j for its internal master data management solution. •Identified a strong fit for online recommendations Solution & Benefits •Cases, solutions, articles, etc. continuously scraped for cross-reference links, and represented in Neo4j •Real-time reading recommendations via Neo4j •Neo4j Enterprise with HA cluster •The result: customers obtain help faster, with decreased reliance on customer support Neo Technology Confidential Background Business problem •Call center volumes needed to be lowered by improving the efficacy of online self service •Leverage large amounts of knowledge stored in service cases, solutions, articles, forums, etc. •Problem resolution times, as well as support costs, needed to be lowered Support Case Support Case Knowledge Base Article Solution Knowledge Base Article Knowledge Base Article Message San Jose, CA Cisco.com
  • 42. Neo Technology, Inc Confidential Interactive Television Programming Industry: Communications Use case: Social gaming Background • Europe’s largest communications company • Provider of mobile & land telephone lines to consumers and businesses, as well as internet services, television, and other services Solution & Benefits • Interactive, social offering gives fans a way to experience the game more closely • Increased customer stickiness for Deutsche Telekom • A completely new channel for reaching customers with information, promotions, and ads • Clear competitive advantage Frankfurt, Germany Business problem • The Fanorakel application allows fans to have an interactive experience while watching sports • Fans can vote for referee decisions and interact with other fans watching the game • Highly connected dataset with real-time updates • Queries need to be served real-time on rapidly changing data • One technical challenge is to handle the very high spikes of activity during popular games
  • 43. Neo Technology, Inc Confidential Reasons for Choosing a Graph Database 1. Order-of-magnitude improvements in query performance for complex, connected data 2. Drastically accelerated application development cycles 3. Maintainability and extensibility of the data model 4. Maturity and reliability of the product
  • 44. Neo Technology, Inc Confidential Questions ?
  • 45. Neo Technology, Inc Confidential Merci ! Pour  aller  plus  loin  : Cédric  Fauvet  –  Votre  contact  en  France E-­‐mail  :  Cedric.Fauvet@neotechnology.com Twi+er  :  @Neo4jFr Communauté  Francophone  :  meetup.com/graphdb-­‐france