SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
25 Settembre 2017
With a datascience perspective
Aview of graph data usage by Cerved
Stefano Gatti – Head of Innovation and data sources
Nunzio Pellegrino – Senior Data Scientist – Innovation team
Cerved and its graphs in a nutshell
3
Cerved, in a nutshell
The Italian data-driven company
CREDIT INFORMATION
Protection against credit risk
MARKETING SOLUTIONS
New business opportunities
CREDIT MANAGEMENT
Manage and collect performing and
non-performing loans.
	
Over 1000 a minute	
ü  Documents	
	
Over 40 million	
ü  Lines of code
	
Over 30,000	
ü  Customers
	
Over 50 different	
ü  Data sources
Over 10million a day	
ü  Api call
	
Over 1,900	
ü  People
	
377 million Eur (2016)	
ü  Revenue
4
Web	Data	
Open	Data	
Proprietary	data	
Official	data	
Chamber	of	
Commerce	
official	data	
C
o
m
p
l
e
x
i
t
y	
Our big data
5
Cerved, in a tech view
Data	
Algorithms	
Solu0ons	
Towards algorithmic economy …
6
Cerved Graph Story
2011-12 - we started from an IT problem: reengineering of beneficial owner algorithm
7
Cerved Graph Story
2014-15 - we went through a more algorithmic problem: corporate linkages algorithm
8
Cerved Graph Story
2015-16 - we go with a “full stack” solution
9
Cerved Graph thoughts
We strongly believe in …
The	power	of	linking	data		
The	power	of	analyzing	data	with	network	analysis	
The	power	of	visualizing	data	in	a	different	way	
To	understand	a	li5le	be5er	the		
increasing	complexity	of	modern	world	…	
also	from	an	economic	point	of	view
Why a Graph Database?
11
What is a Graph?
12
Key Concepts
Graph database
NoSQL database
Managing highly connected data
and complex queries
Flexible data model
13
Key Concepts
Graph database
Declarative or imperative
language
Horizontal Scaling
Graph native storage and
process
14
Where graphdb can be useful?
“Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
15
Maybe in the future…
“Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
16
Frame the Problem
Data Model
Simple1
Expressive2
3 Additive
17
RDBMS
RDBMS vs Graph Data Model
Graph
18
Store & Get Data
Native graph storage
Store Data Fast Write Performance
Easy Data Integration:
CSV
Jdbc
REST Api
19
Store & Get Data
Native graph storage
Store Data Fast Write Performance
Easy Data Integration:
Get Data
Native graph processing à Index free adjacency
CSV
Jdbc
REST Api
Cypher, Declarative Language
Driver:
Python
py2neo (unofficial)
R (unofficial)
Java
APOC
20
Explore Data
Transform implicit to explicitCypher (access points, pattern)
21
Explore Data
Transform implicit to explicitCypher (access points, pattern)
22
Explore Data
Transform implicit to explicitCypher (access points, pattern)
23
Prepare Data
Feature Creation with parallel Graph algorithms
Centralities
•  Page Rank
•  Betweenness Centrality
•  Closeness Centrality
Graph Partitioning
•  Label Propagation
•  Connected Components
•  Strongly Connected
Components
Path Finding
•  Minimum Weight
Spanning Tree
•  All Pairs- and Single
Source Shortest Path
24
Prepare Data
Feature Creation with parallel Graph algorithms
Centralities
•  Page Rank
•  Betweenness Centrality
•  Closeness Centrality
Graph Partitioning
•  Label Propagation
•  Connected Components
•  Strongly Connected
Components
Path Finding
•  Minimum Weight
Spanning Tree
•  All Pairs- and Single
Source Shortest Path
Graph Size (GB) nodes (M) rels (M)
PageRank
(s)
ConCom
(s)
LabelPropag
(s)
StrongConCom
(s)
Pokec 7.3 2 31 10 24 12 12
DBPedia 15 11 117 46 91 51 65
Graphs500-23 7.9 5 129 19 29 18 25
Twitter-2010 49 42 1468 349 353 405 339
soc-LifeJournal1 6.3 5 69 30 34 25 23
Friendster 62 66 1806 611 619 296 483
Performance
25
Present&Launch your solution
Real time
Recommendation
Fraud Detection
Social Network Analysis
Search & Link Analysis
Knowledge Graph Natural Language
Process
Nunzio Pellegrino
S e n i o r D a t a S c i e n t i s t –
Innovation Team
nunzio.pellegrino@cerved.com
Stefano Gatti
Head of Innovation & Data
Sources
stefano.gatti@cerved.com

Contenu connexe

Tendances

GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionGraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionNeo4j
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavatoringeangevaare
 
Demystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaDemystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaKnoldus Inc.
 
Turning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceTurning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceSabine Kurjo McNeill
 
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap isEric Horesnyi
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...Neo4j
 
Integration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesIntegration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesRoberto García
 
Foundations of Data Teams
Foundations of Data TeamsFoundations of Data Teams
Foundations of Data TeamsDatabricks
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsKate Shao
 
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaAutograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaNeo4j
 
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn
 
Coordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsCoordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsMarcos Rebelo
 
Session 4.3 semantic annotation for enhancing collaborative ideation
Session 4.3   semantic annotation for enhancing collaborative ideationSession 4.3   semantic annotation for enhancing collaborative ideation
Session 4.3 semantic annotation for enhancing collaborative ideationsemanticsconference
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAvkash Chauhan
 
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...OW2
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through WikidataJoachim Neubert
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupOpen Analytics
 
Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn
 

Tendances (20)

GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionGraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavator
 
Demystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaDemystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and Akka
 
Turning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceTurning Data into Insights and Intelligence
Turning Data into Insights and Intelligence
 
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
 
Integration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesIntegration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and Ontologies
 
Foundations of Data Teams
Foundations of Data TeamsFoundations of Data Teams
Foundations of Data Teams
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing Methods
 
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaAutograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
 
Sitech
SitechSitech
Sitech
 
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
 
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
 
Coordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsCoordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functions
 
Session 4.3 semantic annotation for enhancing collaborative ideation
Session 4.3   semantic annotation for enhancing collaborative ideationSession 4.3   semantic annotation for enhancing collaborative ideation
Session 4.3 semantic annotation for enhancing collaborative ideation
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
 
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics Meetup
 
Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021
 

Similaire à Cerved Datascience Milan

A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by CervedData Science Milan
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"MDS ap
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise ArchitectsNeo4j
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...Amazon Web Services
 
Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Juan Sequeda
 
Refactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsRefactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsLuke Han
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemYael Garten
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemShirshanka Das
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Shirshanka Das
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Yael Garten
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedis Labs
 
GraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphGraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphNeo4j
 
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesSC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesBigData_Europe
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jNeo4j
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesCédric Fauvet
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017SingleStore
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & InsightsLoQutus
 
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceNodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceDeepak Chandramouli
 

Similaire à Cerved Datascience Milan (20)

A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by Cerved
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
 
Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities
 
Talend introduction v1
Talend introduction v1Talend introduction v1
Talend introduction v1
 
Refactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsRefactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics Products
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter Cailliau
 
GraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphGraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business Graph
 
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesSC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4j
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & Insights
 
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceNodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
 
It takes a village (to raise a ML model)
It takes a village (to raise a ML model)It takes a village (to raise a ML model)
It takes a village (to raise a ML model)
 

Dernier

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 

Dernier (20)

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 

Cerved Datascience Milan

  • 1. 25 Settembre 2017 With a datascience perspective Aview of graph data usage by Cerved Stefano Gatti – Head of Innovation and data sources Nunzio Pellegrino – Senior Data Scientist – Innovation team
  • 2. Cerved and its graphs in a nutshell
  • 3. 3 Cerved, in a nutshell The Italian data-driven company CREDIT INFORMATION Protection against credit risk MARKETING SOLUTIONS New business opportunities CREDIT MANAGEMENT Manage and collect performing and non-performing loans. Over 1000 a minute ü  Documents Over 40 million ü  Lines of code Over 30,000 ü  Customers Over 50 different ü  Data sources Over 10million a day ü  Api call Over 1,900 ü  People 377 million Eur (2016) ü  Revenue
  • 5. 5 Cerved, in a tech view Data Algorithms Solu0ons Towards algorithmic economy …
  • 6. 6 Cerved Graph Story 2011-12 - we started from an IT problem: reengineering of beneficial owner algorithm
  • 7. 7 Cerved Graph Story 2014-15 - we went through a more algorithmic problem: corporate linkages algorithm
  • 8. 8 Cerved Graph Story 2015-16 - we go with a “full stack” solution
  • 9. 9 Cerved Graph thoughts We strongly believe in … The power of linking data The power of analyzing data with network analysis The power of visualizing data in a different way To understand a li5le be5er the increasing complexity of modern world … also from an economic point of view
  • 10. Why a Graph Database?
  • 11. 11 What is a Graph?
  • 12. 12 Key Concepts Graph database NoSQL database Managing highly connected data and complex queries Flexible data model
  • 13. 13 Key Concepts Graph database Declarative or imperative language Horizontal Scaling Graph native storage and process
  • 14. 14 Where graphdb can be useful? “Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
  • 15. 15 Maybe in the future… “Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
  • 16. 16 Frame the Problem Data Model Simple1 Expressive2 3 Additive
  • 17. 17 RDBMS RDBMS vs Graph Data Model Graph
  • 18. 18 Store & Get Data Native graph storage Store Data Fast Write Performance Easy Data Integration: CSV Jdbc REST Api
  • 19. 19 Store & Get Data Native graph storage Store Data Fast Write Performance Easy Data Integration: Get Data Native graph processing à Index free adjacency CSV Jdbc REST Api Cypher, Declarative Language Driver: Python py2neo (unofficial) R (unofficial) Java APOC
  • 20. 20 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 21. 21 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 22. 22 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 23. 23 Prepare Data Feature Creation with parallel Graph algorithms Centralities •  Page Rank •  Betweenness Centrality •  Closeness Centrality Graph Partitioning •  Label Propagation •  Connected Components •  Strongly Connected Components Path Finding •  Minimum Weight Spanning Tree •  All Pairs- and Single Source Shortest Path
  • 24. 24 Prepare Data Feature Creation with parallel Graph algorithms Centralities •  Page Rank •  Betweenness Centrality •  Closeness Centrality Graph Partitioning •  Label Propagation •  Connected Components •  Strongly Connected Components Path Finding •  Minimum Weight Spanning Tree •  All Pairs- and Single Source Shortest Path Graph Size (GB) nodes (M) rels (M) PageRank (s) ConCom (s) LabelPropag (s) StrongConCom (s) Pokec 7.3 2 31 10 24 12 12 DBPedia 15 11 117 46 91 51 65 Graphs500-23 7.9 5 129 19 29 18 25 Twitter-2010 49 42 1468 349 353 405 339 soc-LifeJournal1 6.3 5 69 30 34 25 23 Friendster 62 66 1806 611 619 296 483 Performance
  • 25. 25 Present&Launch your solution Real time Recommendation Fraud Detection Social Network Analysis Search & Link Analysis Knowledge Graph Natural Language Process
  • 26. Nunzio Pellegrino S e n i o r D a t a S c i e n t i s t – Innovation Team nunzio.pellegrino@cerved.com Stefano Gatti Head of Innovation & Data Sources stefano.gatti@cerved.com