Presented at the First openCypher Implementers Meeting in Walldorf, Germany, February 2017 @ http://www.opencypher.org/blog/2017/03/31/first-ocim-blog/
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
Extended Property Graphs and Cypher on Gradoop
1. Extended Property Graphs and Cypher on Gradoop
1st openCypher Implementers Meeting
8 February 2017
Walldorf, Germany
Martin Junghanns
University of Leipzig – Database Research Group
2. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 2
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
2
Gradoop
„An open-source graph dataflow system for
declarative analytics of heterogeneous graph data.“
3. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 3
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
3
Gradoop
Distributed Graph Storage (Apache HDFS)
Apache Flink Operator Implementation
Distributed Operator Execution (Apache Flink)
Extended Property Graph Model (EPGM)
Graph Dataflow Operators
I/O
4. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 4
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
4
Extended Property Graphs
• Vertices and directed Edges
• Logical Graphs
• Identifiers
• Type Labels
• Properties
1 2
3
4
5
1 2 3
4
5
2
1
Hobbit
name : Samwise
Orc
name : Azog
Clan
name : Tribes of Moria
founded : 1981
Orc
name : Bolg
Hobbit
name : Frodo
yob : 2968
leaderOf
since : 2790
memberOf
since : 2013
hates
since : 2301
hates
knows
since : 2990
|Area|title:Mordor
|Area|title:Shire
8. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 8
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
8
Cypher on Gradoop
„Which two clan leaders hate each other and one
of them knows Frodo over one to ten hops?“
9. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 9
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
9
Cypher on Gradoop
15. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 15
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
15
Cypher on Gradoop
16. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 16
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
16
Cypher on Gradoop
17. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 17
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
17
Cypher on Gradoop
18. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 18
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
18
Conclusion
• Implement Cypher Technology Compatibility KIT (TCK) integration tests
• Benchmarking
• Implement and evaluate LDBC benchmarking queries
• Optimizations
• DP-Planner
• Improve cost model (more statistics, Flink optimizer hints)
• Reuse of intermediate results
• Consider graph partitioning
• Support more Cypher features
• e.g. Aggregation and Functions
• Introduce new Cypher features
• e.g. regular path queries
19. Extended Property Graphs and Cypher on Gradoop – 1st openCypher Implementers Meeting – Martin Junghanns 19
Gradoop Extended Property Graphs ConclusionCypher on Gradoop
19
Conclusion
• Gradoop on Apache Flink
• Extended Property Graph abstraction on Apache Flink
• Schema flexible: Type Labels and Properties
• Logical Graphs / Graphs Collection
• Graph Transformations for Graphs and Graph collections
• Cypher on Gradoop
• Covering many Cypher features (variable length paths, predicates)
• Query execution engine incl. Greedy cost-based optimizer
• Physical operators mapped to Flink transformations
20. www.gradoop.com
[1] Junghanns, M.; Petermann, A.; K.; Rahm, E., „Distributed Grouping of Property Graphs with Gradoop“,
Proc. BTW Conf. , 2017.
[2] Petermann, A.; Junghanns, M.; Kemper, S.; Gomez, K.; Teichmann, N.; Rahm, E., „Graph Mining for Complex Data Analytics “,
Proc. ICDM Conf. (Demo), 2016.
[3] Junghanns, M.; Petermann, A.; Teichmann, N.; Gomez, K.; Rahm, E., „Analyzing Extended Property Graphs with Apache Flink“,
Int. Workshop on Network Data Analytics (NDA), SIGMOD, 2016.
[4] Petermann, A.; Junghanns, M., „Scalable Business Intelligence with Graph Collections“,
it – Special Issue on Big Data Analytics, 2016.
[5] Petermann, A.; Junghanns, M.; Müller, M.; Rahm, E., „Graph-based Data Integration and Business Intelligence with BIIIG“,
Proc. VLDB Conf. (Demo), 2014.