This presentation introduces Titan, Faunus, and scalable graph computing in general. We present a case study of how Pearson builds an education social network on top of Titan, Faunus, and Cassandra to support learning in the 21st century.
Titan is an open source distributed graph database build on top of Cassandra that can power real-time applications with thousands of concurrent users over graphs with billions of edges. Faunus is an open source global graph processing engine build on top of Hadoop and compatible with Cassandra that can analyze graphs, compute graph statistics, and execute global traversals. Titan and Faunus are components of the Aurelius Graph Cluster which enables scalable graph computation and powers applications in social networking, recommendation engines, advertisement optimization, knowledge representation, health care, education, and security.
23. Transaction Description Avg (ms) Stdev (ms)
Student retrieves all content for a
single course in their course list
279.32 81.83
Student follows another student 193.72 22.77
Student is recommended people
to follow
241.33 256.48
Student reads their stream and
shares an item with followers
284.07 68.20
Student retrieves their profile 53.740 22.61
Student reads the most recent
comments for their courses
211.07 45.56
26. label id +
direction
primary key
edge id
Δ
vertex id
signature
properties
other
properties
Edge Representation
Column
Value
=IGJL?MM?> M?LC;FCT?> I<D?=NM
P;LC;<F? FIHA ?H=I>CHA
28. Aurelius Graph Cluster
Stores a massive-scale
property graph allowing real-
time traversals and updates
Batch processing of large
graphs with Hadoop
Runs global graph algorithms
on large, compressed,
in-memory graphs
Map/Reduce
Load & Compress
Analysis results
back into Titan
Bulk Load
TITAN FAUNUS FULGORA
Apache 2
aureliusgraphs@googlegroups.com
titan.thinkaurelius.com
faunus.thinkaurelius.com