2. TOPOLOGY I
• "When a truth is necessary, the reason for it can be
found by analysis, that is, by resolving it into simpler
ideas and truths until the primary ones are reached."
- Leibniz
3. TOPOLOGY II
• Topology is the mathematical study of topological
spaces.
• Topology is interested in shapes,
• More specifically: the concept of 'connectedness'
4. TOPOLOGY III
• A topologist is someone who does not see the
difference between a coffee mug and a donut.
5. HISTORY I
• “Nothing at all takes place in the universe in which
some rule of maximum or minimum does not
appear.” - Euler
• Seven Bridges of Koningsbrucke: devise a walk
through the city that would cross each bridge
once and only once.
7. HISTORY III
• Euler's big insights:
• Doesn’t matter where you start walking, only matters which bridges
you cross.
• A similar solution should be found, regardless where you start your
walk.
• only the connectedness of bridges matter,
• a solution should also apply to all other bridges that are connected
in a similar fashion, no matter the distances between them.
8. HISTORY IV
• We now call these graph walks ‘Eulerian walks’ in
Euler’s honor.
• Euler's first proven graph theory theorem:
• 'Euler walks' are possible if exactly zero or two nodes
have an odd number of edges.
9. TDA I
• TDA marries 300-year old maths with
modern data analysis.
• Captures the shape of data
• Is invariant
• Compresses large datasets
• Functions well in the presence of noise / missing variables
10. TDA II
• Capturing the shape of data
•Traditional techniques like clustering or dimensionality reduction have
trouble capturing this shape.
11. TDA III
• Invariance.
• Euler showed that only connectedness matters.The size, position, or
pose of an object doesn't change that object.
12. TDA IV
• Compression.
• Compressed representations use
the order in data.
• Only order can be compressed.
• Random noise or slight variations
are ignored.
• Lossy compression retains the most
important features.
• "Now where there are no parts, there neither extension, nor shape, nor divisibility is possible.
And these monads are the true atoms of nature and, in a word, the elements of things." - Leibniz
13. MAPPER I
• Mapper was created by Ayasdi Co-founder
Gurjeet Singh during his PhD under Gunnar
Carlsson.
• Based on the idea of partial clustering of the data
guided by a set of functions defined on the data.
15. MAPPER III
• Map the data with overlapping intervals.
• Cluster the points inside the intervals
• When clusters share data points draw an edge
• Color nodes by function
19. FUNCTIONS
• Raw features or point-cloud axis / coordinates
• Statistics: Mean, Max, Skewness, etc.
• Mathematics: L2-norm, FourierTransform, etc.
• Machine Learning: t-SNE, PCA, out-of-fold preds
• Deep Learning: Layer activations, embeddings
20. CLUSTER ALGO’S
• DBSCAN / HDBSCAN:
• Handles noise well.
• No need to set number of clusters.
• K-Means:
• Creates visually nice simplicial complexes/graphs
21. SOME GENERAL USE CASES
• ComputerVision
• Model and feature inspection
• Computational Biology / Healthcare
• Persistent Homology
29. ACCURATE INTERPRETABLE
MODELS
• Create: global linear model
• Function: L2-norm
• Color: Heatmap by ground truth and animate to out-of-fold model predictions
• Identify: Low accuracy sub graphs
• Select: Features that are most important for sub graphs
• Create: Local linear models on sub graphs
• Stack: DecisionTree
• Compare: Divide-and-Conquer and LIME
• DEMO