SlideShare une entreprise Scribd logo
1  sur  54
Télécharger pour lire hors ligne
Modern Radiology for
  Distributed Systems



  Dietrich Featherston
  @d2fn
Thursday, October 11, 12
This is a talk about
                           monitoring




Thursday, October 11, 12
But not just any kind of
                           monitoring

                           Non-invasive monitoring



Thursday, October 11, 12
non-invasive monitoring

                           measures taken to describe the
                           state of a system with minimal
                           changes to the system being
                           monitored




Thursday, October 11, 12
Insight


                           Radiographic
                             Imagery




                                          Invasiveness

Thursday, October 11, 12
preventative care

                           measures taken to prevent
                           diseases or injuries rather than
                           curing them or treating their
                           symptoms




Thursday, October 11, 12
Non-invasive monitoring
                           techniques focus primarily
                           on host-based metrics

                           Why is this a problem?



Thursday, October 11, 12
Because applications are
                                 distributed




Thursday, October 11, 12
Information emitted
                           about nodes in the network


                                     n                  Information emitted
                                                            about edges
                                                           in the network



                                                            n²
 Network size
Thursday, October 11, 12
We analyze cell-structure
                           because we can’t envision
                              the whole organism

                           We react to disease and
                           injury because we lack
                              preventative care


Thursday, October 11, 12
We lack preventative care for
                      applications because our
                      non-invasive monitoring
                      techniques are growing less
                      and less meaningful



Thursday, October 11, 12
Radiology is useful in
                           illuminating non-invasive
                           monitoring of distributed
                           systems



Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Context is
                           everything



Thursday, October 11, 12
How do we use
                             context?


Thursday, October 11, 12
!!!   Context




                  Your Big
                 Dumb Data
Thursday, October 11, 12
Human
                                         brain
                           Diagnoses
                                           +
                                       med school




     Radiographic
       Imagery
Thursday, October 11, 12
E.T.     Signal
                                  Processing




              VLA Output
Thursday, October 11, 12
Application Topology
                                         Signal Processing
                                         Expert Brain

                           Application
                            Behavior




                              Network
Thursday, October 11, 12
                               Data
dimensions (11)       measurements (8)
     epoch seconds         egress packets
     epoch minutes         egress octets
     epoch hours           ingress packets
     node id               ingress octets
     source ip             retransmits
     source port
                           errors
     dest ip
     dest port             app-rtt
     interface             handshake-rtt
     country
     network/asn


Thursday, October 11, 12
Case Study #1

                           GC-Death of a distributed
                           JVM application




Thursday, October 11, 12
Thursday, October 11, 12
Case Study #2

                Symptoms:
                - Latent Riak handoff
                - Cluster throughput bottoming out



Thursday, October 11, 12
Thursday, October 11, 12
busy_dist_port




Thursday, October 11, 12
+zdbbl 8192




Thursday, October 11, 12
Thursday, October 11, 12
Case Study #3

                           Bringing a dead riak node
                           back online




Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Case Study #4

                           Retransmits 10% of total
                           network throughput




Thursday, October 11, 12
Thursday, October 11, 12
var put: HttpPut = null
      try {
        // ... put data
      }
      catch {
        case e: Exception =>
          // ... handle exception
      }
      finally {
        if(put != null) {
          put.abort()
        }
      }


Thursday, October 11, 12
var put: HttpPut = null
      try {
        // ... put data
      }
      catch {
        case e: Exception =>
          // ... handle exception
      }
      finally {
        if(put != null) {
          put.abort()
        }
      }


Thursday, October 11, 12
Source: http://hc.apache.org/httpcomponents-client-ga/httpclient/apidocs/org/apache/http/client/methods/HttpRequestBase.html#abort()




  abort
  public void abort()
  Description copied from interface: HttpUriRequest
  Aborts execution of the request.




                                                               THANKS
Thursday, October 11, 12
129    public void abort() {
   130        ClientConnectionRequest localRequest;
   131        ConnectionReleaseTrigger localTrigger;
   132        
   133        this.abortLock.lock();
   134        try {
   135            if (this.aborted) {
   136                return;
   137            }            
   138            this.aborted = true;
   139            
   140            localRequest = connRequest;
   141            localTrigger = releaseTrigger;
   142        } finally {
   143            this.abortLock.unlock();
   144        }        
   145
   146        // Trigger the callbacks outside of the lock, to prevent
   147        // deadlocks in the scenario where the callbacks have
   148        // their own locks that may be used while calling
   149        // setReleaseTrigger or setConnectionRequest.
   150        if (localRequest != null) {
   151            localRequest.abortRequest();
   152        }
   153        if (localTrigger != null) {
   154            try {
   155                localTrigger.abortConnection();
   156            } catch (IOException ex) {
   157                // ignore
   158            }
   159        }
   160    }

Thursday, October 11, 12
Thursday, October 11, 12
augmented intelligence
                           precedes
                           artificial intelligence




Thursday, October 11, 12
95
18




     Wilhelm Röntgen
     discovers X-Rays
     First medical use of x-rays in
     human imaging takes place one
     month later




Thursday, October 11, 12
95

                             05
18

                           19


                           First English text on
                           chest radiography


     Wilhelm Röntgen
     discovers X-Rays
     First medical use of x-rays in
     human imaging takes place one
     month later




Thursday, October 11, 12
20
   95

                             05
18




                                                                    19
                           19


                           First English text on
                           chest radiography

                                            Society of Radiographers formed
     Wilhelm Röntgen
     discovers X-Rays
     First medical use of x-rays in
     human imaging takes place one
     month later




Thursday, October 11, 12
Recognition of radiology as
                           a formal medical discipline
                           was a cultural problem, not
                              a technology problem



                                    http://www.bshr.org.uk/page13.html
Thursday, October 11, 12
If you want to talk to me about the
        query language used to ask questions
        of the network data we collect at
        Boundary talk to me after or hit me up
        on twitter.


                     @d2fn
                     github.com/dietrichf

Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12

Contenu connexe

En vedette

Invokedynamic in 45 Minutes
Invokedynamic in 45 MinutesInvokedynamic in 45 Minutes
Invokedynamic in 45 MinutesCharles Nutter
 
PDM - Precedence Diagram Method
PDM - Precedence Diagram MethodPDM - Precedence Diagram Method
PDM - Precedence Diagram Methodphantai24
 
Solid modelling cg
Solid modelling cgSolid modelling cg
Solid modelling cgNareek
 
Netflix Architecture Tutorial at Gluecon
Netflix Architecture Tutorial at GlueconNetflix Architecture Tutorial at Gluecon
Netflix Architecture Tutorial at GlueconAdrian Cockcroft
 
Solid modeling
Solid modelingSolid modeling
Solid modelingKRvEsL
 

En vedette (7)

Cad cam
Cad camCad cam
Cad cam
 
Invokedynamic in 45 Minutes
Invokedynamic in 45 MinutesInvokedynamic in 45 Minutes
Invokedynamic in 45 Minutes
 
PDM - Precedence Diagram Method
PDM - Precedence Diagram MethodPDM - Precedence Diagram Method
PDM - Precedence Diagram Method
 
Solid modelling cg
Solid modelling cgSolid modelling cg
Solid modelling cg
 
Introduction to solid modeling
Introduction to solid modelingIntroduction to solid modeling
Introduction to solid modeling
 
Netflix Architecture Tutorial at Gluecon
Netflix Architecture Tutorial at GlueconNetflix Architecture Tutorial at Gluecon
Netflix Architecture Tutorial at Gluecon
 
Solid modeling
Solid modelingSolid modeling
Solid modeling
 

Dernier

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 

Dernier (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 

Distributed systems-radiology

  • 1. Modern Radiology for Distributed Systems Dietrich Featherston @d2fn Thursday, October 11, 12
  • 2. This is a talk about monitoring Thursday, October 11, 12
  • 3. But not just any kind of monitoring Non-invasive monitoring Thursday, October 11, 12
  • 4. non-invasive monitoring measures taken to describe the state of a system with minimal changes to the system being monitored Thursday, October 11, 12
  • 5. Insight Radiographic Imagery Invasiveness Thursday, October 11, 12
  • 6. preventative care measures taken to prevent diseases or injuries rather than curing them or treating their symptoms Thursday, October 11, 12
  • 7. Non-invasive monitoring techniques focus primarily on host-based metrics Why is this a problem? Thursday, October 11, 12
  • 8. Because applications are distributed Thursday, October 11, 12
  • 9. Information emitted about nodes in the network n Information emitted about edges in the network n² Network size Thursday, October 11, 12
  • 10. We analyze cell-structure because we can’t envision the whole organism We react to disease and injury because we lack preventative care Thursday, October 11, 12
  • 11. We lack preventative care for applications because our non-invasive monitoring techniques are growing less and less meaningful Thursday, October 11, 12
  • 12. Radiology is useful in illuminating non-invasive monitoring of distributed systems Thursday, October 11, 12
  • 16. Context is everything Thursday, October 11, 12
  • 17. How do we use context? Thursday, October 11, 12
  • 18. !!! Context Your Big Dumb Data Thursday, October 11, 12
  • 19. Human brain Diagnoses + med school Radiographic Imagery Thursday, October 11, 12
  • 20. E.T. Signal Processing VLA Output Thursday, October 11, 12
  • 21. Application Topology Signal Processing Expert Brain Application Behavior Network Thursday, October 11, 12 Data
  • 22. dimensions (11) measurements (8) epoch seconds egress packets epoch minutes egress octets epoch hours ingress packets node id ingress octets source ip retransmits source port errors dest ip dest port app-rtt interface handshake-rtt country network/asn Thursday, October 11, 12
  • 23. Case Study #1 GC-Death of a distributed JVM application Thursday, October 11, 12
  • 25. Case Study #2 Symptoms: - Latent Riak handoff - Cluster throughput bottoming out Thursday, October 11, 12
  • 30. Case Study #3 Bringing a dead riak node back online Thursday, October 11, 12
  • 34. Case Study #4 Retransmits 10% of total network throughput Thursday, October 11, 12
  • 36. var put: HttpPut = null try {   // ... put data } catch {   case e: Exception =>     // ... handle exception } finally {   if(put != null) {     put.abort()   } } Thursday, October 11, 12
  • 37. var put: HttpPut = null try {   // ... put data } catch {   case e: Exception =>     // ... handle exception } finally {   if(put != null) {     put.abort()   } } Thursday, October 11, 12
  • 38. Source: http://hc.apache.org/httpcomponents-client-ga/httpclient/apidocs/org/apache/http/client/methods/HttpRequestBase.html#abort() abort public void abort() Description copied from interface: HttpUriRequest Aborts execution of the request. THANKS Thursday, October 11, 12
  • 39. 129    public void abort() { 130        ClientConnectionRequest localRequest; 131        ConnectionReleaseTrigger localTrigger; 132         133        this.abortLock.lock(); 134        try { 135            if (this.aborted) { 136                return; 137            }             138            this.aborted = true; 139             140            localRequest = connRequest; 141            localTrigger = releaseTrigger; 142        } finally { 143            this.abortLock.unlock(); 144        }         145 146        // Trigger the callbacks outside of the lock, to prevent 147        // deadlocks in the scenario where the callbacks have 148        // their own locks that may be used while calling 149        // setReleaseTrigger or setConnectionRequest. 150        if (localRequest != null) { 151            localRequest.abortRequest(); 152        } 153        if (localTrigger != null) { 154            try { 155                localTrigger.abortConnection(); 156            } catch (IOException ex) { 157                // ignore 158            } 159        } 160    } Thursday, October 11, 12
  • 41. augmented intelligence precedes artificial intelligence Thursday, October 11, 12
  • 42. 95 18 Wilhelm Röntgen discovers X-Rays First medical use of x-rays in human imaging takes place one month later Thursday, October 11, 12
  • 43. 95 05 18 19 First English text on chest radiography Wilhelm Röntgen discovers X-Rays First medical use of x-rays in human imaging takes place one month later Thursday, October 11, 12
  • 44. 20 95 05 18 19 19 First English text on chest radiography Society of Radiographers formed Wilhelm Röntgen discovers X-Rays First medical use of x-rays in human imaging takes place one month later Thursday, October 11, 12
  • 45. Recognition of radiology as a formal medical discipline was a cultural problem, not a technology problem http://www.bshr.org.uk/page13.html Thursday, October 11, 12
  • 46. If you want to talk to me about the query language used to ask questions of the network data we collect at Boundary talk to me after or hit me up on twitter. @d2fn github.com/dietrichf Thursday, October 11, 12
  • 47. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 48. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 49. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 50. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 51. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 52. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 53. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 54. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12