SlideShare une entreprise Scribd logo
1  sur  32
Télécharger pour lire hors ligne
How I Hacked My Energy Monitor:
Lessons in Data Analytics Pipelines
Sho Fola Soboyejo
MeasureCamp San Francisco
July 2017
About Me
Books in 6 Months
Cincinnati Machine Learning @shoreason
www.shoreason.com
github.com/shoreason
The Genesis
• Sunday Chat
• Sustainable Energy
• Sentient Home
Getting to The Data
Energy Company Options:
• No API
• Data Download (Not Realtime)
• Tight Coupling
Lesson
No ready packaged full solution = Opportunity
Building the ‘Novel’ on What Exists – Go find your bricks
Photo Credit: Justin Hamilton
Where I Ended
• MQTT
• Kafka
• ELK Stack
• Cassandra
• 1 Linux box
• Programming language choice –  Python
• Sensor Peripherals (Efergy Electricity
Monitor and R820 Tuner)
Getting to the Source
• Updates Every 10
Secs
• Understand the
impact of each
event
• Getting to data is
necessary … don’t
be afraid to ask for
expert help
Elite Classic
photo credit: Jude
My Bricks
Efergy Monitor
Intercepting Data
R820 Tuner RTL-SDR (USB Dongle)
• Software Defined Radio
• A computer based radio scanner
• Enables easy signal processing of TV and
Radio Signals
Decoding the Data
• C program to
decode
baseband data
• New tools and
skillsets for data
transformation
• Polyglot
• Frequency -
4331.51Mhz
• Bandwidth -
200khz
• Resampled to
96kHz
• Tuner gain -
49.6dB
Decoding the Data
My Bricks
Efergy Monitor R820 Tuner C - Adapter
Moving Data Efficiently
1st Class Citizen Data
• Data that is critical to value stream needs
to be shared
• Enterprise Options:
• ETL (Extract-Transform-Load)
• APIs
• Data Stream
Finding a Dealer or Broker
Requirements Met by Both
• Support for data streams (pub/sub)
• Decouple consumers from producers
• Offer measure of reliability (QoS 1, 2 &3)
• Address messaging needs
• Forward compatibility (accommodate new
consumers)
The Smart Broker for Dumb Clients
A TCP protocol made to support lightweight
messages (2 byte overhead per message)
• Easy to parse for computers (length prefixed
length)
• Broker keeps track of client state (Last Will
Message)
• Guarantees message delivery through retries
• Security (usr/pwd over TLS)
• Dynamic topics
• Arduino or Raspberry Pi
The Dumb Broker for Smart Clients
A distributed commit log with support for high
throughput and message durability even at scale
• Distributed Log aggregator
• Client keeps track of what messages it has
processed
• Meant for server side environments (server to
server)
• Static topics (topic - static, key - dynamic)
• Short window of persistence
Lesson
• Look at your options
• Avoid paralysis of analysis
• Start with what you know … you can
always optimize later
• Guard against premature optimization
My Bricks
Efergy Monitor R820 Tuner C - Adapter
Python Adapter
Drawing Insights with Visualization
• A way to interact with data
• Monitor data
• Starting point for decision making
• ELK Stack
Logstash Elastic Kibana
• What time of the day do I use the most power?
• Trend by day of the week
• Energy spikes? (Where, When and What)
Are there Anomalies?
• First define Normal
• Normal
• Something behaves in a consistent way with
respect to itself over time
• Something behaves in a consistent way with
respect to things around it
• Anomaly
• Change with respect to self as a function of time
• Relative diff compared to peers within a
population
Using Unsupervised Learning
• Very small sample of +ve scenarios (above
threshold) : 0 -20
• Large number of -ve samples
• Where future anomalies may look nothing
like past anomalies or samples in training
set
• Gaussian (normal) distribution
Why Not Use Supervised
• You have some labels, why not use
supervised?
• Supervised better when there’s a large
number of both -ve and +ve labels
• Supervised Algorithm needs enough view
into what an anomaly could possibly be
• Future anomalies might look nothing like
past anomalies
• Anomaly detection using Machine Learning
• ML model based on Bayesian Stats
• Forgetting to unplug the iron or stove left on
Feature Selection
• Transforming features to be more
Gaussian
• Features that might take unusual large or
small values in the event of an anomaly
My Bricks
Efergy Monitor R820 Tuner C - Adapter
Python Adapter Logstash Elastic Kibana
Being Persistent
• Losing data sucks (My Presentation)
• Historical analysis
• Exposing data to other clients
Snapshot
Lessons
• Pay attention and be inspired by side
conversations
• Don’t be overwhelmed by the idea … take
one step, the first step
Thank You
@shoreason
www.shoreason.com
github.com/shoreason
https://www.linkedin.com/in/shofola

Contenu connexe

Tendances

Batch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fastBatch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fastMarc Sturlese
 
IBM Extreme Blue FTP Discovery Week 2 Presentation
IBM Extreme Blue FTP Discovery Week 2 PresentationIBM Extreme Blue FTP Discovery Week 2 Presentation
IBM Extreme Blue FTP Discovery Week 2 PresentationUniversity of St Andrews
 
Building High Performance Apps with In-Memory Data
Building High Performance Apps with In-Memory DataBuilding High Performance Apps with In-Memory Data
Building High Performance Apps with In-Memory DataAmazon Web Services
 
Hands-on Lab Building High Performance Apps with In-Memory Data
Hands-on Lab Building High Performance Apps with In-Memory DataHands-on Lab Building High Performance Apps with In-Memory Data
Hands-on Lab Building High Performance Apps with In-Memory DataAmazon Web Services
 
Distributed Tracing
Distributed TracingDistributed Tracing
Distributed Tracingsoasme
 
[NetPonto] NoSQL em Windows Azure Table Storage
[NetPonto] NoSQL em Windows Azure Table Storage[NetPonto] NoSQL em Windows Azure Table Storage
[NetPonto] NoSQL em Windows Azure Table StorageVitor Tomaz
 
Machine Learning Deep Dive
Machine Learning Deep DiveMachine Learning Deep Dive
Machine Learning Deep DiveElasticsearch
 
Monitoring & alerting presentation sabin&mustafa
Monitoring & alerting presentation sabin&mustafaMonitoring & alerting presentation sabin&mustafa
Monitoring & alerting presentation sabin&mustafaLama K Banna
 
Artik cloud deview 2016
Artik cloud   deview 2016Artik cloud   deview 2016
Artik cloud deview 2016NAVER D2
 
How Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
How Much Can You Connect? | Bhavesh Raheja, Disney + HotstarHow Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
How Much Can You Connect? | Bhavesh Raheja, Disney + HotstarHostedbyConfluent
 
Stream processing using Apache Storm - Big Data Meetup Athens 2016
Stream processing using Apache Storm - Big Data Meetup Athens 2016Stream processing using Apache Storm - Big Data Meetup Athens 2016
Stream processing using Apache Storm - Big Data Meetup Athens 2016Adrianos Dadis
 
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민NAVER D2
 
MQTT - REST Bridge using the Smart Object API
MQTT - REST Bridge using the Smart Object APIMQTT - REST Bridge using the Smart Object API
MQTT - REST Bridge using the Smart Object APIMichael Koster
 

Tendances (14)

Batch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fastBatch Indexing & Near Real Time, keeping things fast
Batch Indexing & Near Real Time, keeping things fast
 
IBM Extreme Blue FTP Discovery Week 2 Presentation
IBM Extreme Blue FTP Discovery Week 2 PresentationIBM Extreme Blue FTP Discovery Week 2 Presentation
IBM Extreme Blue FTP Discovery Week 2 Presentation
 
Building High Performance Apps with In-Memory Data
Building High Performance Apps with In-Memory DataBuilding High Performance Apps with In-Memory Data
Building High Performance Apps with In-Memory Data
 
Hands-on Lab Building High Performance Apps with In-Memory Data
Hands-on Lab Building High Performance Apps with In-Memory DataHands-on Lab Building High Performance Apps with In-Memory Data
Hands-on Lab Building High Performance Apps with In-Memory Data
 
Distributed Tracing
Distributed TracingDistributed Tracing
Distributed Tracing
 
Distributed Tracing
Distributed TracingDistributed Tracing
Distributed Tracing
 
[NetPonto] NoSQL em Windows Azure Table Storage
[NetPonto] NoSQL em Windows Azure Table Storage[NetPonto] NoSQL em Windows Azure Table Storage
[NetPonto] NoSQL em Windows Azure Table Storage
 
Machine Learning Deep Dive
Machine Learning Deep DiveMachine Learning Deep Dive
Machine Learning Deep Dive
 
Monitoring & alerting presentation sabin&mustafa
Monitoring & alerting presentation sabin&mustafaMonitoring & alerting presentation sabin&mustafa
Monitoring & alerting presentation sabin&mustafa
 
Artik cloud deview 2016
Artik cloud   deview 2016Artik cloud   deview 2016
Artik cloud deview 2016
 
How Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
How Much Can You Connect? | Bhavesh Raheja, Disney + HotstarHow Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
How Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
 
Stream processing using Apache Storm - Big Data Meetup Athens 2016
Stream processing using Apache Storm - Big Data Meetup Athens 2016Stream processing using Apache Storm - Big Data Meetup Athens 2016
Stream processing using Apache Storm - Big Data Meetup Athens 2016
 
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
 
MQTT - REST Bridge using the Smart Object API
MQTT - REST Bridge using the Smart Object APIMQTT - REST Bridge using the Smart Object API
MQTT - REST Bridge using the Smart Object API
 

Similaire à Data management and pipelines Measure Camp - San Francisco

Building Big Data Streaming Architectures
Building Big Data Streaming ArchitecturesBuilding Big Data Streaming Architectures
Building Big Data Streaming ArchitecturesDavid Martínez Rego
 
Fixing twitter
Fixing twitterFixing twitter
Fixing twitterRoger Xia
 
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...smallerror
 
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...xlight
 
Data pipelines and anomaly detection
Data pipelines and anomaly detectionData pipelines and anomaly detection
Data pipelines and anomaly detectionSho Fola Soboyejo
 
John adams talk cloudy
John adams   talk cloudyJohn adams   talk cloudy
John adams talk cloudyJohn Adams
 
DataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at PinterestDataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at PinterestHakka Labs
 
Scalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at PinterestScalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at PinterestKrishna Gade
 
Huawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingHuawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingJen Aman
 
Drinking from the Firehose - Real-time Metrics
Drinking from the Firehose - Real-time MetricsDrinking from the Firehose - Real-time Metrics
Drinking from the Firehose - Real-time MetricsSamantha Quiñones
 
Mining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOAMining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOAAlbert Bifet
 
Cloud Security Monitoring and Spark Analytics
Cloud Security Monitoring and Spark AnalyticsCloud Security Monitoring and Spark Analytics
Cloud Security Monitoring and Spark Analyticsamesar0
 
Real Time Big Data Processing on AWS
Real Time Big Data Processing on AWSReal Time Big Data Processing on AWS
Real Time Big Data Processing on AWSCaserta
 
Azure Messaging Crossroads
Azure Messaging CrossroadsAzure Messaging Crossroads
Azure Messaging CrossroadsSean Feldman
 
A Production Quality Sketching Library for the Analysis of Big Data
A Production Quality Sketching Library for the Analysis of Big DataA Production Quality Sketching Library for the Analysis of Big Data
A Production Quality Sketching Library for the Analysis of Big DataDatabricks
 
Data Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax EnterpriseData Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax EnterpriseDataStax
 

Similaire à Data management and pipelines Measure Camp - San Francisco (20)

Building Big Data Streaming Architectures
Building Big Data Streaming ArchitecturesBuilding Big Data Streaming Architectures
Building Big Data Streaming Architectures
 
Fixing twitter
Fixing twitterFixing twitter
Fixing twitter
 
Fixing_Twitter
Fixing_TwitterFixing_Twitter
Fixing_Twitter
 
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...
 
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...Fixing Twitter  Improving The Performance And Scalability Of The Worlds Most ...
Fixing Twitter Improving The Performance And Scalability Of The Worlds Most ...
 
Data pipelines and anomaly detection
Data pipelines and anomaly detectionData pipelines and anomaly detection
Data pipelines and anomaly detection
 
John adams talk cloudy
John adams   talk cloudyJohn adams   talk cloudy
John adams talk cloudy
 
DataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at PinterestDataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at Pinterest
 
Scalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at PinterestScalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at Pinterest
 
Shikha fdp 62_14july2017
Shikha fdp 62_14july2017Shikha fdp 62_14july2017
Shikha fdp 62_14july2017
 
Debunking Common Myths in Stream Processing
Debunking Common Myths in Stream ProcessingDebunking Common Myths in Stream Processing
Debunking Common Myths in Stream Processing
 
Huawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingHuawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark Streaming
 
Drinking from the Firehose - Real-time Metrics
Drinking from the Firehose - Real-time MetricsDrinking from the Firehose - Real-time Metrics
Drinking from the Firehose - Real-time Metrics
 
Ceilosca
CeiloscaCeilosca
Ceilosca
 
Mining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOAMining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOA
 
Cloud Security Monitoring and Spark Analytics
Cloud Security Monitoring and Spark AnalyticsCloud Security Monitoring and Spark Analytics
Cloud Security Monitoring and Spark Analytics
 
Real Time Big Data Processing on AWS
Real Time Big Data Processing on AWSReal Time Big Data Processing on AWS
Real Time Big Data Processing on AWS
 
Azure Messaging Crossroads
Azure Messaging CrossroadsAzure Messaging Crossroads
Azure Messaging Crossroads
 
A Production Quality Sketching Library for the Analysis of Big Data
A Production Quality Sketching Library for the Analysis of Big DataA Production Quality Sketching Library for the Analysis of Big Data
A Production Quality Sketching Library for the Analysis of Big Data
 
Data Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax EnterpriseData Pipelines with Spark & DataStax Enterprise
Data Pipelines with Spark & DataStax Enterprise
 

Dernier

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Dernier (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

Data management and pipelines Measure Camp - San Francisco

  • 1. How I Hacked My Energy Monitor: Lessons in Data Analytics Pipelines Sho Fola Soboyejo MeasureCamp San Francisco July 2017
  • 2. About Me Books in 6 Months Cincinnati Machine Learning @shoreason www.shoreason.com github.com/shoreason
  • 3. The Genesis • Sunday Chat • Sustainable Energy • Sentient Home
  • 4. Getting to The Data Energy Company Options: • No API • Data Download (Not Realtime) • Tight Coupling
  • 5. Lesson No ready packaged full solution = Opportunity Building the ‘Novel’ on What Exists – Go find your bricks Photo Credit: Justin Hamilton
  • 6. Where I Ended • MQTT • Kafka • ELK Stack • Cassandra • 1 Linux box • Programming language choice –  Python • Sensor Peripherals (Efergy Electricity Monitor and R820 Tuner)
  • 7. Getting to the Source • Updates Every 10 Secs • Understand the impact of each event • Getting to data is necessary … don’t be afraid to ask for expert help Elite Classic photo credit: Jude
  • 9. Intercepting Data R820 Tuner RTL-SDR (USB Dongle) • Software Defined Radio • A computer based radio scanner • Enables easy signal processing of TV and Radio Signals
  • 10. Decoding the Data • C program to decode baseband data • New tools and skillsets for data transformation • Polyglot
  • 11. • Frequency - 4331.51Mhz • Bandwidth - 200khz • Resampled to 96kHz • Tuner gain - 49.6dB Decoding the Data
  • 12. My Bricks Efergy Monitor R820 Tuner C - Adapter
  • 14. 1st Class Citizen Data • Data that is critical to value stream needs to be shared • Enterprise Options: • ETL (Extract-Transform-Load) • APIs • Data Stream
  • 15. Finding a Dealer or Broker
  • 16. Requirements Met by Both • Support for data streams (pub/sub) • Decouple consumers from producers • Offer measure of reliability (QoS 1, 2 &3) • Address messaging needs • Forward compatibility (accommodate new consumers)
  • 17. The Smart Broker for Dumb Clients A TCP protocol made to support lightweight messages (2 byte overhead per message) • Easy to parse for computers (length prefixed length) • Broker keeps track of client state (Last Will Message) • Guarantees message delivery through retries • Security (usr/pwd over TLS) • Dynamic topics • Arduino or Raspberry Pi
  • 18. The Dumb Broker for Smart Clients A distributed commit log with support for high throughput and message durability even at scale • Distributed Log aggregator • Client keeps track of what messages it has processed • Meant for server side environments (server to server) • Static topics (topic - static, key - dynamic) • Short window of persistence
  • 19. Lesson • Look at your options • Avoid paralysis of analysis • Start with what you know … you can always optimize later • Guard against premature optimization
  • 20. My Bricks Efergy Monitor R820 Tuner C - Adapter Python Adapter
  • 21. Drawing Insights with Visualization • A way to interact with data • Monitor data • Starting point for decision making • ELK Stack Logstash Elastic Kibana
  • 22. • What time of the day do I use the most power? • Trend by day of the week • Energy spikes? (Where, When and What)
  • 23. Are there Anomalies? • First define Normal • Normal • Something behaves in a consistent way with respect to itself over time • Something behaves in a consistent way with respect to things around it • Anomaly • Change with respect to self as a function of time • Relative diff compared to peers within a population
  • 24. Using Unsupervised Learning • Very small sample of +ve scenarios (above threshold) : 0 -20 • Large number of -ve samples • Where future anomalies may look nothing like past anomalies or samples in training set • Gaussian (normal) distribution
  • 25. Why Not Use Supervised • You have some labels, why not use supervised? • Supervised better when there’s a large number of both -ve and +ve labels • Supervised Algorithm needs enough view into what an anomaly could possibly be • Future anomalies might look nothing like past anomalies
  • 26. • Anomaly detection using Machine Learning • ML model based on Bayesian Stats • Forgetting to unplug the iron or stove left on
  • 27. Feature Selection • Transforming features to be more Gaussian • Features that might take unusual large or small values in the event of an anomaly
  • 28. My Bricks Efergy Monitor R820 Tuner C - Adapter Python Adapter Logstash Elastic Kibana
  • 29. Being Persistent • Losing data sucks (My Presentation) • Historical analysis • Exposing data to other clients
  • 31. Lessons • Pay attention and be inspired by side conversations • Don’t be overwhelmed by the idea … take one step, the first step