SlideShare une entreprise Scribd logo
1  sur  37
Télécharger pour lire hors ligne
Sensing the world with
Data of Things
By:Sriskandarajah Suhothayan (Suho)
Technical Lead at WSO2
@suhothayan
suho@wso2.com
STRUCTURE DATA 2016
MARCH 9 - 10 • SAN FRANCISCO
Any customer can have a car
painted any colour that he wants
so long as it is black
~ Henry Ford ~
Me Me Me !!!
Your customers want to have a
personalized experience.
We are in the time of ME!
What to do ?
Is IoT New ?
• source: http://community.arm.com/groups/internet-of-things/blog/2014/06
Internet of Things
http://na1.www.gartner.com/imagesrv/newsroom/images/HC_ET_2014.jpg;wadf79d1c8397a49a2
source : http://na1.www.gartner.com/imagesrv/newsroom/images/HC_ET_2014.jpg;wadf79d1c8397a49a2
IoT Ecosystem
WSO2 IoT Server M3 : https://goo.gl/nhbxnG
http://wso2.com/iot
Concepts of IoT Analytics
● Type of Data
● Distributed Nature
● Event-Drivenness
● Possible Type of Analytics
● Scalability
● Edge Analytics
● Uncertainty
Data Types of Things
● Time based data
○ Continuous monitoring & reporting
○ Time series processing (e.g. Energy
consumption over time)
○ Specialised DBs - OpenTSDB
● Location based data
○ Things are allover the place & they move
○ Tracked via GPS / iBeacons
○ Geospatial processing (e.g Traffic planning,
better route suggestion for vehicles)
○ Geospatial optimised processing engines -
GeoTrellis
IoT is Distributed
● Constant changes
○ When components added and removed
○ Data flows are modified or repurposed
● Data collection need to support
○ Weak 3G networks to Ad-hoc peer-to-peer networks.
○ Message Queuing Telemetry Transport (MQTT)
○ Common Open Source Publishing Platform (CoApp)
○ ZigBee or Bluetooth low energy (BLE)
● Dynamic scaling
○ Hybrid cloud
IoT Analytics are Event-Driven
● Sensors report data as Event Streams
● Analysis on flowing (or perishable) data
● Realtime Analytics
○ Detect temporal and logical patterns
○ Identify KPIs and Thresholds
○ Send out alerts immediately
○ E.g. Alert when temperature sensor hit a limit, notify in
car dashboard of low tire pressure
○ Systems : Apache Storm, Google Cloud DataFlow &
WSO2 CEP
History Repeats
● Present vs usual behavior
● Understand the history
● Batch Analytics
○ Perform periodic summarisation/analytics
○ E.g. Average temperature in a room last month, total
power usage of the factory last year
○ Systems : Apache Hadoop, Apache Spark + Storage
● Ad-Hoc Queries
● Interactive Analytics
○ Provides searchability
○ E.g. Identify fraud rings from simple fraud alerts
○ Systems : Apache Drill, indexed storage systems such
as Couchbase, Apache Lucene
Deep Investigations
Thinking Ahead
● When you don’t Know the equations
● Focusing conditions & preventing issues
● Predictive Analytics
○ Incremental Learning
○ E.g. Proactive maintenance, fraud detection and health
warnings
○ Systems : Apache Mahout, Apache Spark MLlib,
Microsoft Azure Machine Learning, WSO2 ML, Skytree
Technology we’ve chosen
Realtime Batch
Interactive Predictive
WSO2 Data Analytics Server
Plenty of Data
Scalable Data Processing
source : http://www.websitemagazine.com/content/blogs/posts/archive/2014/09/25/customer-service-in-2039.aspx
Scalable Realtime Deployment
More info : https://docs.wso2.com/display/CEP410/Creating+a+Storm+Based+Distributed+Execution+Plan
Scalable Deployment
Interactive
BatchRealtime &
Predictive
● Publishing all events is not good!
○ Hardware may not be scalable
○ Network getting flooded
● What we usually need
○ Aggregation over time
○ Trends that exceed thresholds
○ Event matching a rare condition
● Results in
○ Local optimisation
○ Quick detection of issues
○ Instant notification
Is Every Event Significant?
Edge Analytics
Analytics on the Edge
with WSO2 Siddhi
Push
Outliers ...
● E.g. Anomaly detection, Fraud
Analytics
● Alerts for known and unknown frauds and
Deep Search Analytics
https://goo.gl/TWV5C1
Outliers
● We used: Linear Regression, Markov Models & Credit Scoring
Uncertainty in Data of Things
Data can be
● Duplicated
● Arrives out of order
● Not arrive at all
● Wrong readings
Events Duplicates & Out of Order …
● Due redundant sensors & network latency
● Difficult for temporal data processing
○ Time Windows
○ Temporal ordering
● Such as Fraud detection
define stream Purchase (price double, cardNo long,place string);
from every (a1 = Purchase[price < 10] ) ->
a2 = Purchase[ price >10000 and a1.cardNo == a2.cardNo ]
within 1 day
select a1.cardNo as cardNo, a2.price as price, a2.place as place
insert into PotentialFraud ;
Events Arriving Out of Order
E.g. Realtime Soccer Analytics (DEBS 2013) https://goo.gl/c2gPrQ
● Identify ball kicks, ball possession, shot on goal & offside
● Solutions : K-Slack Based Algorithms
https://www2.informatik.uni-erlangen.de/publication/download/IPDPS2013.pdf
Missing Data
● Due to network outages
● E.g. Smart Meters (DEBS 2014)
○ Smart home electricity data: 2000 sensors,
40 houses, 4 Billion events in four months
○ Processed 400K events/sec
● Solutions:
○ Approximate using complimenting
sensor reading
■ Electricity Monitoring
● Frequent Load readings
● Occasional Work readings
○ Fault-tolerant data streams : Google
Millwheel
Wrong Sensor Readings
● From GPS
● E.g.TFL Traffic Analysis
○ Using Transport for London open
data feeds.
○ http://goo.gl/04tX6k, http://goo.
gl/9xNiCm
○ Scales to 500,000 Events/Sec
and more
● From iBcons at shops, ships
and airport
● Solution: Kalman Filter
Visualisation
● Per-device & Summarization View
● Ability to group by categories
● Solutions: Composable Dashboard with sampling &
indexing
Communicate to Mobile & 3rd Party Apps
● Expose analytics
Results as API
○ Mobile Apps,
Third Party
● Provides
○ Security, Billing,
○ Throttling, Quotas
& SLA
● Solution
○ Write data to database
○ Expose them via secured APIs (E.g. WSO2 API Manager)
Reference Architecture for IoT Analytics
IoT Analytics
●
○
●
○
●
○
http://wso2.com/analytics
http://wso2.com/iot
Thank You
Any Questions ?
Contact us !

Contenu connexe

Plus de WSO2

How to Create a Service in Choreo
How to Create a Service in ChoreoHow to Create a Service in Choreo
How to Create a Service in ChoreoWSO2
 
Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023WSO2
 
Platform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on AzurePlatform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on AzureWSO2
 
GartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdfGartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdfWSO2
 
[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in Minutes[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in MinutesWSO2
 
Modernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos IdentityModernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos IdentityWSO2
 
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...WSO2
 
CIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdfCIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdfWSO2
 
Delivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing ChoreoDelivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing ChoreoWSO2
 
Fueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected ProductsFueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected ProductsWSO2
 
A Reference Methodology for Agile Digital Businesses
 A Reference Methodology for Agile Digital Businesses A Reference Methodology for Agile Digital Businesses
A Reference Methodology for Agile Digital BusinessesWSO2
 
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)WSO2
 
Lessons from the pandemic - From a single use case to true transformation
 Lessons from the pandemic - From a single use case to true transformation Lessons from the pandemic - From a single use case to true transformation
Lessons from the pandemic - From a single use case to true transformationWSO2
 
Adding Liveliness to Banking Experiences
Adding Liveliness to Banking ExperiencesAdding Liveliness to Banking Experiences
Adding Liveliness to Banking ExperiencesWSO2
 
Building a Future-ready Bank
Building a Future-ready BankBuilding a Future-ready Bank
Building a Future-ready BankWSO2
 
WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021WSO2
 
[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIs[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIsWSO2
 
[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native Deployment[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native DeploymentWSO2
 
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”WSO2
 
API Revisions - WSO2 API Manager Community Call (10/27/2021)
API Revisions - WSO2 API Manager Community Call (10/27/2021)API Revisions - WSO2 API Manager Community Call (10/27/2021)
API Revisions - WSO2 API Manager Community Call (10/27/2021)WSO2
 

Plus de WSO2 (20)

How to Create a Service in Choreo
How to Create a Service in ChoreoHow to Create a Service in Choreo
How to Create a Service in Choreo
 
Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023Ballerina Tech Talk - May 2023
Ballerina Tech Talk - May 2023
 
Platform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on AzurePlatform Strategy to Deliver Digital Experiences on Azure
Platform Strategy to Deliver Digital Experiences on Azure
 
GartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdfGartnerITSymSessionSlides.pdf
GartnerITSymSessionSlides.pdf
 
[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in Minutes[Webinar] How to Create an API in Minutes
[Webinar] How to Create an API in Minutes
 
Modernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos IdentityModernizing the Student Journey with Ethos Identity
Modernizing the Student Journey with Ethos Identity
 
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
Choreo - Build unique digital experiences on WSO2's platform, secured by Etho...
 
CIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdfCIO Summit Berlin 2022.pptx.pdf
CIO Summit Berlin 2022.pptx.pdf
 
Delivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing ChoreoDelivering New Digital Experiences Fast - Introducing Choreo
Delivering New Digital Experiences Fast - Introducing Choreo
 
Fueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected ProductsFueling the Digital Experience Economy with Connected Products
Fueling the Digital Experience Economy with Connected Products
 
A Reference Methodology for Agile Digital Businesses
 A Reference Methodology for Agile Digital Businesses A Reference Methodology for Agile Digital Businesses
A Reference Methodology for Agile Digital Businesses
 
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
Workflows in WSO2 API Manager - WSO2 API Manager Community Call (12/15/2021)
 
Lessons from the pandemic - From a single use case to true transformation
 Lessons from the pandemic - From a single use case to true transformation Lessons from the pandemic - From a single use case to true transformation
Lessons from the pandemic - From a single use case to true transformation
 
Adding Liveliness to Banking Experiences
Adding Liveliness to Banking ExperiencesAdding Liveliness to Banking Experiences
Adding Liveliness to Banking Experiences
 
Building a Future-ready Bank
Building a Future-ready BankBuilding a Future-ready Bank
Building a Future-ready Bank
 
WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021WSO2 API Manager Community Call - November 2021
WSO2 API Manager Community Call - November 2021
 
[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIs[API World ] - Managing Asynchronous APIs
[API World ] - Managing Asynchronous APIs
 
[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native Deployment[API World 2021 ] - Understanding Cloud Native Deployment
[API World 2021 ] - Understanding Cloud Native Deployment
 
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
[API Word 2021] - Quantum Duality of “API as a Business and a Technology”
 
API Revisions - WSO2 API Manager Community Call (10/27/2021)
API Revisions - WSO2 API Manager Community Call (10/27/2021)API Revisions - WSO2 API Manager Community Call (10/27/2021)
API Revisions - WSO2 API Manager Community Call (10/27/2021)
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 

Dernier (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 

Sensing the World with Data of Things

  • 1. Sensing the world with Data of Things By:Sriskandarajah Suhothayan (Suho) Technical Lead at WSO2 @suhothayan suho@wso2.com STRUCTURE DATA 2016 MARCH 9 - 10 • SAN FRANCISCO
  • 2. Any customer can have a car painted any colour that he wants so long as it is black ~ Henry Ford ~
  • 3. Me Me Me !!! Your customers want to have a personalized experience. We are in the time of ME!
  • 4.
  • 5.
  • 7. Is IoT New ? • source: http://community.arm.com/groups/internet-of-things/blog/2014/06
  • 8. Internet of Things http://na1.www.gartner.com/imagesrv/newsroom/images/HC_ET_2014.jpg;wadf79d1c8397a49a2 source : http://na1.www.gartner.com/imagesrv/newsroom/images/HC_ET_2014.jpg;wadf79d1c8397a49a2
  • 10. WSO2 IoT Server M3 : https://goo.gl/nhbxnG http://wso2.com/iot
  • 11. Concepts of IoT Analytics ● Type of Data ● Distributed Nature ● Event-Drivenness ● Possible Type of Analytics ● Scalability ● Edge Analytics ● Uncertainty
  • 12. Data Types of Things ● Time based data ○ Continuous monitoring & reporting ○ Time series processing (e.g. Energy consumption over time) ○ Specialised DBs - OpenTSDB ● Location based data ○ Things are allover the place & they move ○ Tracked via GPS / iBeacons ○ Geospatial processing (e.g Traffic planning, better route suggestion for vehicles) ○ Geospatial optimised processing engines - GeoTrellis
  • 13. IoT is Distributed ● Constant changes ○ When components added and removed ○ Data flows are modified or repurposed ● Data collection need to support ○ Weak 3G networks to Ad-hoc peer-to-peer networks. ○ Message Queuing Telemetry Transport (MQTT) ○ Common Open Source Publishing Platform (CoApp) ○ ZigBee or Bluetooth low energy (BLE) ● Dynamic scaling ○ Hybrid cloud
  • 14. IoT Analytics are Event-Driven ● Sensors report data as Event Streams ● Analysis on flowing (or perishable) data ● Realtime Analytics ○ Detect temporal and logical patterns ○ Identify KPIs and Thresholds ○ Send out alerts immediately ○ E.g. Alert when temperature sensor hit a limit, notify in car dashboard of low tire pressure ○ Systems : Apache Storm, Google Cloud DataFlow & WSO2 CEP
  • 15. History Repeats ● Present vs usual behavior ● Understand the history ● Batch Analytics ○ Perform periodic summarisation/analytics ○ E.g. Average temperature in a room last month, total power usage of the factory last year ○ Systems : Apache Hadoop, Apache Spark + Storage
  • 16. ● Ad-Hoc Queries ● Interactive Analytics ○ Provides searchability ○ E.g. Identify fraud rings from simple fraud alerts ○ Systems : Apache Drill, indexed storage systems such as Couchbase, Apache Lucene Deep Investigations
  • 17. Thinking Ahead ● When you don’t Know the equations ● Focusing conditions & preventing issues ● Predictive Analytics ○ Incremental Learning ○ E.g. Proactive maintenance, fraud detection and health warnings ○ Systems : Apache Mahout, Apache Spark MLlib, Microsoft Azure Machine Learning, WSO2 ML, Skytree
  • 18. Technology we’ve chosen Realtime Batch Interactive Predictive
  • 20. Plenty of Data Scalable Data Processing source : http://www.websitemagazine.com/content/blogs/posts/archive/2014/09/25/customer-service-in-2039.aspx
  • 21. Scalable Realtime Deployment More info : https://docs.wso2.com/display/CEP410/Creating+a+Storm+Based+Distributed+Execution+Plan
  • 23. ● Publishing all events is not good! ○ Hardware may not be scalable ○ Network getting flooded ● What we usually need ○ Aggregation over time ○ Trends that exceed thresholds ○ Event matching a rare condition ● Results in ○ Local optimisation ○ Quick detection of issues ○ Instant notification Is Every Event Significant?
  • 24. Edge Analytics Analytics on the Edge with WSO2 Siddhi Push
  • 25. Outliers ... ● E.g. Anomaly detection, Fraud Analytics ● Alerts for known and unknown frauds and Deep Search Analytics https://goo.gl/TWV5C1
  • 26. Outliers ● We used: Linear Regression, Markov Models & Credit Scoring
  • 27. Uncertainty in Data of Things Data can be ● Duplicated ● Arrives out of order ● Not arrive at all ● Wrong readings
  • 28. Events Duplicates & Out of Order … ● Due redundant sensors & network latency ● Difficult for temporal data processing ○ Time Windows ○ Temporal ordering ● Such as Fraud detection define stream Purchase (price double, cardNo long,place string); from every (a1 = Purchase[price < 10] ) -> a2 = Purchase[ price >10000 and a1.cardNo == a2.cardNo ] within 1 day select a1.cardNo as cardNo, a2.price as price, a2.place as place insert into PotentialFraud ;
  • 29. Events Arriving Out of Order E.g. Realtime Soccer Analytics (DEBS 2013) https://goo.gl/c2gPrQ ● Identify ball kicks, ball possession, shot on goal & offside ● Solutions : K-Slack Based Algorithms https://www2.informatik.uni-erlangen.de/publication/download/IPDPS2013.pdf
  • 30. Missing Data ● Due to network outages ● E.g. Smart Meters (DEBS 2014) ○ Smart home electricity data: 2000 sensors, 40 houses, 4 Billion events in four months ○ Processed 400K events/sec ● Solutions: ○ Approximate using complimenting sensor reading ■ Electricity Monitoring ● Frequent Load readings ● Occasional Work readings ○ Fault-tolerant data streams : Google Millwheel
  • 31. Wrong Sensor Readings ● From GPS ● E.g.TFL Traffic Analysis ○ Using Transport for London open data feeds. ○ http://goo.gl/04tX6k, http://goo. gl/9xNiCm ○ Scales to 500,000 Events/Sec and more ● From iBcons at shops, ships and airport ● Solution: Kalman Filter
  • 32. Visualisation ● Per-device & Summarization View ● Ability to group by categories ● Solutions: Composable Dashboard with sampling & indexing
  • 33. Communicate to Mobile & 3rd Party Apps ● Expose analytics Results as API ○ Mobile Apps, Third Party ● Provides ○ Security, Billing, ○ Throttling, Quotas & SLA ● Solution ○ Write data to database ○ Expose them via secured APIs (E.g. WSO2 API Manager)
  • 34. Reference Architecture for IoT Analytics