SlideShare une entreprise Scribd logo
1  sur  21
EXPLORING THE
HUBNESS-RELATED   Nenad
                  Tomašev
  PROPERTIES OF   Dunja
 OCEANOGRAPHIC    Mladenić


    SENSOR DATA
PRESENTATION OUTLINE


Hubness and why it matters

 Oceanographic data: overview

 Bad hubs in the measurements

Visualizing the problematic sensors
WHY IT MATTERS

 Hubness is the skewness (asymmetry) in the distribution of k-
  occurrences: some points ( Hubs) become neighbors very VERY
  often

 This often happens in high dimensional data

 It is, however, a phenomenon only of importance for nearest-
  neighbor methods

 So, why should we care, in general?
WHY IT MATTERS

 Sensor data = streams, time series

 The state of the art for time series data: 1 -NN classifier
  coupled with an appropriate metric for comparing the time
  series

 In other words: nearest neighbor methods are not only
  occasionally used for time series classification, they are
  considered the state of the art!

 So, hubness matters.
RELATED WORK

 Radovanovic, Nanopulous, Ivanovic: Time series classification
  in many intrinsic dimensions, SDM 2010

 Due to the correlation between subsequent values, not all
  time series are inherently very high dimensional

 Some, however – are. These time series have been shown to
  exhibit hubness. Also – bad hubness.

 It was shown that in such cases, bad -hubness-based weighting
  is helpful (the hw -kNN algorithm)
ANALYSIS GOALS



 Explore the k-nearest neighbor structure of the oceanographic
  sensor data

 Explore the bad hubness in the data

 Visualize the results
TEST CASE: OCEANOGRAPHIC DATA

 Integrated Ocean Observing System data
  (http://www.ioos.gov/)

 Nodes spread across the Pacific, Atlantic and Great lakes…

 Several sensors at each node, measuring various quantities

 air temperature, barometric pressure, wind, water level
  observation, water level prediction, salinity, water
  temperature and conductivity
TEST CASE: OCEANOGRAPHIC DATA

 20 days worth of measurements

 10.11 .-30.11.2010.

 Sampled every 6 minutes (10 measurements an hour)

 4801 measurements total for each sensor

 Missing values: replaced by the average of the closest known
  values
THE EXPERIMENTAL SETUP



 Tested under two dif ferent metrics
   Manhattan, Variance of between-series differences
   Future work: perform the experiments with DTW (Dynamic Time
    Warping)


 Defined “Pacific”, “Atlantic” and “Lakes” as location-based
  labels = 3 categories
SKEWNESS, BAD HUBNESS
CLASS TO CLASS HUBNESS MATRIX, K=3,
        WIND MEASUREMENTS

   0.772          0.186            0.042

   0.013          0.987             0.0

   0.027          0.014            0.959


    Atlantic = 1. Pacific = 2. Lakes = 3
WOULD THE HUBNESS-AWARE METHODS
             HELP?
WIND MEASUREMENTS: SENSOR
       HUBNESS MAP
WIND MEASUREMENTS: SENSOR
       HUBNESS MAP
WATER TEMPERATURE: SENSOR
       HUBNESS MAP
WATER TEMPERATURE: SENSOR
       HUBNESS MAP
BAROMETRIC PRESSURE: SENSOR
       HUBNESS MAP
AIR TEMPERATURE: THE BERMUDA
         TRIANGLE 
CONCLUSIONS:

 Bad hubness may be useful to detect potentially erroneous
  measurement devices

 Some measurement type stream apparently do exhibit
  hubness, so hubness is a phenomenon of interest for dealing
  with sensor data

 Hubness-aware methods could be potentially helpful when
  working with sensor data
AKNOWLEDGEMENTS

This work was supported by the ICT
 Programme of the EC PlanetData (ICTNoE-
 257641).
THANK YOU FOR YOUR ATTENTION

Contenu connexe

Tendances

IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...India UK Water Centre (IUKWC)
 
Impact of time displaced precipitation estimates for online updated models
Impact of time displaced precipitation estimates for online updated modelsImpact of time displaced precipitation estimates for online updated models
Impact of time displaced precipitation estimates for online updated modelsEVAnetDenmark
 
Hydrology measuring rain
Hydrology measuring rainHydrology measuring rain
Hydrology measuring rainSajjad Ahmad
 
T7: Flood Risk Assessment Using GIS Tools
T7: Flood Risk Assessment Using GIS ToolsT7: Flood Risk Assessment Using GIS Tools
T7: Flood Risk Assessment Using GIS ToolsFAO
 
Geostatistical Space Time Modeling
Geostatistical Space Time ModelingGeostatistical Space Time Modeling
Geostatistical Space Time Modelingsriniosu
 
MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...
MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...
MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...grssieee
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...India UK Water Centre (IUKWC)
 
Mid term presentation_Sunil Basnet
Mid term presentation_Sunil BasnetMid term presentation_Sunil Basnet
Mid term presentation_Sunil Basnettensab_linus419
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...India UK Water Centre (IUKWC)
 
Development of a Flood Warning Tool Set for Bandera, Texas - Doug Schnoebelen
Development of a Flood Warning Tool Set for Bandera, Texas - Doug SchnoebelenDevelopment of a Flood Warning Tool Set for Bandera, Texas - Doug Schnoebelen
Development of a Flood Warning Tool Set for Bandera, Texas - Doug SchnoebelenTWCA
 
Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...
Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...
Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...pietro richelli
 
NineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.ppt
NineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.pptNineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.ppt
NineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.pptgrssieee
 
Metrological instuments.
Metrological instuments.Metrological instuments.
Metrological instuments.neil0504
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...India UK Water Centre (IUKWC)
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...India UK Water Centre (IUKWC)
 
igarss2011_lion.pptx
igarss2011_lion.pptxigarss2011_lion.pptx
igarss2011_lion.pptxgrssieee
 
DRI and UAS Applications Research
DRI and UAS Applications ResearchDRI and UAS Applications Research
DRI and UAS Applications ResearchDRIscience
 

Tendances (20)

IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
 
Impact of time displaced precipitation estimates for online updated models
Impact of time displaced precipitation estimates for online updated modelsImpact of time displaced precipitation estimates for online updated models
Impact of time displaced precipitation estimates for online updated models
 
design of rain gauge network
design of rain gauge networkdesign of rain gauge network
design of rain gauge network
 
Hydrology measuring rain
Hydrology measuring rainHydrology measuring rain
Hydrology measuring rain
 
T7: Flood Risk Assessment Using GIS Tools
T7: Flood Risk Assessment Using GIS ToolsT7: Flood Risk Assessment Using GIS Tools
T7: Flood Risk Assessment Using GIS Tools
 
Geostatistical Space Time Modeling
Geostatistical Space Time ModelingGeostatistical Space Time Modeling
Geostatistical Space Time Modeling
 
MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...
MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...
MONITORING LONG TERM VARIABILITY IN THE ATMOSPHERIC WATER VAPOUR CONTENT USIN...
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
 
Mid term presentation_Sunil Basnet
Mid term presentation_Sunil BasnetMid term presentation_Sunil Basnet
Mid term presentation_Sunil Basnet
 
Dragana densitometry 2nd-behydroday_v2
Dragana densitometry 2nd-behydroday_v2Dragana densitometry 2nd-behydroday_v2
Dragana densitometry 2nd-behydroday_v2
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
 
Development of a Flood Warning Tool Set for Bandera, Texas - Doug Schnoebelen
Development of a Flood Warning Tool Set for Bandera, Texas - Doug SchnoebelenDevelopment of a Flood Warning Tool Set for Bandera, Texas - Doug Schnoebelen
Development of a Flood Warning Tool Set for Bandera, Texas - Doug Schnoebelen
 
Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...
Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...
Effects of Climate Change on Hydrology and Hydropower Systems in the Italian ...
 
NineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.ppt
NineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.pptNineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.ppt
NineYearsofAtmosphericRemoteSensingwithSCIAMACHY-InstrumentPerformance.ppt
 
Metrological instuments.
Metrological instuments.Metrological instuments.
Metrological instuments.
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
 
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
IUKWC Workshop Nov16: Developing Hydro-climatic Services for Water Security –...
 
igarss2011_lion.pptx
igarss2011_lion.pptxigarss2011_lion.pptx
igarss2011_lion.pptx
 
DRI and UAS Applications Research
DRI and UAS Applications ResearchDRI and UAS Applications Research
DRI and UAS Applications Research
 
ESA_smpehle_16October2015
ESA_smpehle_16October2015ESA_smpehle_16October2015
ESA_smpehle_16October2015
 

En vedette

Bases I Concurso Fotografias Guadalmolares
Bases I Concurso Fotografias GuadalmolaresBases I Concurso Fotografias Guadalmolares
Bases I Concurso Fotografias GuadalmolaresGuadalinfo Los Molares
 
SWINBURNE CERT_Degree
SWINBURNE CERT_DegreeSWINBURNE CERT_Degree
SWINBURNE CERT_DegreeMaseela Khan
 
Cómo sacarle mejor provecho a Linkedin
Cómo sacarle mejor provecho a LinkedinCómo sacarle mejor provecho a Linkedin
Cómo sacarle mejor provecho a LinkedinMiguel Jaramillo
 
13.503.31.2170_Worksheet_2014-07-09_09_43_05
13.503.31.2170_Worksheet_2014-07-09_09_43_0513.503.31.2170_Worksheet_2014-07-09_09_43_05
13.503.31.2170_Worksheet_2014-07-09_09_43_05Tariqul Islam Sumon
 
Growth for SaaS using conversion optimization
Growth for SaaS using conversion optimizationGrowth for SaaS using conversion optimization
Growth for SaaS using conversion optimizationValentin Radu
 

En vedette (8)

Bases I Concurso Fotografias Guadalmolares
Bases I Concurso Fotografias GuadalmolaresBases I Concurso Fotografias Guadalmolares
Bases I Concurso Fotografias Guadalmolares
 
SWINBURNE CERT_Degree
SWINBURNE CERT_DegreeSWINBURNE CERT_Degree
SWINBURNE CERT_Degree
 
Listening and speaking 1
Listening and speaking 1Listening and speaking 1
Listening and speaking 1
 
11 angi soril 22.blogt
11 angi soril 22.blogt11 angi soril 22.blogt
11 angi soril 22.blogt
 
Cómo sacarle mejor provecho a Linkedin
Cómo sacarle mejor provecho a LinkedinCómo sacarle mejor provecho a Linkedin
Cómo sacarle mejor provecho a Linkedin
 
Actividades abril 2013
Actividades abril 2013Actividades abril 2013
Actividades abril 2013
 
13.503.31.2170_Worksheet_2014-07-09_09_43_05
13.503.31.2170_Worksheet_2014-07-09_09_43_0513.503.31.2170_Worksheet_2014-07-09_09_43_05
13.503.31.2170_Worksheet_2014-07-09_09_43_05
 
Growth for SaaS using conversion optimization
Growth for SaaS using conversion optimizationGrowth for SaaS using conversion optimization
Growth for SaaS using conversion optimization
 

Similaire à Exploring The Hubness-Related Properties of Oceanographic Sensor Data

2003-12-04 Evaluation of the ASOS Light Scattering Network
2003-12-04 Evaluation of the ASOS Light Scattering Network2003-12-04 Evaluation of the ASOS Light Scattering Network
2003-12-04 Evaluation of the ASOS Light Scattering NetworkRudolf Husar
 
An Introduction to the Environment Agency extreme offshore wave, water level ...
An Introduction to the Environment Agency extreme offshore wave, water level ...An Introduction to the Environment Agency extreme offshore wave, water level ...
An Introduction to the Environment Agency extreme offshore wave, water level ...Stephen Flood
 
Measuring electronic latencies in MINOS with Auxiliary Detector
Measuring electronic latencies in MINOS with Auxiliary DetectorMeasuring electronic latencies in MINOS with Auxiliary Detector
Measuring electronic latencies in MINOS with Auxiliary DetectorSon Cao
 
YellowIGARSS.ppt
YellowIGARSS.pptYellowIGARSS.ppt
YellowIGARSS.pptgrssieee
 
Pierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.pptPierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.pptgrssieee
 
Pierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.pptPierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.pptgrssieee
 
Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...
Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...
Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...Codevintec Italiana srl
 
Remote sensing by jitendra thakor
Remote sensing by jitendra thakorRemote sensing by jitendra thakor
Remote sensing by jitendra thakorgandhinagar
 
Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...
Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...
Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...Pioneer Natural Resources
 
Multi-sensor Improved Sea Surface Temperatures Project
Multi-sensor Improved Sea Surface Temperatures ProjectMulti-sensor Improved Sea Surface Temperatures Project
Multi-sensor Improved Sea Surface Temperatures ProjectChelle Gentemann
 
Methodology
MethodologyMethodology
MethodologyRCB78
 
Gis120 lec1 slide_share_practice
Gis120 lec1 slide_share_practiceGis120 lec1 slide_share_practice
Gis120 lec1 slide_share_practiceMichelle Kinzel
 
Discharge measurement using a current meter.docx
Discharge measurement using a current meter.docxDischarge measurement using a current meter.docx
Discharge measurement using a current meter.docxssuser6e70fd
 
DSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-Bajo
DSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-BajoDSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-Bajo
DSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-BajoDeltares
 
Underwater wireless communication
Underwater wireless communication Underwater wireless communication
Underwater wireless communication AndeAkash
 
Seismic interpretation work flow final ppt
Seismic interpretation work flow final pptSeismic interpretation work flow final ppt
Seismic interpretation work flow final pptMuhammadJawwad28
 

Similaire à Exploring The Hubness-Related Properties of Oceanographic Sensor Data (20)

2003-12-04 Evaluation of the ASOS Light Scattering Network
2003-12-04 Evaluation of the ASOS Light Scattering Network2003-12-04 Evaluation of the ASOS Light Scattering Network
2003-12-04 Evaluation of the ASOS Light Scattering Network
 
An Introduction to the Environment Agency extreme offshore wave, water level ...
An Introduction to the Environment Agency extreme offshore wave, water level ...An Introduction to the Environment Agency extreme offshore wave, water level ...
An Introduction to the Environment Agency extreme offshore wave, water level ...
 
sea water quality
sea water qualitysea water quality
sea water quality
 
Sat fc j-intro_mw_remotesensing
Sat fc j-intro_mw_remotesensingSat fc j-intro_mw_remotesensing
Sat fc j-intro_mw_remotesensing
 
Measuring electronic latencies in MINOS with Auxiliary Detector
Measuring electronic latencies in MINOS with Auxiliary DetectorMeasuring electronic latencies in MINOS with Auxiliary Detector
Measuring electronic latencies in MINOS with Auxiliary Detector
 
YellowIGARSS.ppt
YellowIGARSS.pptYellowIGARSS.ppt
YellowIGARSS.ppt
 
Pierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.pptPierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.ppt
 
Pierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.pptPierdicca-Igarss2011_july2011.ppt
Pierdicca-Igarss2011_july2011.ppt
 
Technology~linkage data .
Technology~linkage data .Technology~linkage data .
Technology~linkage data .
 
Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...
Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...
Nick - Benefits of Using Combined Bathymetry and Side Scan Sonar in Shallow W...
 
Remote sensing by jitendra thakor
Remote sensing by jitendra thakorRemote sensing by jitendra thakor
Remote sensing by jitendra thakor
 
3919841 (1).ppt
3919841 (1).ppt3919841 (1).ppt
3919841 (1).ppt
 
Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...
Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...
Hydraulic Fracturing Stimulation Monitoring with Distributed Fiber Optic Sens...
 
Multi-sensor Improved Sea Surface Temperatures Project
Multi-sensor Improved Sea Surface Temperatures ProjectMulti-sensor Improved Sea Surface Temperatures Project
Multi-sensor Improved Sea Surface Temperatures Project
 
Methodology
MethodologyMethodology
Methodology
 
Gis120 lec1 slide_share_practice
Gis120 lec1 slide_share_practiceGis120 lec1 slide_share_practice
Gis120 lec1 slide_share_practice
 
Discharge measurement using a current meter.docx
Discharge measurement using a current meter.docxDischarge measurement using a current meter.docx
Discharge measurement using a current meter.docx
 
DSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-Bajo
DSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-BajoDSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-Bajo
DSD-INT 2019 Modelling of the Danube Delta and of the Razelm-Sinoe lagoon-Bajo
 
Underwater wireless communication
Underwater wireless communication Underwater wireless communication
Underwater wireless communication
 
Seismic interpretation work flow final ppt
Seismic interpretation work flow final pptSeismic interpretation work flow final ppt
Seismic interpretation work flow final ppt
 

Plus de PlanetData Network of Excellence

A Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about TrentinoA Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about TrentinoPlanetData Network of Excellence
 
On Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching NetworksOn Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching NetworksPlanetData Network of Excellence
 
Towards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory SensingTowards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory SensingPlanetData Network of Excellence
 
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstreamDemo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstreamPlanetData Network of Excellence
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingPlanetData Network of Excellence
 
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...PlanetData Network of Excellence
 
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatchLinking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatchPlanetData Network of Excellence
 
SciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMSSciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMSPlanetData Network of Excellence
 
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduceScalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReducePlanetData Network of Excellence
 
Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...PlanetData Network of Excellence
 
Towards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of FactsTowards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of FactsPlanetData Network of Excellence
 
Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...
Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...
Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...PlanetData Network of Excellence
 

Plus de PlanetData Network of Excellence (20)

Dl2014 slides
Dl2014 slidesDl2014 slides
Dl2014 slides
 
A Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about TrentinoA Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about Trentino
 
On Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching NetworksOn Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching Networks
 
Towards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory SensingTowards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory Sensing
 
Privacy-Preserving Schema Reuse
Privacy-Preserving Schema ReusePrivacy-Preserving Schema Reuse
Privacy-Preserving Schema Reuse
 
Pay-as-you-go Reconciliation in Schema Matching Networks
Pay-as-you-go Reconciliation in Schema Matching NetworksPay-as-you-go Reconciliation in Schema Matching Networks
Pay-as-you-go Reconciliation in Schema Matching Networks
 
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstreamDemo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream Processing
 
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
 
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatchLinking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch
 
SciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMSSciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMS
 
CLODA: A Crowdsourced Linked Open Data Architecture
CLODA: A Crowdsourced Linked Open Data ArchitectureCLODA: A Crowdsourced Linked Open Data Architecture
CLODA: A Crowdsourced Linked Open Data Architecture
 
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduceScalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce
 
Data and Knowledge Evolution
Data and Knowledge Evolution  Data and Knowledge Evolution
Data and Knowledge Evolution
 
Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...
 
Access Control for RDF graphs using Abstract Models
Access Control for RDF graphs using Abstract ModelsAccess Control for RDF graphs using Abstract Models
Access Control for RDF graphs using Abstract Models
 
Arrays in Databases, the next frontier?
Arrays in Databases, the next frontier?Arrays in Databases, the next frontier?
Arrays in Databases, the next frontier?
 
Abstract Access Control Model for Dynamic RDF Datasets
Abstract Access Control Model for Dynamic RDF DatasetsAbstract Access Control Model for Dynamic RDF Datasets
Abstract Access Control Model for Dynamic RDF Datasets
 
Towards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of FactsTowards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of Facts
 
Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...
Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...
Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and ...
 

Dernier

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Dernier (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Exploring The Hubness-Related Properties of Oceanographic Sensor Data

  • 1. EXPLORING THE HUBNESS-RELATED Nenad Tomašev PROPERTIES OF Dunja OCEANOGRAPHIC Mladenić SENSOR DATA
  • 2. PRESENTATION OUTLINE Hubness and why it matters Oceanographic data: overview Bad hubs in the measurements Visualizing the problematic sensors
  • 3. WHY IT MATTERS  Hubness is the skewness (asymmetry) in the distribution of k- occurrences: some points ( Hubs) become neighbors very VERY often  This often happens in high dimensional data  It is, however, a phenomenon only of importance for nearest- neighbor methods  So, why should we care, in general?
  • 4. WHY IT MATTERS  Sensor data = streams, time series  The state of the art for time series data: 1 -NN classifier coupled with an appropriate metric for comparing the time series  In other words: nearest neighbor methods are not only occasionally used for time series classification, they are considered the state of the art!  So, hubness matters.
  • 5. RELATED WORK  Radovanovic, Nanopulous, Ivanovic: Time series classification in many intrinsic dimensions, SDM 2010  Due to the correlation between subsequent values, not all time series are inherently very high dimensional  Some, however – are. These time series have been shown to exhibit hubness. Also – bad hubness.  It was shown that in such cases, bad -hubness-based weighting is helpful (the hw -kNN algorithm)
  • 6. ANALYSIS GOALS  Explore the k-nearest neighbor structure of the oceanographic sensor data  Explore the bad hubness in the data  Visualize the results
  • 7. TEST CASE: OCEANOGRAPHIC DATA  Integrated Ocean Observing System data (http://www.ioos.gov/)  Nodes spread across the Pacific, Atlantic and Great lakes…  Several sensors at each node, measuring various quantities  air temperature, barometric pressure, wind, water level observation, water level prediction, salinity, water temperature and conductivity
  • 8. TEST CASE: OCEANOGRAPHIC DATA  20 days worth of measurements  10.11 .-30.11.2010.  Sampled every 6 minutes (10 measurements an hour)  4801 measurements total for each sensor  Missing values: replaced by the average of the closest known values
  • 9. THE EXPERIMENTAL SETUP  Tested under two dif ferent metrics  Manhattan, Variance of between-series differences  Future work: perform the experiments with DTW (Dynamic Time Warping)  Defined “Pacific”, “Atlantic” and “Lakes” as location-based labels = 3 categories
  • 11. CLASS TO CLASS HUBNESS MATRIX, K=3, WIND MEASUREMENTS 0.772 0.186 0.042 0.013 0.987 0.0 0.027 0.014 0.959 Atlantic = 1. Pacific = 2. Lakes = 3
  • 12. WOULD THE HUBNESS-AWARE METHODS HELP?
  • 18. AIR TEMPERATURE: THE BERMUDA TRIANGLE 
  • 19. CONCLUSIONS:  Bad hubness may be useful to detect potentially erroneous measurement devices  Some measurement type stream apparently do exhibit hubness, so hubness is a phenomenon of interest for dealing with sensor data  Hubness-aware methods could be potentially helpful when working with sensor data
  • 20. AKNOWLEDGEMENTS This work was supported by the ICT Programme of the EC PlanetData (ICTNoE- 257641).
  • 21. THANK YOU FOR YOUR ATTENTION