SlideShare une entreprise Scribd logo
1  sur  20
NoizCrowd:
A Crowd-Based Data Gathering and
Management System for Noise Level Data
Mariusz Wisniewski, Gianluca
Demartini, Apostolos Malatras, and
Philippe Cudré-Mauroux
University of Fribourg, Switzerland
Motivation - Big Data
• Large dataset are necessary to enable analytics
and support decision making
– Meteorological station / car traffic
• Set up a large-scale sensing infrastructure is
costly and time-consuming
• Create a large amount of valuable data
– Crowdsourcing
– Data generation models
– Smartphones as sensors
– Big Data analytics
Gianluca Demartini 2
NoizCrowd
• A crowd-sensing approach to big data generation
using commodity sensors
• Crowd-source noise level in a geo region
• Noise propagation models to generate data
• Array data management techniques to scale
• Results accessible via a visual interface
• Support decisions (e.g., where to live)
Gianluca Demartini 3
Outline
• Related approaches
• NoizCrowd Architecture Overview
– Data Gathering
– Storage
– Modeling
– Export and Visualization
• Data Models
• Performance Evaluation
Gianluca Demartini 4
Related Work
• Participatory Sensing vs Sensor Networks
– Low cost / High cost
– Mobile phones / Sensors
– Distributed / Centralized management
– Privacy, data quality
• Applications: Environment, vehicle routing
Gianluca Demartini 5
Related Work
• Noise Mapping Apps
– NoiseTube: opensource, widespread usage
– NoiseMap: control over data
– SoundSense: machine learning to classify sounds
• NoizCrowd
– Data in RDF linkable to other datasets
(linkeddata.org)
– Scalable storage: generate data by interpolation
Gianluca Demartini 6
NoizCrowd Architecture
Gianluca Demartini 7
Data Gathering
• By means of Crowd-sourcing
– GPS: location
– Microphone: noise level
– Internet connection: send data to server
• Microphone Calibration
– Sound level meter
– Sharing conversion table for smartphone models
Gianluca Demartini 8
Data Storage
• App sends median and peak dB values over
few seconds
• Spatio-temporal data: non-relational storage
system (SciDB)
– Durable storage
– Retrieve data to build models
– Export data for visualization
• Multi-dimensional array (space and time)
• Distributed storage
Gianluca Demartini 9
Noise Modeling
• Data from crowd is noisy and skewed/sparse
• Raw data is not shown to the end users
• Models to deal with
– Overlapping data
– Missing data
Gianluca Demartini 10
Data Export and Visualization
• From SciDB data is
– converted to RDF
– stored in dipLODocus[RDF]
– Available via SPARQL
• Visualization
– Overlay noise level on a map
– Additional chart for time evolution
Gianluca Demartini 11
Gianluca Demartini 12
Data Models
• Spatial Interpolation
– In the same time interval, data from different
locations
– Need to be computational simple (large volume)
– Bi-dimensional range queries in space (SciDB)
– K-nearest neighbor interpolation
– Computed in parallel
Gianluca Demartini
Data Models
• Temporal interpolation
– Short ranges (minutes) like spatial interp. in 3D
– Long ranges, look for patterns and infer
• E.g., every Monday at 11am we have 50dB and we miss
a Monday measurement
• E.g., same measurement (50dB) in same area 2h ago
and now
Gianluca Demartini 14
Noise Propagation Models
• We adopt an existing model that takes into
account:
– Sound power
– Distance from source
– Directivity
– Atmospheric absorption
– Excess attenuation (we use meteo conditions)
• Difficult to measure with smartphone
• Constant in a given region (and use GPS info)
Gianluca Demartini 15
Materialization of Models
• Data from models
– Is computationally expensive to generate
– May be a lot since we can cover any region
• We do late materialization
– At query time
– Only for the specific request
– Cached and indexed for future requests
– Incremental updates of views, if possible
Gianluca Demartini 16
Performance Evaluation (1)
• 30 outdoor deployments
– 2,3,4 smartphones
– Multiple noise sources
– Urban setting, flat area of 50x50 meters
• Professional-grade noise level meter as gold
standard measurement
• 85% of interpolated data +-6dB error
• 63% of interpolated data +-4dB error
Gianluca Demartini 17
Performance Evaluation (2)
• Sound propagation and source location
• 3 smartphones, 100dB source
Gianluca Demartini 18
Performance Evaluation (3)
• Sound level of source error
– 16% with 3 measurements
– 10% with 4 measurements
– 9% with 5 measurements
• Source location
– 3m error on average
Gianluca Demartini 19
NoizCrowd - Conclusions
• Large scale data is key for decision making
• Crowd-source noise level data using mobiles
– Scale-out using an array backend
– Generate missing data and visualize
• Next steps
– Android app
– Data recording as background feature
– Additional materialization strategies
http://exascale.info
Gianluca Demartini 20

Contenu connexe

Tendances

Wi-Fi based indoor positioning
Wi-Fi based indoor positioningWi-Fi based indoor positioning
Wi-Fi based indoor positioningSherwin Rodrigues
 
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSravan Puttagunta
 
Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2Stephan Linzner
 
Indoor Positioning Systems
Indoor Positioning SystemsIndoor Positioning Systems
Indoor Positioning SystemsProjectENhANCE
 
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...YutaFunada
 
Precision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location SystemsPrecision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location SystemsPeter Batty
 
Lecture 6 geolocation
Lecture 6 geolocationLecture 6 geolocation
Lecture 6 geolocationmoduledesign
 
GET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living LabGET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living LabCecilia Rocha
 
IoT Applications based on LoRaWan
IoT Applications based on LoRaWanIoT Applications based on LoRaWan
IoT Applications based on LoRaWanDaniel Koller
 

Tendances (13)

Wi-Fi based indoor positioning
Wi-Fi based indoor positioningWi-Fi based indoor positioning
Wi-Fi based indoor positioning
 
3D remote sensing of mines
3D remote sensing of mines 3D remote sensing of mines
3D remote sensing of mines
 
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan PuttaguntaSPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
SPAR 2015 - Civil Maps Presentation by Sravan Puttagunta
 
Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2Sensor-driven indoor localization with android #bcs2
Sensor-driven indoor localization with android #bcs2
 
Indoor Positioning Systems
Indoor Positioning SystemsIndoor Positioning Systems
Indoor Positioning Systems
 
Indoor Tracking System
Indoor Tracking SystemIndoor Tracking System
Indoor Tracking System
 
Geolocalisation
GeolocalisationGeolocalisation
Geolocalisation
 
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
Summary of the paper「PrivacyMic: Utilizing Inaudible Frequencies for Privacy ...
 
Precision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location SystemsPrecision (Indoor) Real Time Location Systems
Precision (Indoor) Real Time Location Systems
 
Lecture 6 geolocation
Lecture 6 geolocationLecture 6 geolocation
Lecture 6 geolocation
 
Indoor navigation system
Indoor navigation systemIndoor navigation system
Indoor navigation system
 
GET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living LabGET2016 - UrbanSense Platform. Porto Living Lab
GET2016 - UrbanSense Platform. Porto Living Lab
 
IoT Applications based on LoRaWan
IoT Applications based on LoRaWanIoT Applications based on LoRaWan
IoT Applications based on LoRaWan
 

Similaire à NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

Automation of National Noise Model
Automation of National Noise Model Automation of National Noise Model
Automation of National Noise Model Safe Software
 
Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015Arpan Pal
 
Advancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D DataAdvancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D DataMerrick & Company
 
20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona ai20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona aiAlbert Pujol Torras
 
(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018Brett Johnson
 
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Gloria Re Calegari
 
Urban senseoverview201507
Urban senseoverview201507Urban senseoverview201507
Urban senseoverview201507Ana Aguiar
 
Singapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationSingapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationMonica Moran
 
PathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed DataPathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed DataUniversity of Geneva
 
Will camera technology become an ITS sensor
Will camera technology become an ITS sensorWill camera technology become an ITS sensor
Will camera technology become an ITS sensorAllied Vision
 
Geo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality PerspectiveGeo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality PerspectiveAntonio Camara
 
Ambient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasAmbient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasFulvio Corno
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningAlbert Pujol Torras
 
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...Daniela Mazza
 
Crowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadoopCrowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadooplucenerevolution
 

Similaire à NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data (20)

Automation of National Noise Model
Automation of National Noise Model Automation of National Noise Model
Automation of National Noise Model
 
Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015Arpan pal roboticsensing_sw2015
Arpan pal roboticsensing_sw2015
 
Advancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D DataAdvancements In Visualization Of Remotely Sensed 3D Data
Advancements In Visualization Of Remotely Sensed 3D Data
 
20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona ai20181128 satellogic @ barcelona ai
20181128 satellogic @ barcelona ai
 
(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018(Lidar) Pan Australia Topo Mapping Q1 2018
(Lidar) Pan Australia Topo Mapping Q1 2018
 
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
Smart Urban Planning Support through Web Data Science on Open and Enterprise ...
 
Urban senseoverview201507
Urban senseoverview201507Urban senseoverview201507
Urban senseoverview201507
 
Singapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart NationSingapore 3D Map for a Smart Nation
Singapore 3D Map for a Smart Nation
 
PathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed DataPathS: Enhancing Geographical Maps with Environmental Sensed Data
PathS: Enhancing Geographical Maps with Environmental Sensed Data
 
Intro
IntroIntro
Intro
 
Will camera technology become an ITS sensor
Will camera technology become an ITS sensorWill camera technology become an ITS sensor
Will camera technology become an ITS sensor
 
Geo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality PerspectiveGeo-Navigation, an Augmented Reality Perspective
Geo-Navigation, an Augmented Reality Perspective
 
Ambient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasAmbient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application Areas
 
matdid473708.pdf
matdid473708.pdfmatdid473708.pdf
matdid473708.pdf
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learning
 
PPT-DC.pptx
PPT-DC.pptxPPT-DC.pptx
PPT-DC.pptx
 
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
Challenges on wireless Heterogeneous Networks for Mobile Cloud Computing in a...
 
On Crowd-sensing back-end
On Crowd-sensing back-endOn Crowd-sensing back-end
On Crowd-sensing back-end
 
DGterzo
DGterzoDGterzo
DGterzo
 
Crowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadoopCrowd sourced intelligence built into search over hadoop
Crowd sourced intelligence built into search over hadoop
 

Plus de eXascale Infolab

Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictionBeyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictioneXascale Infolab
 
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...eXascale Infolab
 
Representation Learning on Complex Graphs
Representation Learning on Complex GraphsRepresentation Learning on Complex Graphs
Representation Learning on Complex GraphseXascale Infolab
 
A force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapA force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapeXascale Infolab
 
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...eXascale Infolab
 
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...eXascale Infolab
 
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceansDependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceanseXascale Infolab
 
SANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutionSANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutioneXascale Infolab
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataeXascale Infolab
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data ManagementeXascale Infolab
 
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataLDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataeXascale Infolab
 
Executing Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataExecuting Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataeXascale Infolab
 
The Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingThe Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingeXascale Infolab
 
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...eXascale Infolab
 
CIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingCIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingeXascale Infolab
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big DataeXascale Infolab
 

Plus de eXascale Infolab (20)

Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link PredictionBeyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
Beyond Triplets: Hyper-Relational Knowledge Graph Embedding for Link Prediction
 
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
It Takes Two: Instrumenting the Interaction between In-Memory Databases and S...
 
Representation Learning on Complex Graphs
Representation Learning on Complex GraphsRepresentation Learning on Complex Graphs
Representation Learning on Complex Graphs
 
A force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory mapA force directed approach for offline gps trajectory map
A force directed approach for offline gps trajectory map
 
Cikm 2018
Cikm 2018Cikm 2018
Cikm 2018
 
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
HistoSketch: Fast Similarity-Preserving Sketching of Streaming Histograms wit...
 
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
 
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data OceansDependency-Driven Analytics: A Compass for Uncharted Data Oceans
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
 
Crowd scheduling www2016
Crowd scheduling www2016Crowd scheduling www2016
Crowd scheduling www2016
 
SANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference ResolutionSANAPHOR: Ontology-based Coreference Resolution
SANAPHOR: Ontology-based Coreference Resolution
 
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked DataEfficient, Scalable, and Provenance-Aware Management of Linked Data
Efficient, Scalable, and Provenance-Aware Management of Linked Data
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
 
SSSW 2015 Sense Making
SSSW 2015 Sense MakingSSSW 2015 Sense Making
SSSW 2015 Sense Making
 
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked DataLDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
LDOW2015 - Uduvudu: a Graph-Aware and Adaptive UI Engine for Linked Data
 
Executing Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web DataExecuting Provenance-Enabled Queries over Web Data
Executing Provenance-Enabled Queries over Web Data
 
The Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task CrowdsourcingThe Dynamics of Micro-Task Crowdsourcing
The Dynamics of Micro-Task Crowdsourcing
 
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
Fixing the Domain and Range of Properties in Linked Data by Context Disambigu...
 
CIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition rankingCIKM14: Fixing grammatical errors by preposition ranking
CIKM14: Fixing grammatical errors by preposition ranking
 
OLTP-Bench
OLTP-BenchOLTP-Bench
OLTP-Bench
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
 

Dernier

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Dernier (20)

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

  • 1. NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data Mariusz Wisniewski, Gianluca Demartini, Apostolos Malatras, and Philippe Cudré-Mauroux University of Fribourg, Switzerland
  • 2. Motivation - Big Data • Large dataset are necessary to enable analytics and support decision making – Meteorological station / car traffic • Set up a large-scale sensing infrastructure is costly and time-consuming • Create a large amount of valuable data – Crowdsourcing – Data generation models – Smartphones as sensors – Big Data analytics Gianluca Demartini 2
  • 3. NoizCrowd • A crowd-sensing approach to big data generation using commodity sensors • Crowd-source noise level in a geo region • Noise propagation models to generate data • Array data management techniques to scale • Results accessible via a visual interface • Support decisions (e.g., where to live) Gianluca Demartini 3
  • 4. Outline • Related approaches • NoizCrowd Architecture Overview – Data Gathering – Storage – Modeling – Export and Visualization • Data Models • Performance Evaluation Gianluca Demartini 4
  • 5. Related Work • Participatory Sensing vs Sensor Networks – Low cost / High cost – Mobile phones / Sensors – Distributed / Centralized management – Privacy, data quality • Applications: Environment, vehicle routing Gianluca Demartini 5
  • 6. Related Work • Noise Mapping Apps – NoiseTube: opensource, widespread usage – NoiseMap: control over data – SoundSense: machine learning to classify sounds • NoizCrowd – Data in RDF linkable to other datasets (linkeddata.org) – Scalable storage: generate data by interpolation Gianluca Demartini 6
  • 8. Data Gathering • By means of Crowd-sourcing – GPS: location – Microphone: noise level – Internet connection: send data to server • Microphone Calibration – Sound level meter – Sharing conversion table for smartphone models Gianluca Demartini 8
  • 9. Data Storage • App sends median and peak dB values over few seconds • Spatio-temporal data: non-relational storage system (SciDB) – Durable storage – Retrieve data to build models – Export data for visualization • Multi-dimensional array (space and time) • Distributed storage Gianluca Demartini 9
  • 10. Noise Modeling • Data from crowd is noisy and skewed/sparse • Raw data is not shown to the end users • Models to deal with – Overlapping data – Missing data Gianluca Demartini 10
  • 11. Data Export and Visualization • From SciDB data is – converted to RDF – stored in dipLODocus[RDF] – Available via SPARQL • Visualization – Overlay noise level on a map – Additional chart for time evolution Gianluca Demartini 11
  • 13. Data Models • Spatial Interpolation – In the same time interval, data from different locations – Need to be computational simple (large volume) – Bi-dimensional range queries in space (SciDB) – K-nearest neighbor interpolation – Computed in parallel Gianluca Demartini
  • 14. Data Models • Temporal interpolation – Short ranges (minutes) like spatial interp. in 3D – Long ranges, look for patterns and infer • E.g., every Monday at 11am we have 50dB and we miss a Monday measurement • E.g., same measurement (50dB) in same area 2h ago and now Gianluca Demartini 14
  • 15. Noise Propagation Models • We adopt an existing model that takes into account: – Sound power – Distance from source – Directivity – Atmospheric absorption – Excess attenuation (we use meteo conditions) • Difficult to measure with smartphone • Constant in a given region (and use GPS info) Gianluca Demartini 15
  • 16. Materialization of Models • Data from models – Is computationally expensive to generate – May be a lot since we can cover any region • We do late materialization – At query time – Only for the specific request – Cached and indexed for future requests – Incremental updates of views, if possible Gianluca Demartini 16
  • 17. Performance Evaluation (1) • 30 outdoor deployments – 2,3,4 smartphones – Multiple noise sources – Urban setting, flat area of 50x50 meters • Professional-grade noise level meter as gold standard measurement • 85% of interpolated data +-6dB error • 63% of interpolated data +-4dB error Gianluca Demartini 17
  • 18. Performance Evaluation (2) • Sound propagation and source location • 3 smartphones, 100dB source Gianluca Demartini 18
  • 19. Performance Evaluation (3) • Sound level of source error – 16% with 3 measurements – 10% with 4 measurements – 9% with 5 measurements • Source location – 3m error on average Gianluca Demartini 19
  • 20. NoizCrowd - Conclusions • Large scale data is key for decision making • Crowd-source noise level data using mobiles – Scale-out using an array backend – Generate missing data and visualize • Next steps – Android app – Data recording as background feature – Additional materialization strategies http://exascale.info Gianluca Demartini 20