SlideShare une entreprise Scribd logo
Carsten Keßler a,b and René de Groot a
a Institute for Geoinformatics, University of Münster | b soon: Hunter College, CUNY
http://carsten.io | @carstenkessler
Trust as a Proxy Measure for the
Quality of VGI in the Case of OSM
The Idea
‣ Develop a measure to assess the degree to which a data
consumer can trust the quality of a feature
The Idea
‣ Develop a measure to assess the degree to which a data
consumer can trust the quality of a feature
‣ Trust measure is based on a feature’s editing history
The Idea
‣ Develop a measure to assess the degree to which a data
consumer can trust the quality of a feature
‣ Trust measure is based on a feature’s editing history
‣ Benefits
‣ Works at feature level
‣ Filter features by quality
‣ Spot problematic features
Does this work?
Can we reliably assess the quality of a feature in
OpenStreetMap based on its editing history?
Does this work?
Can we reliably assess the quality of a feature in
OpenStreetMap based on its editing history?
amenity = university
name = Institute for Geoinformatics
v1
Does this work?
Can we reliably assess the quality of a feature in
OpenStreetMap based on its editing history?
amenity = university
name = Institute for Geoinformatics
amenity = university
building = yes
name = Institute for Geoinformatics
v1 v2
Does this work?
Can we reliably assess the quality of a feature in
OpenStreetMap based on its editing history?
amenity = university
name = Institute for Geoinformatics
amenity = university
building = yes
name = Institute for Geoinformatics
addr:city = Münster
addr:country = DE
addr:housenumber = 253
addr:street = Weseler Straße
building = yes
wheelchair = limited
v1 v2 v3 …
OSM Heatmap Kudos: Johannes Trame
OSM Provenance Ontology
http://carsten.io/osm/osm-provenance.rdf
prv:Tag
includesEdit
Changeset prv:CreationGuideline
Edit
prv:createdBy
prv:precededBy
prv:usedData
NodeState
WayState
prv:DataCreation User
prv:performedBy
changesGeometry
addsTag
removesTag
changesValueOfKey
rdfs:Literal
prv:DataItem
prv:HumanActor
subClassOfhasTag
FeatureState
Does this work?
‣ Get a first idea whether this is a viable approach
‣ Compare results of
‣ a simple trust measure and
‣ observed feature quality
‣ Is there a correlation between the two?
Study area:
Münster’s
old town
Feature Selection
Feature Selection
‣ Re-mapping the whole district was not feasible
Feature Selection
‣ Re-mapping the whole district was not feasible
‣ Up to 100 features were manageable
Feature Selection
‣ Re-mapping the whole district was not feasible
‣ Up to 100 features were manageable
‣ Selection based on minimum number of versions
Feature Selection
‣ Re-mapping the whole district was not feasible
‣ Up to 100 features were manageable
‣ Selection based on minimum number of versions
‣ 74 features with 6+ versions
74 features
selected
Trust measure
Trust measure
‣ Positive factors:
‣ Versions
‣ Users
‣ Indirect confirmations =
edits in the direct vicinity
(50m)
Trust measure
‣ Positive factors:
‣ Versions
‣ Users
‣ Indirect confirmations =
edits in the direct vicinity
(50m)
‣ Negative factors:
‣ Tag corrections
‣ Rollbacks
Trust measure (contd.)
‣ Classification for each factor: 5 equal classes
‣ Combined into one classification
‣ Equal weights
Trust
measure
Field Survey
‣ Thematic accuracy
4 classes:
1. Main tag wrong
2. Other tags wrong
3. Thematic ambiguities
4. Thematically correct
Field Survey
‣ Thematic accuracy
4 classes:
1. Main tag wrong
2. Other tags wrong
3. Thematic ambiguities
4. Thematically correct
‣ Results:
‣ 6 features (~8%)
‣ 2 features (~3%)
‣ 9 features (~12%)
‣ 57 features (~77%)
Field Survey (contd.)
‣ Topological consistency
Field Survey (contd.)
‣ Topological consistency
‣ Is the feature correctly
positioned relative to the
surrounding features?
Field Survey (contd.)
‣ Topological consistency
‣ Is the feature correctly
positioned relative to the
surrounding features?
‣ Results:
‣ 73 out of 74 features (~99%)
Field Survey (contd.)
‣ Topological consistency
‣ Is the feature correctly
positioned relative to the
surrounding features?
‣ Results:
‣ 73 out of 74 features (~99%)
‣ Information completeness
‣ TF-IDF measure to identify
relevant tags per main tag
Field Survey (contd.)
‣ Topological consistency
‣ Is the feature correctly
positioned relative to the
surrounding features?
‣ Results:
‣ 73 out of 74 features (~99%)
‣ Information completeness
‣ TF-IDF measure to identify
relevant tags per main tag
‣ ~37% tags missing (avg.)
Observed
quality:
combined
results
Trust
measure
mean quality class: ~4.2
mean trust class: ~2.8
Do we get the trend right?
Do we get the trend right?
‣ Removed outliers
‣ Kendall’s τ: 0.52
‣ Moderate, but significant
positive correlation
Conclusions
Conclusions
‣ Initial study
Conclusions
‣ Initial study
‣ A feature’s history can determine its trustworthiness
Conclusions
‣ Initial study
‣ A feature’s history can determine its trustworthiness
‣ Trust values correlate with observed quality
Conclusions
‣ Initial study
‣ A feature’s history can determine its trustworthiness
‣ Trust values correlate with observed quality
‣ Even with a very simple model
Conclusions
‣ Initial study
‣ A feature’s history can determine its trustworthiness
‣ Trust values correlate with observed quality
‣ Even with a very simple model
‣ Outliers cannot be explained yet
Tons of Future Work
Tons of Future Work
‣ Extend and refine the trust model:
Classification, weighting, positive vs negative aspects, …
Tons of Future Work
‣ Extend and refine the trust model:
Classification, weighting, positive vs negative aspects, …
‣ Social aspects: Who has edited a feature?
Tons of Future Work
‣ Extend and refine the trust model:
Classification, weighting, positive vs negative aspects, …
‣ Social aspects: Who has edited a feature?
‣ Repeat study without spatial focus
Tons of Future Work
‣ Extend and refine the trust model:
Classification, weighting, positive vs negative aspects, …
‣ Social aspects: Who has edited a feature?
‣ Repeat study without spatial focus
‣ How to scale the data collection?
Tons of Future Work
‣ Extend and refine the trust model:
Classification, weighting, positive vs negative aspects, …
‣ Social aspects: Who has edited a feature?
‣ Repeat study without spatial focus
‣ How to scale the data collection?
‣ Learn the trust model from the data
Thankyou!
All data used in this research © OpenStreetMap contributors.
carsten.kessler@uni-muenster.de | http://carsten.io | @carstenkessler
Carsten Keßler | René de Groot

Contenu connexe

Similaire à Trust as a Proxy Measure for the Quality of VGI in the Case of OSM

351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptxXanGwaps
 
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
GDG Cloud Community Day 2022 -  Managing data quality in Machine LearningGDG Cloud Community Day 2022 -  Managing data quality in Machine Learning
GDG Cloud Community Day 2022 - Managing data quality in Machine LearningSARADINDU SENGUPTA
 
Kaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyKaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyAlon Bochman, CFA
 
Using Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing BehaviorUsing Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing BehaviorJulia Kiseleva
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptopRising Media, Inc.
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Greg Makowski
 
Stance classification - Presentation QMUL by Carolina Scarton, USFD
Stance classification - Presentation QMUL by Carolina Scarton, USFDStance classification - Presentation QMUL by Carolina Scarton, USFD
Stance classification - Presentation QMUL by Carolina Scarton, USFDWeverify
 
Setting up an A/B-testing framework
Setting up an A/B-testing frameworkSetting up an A/B-testing framework
Setting up an A/B-testing frameworkAgnes van Belle
 
Artificial intelligence and IoT
Artificial intelligence and IoTArtificial intelligence and IoT
Artificial intelligence and IoTVeselin Pizurica
 
Monte Carlo Simulation for Agile Development
Monte Carlo Simulation for Agile DevelopmentMonte Carlo Simulation for Agile Development
Monte Carlo Simulation for Agile DevelopmentGlen Alleman
 
RDF Data Quality Assessment - connecting the pieces
RDF Data Quality Assessment - connecting the piecesRDF Data Quality Assessment - connecting the pieces
RDF Data Quality Assessment - connecting the piecesConnected Data World
 
udacity-dandsyllabus
udacity-dandsyllabusudacity-dandsyllabus
udacity-dandsyllabusBora Yüret
 
Connecting Scenario Approaches with Scenario Tools
Connecting Scenario Approaches with Scenario ToolsConnecting Scenario Approaches with Scenario Tools
Connecting Scenario Approaches with Scenario ToolsRPO America
 
Air Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPA
Air Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPAAir Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPA
Air Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPASTEP_scotland
 
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Bayes Nets meetup London
 
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Ian Morgan
 

Similaire à Trust as a Proxy Measure for the Quality of VGI in the Case of OSM (20)

351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
 
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
GDG Cloud Community Day 2022 -  Managing data quality in Machine LearningGDG Cloud Community Day 2022 -  Managing data quality in Machine Learning
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
 
Kaggle Gold Medal Case Study
Kaggle Gold Medal Case StudyKaggle Gold Medal Case Study
Kaggle Gold Medal Case Study
 
Using Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing BehaviorUsing Contextual Information to Understand Searching and Browsing Behavior
Using Contextual Information to Understand Searching and Browsing Behavior
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 
Weka bike rental
Weka bike rentalWeka bike rental
Weka bike rental
 
RISK EVALUATION-1
RISK EVALUATION-1RISK EVALUATION-1
RISK EVALUATION-1
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
 
Stance classification - Presentation QMUL by Carolina Scarton, USFD
Stance classification - Presentation QMUL by Carolina Scarton, USFDStance classification - Presentation QMUL by Carolina Scarton, USFD
Stance classification - Presentation QMUL by Carolina Scarton, USFD
 
Setting up an A/B-testing framework
Setting up an A/B-testing frameworkSetting up an A/B-testing framework
Setting up an A/B-testing framework
 
Artificial intelligence and IoT
Artificial intelligence and IoTArtificial intelligence and IoT
Artificial intelligence and IoT
 
Monte Carlo Simulation for Agile Development
Monte Carlo Simulation for Agile DevelopmentMonte Carlo Simulation for Agile Development
Monte Carlo Simulation for Agile Development
 
RDF Data Quality Assessment - connecting the pieces
RDF Data Quality Assessment - connecting the piecesRDF Data Quality Assessment - connecting the pieces
RDF Data Quality Assessment - connecting the pieces
 
Cist2014 slides
Cist2014 slidesCist2014 slides
Cist2014 slides
 
BOM DMAIC TEMPLATE
BOM DMAIC TEMPLATEBOM DMAIC TEMPLATE
BOM DMAIC TEMPLATE
 
udacity-dandsyllabus
udacity-dandsyllabusudacity-dandsyllabus
udacity-dandsyllabus
 
Connecting Scenario Approaches with Scenario Tools
Connecting Scenario Approaches with Scenario ToolsConnecting Scenario Approaches with Scenario Tools
Connecting Scenario Approaches with Scenario Tools
 
Air Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPA
Air Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPAAir Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPA
Air Quality Modelling Tools (Aberdeen Pilot Project) Dr. Alan Hills, SEPA
 
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
 
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
Professor Steve Roberts; The Bayesian Crowd: scalable information combinati...
 

Plus de Carsten Keßler

Geoprivacy Talk @ UJI Castellòn
Geoprivacy Talk @ UJI CastellònGeoprivacy Talk @ UJI Castellòn
Geoprivacy Talk @ UJI CastellònCarsten Keßler
 
Privacy-preserving contagious disease tracking
Privacy-preserving contagious disease trackingPrivacy-preserving contagious disease tracking
Privacy-preserving contagious disease trackingCarsten Keßler
 
Central Places in Wikipedia
Central Places in WikipediaCentral Places in Wikipedia
Central Places in WikipediaCarsten Keßler
 
Using the Web as a Data Source: Challenges for Linked Science
Using the Web as a Data Source: Challenges for Linked ScienceUsing the Web as a Data Source: Challenges for Linked Science
Using the Web as a Data Source: Challenges for Linked ScienceCarsten Keßler
 
The EnviroCar Platform: A Decentralized Approach to Monitoring Urban Traffic...
The EnviroCar Platform: A Decentralized Approach  to Monitoring Urban Traffic...The EnviroCar Platform: A Decentralized Approach  to Monitoring Urban Traffic...
The EnviroCar Platform: A Decentralized Approach to Monitoring Urban Traffic...Carsten Keßler
 
Research in the Age of the Context Machine
Research in the Age of the Context MachineResearch in the Age of the Context Machine
Research in the Age of the Context MachineCarsten Keßler
 
Encoding and querying historic map content
Encoding and querying historic map contentEncoding and querying historic map content
Encoding and querying historic map contentCarsten Keßler
 
GIS for the Masses: Volunteered Geographic Information
GIS for the Masses: Volunteered Geographic InformationGIS for the Masses: Volunteered Geographic Information
GIS for the Masses: Volunteered Geographic InformationCarsten Keßler
 
Linked Data and Time – Modeling Researcher Life Lines by Events
Linked Data and Time – Modeling Researcher Life Lines by EventsLinked Data and Time – Modeling Researcher Life Lines by Events
Linked Data and Time – Modeling Researcher Life Lines by EventsCarsten Keßler
 
LODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesLODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesCarsten Keßler
 

Plus de Carsten Keßler (11)

Geoprivacy Talk @ UJI Castellòn
Geoprivacy Talk @ UJI CastellònGeoprivacy Talk @ UJI Castellòn
Geoprivacy Talk @ UJI Castellòn
 
Privacy-preserving contagious disease tracking
Privacy-preserving contagious disease trackingPrivacy-preserving contagious disease tracking
Privacy-preserving contagious disease tracking
 
Central Places in Wikipedia
Central Places in WikipediaCentral Places in Wikipedia
Central Places in Wikipedia
 
Using the Web as a Data Source: Challenges for Linked Science
Using the Web as a Data Source: Challenges for Linked ScienceUsing the Web as a Data Source: Challenges for Linked Science
Using the Web as a Data Source: Challenges for Linked Science
 
The EnviroCar Platform: A Decentralized Approach to Monitoring Urban Traffic...
The EnviroCar Platform: A Decentralized Approach  to Monitoring Urban Traffic...The EnviroCar Platform: A Decentralized Approach  to Monitoring Urban Traffic...
The EnviroCar Platform: A Decentralized Approach to Monitoring Urban Traffic...
 
Research in the Age of the Context Machine
Research in the Age of the Context MachineResearch in the Age of the Context Machine
Research in the Age of the Context Machine
 
Encoding and querying historic map content
Encoding and querying historic map contentEncoding and querying historic map content
Encoding and querying historic map content
 
GIS for the Masses: Volunteered Geographic Information
GIS for the Masses: Volunteered Geographic InformationGIS for the Masses: Volunteered Geographic Information
GIS for the Masses: Volunteered Geographic Information
 
Linked Data and Time – Modeling Researcher Life Lines by Events
Linked Data and Time – Modeling Researcher Life Lines by EventsLinked Data and Time – Modeling Researcher Life Lines by Events
Linked Data and Time – Modeling Researcher Life Lines by Events
 
LODUM @ SWIB11
LODUM @ SWIB11LODUM @ SWIB11
LODUM @ SWIB11
 
LODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesLODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU series
 

Dernier

How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfFIDO Alliance
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024Stephanie Beckett
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsStefano
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfFIDO Alliance
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FIDO Alliance
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekCzechDreamin
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...CzechDreamin
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 
Buy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfBuy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfEasyPrinterHelp
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka DoktorováCzechDreamin
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationZilliz
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessUXDXConf
 
Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024TopCSSGallery
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCzechDreamin
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfChristopherTHyatt
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?Mark Billinghurst
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfFIDO Alliance
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 

Dernier (20)

How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Buy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfBuy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdf
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024Top 10 Symfony Development Companies 2024
Top 10 Symfony Development Companies 2024
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 

Trust as a Proxy Measure for the Quality of VGI in the Case of OSM

  • 1. Carsten Keßler a,b and René de Groot a a Institute for Geoinformatics, University of Münster | b soon: Hunter College, CUNY http://carsten.io | @carstenkessler Trust as a Proxy Measure for the Quality of VGI in the Case of OSM
  • 2. The Idea ‣ Develop a measure to assess the degree to which a data consumer can trust the quality of a feature
  • 3. The Idea ‣ Develop a measure to assess the degree to which a data consumer can trust the quality of a feature ‣ Trust measure is based on a feature’s editing history
  • 4. The Idea ‣ Develop a measure to assess the degree to which a data consumer can trust the quality of a feature ‣ Trust measure is based on a feature’s editing history ‣ Benefits ‣ Works at feature level ‣ Filter features by quality ‣ Spot problematic features
  • 5. Does this work? Can we reliably assess the quality of a feature in OpenStreetMap based on its editing history?
  • 6. Does this work? Can we reliably assess the quality of a feature in OpenStreetMap based on its editing history? amenity = university name = Institute for Geoinformatics v1
  • 7. Does this work? Can we reliably assess the quality of a feature in OpenStreetMap based on its editing history? amenity = university name = Institute for Geoinformatics amenity = university building = yes name = Institute for Geoinformatics v1 v2
  • 8. Does this work? Can we reliably assess the quality of a feature in OpenStreetMap based on its editing history? amenity = university name = Institute for Geoinformatics amenity = university building = yes name = Institute for Geoinformatics addr:city = Münster addr:country = DE addr:housenumber = 253 addr:street = Weseler Straße building = yes wheelchair = limited v1 v2 v3 …
  • 9. OSM Heatmap Kudos: Johannes Trame
  • 10. OSM Provenance Ontology http://carsten.io/osm/osm-provenance.rdf prv:Tag includesEdit Changeset prv:CreationGuideline Edit prv:createdBy prv:precededBy prv:usedData NodeState WayState prv:DataCreation User prv:performedBy changesGeometry addsTag removesTag changesValueOfKey rdfs:Literal prv:DataItem prv:HumanActor subClassOfhasTag FeatureState
  • 11. Does this work? ‣ Get a first idea whether this is a viable approach ‣ Compare results of ‣ a simple trust measure and ‣ observed feature quality ‣ Is there a correlation between the two?
  • 14. Feature Selection ‣ Re-mapping the whole district was not feasible
  • 15. Feature Selection ‣ Re-mapping the whole district was not feasible ‣ Up to 100 features were manageable
  • 16. Feature Selection ‣ Re-mapping the whole district was not feasible ‣ Up to 100 features were manageable ‣ Selection based on minimum number of versions
  • 17. Feature Selection ‣ Re-mapping the whole district was not feasible ‣ Up to 100 features were manageable ‣ Selection based on minimum number of versions ‣ 74 features with 6+ versions
  • 20. Trust measure ‣ Positive factors: ‣ Versions ‣ Users ‣ Indirect confirmations = edits in the direct vicinity (50m)
  • 21. Trust measure ‣ Positive factors: ‣ Versions ‣ Users ‣ Indirect confirmations = edits in the direct vicinity (50m) ‣ Negative factors: ‣ Tag corrections ‣ Rollbacks
  • 22. Trust measure (contd.) ‣ Classification for each factor: 5 equal classes ‣ Combined into one classification ‣ Equal weights
  • 24. Field Survey ‣ Thematic accuracy 4 classes: 1. Main tag wrong 2. Other tags wrong 3. Thematic ambiguities 4. Thematically correct
  • 25. Field Survey ‣ Thematic accuracy 4 classes: 1. Main tag wrong 2. Other tags wrong 3. Thematic ambiguities 4. Thematically correct ‣ Results: ‣ 6 features (~8%) ‣ 2 features (~3%) ‣ 9 features (~12%) ‣ 57 features (~77%)
  • 26. Field Survey (contd.) ‣ Topological consistency
  • 27. Field Survey (contd.) ‣ Topological consistency ‣ Is the feature correctly positioned relative to the surrounding features?
  • 28. Field Survey (contd.) ‣ Topological consistency ‣ Is the feature correctly positioned relative to the surrounding features? ‣ Results: ‣ 73 out of 74 features (~99%)
  • 29. Field Survey (contd.) ‣ Topological consistency ‣ Is the feature correctly positioned relative to the surrounding features? ‣ Results: ‣ 73 out of 74 features (~99%) ‣ Information completeness ‣ TF-IDF measure to identify relevant tags per main tag
  • 30. Field Survey (contd.) ‣ Topological consistency ‣ Is the feature correctly positioned relative to the surrounding features? ‣ Results: ‣ 73 out of 74 features (~99%) ‣ Information completeness ‣ TF-IDF measure to identify relevant tags per main tag ‣ ~37% tags missing (avg.)
  • 33.
  • 34. mean quality class: ~4.2 mean trust class: ~2.8
  • 35. Do we get the trend right?
  • 36. Do we get the trend right? ‣ Removed outliers ‣ Kendall’s τ: 0.52 ‣ Moderate, but significant positive correlation
  • 39. Conclusions ‣ Initial study ‣ A feature’s history can determine its trustworthiness
  • 40. Conclusions ‣ Initial study ‣ A feature’s history can determine its trustworthiness ‣ Trust values correlate with observed quality
  • 41. Conclusions ‣ Initial study ‣ A feature’s history can determine its trustworthiness ‣ Trust values correlate with observed quality ‣ Even with a very simple model
  • 42. Conclusions ‣ Initial study ‣ A feature’s history can determine its trustworthiness ‣ Trust values correlate with observed quality ‣ Even with a very simple model ‣ Outliers cannot be explained yet
  • 44. Tons of Future Work ‣ Extend and refine the trust model: Classification, weighting, positive vs negative aspects, …
  • 45. Tons of Future Work ‣ Extend and refine the trust model: Classification, weighting, positive vs negative aspects, … ‣ Social aspects: Who has edited a feature?
  • 46. Tons of Future Work ‣ Extend and refine the trust model: Classification, weighting, positive vs negative aspects, … ‣ Social aspects: Who has edited a feature? ‣ Repeat study without spatial focus
  • 47. Tons of Future Work ‣ Extend and refine the trust model: Classification, weighting, positive vs negative aspects, … ‣ Social aspects: Who has edited a feature? ‣ Repeat study without spatial focus ‣ How to scale the data collection?
  • 48. Tons of Future Work ‣ Extend and refine the trust model: Classification, weighting, positive vs negative aspects, … ‣ Social aspects: Who has edited a feature? ‣ Repeat study without spatial focus ‣ How to scale the data collection? ‣ Learn the trust model from the data
  • 49. Thankyou! All data used in this research © OpenStreetMap contributors. carsten.kessler@uni-muenster.de | http://carsten.io | @carstenkessler Carsten Keßler | René de Groot