SlideShare une entreprise Scribd logo
1  sur  18
Discovering Spatial Co-location
Patterns : A summary of Results
Shashi Sekhar, Yan Huang
Dept of Computer Sciences, University of Minnesota, USA
AG 2414
Spatial Analysis
Seminar by-
Adrian C Prelipcean
Ipsit Dash
Outline
• Introduction
• Focus of the Research
• Background of the Problem
• Approaches of Modeling the Co-location
problem
• Co-location Miner Algorithm
• Conclusions
Data Data Data!!!!
• Data are being collected continuously for innumerable
phenomenon.
• Business applications
• Scientific Applications
• National Security Purposes
It is impossible to
analyze each strand of
data collected.
Calls for
Data
Mining
Automation
Hypothesis
Generation
Better Linking of
phenomenon
Data Mining
• Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or
knowledge from huge amount of data backed by complex computer based
decision systems ( A.I.,Business Intelligence, Machine learning)
• Spatial Data Mining is different from Classical Data Mining used with reference to RDBMS.
• Here attributes of the neighbours of some object of interest may have an
influence on the object and therefore have to be considered as well. The
explicit location and extension of spatial objects define implicit relations of
spatial neighbourhood (such as topological, distance and direction relations) which are used by
spatial data mining algorithms.
• Methods used in General Data Mining- Predictive (Classification, Regression)
and Descriptive (Clustering, Association)
• Methods used in determining Spatial Patterns-
 Location prediction model (to identify habitat of endangered species)
 Spatial clusters (crime hot-spots, cancer clusters)
 Spatial associations: co-locations (predator-prey species, symbiosis dental health and fluoride)
 Spatial outlier : discontinuities (bad traffic sensors on highways)
Spatial Associations : Co-locations
• Classical Association methodology : Given a set
of transactions, find rules that will predict the occurrence of
an item based on the occurrences of other items in the
transaction. Implication means co-occurrence, not causality!
• Now in case of Spatial Data, the transactions are not disjoint.
So classical approach is not ideal to be used here.
• Association vs Co-location
Focus of the Research
• To extract information from geospatial data and identify frequent co-
occurrence among Boolean spatial features like Draught, El Nino,
Substantial drop in vegetation etc. on Ecological Datasets.
• Approaches to discover co-location rules can be classified into 2 classes-
Spatial Statistics, Association Rules
• Spatial Statistics- Uses spatial correlation measures to characterize
different relations between spatial features ( chi-sq tests, Correlation
coefficients, regression models etc)
• Association Rules- Assumes that finite set of disjoint sets are given as input to the
Algorithm and they can find the most frequent items from the set and deduce
relationships on those items ( apriori algorithm and Independent Approaches-
Based on suitability of Reference Spatial Feature to mine all association rules of
nearby spatial features. )
Background of the Problem
• Given
1. A set of items T of K boolean spatial feature types T={f1,f2,…,fK)
2. A set of N instances P={p1…pN} each p, is a vector <instance-id, spatial feature type,
location>
3. A neighbor relation R over locations in S
4. Min prevalence threshold value, min conditional probability threshold
• Objectives
1. Completeness: it finds all spatial colocation rules that satisfy the threshold value(s)
2. Correctness: any spatial co-location rule found by the algorithm respects the threshold
value(s)
3. IO cost and CPU cost to generate the colocation rules should be acceptable
• Find
– Co-location rules with high prevalence and high conditional probability
• Constraints
– R is symmetric and reflexive
– Monotonic prevalence measure
– Conditional probability measures are specified by the event centric model
– Sparse data set, the number of instance of any spatial features is << cardinality (P)
Approaches of Modeling the
Co-location Rules Problem
• The reference feature centric model
– Is relevant to application domains focusing on a specific boolean
spatial feature
• The window centric model
– Is relevant to applications like mining, surveying and geology,
which focus on land-parcels
– One goal is to predict sets of spatial features likely to be
discovered in a land parcel given that some other features have
been found there
• The event-centric model
– Is relevant to applications that have interest in finding subsets of
spatial features likely to occur in a neighborhood around
instances of given subsets of event types
Reference feature centric model
• Let the reference feature be A
• The set of spatial predicates include one
predicate: close_to(a,b) which is true if and
only if b is a’s neighbour
Reference feature centric model
Association rule example:
is_type(i,A)∧∃ j is_type(j,B)∧close_to(j,i)
→∃ k is_type(k,C)∧close_to(k,i)
with 100% probability
Window centric model
Number of windows: 16
Number of windows containing A: 15
Number of windows containing A and B: 7
Association rule:
an instance of type A in a window → an
instance of type B in a window with
7/15=46.67% probability
Event centric model
Instances of type A: 4
Instances of type A that have some
instances of type B: 1
Conditional probability for the co-
location rule is:
spatial feature A at location l →
spatial feature type B in 9-neighbor
neighborhood is 25%
Concepts
Co-location Miner Algorithm
• Input
1. K boolean spatial instance and their instances
2. A symmetric and reflexive neighbor relation R
3. A user specified minimum threshold prevalence measure (min_prevalence)
4. A user specified minimum conditional probability (min_cond_prob)
• Output
– Co-location rule sets with participation index> min_prevalence and conditional
probability>min_cond_prob
• Method
1. Prevalent size 1 co-location set along with their table instances=P
2. Generate size 2 co-location rules
3. For size of co-locations in (2,3,…,K-1) do
4. Generate candidate prevalent co-locations using the generalized apriori_gen
algorithm
5. Generate table instances and prune based on neighborhood
6. Prune based on prevalence of co-locations
7. Generate co-location rules
8. end;
Co-location Miner Algorithm
Conclusions
• This paper gives a clear idea about co-
localization problem and its difference from
classical association problem.
• Co-location Miner algorithm was dealt with
easily with a comprehensive understanding
and detailed analysis.
• Future centric approach of the authors
promises development in field of Plane
Sweeping Algorithms.
Spatial Co-location Patterns: articles
• – S. Shekhar and Y. Huang, Discovering Spatial Co-location Patterns: A
Summary of Results, In Proc. of 7th Intl Symposium on Spatial and
Temporal Databases (SSTD), Springer-Verlag, Lecture Notes in Computer
Science, LNCS 2121, p.236 ff, July 2001
• – S. Shekhar and Y. Huang, Multi-resolution Co-location Miner: a New
Algorithm to Find Co-location Patterns from Spatial Datasets, SIAM
SDM02 Workshop on Mining Scientific Datasets, April 2002
• – Y. Huang, H. Xiong, S. Shekhar, and J. Pei, Mining Confident Co-location
Rules without A Support Threshold, in Proc. of 18th ACM Symposium on
Applied Computing (ACM SAC), March 2003
• – Y. Huang, S. Shekhar, and H. Xiong, Discovering Colocation Patterns from
Spatial Datasets: A General Approach, submitted to IEEE Transactions on
Knowledge and Data Engineering (TKDE), 2004

Contenu connexe

Tendances

An Introduction To NoSQL & MongoDB
An Introduction To NoSQL & MongoDBAn Introduction To NoSQL & MongoDB
An Introduction To NoSQL & MongoDBLee Theobald
 
Serverless computing - Build and run applications without thinking about servers
Serverless computing - Build and run applications without thinking about serversServerless computing - Build and run applications without thinking about servers
Serverless computing - Build and run applications without thinking about serversAmazon Web Services
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Databasenehabsairam
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion MiningShital Kat
 
NoSQL Databases
NoSQL DatabasesNoSQL Databases
NoSQL DatabasesBADR
 
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...Amazon Web Services
 
Introduction to Mobile Business Intelligence
Introduction to Mobile Business IntelligenceIntroduction to Mobile Business Intelligence
Introduction to Mobile Business IntelligenceVamshi Vangapally
 
Continuous Data Ingestion pipeline for the Enterprise
Continuous Data Ingestion pipeline for the EnterpriseContinuous Data Ingestion pipeline for the Enterprise
Continuous Data Ingestion pipeline for the EnterpriseDataWorks Summit
 
Speed up data preparation for ML pipelines on AWS
Speed up data preparation for ML pipelines on AWSSpeed up data preparation for ML pipelines on AWS
Speed up data preparation for ML pipelines on AWSData Science Milan
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureGuido Schmutz
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLRamakant Soni
 

Tendances (20)

An Introduction To NoSQL & MongoDB
An Introduction To NoSQL & MongoDBAn Introduction To NoSQL & MongoDB
An Introduction To NoSQL & MongoDB
 
Cloud database
Cloud databaseCloud database
Cloud database
 
Serverless computing - Build and run applications without thinking about servers
Serverless computing - Build and run applications without thinking about serversServerless computing - Build and run applications without thinking about servers
Serverless computing - Build and run applications without thinking about servers
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
 
Cloud Encryption
Cloud EncryptionCloud Encryption
Cloud Encryption
 
What is Serverless Computing?
What is Serverless Computing?What is Serverless Computing?
What is Serverless Computing?
 
NoSQL Databases
NoSQL DatabasesNoSQL Databases
NoSQL Databases
 
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
 
AWS RDS
AWS RDSAWS RDS
AWS RDS
 
Introduction to Mobile Business Intelligence
Introduction to Mobile Business IntelligenceIntroduction to Mobile Business Intelligence
Introduction to Mobile Business Intelligence
 
SPADE -
SPADE - SPADE -
SPADE -
 
Continuous Data Ingestion pipeline for the Enterprise
Continuous Data Ingestion pipeline for the EnterpriseContinuous Data Ingestion pipeline for the Enterprise
Continuous Data Ingestion pipeline for the Enterprise
 
Speed up data preparation for ML pipelines on AWS
Speed up data preparation for ML pipelines on AWSSpeed up data preparation for ML pipelines on AWS
Speed up data preparation for ML pipelines on AWS
 
web mining
web miningweb mining
web mining
 
Deep Dive on AWS Lambda
Deep Dive on AWS LambdaDeep Dive on AWS Lambda
Deep Dive on AWS Lambda
 
Apache HBase™
Apache HBase™Apache HBase™
Apache HBase™
 
Cloud computing architectures
Cloud computing architecturesCloud computing architectures
Cloud computing architectures
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 

En vedette

ppt spatial data
ppt spatial datappt spatial data
ppt spatial dataRahul Kumar
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatialSumant Diwakar
 
Introduction to spatial data mining
Introduction to spatial data miningIntroduction to spatial data mining
Introduction to spatial data miningHoang Nguyen
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningSalford Systems
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slidesmahavir_a
 
Classification with Naive Bayes
Classification with Naive BayesClassification with Naive Bayes
Classification with Naive BayesJosh Patterson
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Crop production ppt
Crop production pptCrop production ppt
Crop production pptvaggyaggy
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 

En vedette (18)

ppt spatial data
ppt spatial datappt spatial data
ppt spatial data
 
Temporal data mining
Temporal data miningTemporal data mining
Temporal data mining
 
Spatial databases
Spatial databasesSpatial databases
Spatial databases
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatial
 
Introduction to spatial data mining
Introduction to spatial data miningIntroduction to spatial data mining
Introduction to spatial data mining
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data Mining
 
WEB MINING.
WEB MINING.WEB MINING.
WEB MINING.
 
Textmining Introduction
Textmining IntroductionTextmining Introduction
Textmining Introduction
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slides
 
Classification with Naive Bayes
Classification with Naive BayesClassification with Naive Bayes
Classification with Naive Bayes
 
GIS Data Types
GIS Data TypesGIS Data Types
GIS Data Types
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Crop production ppt
Crop production pptCrop production ppt
Crop production ppt
 
Data mining
Data miningData mining
Data mining
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
 

Similaire à Spatial Data Mining : Seminar

Spatial co location pattern mining
Spatial co location pattern miningSpatial co location pattern mining
Spatial co location pattern miningSeung Kwan Kim
 
D1T3 enm workflows updated
D1T3 enm workflows updatedD1T3 enm workflows updated
D1T3 enm workflows updatedTown Peterson
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesTown Peterson
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex sceneKumar Mayank
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
Rank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceRank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceredpel dot com
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.ppsbutest
 
report2.doc
report2.docreport2.doc
report2.docbutest
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep LearningExperfy
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1Johan Blomme
 
Seminar on gis analysis functions
Seminar on gis analysis functionsSeminar on gis analysis functions
Seminar on gis analysis functionsPramoda Raj
 
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHCOLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHIJCI JOURNAL
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray dataGianluca Bontempi
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization CS, NcState
 
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Bob MacMillan
 
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Wilfried Elmenreich
 

Similaire à Spatial Data Mining : Seminar (20)

Spatial co location pattern mining
Spatial co location pattern miningSpatial co location pattern mining
Spatial co location pattern mining
 
D1T3 enm workflows updated
D1T3 enm workflows updatedD1T3 enm workflows updated
D1T3 enm workflows updated
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling Methodologies
 
DM_clustering.ppt
DM_clustering.pptDM_clustering.ppt
DM_clustering.ppt
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex scene
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
Rank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceRank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependence
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.pps
 
report2.doc
report2.docreport2.doc
report2.doc
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep Learning
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1
 
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
 
Seminar on gis analysis functions
Seminar on gis analysis functionsSeminar on gis analysis functions
Seminar on gis analysis functions
 
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHCOLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
 
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
 
UNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptxUNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptx
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray data
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization 
 
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
 
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
 

Plus de Ipsit Dash

Land Reforms : An overview
Land Reforms : An overviewLand Reforms : An overview
Land Reforms : An overviewIpsit Dash
 
Water Sector Debate
Water Sector DebateWater Sector Debate
Water Sector DebateIpsit Dash
 
Tirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationTirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationIpsit Dash
 
Change Detection Dubai
Change Detection DubaiChange Detection Dubai
Change Detection DubaiIpsit Dash
 
Average- An android project
Average- An android projectAverage- An android project
Average- An android projectIpsit Dash
 
Implementation of INS-GPS
Implementation of INS-GPSImplementation of INS-GPS
Implementation of INS-GPSIpsit Dash
 

Plus de Ipsit Dash (7)

Land Reforms : An overview
Land Reforms : An overviewLand Reforms : An overview
Land Reforms : An overview
 
Water Sector Debate
Water Sector DebateWater Sector Debate
Water Sector Debate
 
Tirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationTirupur Water Supply and Sanitation
Tirupur Water Supply and Sanitation
 
Change Detection Dubai
Change Detection DubaiChange Detection Dubai
Change Detection Dubai
 
Average- An android project
Average- An android projectAverage- An android project
Average- An android project
 
Dark diamond
Dark diamondDark diamond
Dark diamond
 
Implementation of INS-GPS
Implementation of INS-GPSImplementation of INS-GPS
Implementation of INS-GPS
 

Dernier

How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17Celine George
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhleson0603
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesAmanpreetKaur157993
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/siemaillard
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjMohammed Sikander
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnershipsexpandedwebsite
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...Krashi Coaching
 
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMDEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMELOISARIVERA8
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024Borja Sotomayor
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Denish Jangid
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...Nguyen Thanh Tu Collection
 
How to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryHow to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryCeline George
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFVivekanand Anglo Vedic Academy
 
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...Nguyen Thanh Tu Collection
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxneillewis46
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45MysoreMuleSoftMeetup
 
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxAnalyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxLimon Prince
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptxPoojaSen20
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSean M. Fox
 

Dernier (20)

How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategies
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
 
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMDEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
“O BEIJO” EM ARTE .
“O BEIJO” EM ARTE                       .“O BEIJO” EM ARTE                       .
“O BEIJO” EM ARTE .
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
 
How to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryHow to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 Inventory
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDF
 
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
 
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxAnalyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptx
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 

Spatial Data Mining : Seminar

  • 1. Discovering Spatial Co-location Patterns : A summary of Results Shashi Sekhar, Yan Huang Dept of Computer Sciences, University of Minnesota, USA AG 2414 Spatial Analysis Seminar by- Adrian C Prelipcean Ipsit Dash
  • 2. Outline • Introduction • Focus of the Research • Background of the Problem • Approaches of Modeling the Co-location problem • Co-location Miner Algorithm • Conclusions
  • 3. Data Data Data!!!! • Data are being collected continuously for innumerable phenomenon. • Business applications • Scientific Applications • National Security Purposes It is impossible to analyze each strand of data collected. Calls for Data Mining Automation Hypothesis Generation Better Linking of phenomenon
  • 4. Data Mining • Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data backed by complex computer based decision systems ( A.I.,Business Intelligence, Machine learning) • Spatial Data Mining is different from Classical Data Mining used with reference to RDBMS. • Here attributes of the neighbours of some object of interest may have an influence on the object and therefore have to be considered as well. The explicit location and extension of spatial objects define implicit relations of spatial neighbourhood (such as topological, distance and direction relations) which are used by spatial data mining algorithms. • Methods used in General Data Mining- Predictive (Classification, Regression) and Descriptive (Clustering, Association) • Methods used in determining Spatial Patterns-  Location prediction model (to identify habitat of endangered species)  Spatial clusters (crime hot-spots, cancer clusters)  Spatial associations: co-locations (predator-prey species, symbiosis dental health and fluoride)  Spatial outlier : discontinuities (bad traffic sensors on highways)
  • 5. Spatial Associations : Co-locations • Classical Association methodology : Given a set of transactions, find rules that will predict the occurrence of an item based on the occurrences of other items in the transaction. Implication means co-occurrence, not causality! • Now in case of Spatial Data, the transactions are not disjoint. So classical approach is not ideal to be used here. • Association vs Co-location
  • 6. Focus of the Research • To extract information from geospatial data and identify frequent co- occurrence among Boolean spatial features like Draught, El Nino, Substantial drop in vegetation etc. on Ecological Datasets. • Approaches to discover co-location rules can be classified into 2 classes- Spatial Statistics, Association Rules • Spatial Statistics- Uses spatial correlation measures to characterize different relations between spatial features ( chi-sq tests, Correlation coefficients, regression models etc) • Association Rules- Assumes that finite set of disjoint sets are given as input to the Algorithm and they can find the most frequent items from the set and deduce relationships on those items ( apriori algorithm and Independent Approaches- Based on suitability of Reference Spatial Feature to mine all association rules of nearby spatial features. )
  • 7. Background of the Problem • Given 1. A set of items T of K boolean spatial feature types T={f1,f2,…,fK) 2. A set of N instances P={p1…pN} each p, is a vector <instance-id, spatial feature type, location> 3. A neighbor relation R over locations in S 4. Min prevalence threshold value, min conditional probability threshold • Objectives 1. Completeness: it finds all spatial colocation rules that satisfy the threshold value(s) 2. Correctness: any spatial co-location rule found by the algorithm respects the threshold value(s) 3. IO cost and CPU cost to generate the colocation rules should be acceptable • Find – Co-location rules with high prevalence and high conditional probability • Constraints – R is symmetric and reflexive – Monotonic prevalence measure – Conditional probability measures are specified by the event centric model – Sparse data set, the number of instance of any spatial features is << cardinality (P)
  • 8. Approaches of Modeling the Co-location Rules Problem • The reference feature centric model – Is relevant to application domains focusing on a specific boolean spatial feature • The window centric model – Is relevant to applications like mining, surveying and geology, which focus on land-parcels – One goal is to predict sets of spatial features likely to be discovered in a land parcel given that some other features have been found there • The event-centric model – Is relevant to applications that have interest in finding subsets of spatial features likely to occur in a neighborhood around instances of given subsets of event types
  • 9. Reference feature centric model • Let the reference feature be A • The set of spatial predicates include one predicate: close_to(a,b) which is true if and only if b is a’s neighbour
  • 10. Reference feature centric model Association rule example: is_type(i,A)∧∃ j is_type(j,B)∧close_to(j,i) →∃ k is_type(k,C)∧close_to(k,i) with 100% probability
  • 11. Window centric model Number of windows: 16 Number of windows containing A: 15 Number of windows containing A and B: 7 Association rule: an instance of type A in a window → an instance of type B in a window with 7/15=46.67% probability
  • 12. Event centric model Instances of type A: 4 Instances of type A that have some instances of type B: 1 Conditional probability for the co- location rule is: spatial feature A at location l → spatial feature type B in 9-neighbor neighborhood is 25%
  • 14. Co-location Miner Algorithm • Input 1. K boolean spatial instance and their instances 2. A symmetric and reflexive neighbor relation R 3. A user specified minimum threshold prevalence measure (min_prevalence) 4. A user specified minimum conditional probability (min_cond_prob) • Output – Co-location rule sets with participation index> min_prevalence and conditional probability>min_cond_prob • Method 1. Prevalent size 1 co-location set along with their table instances=P 2. Generate size 2 co-location rules 3. For size of co-locations in (2,3,…,K-1) do 4. Generate candidate prevalent co-locations using the generalized apriori_gen algorithm 5. Generate table instances and prune based on neighborhood 6. Prune based on prevalence of co-locations 7. Generate co-location rules 8. end;
  • 16.
  • 17. Conclusions • This paper gives a clear idea about co- localization problem and its difference from classical association problem. • Co-location Miner algorithm was dealt with easily with a comprehensive understanding and detailed analysis. • Future centric approach of the authors promises development in field of Plane Sweeping Algorithms.
  • 18. Spatial Co-location Patterns: articles • – S. Shekhar and Y. Huang, Discovering Spatial Co-location Patterns: A Summary of Results, In Proc. of 7th Intl Symposium on Spatial and Temporal Databases (SSTD), Springer-Verlag, Lecture Notes in Computer Science, LNCS 2121, p.236 ff, July 2001 • – S. Shekhar and Y. Huang, Multi-resolution Co-location Miner: a New Algorithm to Find Co-location Patterns from Spatial Datasets, SIAM SDM02 Workshop on Mining Scientific Datasets, April 2002 • – Y. Huang, H. Xiong, S. Shekhar, and J. Pei, Mining Confident Co-location Rules without A Support Threshold, in Proc. of 18th ACM Symposium on Applied Computing (ACM SAC), March 2003 • – Y. Huang, S. Shekhar, and H. Xiong, Discovering Colocation Patterns from Spatial Datasets: A General Approach, submitted to IEEE Transactions on Knowledge and Data Engineering (TKDE), 2004