SlideShare une entreprise Scribd logo
1  sur  18
Discovering Spatial Co-location
Patterns : A summary of Results
Shashi Sekhar, Yan Huang
Dept of Computer Sciences, University of Minnesota, USA
AG 2414
Spatial Analysis
Seminar by-
Adrian C Prelipcean
Ipsit Dash
Outline
• Introduction
• Focus of the Research
• Background of the Problem
• Approaches of Modeling the Co-location
problem
• Co-location Miner Algorithm
• Conclusions
Data Data Data!!!!
• Data are being collected continuously for innumerable
phenomenon.
• Business applications
• Scientific Applications
• National Security Purposes
It is impossible to
analyze each strand of
data collected.
Calls for
Data
Mining
Automation
Hypothesis
Generation
Better Linking of
phenomenon
Data Mining
• Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or
knowledge from huge amount of data backed by complex computer based
decision systems ( A.I.,Business Intelligence, Machine learning)
• Spatial Data Mining is different from Classical Data Mining used with reference to RDBMS.
• Here attributes of the neighbours of some object of interest may have an
influence on the object and therefore have to be considered as well. The
explicit location and extension of spatial objects define implicit relations of
spatial neighbourhood (such as topological, distance and direction relations) which are used by
spatial data mining algorithms.
• Methods used in General Data Mining- Predictive (Classification, Regression)
and Descriptive (Clustering, Association)
• Methods used in determining Spatial Patterns-
 Location prediction model (to identify habitat of endangered species)
 Spatial clusters (crime hot-spots, cancer clusters)
 Spatial associations: co-locations (predator-prey species, symbiosis dental health and fluoride)
 Spatial outlier : discontinuities (bad traffic sensors on highways)
Spatial Associations : Co-locations
• Classical Association methodology : Given a set
of transactions, find rules that will predict the occurrence of
an item based on the occurrences of other items in the
transaction. Implication means co-occurrence, not causality!
• Now in case of Spatial Data, the transactions are not disjoint.
So classical approach is not ideal to be used here.
• Association vs Co-location
Focus of the Research
• To extract information from geospatial data and identify frequent co-
occurrence among Boolean spatial features like Draught, El Nino,
Substantial drop in vegetation etc. on Ecological Datasets.
• Approaches to discover co-location rules can be classified into 2 classes-
Spatial Statistics, Association Rules
• Spatial Statistics- Uses spatial correlation measures to characterize
different relations between spatial features ( chi-sq tests, Correlation
coefficients, regression models etc)
• Association Rules- Assumes that finite set of disjoint sets are given as input to the
Algorithm and they can find the most frequent items from the set and deduce
relationships on those items ( apriori algorithm and Independent Approaches-
Based on suitability of Reference Spatial Feature to mine all association rules of
nearby spatial features. )
Background of the Problem
• Given
1. A set of items T of K boolean spatial feature types T={f1,f2,…,fK)
2. A set of N instances P={p1…pN} each p, is a vector <instance-id, spatial feature type,
location>
3. A neighbor relation R over locations in S
4. Min prevalence threshold value, min conditional probability threshold
• Objectives
1. Completeness: it finds all spatial colocation rules that satisfy the threshold value(s)
2. Correctness: any spatial co-location rule found by the algorithm respects the threshold
value(s)
3. IO cost and CPU cost to generate the colocation rules should be acceptable
• Find
– Co-location rules with high prevalence and high conditional probability
• Constraints
– R is symmetric and reflexive
– Monotonic prevalence measure
– Conditional probability measures are specified by the event centric model
– Sparse data set, the number of instance of any spatial features is << cardinality (P)
Approaches of Modeling the
Co-location Rules Problem
• The reference feature centric model
– Is relevant to application domains focusing on a specific boolean
spatial feature
• The window centric model
– Is relevant to applications like mining, surveying and geology,
which focus on land-parcels
– One goal is to predict sets of spatial features likely to be
discovered in a land parcel given that some other features have
been found there
• The event-centric model
– Is relevant to applications that have interest in finding subsets of
spatial features likely to occur in a neighborhood around
instances of given subsets of event types
Reference feature centric model
• Let the reference feature be A
• The set of spatial predicates include one
predicate: close_to(a,b) which is true if and
only if b is a’s neighbour
Reference feature centric model
Association rule example:
is_type(i,A)∧∃ j is_type(j,B)∧close_to(j,i)
→∃ k is_type(k,C)∧close_to(k,i)
with 100% probability
Window centric model
Number of windows: 16
Number of windows containing A: 15
Number of windows containing A and B: 7
Association rule:
an instance of type A in a window → an
instance of type B in a window with
7/15=46.67% probability
Event centric model
Instances of type A: 4
Instances of type A that have some
instances of type B: 1
Conditional probability for the co-
location rule is:
spatial feature A at location l →
spatial feature type B in 9-neighbor
neighborhood is 25%
Concepts
Co-location Miner Algorithm
• Input
1. K boolean spatial instance and their instances
2. A symmetric and reflexive neighbor relation R
3. A user specified minimum threshold prevalence measure (min_prevalence)
4. A user specified minimum conditional probability (min_cond_prob)
• Output
– Co-location rule sets with participation index> min_prevalence and conditional
probability>min_cond_prob
• Method
1. Prevalent size 1 co-location set along with their table instances=P
2. Generate size 2 co-location rules
3. For size of co-locations in (2,3,…,K-1) do
4. Generate candidate prevalent co-locations using the generalized apriori_gen
algorithm
5. Generate table instances and prune based on neighborhood
6. Prune based on prevalence of co-locations
7. Generate co-location rules
8. end;
Co-location Miner Algorithm
Conclusions
• This paper gives a clear idea about co-
localization problem and its difference from
classical association problem.
• Co-location Miner algorithm was dealt with
easily with a comprehensive understanding
and detailed analysis.
• Future centric approach of the authors
promises development in field of Plane
Sweeping Algorithms.
Spatial Co-location Patterns: articles
• – S. Shekhar and Y. Huang, Discovering Spatial Co-location Patterns: A
Summary of Results, In Proc. of 7th Intl Symposium on Spatial and
Temporal Databases (SSTD), Springer-Verlag, Lecture Notes in Computer
Science, LNCS 2121, p.236 ff, July 2001
• – S. Shekhar and Y. Huang, Multi-resolution Co-location Miner: a New
Algorithm to Find Co-location Patterns from Spatial Datasets, SIAM
SDM02 Workshop on Mining Scientific Datasets, April 2002
• – Y. Huang, H. Xiong, S. Shekhar, and J. Pei, Mining Confident Co-location
Rules without A Support Threshold, in Proc. of 18th ACM Symposium on
Applied Computing (ACM SAC), March 2003
• – Y. Huang, S. Shekhar, and H. Xiong, Discovering Colocation Patterns from
Spatial Datasets: A General Approach, submitted to IEEE Transactions on
Knowledge and Data Engineering (TKDE), 2004

Contenu connexe

Tendances

TYBSC IT PGIS Unit IV Spacial Data Analysis
TYBSC IT PGIS Unit IV  Spacial Data AnalysisTYBSC IT PGIS Unit IV  Spacial Data Analysis
TYBSC IT PGIS Unit IV Spacial Data AnalysisArti Parab Academics
 
Data mining
Data mining Data mining
Data mining AthiraR23
 
Python in geospatial analysis
Python in geospatial analysisPython in geospatial analysis
Python in geospatial analysisSakthivel R
 
Image classification
Image classificationImage classification
Image classificationAli A Jalil
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big DataDataWorks Summit
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining Phi Jack
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine LearningKnoldus Inc.
 
Remote Sensing: Georeferencing
Remote Sensing: GeoreferencingRemote Sensing: Georeferencing
Remote Sensing: GeoreferencingKamlesh Kumar
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter 5 Data Mining:  Concepts and Techniques (3rd ed.)— Chapter 5
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5 Salah Amean
 
08. Mining Type Of Complex Data
08. Mining Type Of Complex Data08. Mining Type Of Complex Data
08. Mining Type Of Complex DataAchmad Solichin
 
Introduction to Data mining
Introduction to Data miningIntroduction to Data mining
Introduction to Data miningHadi Fadlallah
 
Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...
Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...
Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...Praxitelis Nikolaos Kouroupetroglou
 
Geographic Information Retrieval Systems.
Geographic Information Retrieval Systems.Geographic Information Retrieval Systems.
Geographic Information Retrieval Systems.Dinesh Manajipet
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationData Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationDataminingTools Inc
 
Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...
Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...
Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...A F M Fakhrul Azam Shaikat
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisDataminingTools Inc
 
5.1 mining data streams
5.1 mining data streams5.1 mining data streams
5.1 mining data streamsKrish_ver2
 

Tendances (20)

TYBSC IT PGIS Unit IV Spacial Data Analysis
TYBSC IT PGIS Unit IV  Spacial Data AnalysisTYBSC IT PGIS Unit IV  Spacial Data Analysis
TYBSC IT PGIS Unit IV Spacial Data Analysis
 
Data mining
Data mining Data mining
Data mining
 
Python in geospatial analysis
Python in geospatial analysisPython in geospatial analysis
Python in geospatial analysis
 
Image classification
Image classificationImage classification
Image classification
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
 
Web mapping
Web mappingWeb mapping
Web mapping
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
 
Remote Sensing: Georeferencing
Remote Sensing: GeoreferencingRemote Sensing: Georeferencing
Remote Sensing: Georeferencing
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter 5 Data Mining:  Concepts and Techniques (3rd ed.)— Chapter 5
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
 
GIS file types
GIS file typesGIS file types
GIS file types
 
08. Mining Type Of Complex Data
08. Mining Type Of Complex Data08. Mining Type Of Complex Data
08. Mining Type Of Complex Data
 
Deep Q-Learning
Deep Q-LearningDeep Q-Learning
Deep Q-Learning
 
Introduction to Data mining
Introduction to Data miningIntroduction to Data mining
Introduction to Data mining
 
Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...
Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...
Presentation - Msc Thesis - Machine Learning Techniques for Short-Term Electr...
 
Geographic Information Retrieval Systems.
Geographic Information Retrieval Systems.Geographic Information Retrieval Systems.
Geographic Information Retrieval Systems.
 
Data Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalizationData Mining: Data cube computation and data generalization
Data Mining: Data cube computation and data generalization
 
Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...
Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...
Remote sensing for change detection (presentation) - Prepared by A F M Fakhru...
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
 
5.1 mining data streams
5.1 mining data streams5.1 mining data streams
5.1 mining data streams
 

En vedette

Spatial data mining
Spatial data miningSpatial data mining
Spatial data miningMITS Gwalior
 
ppt spatial data
ppt spatial datappt spatial data
ppt spatial dataRahul Kumar
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatialSumant Diwakar
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningSalford Systems
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slidesmahavir_a
 
Classification with Naive Bayes
Classification with Naive BayesClassification with Naive Bayes
Classification with Naive BayesJosh Patterson
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Crop production ppt
Crop production pptCrop production ppt
Crop production pptvaggyaggy
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 

En vedette (18)

Spatial data mining
Spatial data miningSpatial data mining
Spatial data mining
 
ppt spatial data
ppt spatial datappt spatial data
ppt spatial data
 
Temporal data mining
Temporal data miningTemporal data mining
Temporal data mining
 
Spatial databases
Spatial databasesSpatial databases
Spatial databases
 
Spatial vs non spatial
Spatial vs non spatialSpatial vs non spatial
Spatial vs non spatial
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data Mining
 
WEB MINING.
WEB MINING.WEB MINING.
WEB MINING.
 
Textmining Introduction
Textmining IntroductionTextmining Introduction
Textmining Introduction
 
Web mining slides
Web mining slidesWeb mining slides
Web mining slides
 
Classification with Naive Bayes
Classification with Naive BayesClassification with Naive Bayes
Classification with Naive Bayes
 
GIS Data Types
GIS Data TypesGIS Data Types
GIS Data Types
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Crop production ppt
Crop production pptCrop production ppt
Crop production ppt
 
Data mining
Data miningData mining
Data mining
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
 

Similaire à Spatial Data Mining : Seminar

Spatial co location pattern mining
Spatial co location pattern miningSpatial co location pattern mining
Spatial co location pattern miningSeung Kwan Kim
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesTown Peterson
 
D1T3 enm workflows updated
D1T3 enm workflows updatedD1T3 enm workflows updated
D1T3 enm workflows updatedTown Peterson
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex sceneKumar Mayank
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics OverviewTony Fast
 
Rank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceRank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceredpel dot com
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.ppsbutest
 
report2.doc
report2.docreport2.doc
report2.docbutest
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep LearningExperfy
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1Johan Blomme
 
Seminar on gis analysis functions
Seminar on gis analysis functionsSeminar on gis analysis functions
Seminar on gis analysis functionsPramoda Raj
 
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHCOLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHIJCI JOURNAL
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray dataGianluca Bontempi
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization CS, NcState
 
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Bob MacMillan
 
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Wilfried Elmenreich
 

Similaire à Spatial Data Mining : Seminar (20)

Spatial co location pattern mining
Spatial co location pattern miningSpatial co location pattern mining
Spatial co location pattern mining
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling Methodologies
 
D1T3 enm workflows updated
D1T3 enm workflows updatedD1T3 enm workflows updated
D1T3 enm workflows updated
 
DM_clustering.ppt
DM_clustering.pptDM_clustering.ppt
DM_clustering.ppt
 
Moving object detection in complex scene
Moving object detection in complex sceneMoving object detection in complex scene
Moving object detection in complex scene
 
Materials Informatics Overview
Materials Informatics OverviewMaterials Informatics Overview
Materials Informatics Overview
 
Rank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependenceRank based similarity search reducing the dimensional dependence
Rank based similarity search reducing the dimensional dependence
 
UHDMML.pps
UHDMML.ppsUHDMML.pps
UHDMML.pps
 
report2.doc
report2.docreport2.doc
report2.doc
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep Learning
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1
 
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
 
Seminar on gis analysis functions
Seminar on gis analysis functionsSeminar on gis analysis functions
Seminar on gis analysis functions
 
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACHCOLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
COLOCATION MINING IN UNCERTAIN DATA SETS: A PROBABILISTIC APPROACH
 
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
Fahroo - Optimization and Discrete Mathematics - Spring Review 2013
 
UNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptxUNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptx
 
Feature selection and microarray data
Feature selection and microarray dataFeature selection and microarray data
Feature selection and microarray data
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization 
 
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2Aitf 2014 pem_introduction_presentation_feb28_ram_version2
Aitf 2014 pem_introduction_presentation_feb28_ram_version2
 
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
Machine Learning Techniques for the Smart Grid – Modeling of Solar Energy usi...
 

Plus de Ipsit Dash

Land Reforms : An overview
Land Reforms : An overviewLand Reforms : An overview
Land Reforms : An overviewIpsit Dash
 
Water Sector Debate
Water Sector DebateWater Sector Debate
Water Sector DebateIpsit Dash
 
Tirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationTirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationIpsit Dash
 
Change Detection Dubai
Change Detection DubaiChange Detection Dubai
Change Detection DubaiIpsit Dash
 
Average- An android project
Average- An android projectAverage- An android project
Average- An android projectIpsit Dash
 
Implementation of INS-GPS
Implementation of INS-GPSImplementation of INS-GPS
Implementation of INS-GPSIpsit Dash
 

Plus de Ipsit Dash (7)

Land Reforms : An overview
Land Reforms : An overviewLand Reforms : An overview
Land Reforms : An overview
 
Water Sector Debate
Water Sector DebateWater Sector Debate
Water Sector Debate
 
Tirupur Water Supply and Sanitation
Tirupur Water Supply and SanitationTirupur Water Supply and Sanitation
Tirupur Water Supply and Sanitation
 
Change Detection Dubai
Change Detection DubaiChange Detection Dubai
Change Detection Dubai
 
Average- An android project
Average- An android projectAverage- An android project
Average- An android project
 
Dark diamond
Dark diamondDark diamond
Dark diamond
 
Implementation of INS-GPS
Implementation of INS-GPSImplementation of INS-GPS
Implementation of INS-GPS
 

Dernier

Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 

Dernier (20)

Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 

Spatial Data Mining : Seminar

  • 1. Discovering Spatial Co-location Patterns : A summary of Results Shashi Sekhar, Yan Huang Dept of Computer Sciences, University of Minnesota, USA AG 2414 Spatial Analysis Seminar by- Adrian C Prelipcean Ipsit Dash
  • 2. Outline • Introduction • Focus of the Research • Background of the Problem • Approaches of Modeling the Co-location problem • Co-location Miner Algorithm • Conclusions
  • 3. Data Data Data!!!! • Data are being collected continuously for innumerable phenomenon. • Business applications • Scientific Applications • National Security Purposes It is impossible to analyze each strand of data collected. Calls for Data Mining Automation Hypothesis Generation Better Linking of phenomenon
  • 4. Data Mining • Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data backed by complex computer based decision systems ( A.I.,Business Intelligence, Machine learning) • Spatial Data Mining is different from Classical Data Mining used with reference to RDBMS. • Here attributes of the neighbours of some object of interest may have an influence on the object and therefore have to be considered as well. The explicit location and extension of spatial objects define implicit relations of spatial neighbourhood (such as topological, distance and direction relations) which are used by spatial data mining algorithms. • Methods used in General Data Mining- Predictive (Classification, Regression) and Descriptive (Clustering, Association) • Methods used in determining Spatial Patterns-  Location prediction model (to identify habitat of endangered species)  Spatial clusters (crime hot-spots, cancer clusters)  Spatial associations: co-locations (predator-prey species, symbiosis dental health and fluoride)  Spatial outlier : discontinuities (bad traffic sensors on highways)
  • 5. Spatial Associations : Co-locations • Classical Association methodology : Given a set of transactions, find rules that will predict the occurrence of an item based on the occurrences of other items in the transaction. Implication means co-occurrence, not causality! • Now in case of Spatial Data, the transactions are not disjoint. So classical approach is not ideal to be used here. • Association vs Co-location
  • 6. Focus of the Research • To extract information from geospatial data and identify frequent co- occurrence among Boolean spatial features like Draught, El Nino, Substantial drop in vegetation etc. on Ecological Datasets. • Approaches to discover co-location rules can be classified into 2 classes- Spatial Statistics, Association Rules • Spatial Statistics- Uses spatial correlation measures to characterize different relations between spatial features ( chi-sq tests, Correlation coefficients, regression models etc) • Association Rules- Assumes that finite set of disjoint sets are given as input to the Algorithm and they can find the most frequent items from the set and deduce relationships on those items ( apriori algorithm and Independent Approaches- Based on suitability of Reference Spatial Feature to mine all association rules of nearby spatial features. )
  • 7. Background of the Problem • Given 1. A set of items T of K boolean spatial feature types T={f1,f2,…,fK) 2. A set of N instances P={p1…pN} each p, is a vector <instance-id, spatial feature type, location> 3. A neighbor relation R over locations in S 4. Min prevalence threshold value, min conditional probability threshold • Objectives 1. Completeness: it finds all spatial colocation rules that satisfy the threshold value(s) 2. Correctness: any spatial co-location rule found by the algorithm respects the threshold value(s) 3. IO cost and CPU cost to generate the colocation rules should be acceptable • Find – Co-location rules with high prevalence and high conditional probability • Constraints – R is symmetric and reflexive – Monotonic prevalence measure – Conditional probability measures are specified by the event centric model – Sparse data set, the number of instance of any spatial features is << cardinality (P)
  • 8. Approaches of Modeling the Co-location Rules Problem • The reference feature centric model – Is relevant to application domains focusing on a specific boolean spatial feature • The window centric model – Is relevant to applications like mining, surveying and geology, which focus on land-parcels – One goal is to predict sets of spatial features likely to be discovered in a land parcel given that some other features have been found there • The event-centric model – Is relevant to applications that have interest in finding subsets of spatial features likely to occur in a neighborhood around instances of given subsets of event types
  • 9. Reference feature centric model • Let the reference feature be A • The set of spatial predicates include one predicate: close_to(a,b) which is true if and only if b is a’s neighbour
  • 10. Reference feature centric model Association rule example: is_type(i,A)∧∃ j is_type(j,B)∧close_to(j,i) →∃ k is_type(k,C)∧close_to(k,i) with 100% probability
  • 11. Window centric model Number of windows: 16 Number of windows containing A: 15 Number of windows containing A and B: 7 Association rule: an instance of type A in a window → an instance of type B in a window with 7/15=46.67% probability
  • 12. Event centric model Instances of type A: 4 Instances of type A that have some instances of type B: 1 Conditional probability for the co- location rule is: spatial feature A at location l → spatial feature type B in 9-neighbor neighborhood is 25%
  • 14. Co-location Miner Algorithm • Input 1. K boolean spatial instance and their instances 2. A symmetric and reflexive neighbor relation R 3. A user specified minimum threshold prevalence measure (min_prevalence) 4. A user specified minimum conditional probability (min_cond_prob) • Output – Co-location rule sets with participation index> min_prevalence and conditional probability>min_cond_prob • Method 1. Prevalent size 1 co-location set along with their table instances=P 2. Generate size 2 co-location rules 3. For size of co-locations in (2,3,…,K-1) do 4. Generate candidate prevalent co-locations using the generalized apriori_gen algorithm 5. Generate table instances and prune based on neighborhood 6. Prune based on prevalence of co-locations 7. Generate co-location rules 8. end;
  • 16.
  • 17. Conclusions • This paper gives a clear idea about co- localization problem and its difference from classical association problem. • Co-location Miner algorithm was dealt with easily with a comprehensive understanding and detailed analysis. • Future centric approach of the authors promises development in field of Plane Sweeping Algorithms.
  • 18. Spatial Co-location Patterns: articles • – S. Shekhar and Y. Huang, Discovering Spatial Co-location Patterns: A Summary of Results, In Proc. of 7th Intl Symposium on Spatial and Temporal Databases (SSTD), Springer-Verlag, Lecture Notes in Computer Science, LNCS 2121, p.236 ff, July 2001 • – S. Shekhar and Y. Huang, Multi-resolution Co-location Miner: a New Algorithm to Find Co-location Patterns from Spatial Datasets, SIAM SDM02 Workshop on Mining Scientific Datasets, April 2002 • – Y. Huang, H. Xiong, S. Shekhar, and J. Pei, Mining Confident Co-location Rules without A Support Threshold, in Proc. of 18th ACM Symposium on Applied Computing (ACM SAC), March 2003 • – Y. Huang, S. Shekhar, and H. Xiong, Discovering Colocation Patterns from Spatial Datasets: A General Approach, submitted to IEEE Transactions on Knowledge and Data Engineering (TKDE), 2004