SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
Absolute and Relative Clustering
Toshihiro Kamishima and Shotaro Akaho
National Institute of Advanced Industrial Science and Technology (AIST), Japan
4th MultiClust Workshop on Multiple Clusterings, Multi-view Data,
and Multi-source Knowledge-driven Clustering
In conjunction with the KDD 2013 @ Chicago, U.S.A., Aug. 11, 2013
1START
clustering a data set under the supervision
that indicates clusters desired by a user
Supervised Clustering
Overview
2
properties of real tasks that should be considered
when formalizing these tasks as mathematical problems
Absolute and Relative Clustering
These properties are useful for determining these design issues:
formats of input examples & the goal of learning
the types of supervision
information provided by features
An Intuitive Definition of
Absolute and Relative Clustering
3
Real Tasks and Math Problems
4
tasks in the real world
what we want to perform
problems in the math world
solved in computers
Absolute and relative clustering are properties
of real tasks, not of mathematical problems
ex. document clustering
a set of documents document vectors
of bags of words
x1, x2, . . . , xN
algorithm
document clusters
C1, C2, . . . , CK
criterion
formalize
Absolute and Relative Clustering
5
A B A B
In user’s target task, consider the determination whether two objects
grouped together separatedOR
X
BA
OR
BA
absolute clustering relative clustering
X
BA
OR
BA
If the determination is
NOT
influenced
CAN
influenced
Reference Matching
6
The reference matching task is an example of absolute clustering
The goal of reference matching is to group reference strings into clusters
of multiple real references to objects consisting of the same entity
the appearances of strings are different
Knowledge Discovery and Data Mining KDD& grouped
the order of words are permuted
Author → Title → Journal → Year
Author → Year → Title → Journal
grouped
These strings refer the same entity in the real worldEx
Reference Matching
7
The strings 1 and 2 in a document set currently refer the same entity
The entity referred by the strings 1 and 2 never changes
The determination whether a pair of strings are clustered together
is NOT influenced by the other strings in a document set
The reference matching task is absolute clustering
string 1 string 2 string 3 string 4
refer the same entity
string 5
string 1 string 2 string 3 string 4
refer the same entity
string 5 new str
Noun Coreference
8
The noun coreference task is an example of relative clustering
The goal of noun coreference is to group noun phrases in a document
into clusters of phrases corresponding to the same entity or concept
Ex
If one determines these phrases represent the same person in a
news article, they are clustered together
Mr. Abe , who is the prime minister of Japan , visited Kyoto.
And he met the mayor of the Kyoto city.
Noun Coreference
9
Currently, the phrase “a parent turtle” in the sentence A and
the phrase “this turtle” in the sentence C
are separated in different clusters.
A: There is a parent turtle .
B: On this turtle ,
there is a child turtle .
C: On this turtle ,
there is a grandchild turtle .
The sentence B is deleted
Noun Coreference
10
The phrase “a parent turtle” in the sentence A and
the phrase “this turtle” in the sentence C are clustered together
A: There is a parent turtle .
C: On this turtle ,
there is a grandchild turtle .
The determination whether a pair of phrases are clustered together
is influenced by the other phrases
The noun coreference task is relative clustering
A Formal Definition of
Absolute and Relative Clustering
11
Clustering Function
12
: a universal object set, a domain of all possible objectsX
: an object setX = {x1, x2, . . . , xN } ⇢ X
CX = {c1, c2, . . . , cK}: a partition, and c1, c2, . . . , cK : clusters
({xi, xj}, CX ),
(
1 if xi and xj are in the same cluster
0 otherwise
⇡(X): maps a given object set, , into a partition,X CX
Clustering Function
a set of entities
real task math problem
appropriate partition that
fits for the goal of a real task CX
Xformal representation
correspond to
True Clustering Function ⇡⇤
(X)
Formal Definition
If a true clustering function, π*(X), for the target task satisfies the
following condition, the task is absolute clustering; otherwise, it is
relative clustering
Absolute and Relative Clustering
13
Intuitive Definition
If the determination whether two objects are grouped together or
separated is not influenced by the other objects, it is an absolute
clustering task; otherwise, it is a relative clustering task
({xi, xj}, ⇡⇤
(X)) = ({xi, xj}, ⇡⇤
(X0
)),
8
xi, xj 2 X  X0
, xi6=xj, 8
X, X0
✓ X
Property of Absolute Clustering
14
Existence of an Absolute Partition
An absolute partition	 exists iff a true clustering function
corresponds to an absolute clustering task
All assignments of objects are consistent with this an absolute partition
even if clustered object sets are changed
({xi, xj}, ⇡⇤
(X)) = ({xi, xj}, C),
8
xi, xj 2 X, xi6=xj, 8
X ✓ X
absolute
partition
universal
object set
x1 x2x3 x4x5 x6
C
X
X1
X2
C = ⇡⇤
(X)
Property of Absolute Clustering
15
Transitivity across Different Object Sets
For absolute clustering task, the following transitivity is satisfied,
because there is an absolute partition:
For	 and	 , x1 and x2 are in the same
cluster, and x1 and x3 are also in the same cluster.
In this case, when two object sets are merged, x2 and x3 fall in the
same cluster
x1, x2 2 X1 x1, x3 2 X2
X1 X2
same same
sameX1 [ X2
x1x2 x3
Three Types of
Supervised Clustering Problems
16
There Types of
Supervised Clustering Problems
17
Transductive Clustering : A single object set with supervision
information is given, and the goal of learning is to obtain a partition
of the set
Applicable to both absolute and relative clustering tasks
Semi-Supervised Clustering : A clustering function is learned
from a single object set with supervision information
Fit for performing absolute clustering tasks
Fully Supervised Clustering : A clustering function is learned
from multiple object sets with supervision information
Relative clustering tasks must be formulated as this type of
problems
Math Problems of Supervised Clustering
format of input examples & goal of the algorithm
Transductive Clustering
18
A single object set with supervision information is given, and the goal
of learning is to obtain a partition of the set
input example
X Y learning
algorithm
ˆCX
partition of Xsupervision
information
object set
The distinction between absolute and relative clustering becomes
apparent when the contents of an object set change
There is no need to differentiate between absolute and relative
clustering, because an object set is invariant
Semi-Supervised Clustering
19
input example
X Y learning
algorithm
clustering
function
supervision
information
object set
Xt
ˆCXt
ˆ⇡(X)
partition of Xt
test object set
A clustering function is learned from a single object set with
supervision information, and the function is used to cluster a test
object set
To formulate absolute clustering tasks,
transitivity property can be efficiently exploited
To learn a clustering function for an absolute clustering task, the task
should be formulated as a semi-supervised clustering problem
To formulate a relative clustering task,
the supervision information, Yi, is valid only for the object set, Xi
To learn a clustering function for a relative clustering task, the task
must be formulated as a fully supervised clustering problem
A clustering function is learned from multiple object sets with
supervision information, and the function is used to cluster a test
object set
Fully Supervised Clustering
20
input examples
learning
algorithm
clustering
function
Xt
ˆCXt
ˆ⇡(X)
partition of Xt
test object set
X1
X2
XN
Y1
Y2
YN
Conclusions
21
We propose a notion of absolute and relative clustering
The determination whether a pair of objects are clustered together
or not is influenced by the other objects, then it is a absolute
clustering; otherwise, it is relative clustering
Two properties of absolute clustering task
Existence of an absolute partition
Transitivity across different object sets
Three types of supervised clustering problems
Transductive clustering, Semi-supervised clustering, and Fully
supervised clustering
Absolute clustering tasks should be formulated as a semi-
supervised problem, and relative clustering taks must be
formulated as a fully supervised problem.

Contenu connexe

Tendances

Gravitational Based Hierarchical Clustering Algorithm
Gravitational Based Hierarchical Clustering AlgorithmGravitational Based Hierarchical Clustering Algorithm
Gravitational Based Hierarchical Clustering AlgorithmIDES Editor
 
A mathematical model for integrating product of two functions
A mathematical model for integrating product of two functionsA mathematical model for integrating product of two functions
A mathematical model for integrating product of two functionsAlexander Decker
 
Kernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyKernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyMichiel Stock
 
STUDY OF Ε-SMOOTH SUPPORT VECTOR REGRESSION AND COMPARISON WITH Ε- SUPPORT ...
STUDY OF Ε-SMOOTH SUPPORT VECTOR  REGRESSION AND COMPARISON WITH Ε- SUPPORT  ...STUDY OF Ε-SMOOTH SUPPORT VECTOR  REGRESSION AND COMPARISON WITH Ε- SUPPORT  ...
STUDY OF Ε-SMOOTH SUPPORT VECTOR REGRESSION AND COMPARISON WITH Ε- SUPPORT ...ijscai
 
FiniteElementNotes
FiniteElementNotesFiniteElementNotes
FiniteElementNotesMartin Jones
 
3rd Grade cc Reference Guide for Investigations
3rd Grade cc Reference Guide for Investigations3rd Grade cc Reference Guide for Investigations
3rd Grade cc Reference Guide for InvestigationsIsaac_Schools_5
 
Reproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfishReproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfishtuxette
 
[DL輪読会]Generative Models of Visually Grounded Imagination
[DL輪読会]Generative Models of Visually Grounded Imagination[DL輪読会]Generative Models of Visually Grounded Imagination
[DL輪読会]Generative Models of Visually Grounded ImaginationDeep Learning JP
 
Differential analyses of structures in HiC data
Differential analyses of structures in HiC dataDifferential analyses of structures in HiC data
Differential analyses of structures in HiC datatuxette
 
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task LearningMasahiro Suzuki
 
Deep VI with_beta_likelihood
Deep VI with_beta_likelihoodDeep VI with_beta_likelihood
Deep VI with_beta_likelihoodNatan Katz
 
Graph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype PredictionGraph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype Predictiontuxette
 
Stability criterion of periodic oscillations in a (15)
Stability criterion of periodic oscillations in a (15)Stability criterion of periodic oscillations in a (15)
Stability criterion of periodic oscillations in a (15)Alexander Decker
 
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...Anirbit Mukherjee
 
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural NetworksMasahiro Suzuki
 
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRAa - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRAijfls
 

Tendances (20)

Gravitational Based Hierarchical Clustering Algorithm
Gravitational Based Hierarchical Clustering AlgorithmGravitational Based Hierarchical Clustering Algorithm
Gravitational Based Hierarchical Clustering Algorithm
 
A mathematical model for integrating product of two functions
A mathematical model for integrating product of two functionsA mathematical model for integrating product of two functions
A mathematical model for integrating product of two functions
 
Kernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational BiologyKernel Methods and Relational Learning in Computational Biology
Kernel Methods and Relational Learning in Computational Biology
 
STUDY OF Ε-SMOOTH SUPPORT VECTOR REGRESSION AND COMPARISON WITH Ε- SUPPORT ...
STUDY OF Ε-SMOOTH SUPPORT VECTOR  REGRESSION AND COMPARISON WITH Ε- SUPPORT  ...STUDY OF Ε-SMOOTH SUPPORT VECTOR  REGRESSION AND COMPARISON WITH Ε- SUPPORT  ...
STUDY OF Ε-SMOOTH SUPPORT VECTOR REGRESSION AND COMPARISON WITH Ε- SUPPORT ...
 
FiniteElementNotes
FiniteElementNotesFiniteElementNotes
FiniteElementNotes
 
3rd Grade cc Reference Guide for Investigations
3rd Grade cc Reference Guide for Investigations3rd Grade cc Reference Guide for Investigations
3rd Grade cc Reference Guide for Investigations
 
Reproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfishReproducibility and differential analysis with selfish
Reproducibility and differential analysis with selfish
 
[DL輪読会]Generative Models of Visually Grounded Imagination
[DL輪読会]Generative Models of Visually Grounded Imagination[DL輪読会]Generative Models of Visually Grounded Imagination
[DL輪読会]Generative Models of Visually Grounded Imagination
 
Differential analyses of structures in HiC data
Differential analyses of structures in HiC dataDifferential analyses of structures in HiC data
Differential analyses of structures in HiC data
 
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
(研究会輪読) Facial Landmark Detection by Deep Multi-task Learning
 
Deep VI with_beta_likelihood
Deep VI with_beta_likelihoodDeep VI with_beta_likelihood
Deep VI with_beta_likelihood
 
Graph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype PredictionGraph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype Prediction
 
Joco pavone
Joco pavoneJoco pavone
Joco pavone
 
Stability criterion of periodic oscillations in a (15)
Stability criterion of periodic oscillations in a (15)Stability criterion of periodic oscillations in a (15)
Stability criterion of periodic oscillations in a (15)
 
Multitask learning for GGM
Multitask learning for GGMMultitask learning for GGM
Multitask learning for GGM
 
Nested loop
Nested loopNested loop
Nested loop
 
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
 
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
(DL輪読)Variational Dropout Sparsifies Deep Neural Networks
 
Bistablecamnets
BistablecamnetsBistablecamnets
Bistablecamnets
 
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRAa - ANTI FUZZY NEW IDEAL OF PUALGEBRA
a - ANTI FUZZY NEW IDEAL OF PUALGEBRA
 

En vedette

The Independence of Fairness-aware Classifiers
The Independence of Fairness-aware ClassifiersThe Independence of Fairness-aware Classifiers
The Independence of Fairness-aware ClassifiersToshihiro Kamishima
 
Efficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced RecommendationEfficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced RecommendationToshihiro Kamishima
 
Clustering on database systems rkm
Clustering on database systems rkmClustering on database systems rkm
Clustering on database systems rkmVahid Mirjalili
 
Real Time Kernels
Real Time KernelsReal Time Kernels
Real Time KernelsArnav Soni
 
Rt databases vs general purpose tsp
Rt databases vs general purpose  tspRt databases vs general purpose  tsp
Rt databases vs general purpose tspPradeep Kumar TS
 
Chapter 12 transactions and concurrency control
Chapter 12 transactions and concurrency controlChapter 12 transactions and concurrency control
Chapter 12 transactions and concurrency controlAbDul ThaYyal
 
Real time operating-systems
Real time operating-systemsReal time operating-systems
Real time operating-systemskasi963
 
Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理Toshihiro Kamishima
 
Model-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced RecommendationModel-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced RecommendationToshihiro Kamishima
 
PyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしないPyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしないToshihiro Kamishima
 
Real time Operating System
Real time Operating SystemReal time Operating System
Real time Operating SystemTech_MX
 
Transaction concurrency control
Transaction concurrency controlTransaction concurrency control
Transaction concurrency controlAnand Grewal
 

En vedette (16)

The Independence of Fairness-aware Classifiers
The Independence of Fairness-aware ClassifiersThe Independence of Fairness-aware Classifiers
The Independence of Fairness-aware Classifiers
 
Efficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced RecommendationEfficiency Improvement of Neutrality-Enhanced Recommendation
Efficiency Improvement of Neutrality-Enhanced Recommendation
 
Clustering on database systems rkm
Clustering on database systems rkmClustering on database systems rkm
Clustering on database systems rkm
 
Qnx os
Qnx os Qnx os
Qnx os
 
Real Time Kernels
Real Time KernelsReal Time Kernels
Real Time Kernels
 
Rt databases vs general purpose tsp
Rt databases vs general purpose  tspRt databases vs general purpose  tsp
Rt databases vs general purpose tsp
 
Qnx os
Qnx osQnx os
Qnx os
 
Chapter 12 transactions and concurrency control
Chapter 12 transactions and concurrency controlChapter 12 transactions and concurrency control
Chapter 12 transactions and concurrency control
 
Real time operating-systems
Real time operating-systemsReal time operating-systems
Real time operating-systems
 
Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理
 
Model-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced RecommendationModel-based Approaches for Independence-Enhanced Recommendation
Model-based Approaches for Independence-Enhanced Recommendation
 
PyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしないPyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしない
 
QNX OS
QNX OSQNX OS
QNX OS
 
Real time Operating System
Real time Operating SystemReal time Operating System
Real time Operating System
 
Transaction concurrency control
Transaction concurrency controlTransaction concurrency control
Transaction concurrency control
 
RTOS Basic Concepts
RTOS Basic ConceptsRTOS Basic Concepts
RTOS Basic Concepts
 

Similaire à Absolute and Relative Clustering

An Introduction to Radical Minimalism: Merge & Agree
An Introduction to Radical Minimalism: Merge & AgreeAn Introduction to Radical Minimalism: Merge & Agree
An Introduction to Radical Minimalism: Merge & AgreeDiego Krivochen
 
20070702 Text Categorization
20070702 Text Categorization20070702 Text Categorization
20070702 Text Categorizationmidi
 
It Is Possible to Do Object-Oriented Programming in Java
It Is Possible to Do Object-Oriented Programming in JavaIt Is Possible to Do Object-Oriented Programming in Java
It Is Possible to Do Object-Oriented Programming in JavaKevlin Henney
 
Dimensionality reduction by matrix factorization using concept lattice in dat...
Dimensionality reduction by matrix factorization using concept lattice in dat...Dimensionality reduction by matrix factorization using concept lattice in dat...
Dimensionality reduction by matrix factorization using concept lattice in dat...eSAT Journals
 
CORCON2014: Does programming really need data structures?
CORCON2014: Does programming really need data structures?CORCON2014: Does programming really need data structures?
CORCON2014: Does programming really need data structures?Marco Benini
 
A Tree Kernel based approach for clone detection
A Tree Kernel based approach for clone detectionA Tree Kernel based approach for clone detection
A Tree Kernel based approach for clone detectionValerio Maggio
 
A tree kernel based approach for clone detection
A tree kernel based approach for clone detectionA tree kernel based approach for clone detection
A tree kernel based approach for clone detectionICSM 2010
 
Discovering Novel Information with sentence Level clustering From Multi-docu...
Discovering Novel Information with sentence Level clustering  From Multi-docu...Discovering Novel Information with sentence Level clustering  From Multi-docu...
Discovering Novel Information with sentence Level clustering From Multi-docu...irjes
 
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...Tobias Wunner
 
8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithm8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithmLaura Petrosanu
 

Similaire à Absolute and Relative Clustering (20)

An Introduction to Radical Minimalism: Merge & Agree
An Introduction to Radical Minimalism: Merge & AgreeAn Introduction to Radical Minimalism: Merge & Agree
An Introduction to Radical Minimalism: Merge & Agree
 
20070702 Text Categorization
20070702 Text Categorization20070702 Text Categorization
20070702 Text Categorization
 
AI Final report 1.pdf
AI Final report 1.pdfAI Final report 1.pdf
AI Final report 1.pdf
 
It Is Possible to Do Object-Oriented Programming in Java
It Is Possible to Do Object-Oriented Programming in JavaIt Is Possible to Do Object-Oriented Programming in Java
It Is Possible to Do Object-Oriented Programming in Java
 
Dimensionality reduction by matrix factorization using concept lattice in dat...
Dimensionality reduction by matrix factorization using concept lattice in dat...Dimensionality reduction by matrix factorization using concept lattice in dat...
Dimensionality reduction by matrix factorization using concept lattice in dat...
 
Euclideus_Language
Euclideus_LanguageEuclideus_Language
Euclideus_Language
 
paper
paperpaper
paper
 
Lattice2 tree
Lattice2 treeLattice2 tree
Lattice2 tree
 
CORCON2014: Does programming really need data structures?
CORCON2014: Does programming really need data structures?CORCON2014: Does programming really need data structures?
CORCON2014: Does programming really need data structures?
 
Side 2019 #12
Side 2019 #12Side 2019 #12
Side 2019 #12
 
A Tree Kernel based approach for clone detection
A Tree Kernel based approach for clone detectionA Tree Kernel based approach for clone detection
A Tree Kernel based approach for clone detection
 
A tree kernel based approach for clone detection
A tree kernel based approach for clone detectionA tree kernel based approach for clone detection
A tree kernel based approach for clone detection
 
cluster analysis
cluster analysiscluster analysis
cluster analysis
 
FP 4 OOP FTW!
FP 4 OOP FTW!FP 4 OOP FTW!
FP 4 OOP FTW!
 
Discovering Novel Information with sentence Level clustering From Multi-docu...
Discovering Novel Information with sentence Level clustering  From Multi-docu...Discovering Novel Information with sentence Level clustering  From Multi-docu...
Discovering Novel Information with sentence Level clustering From Multi-docu...
 
project
projectproject
project
 
Fuzzy Logic_HKR
Fuzzy Logic_HKRFuzzy Logic_HKR
Fuzzy Logic_HKR
 
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
SOFIE - A Unified Approach To Ontology-Based Information Extraction Using Rea...
 
8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithm8.clustering algorithm.k means.em algorithm
8.clustering algorithm.k means.em algorithm
 
Sets, functions and groups
Sets, functions and groupsSets, functions and groups
Sets, functions and groups
 

Plus de Toshihiro Kamishima

RecSys2018論文読み会 資料
RecSys2018論文読み会 資料RecSys2018論文読み会 資料
RecSys2018論文読み会 資料Toshihiro Kamishima
 
機械学習研究でのPythonの利用
機械学習研究でのPythonの利用機械学習研究でのPythonの利用
機械学習研究でのPythonの利用Toshihiro Kamishima
 
Considerations on Recommendation Independence for a Find-Good-Items Task
Considerations on Recommendation Independence for a Find-Good-Items TaskConsiderations on Recommendation Independence for a Find-Good-Items Task
Considerations on Recommendation Independence for a Find-Good-Items TaskToshihiro Kamishima
 
科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要Toshihiro Kamishima
 
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...Toshihiro Kamishima
 
Correcting Popularity Bias by Enhancing Recommendation Neutrality
Correcting Popularity Bias by Enhancing Recommendation NeutralityCorrecting Popularity Bias by Enhancing Recommendation Neutrality
Correcting Popularity Bias by Enhancing Recommendation NeutralityToshihiro Kamishima
 
Consideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data MiningConsideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data MiningToshihiro Kamishima
 
Fairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover RegularizerFairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover RegularizerToshihiro Kamishima
 
Enhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in RecommendationEnhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in RecommendationToshihiro Kamishima
 
OpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシングOpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシングToshihiro Kamishima
 
Fairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization ApproachFairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization ApproachToshihiro Kamishima
 

Plus de Toshihiro Kamishima (16)

RecSys2018論文読み会 資料
RecSys2018論文読み会 資料RecSys2018論文読み会 資料
RecSys2018論文読み会 資料
 
WSDM2018読み会 資料
WSDM2018読み会 資料WSDM2018読み会 資料
WSDM2018読み会 資料
 
Recommendation Independence
Recommendation IndependenceRecommendation Independence
Recommendation Independence
 
機械学習研究でのPythonの利用
機械学習研究でのPythonの利用機械学習研究でのPythonの利用
機械学習研究でのPythonの利用
 
Considerations on Recommendation Independence for a Find-Good-Items Task
Considerations on Recommendation Independence for a Find-Good-Items TaskConsiderations on Recommendation Independence for a Find-Good-Items Task
Considerations on Recommendation Independence for a Find-Good-Items Task
 
KDD2016勉強会 資料
KDD2016勉強会 資料KDD2016勉強会 資料
KDD2016勉強会 資料
 
科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要
 
WSDM2016勉強会 資料
WSDM2016勉強会 資料WSDM2016勉強会 資料
WSDM2016勉強会 資料
 
ICML2015読み会 資料
ICML2015読み会 資料ICML2015読み会 資料
ICML2015読み会 資料
 
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
Future Directions of Fairness-Aware Data Mining: Recommendation, Causality, a...
 
Correcting Popularity Bias by Enhancing Recommendation Neutrality
Correcting Popularity Bias by Enhancing Recommendation NeutralityCorrecting Popularity Bias by Enhancing Recommendation Neutrality
Correcting Popularity Bias by Enhancing Recommendation Neutrality
 
Consideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data MiningConsideration on Fairness-aware Data Mining
Consideration on Fairness-aware Data Mining
 
Fairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover RegularizerFairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover Regularizer
 
Enhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in RecommendationEnhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in Recommendation
 
OpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシングOpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシング
 
Fairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization ApproachFairness-aware Learning through Regularization Approach
Fairness-aware Learning through Regularization Approach
 

Dernier

Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 

Dernier (20)

Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 

Absolute and Relative Clustering

  • 1. Absolute and Relative Clustering Toshihiro Kamishima and Shotaro Akaho National Institute of Advanced Industrial Science and Technology (AIST), Japan 4th MultiClust Workshop on Multiple Clusterings, Multi-view Data, and Multi-source Knowledge-driven Clustering In conjunction with the KDD 2013 @ Chicago, U.S.A., Aug. 11, 2013 1START
  • 2. clustering a data set under the supervision that indicates clusters desired by a user Supervised Clustering Overview 2 properties of real tasks that should be considered when formalizing these tasks as mathematical problems Absolute and Relative Clustering These properties are useful for determining these design issues: formats of input examples & the goal of learning the types of supervision information provided by features
  • 3. An Intuitive Definition of Absolute and Relative Clustering 3
  • 4. Real Tasks and Math Problems 4 tasks in the real world what we want to perform problems in the math world solved in computers Absolute and relative clustering are properties of real tasks, not of mathematical problems ex. document clustering a set of documents document vectors of bags of words x1, x2, . . . , xN algorithm document clusters C1, C2, . . . , CK criterion formalize
  • 5. Absolute and Relative Clustering 5 A B A B In user’s target task, consider the determination whether two objects grouped together separatedOR X BA OR BA absolute clustering relative clustering X BA OR BA If the determination is NOT influenced CAN influenced
  • 6. Reference Matching 6 The reference matching task is an example of absolute clustering The goal of reference matching is to group reference strings into clusters of multiple real references to objects consisting of the same entity the appearances of strings are different Knowledge Discovery and Data Mining KDD& grouped the order of words are permuted Author → Title → Journal → Year Author → Year → Title → Journal grouped These strings refer the same entity in the real worldEx
  • 7. Reference Matching 7 The strings 1 and 2 in a document set currently refer the same entity The entity referred by the strings 1 and 2 never changes The determination whether a pair of strings are clustered together is NOT influenced by the other strings in a document set The reference matching task is absolute clustering string 1 string 2 string 3 string 4 refer the same entity string 5 string 1 string 2 string 3 string 4 refer the same entity string 5 new str
  • 8. Noun Coreference 8 The noun coreference task is an example of relative clustering The goal of noun coreference is to group noun phrases in a document into clusters of phrases corresponding to the same entity or concept Ex If one determines these phrases represent the same person in a news article, they are clustered together Mr. Abe , who is the prime minister of Japan , visited Kyoto. And he met the mayor of the Kyoto city.
  • 9. Noun Coreference 9 Currently, the phrase “a parent turtle” in the sentence A and the phrase “this turtle” in the sentence C are separated in different clusters. A: There is a parent turtle . B: On this turtle , there is a child turtle . C: On this turtle , there is a grandchild turtle . The sentence B is deleted
  • 10. Noun Coreference 10 The phrase “a parent turtle” in the sentence A and the phrase “this turtle” in the sentence C are clustered together A: There is a parent turtle . C: On this turtle , there is a grandchild turtle . The determination whether a pair of phrases are clustered together is influenced by the other phrases The noun coreference task is relative clustering
  • 11. A Formal Definition of Absolute and Relative Clustering 11
  • 12. Clustering Function 12 : a universal object set, a domain of all possible objectsX : an object setX = {x1, x2, . . . , xN } ⇢ X CX = {c1, c2, . . . , cK}: a partition, and c1, c2, . . . , cK : clusters ({xi, xj}, CX ), ( 1 if xi and xj are in the same cluster 0 otherwise ⇡(X): maps a given object set, , into a partition,X CX Clustering Function a set of entities real task math problem appropriate partition that fits for the goal of a real task CX Xformal representation correspond to True Clustering Function ⇡⇤ (X)
  • 13. Formal Definition If a true clustering function, π*(X), for the target task satisfies the following condition, the task is absolute clustering; otherwise, it is relative clustering Absolute and Relative Clustering 13 Intuitive Definition If the determination whether two objects are grouped together or separated is not influenced by the other objects, it is an absolute clustering task; otherwise, it is a relative clustering task ({xi, xj}, ⇡⇤ (X)) = ({xi, xj}, ⇡⇤ (X0 )), 8 xi, xj 2 X X0 , xi6=xj, 8 X, X0 ✓ X
  • 14. Property of Absolute Clustering 14 Existence of an Absolute Partition An absolute partition exists iff a true clustering function corresponds to an absolute clustering task All assignments of objects are consistent with this an absolute partition even if clustered object sets are changed ({xi, xj}, ⇡⇤ (X)) = ({xi, xj}, C), 8 xi, xj 2 X, xi6=xj, 8 X ✓ X absolute partition universal object set x1 x2x3 x4x5 x6 C X X1 X2 C = ⇡⇤ (X)
  • 15. Property of Absolute Clustering 15 Transitivity across Different Object Sets For absolute clustering task, the following transitivity is satisfied, because there is an absolute partition: For and , x1 and x2 are in the same cluster, and x1 and x3 are also in the same cluster. In this case, when two object sets are merged, x2 and x3 fall in the same cluster x1, x2 2 X1 x1, x3 2 X2 X1 X2 same same sameX1 [ X2 x1x2 x3
  • 16. Three Types of Supervised Clustering Problems 16
  • 17. There Types of Supervised Clustering Problems 17 Transductive Clustering : A single object set with supervision information is given, and the goal of learning is to obtain a partition of the set Applicable to both absolute and relative clustering tasks Semi-Supervised Clustering : A clustering function is learned from a single object set with supervision information Fit for performing absolute clustering tasks Fully Supervised Clustering : A clustering function is learned from multiple object sets with supervision information Relative clustering tasks must be formulated as this type of problems Math Problems of Supervised Clustering format of input examples & goal of the algorithm
  • 18. Transductive Clustering 18 A single object set with supervision information is given, and the goal of learning is to obtain a partition of the set input example X Y learning algorithm ˆCX partition of Xsupervision information object set The distinction between absolute and relative clustering becomes apparent when the contents of an object set change There is no need to differentiate between absolute and relative clustering, because an object set is invariant
  • 19. Semi-Supervised Clustering 19 input example X Y learning algorithm clustering function supervision information object set Xt ˆCXt ˆ⇡(X) partition of Xt test object set A clustering function is learned from a single object set with supervision information, and the function is used to cluster a test object set To formulate absolute clustering tasks, transitivity property can be efficiently exploited To learn a clustering function for an absolute clustering task, the task should be formulated as a semi-supervised clustering problem
  • 20. To formulate a relative clustering task, the supervision information, Yi, is valid only for the object set, Xi To learn a clustering function for a relative clustering task, the task must be formulated as a fully supervised clustering problem A clustering function is learned from multiple object sets with supervision information, and the function is used to cluster a test object set Fully Supervised Clustering 20 input examples learning algorithm clustering function Xt ˆCXt ˆ⇡(X) partition of Xt test object set X1 X2 XN Y1 Y2 YN
  • 21. Conclusions 21 We propose a notion of absolute and relative clustering The determination whether a pair of objects are clustered together or not is influenced by the other objects, then it is a absolute clustering; otherwise, it is relative clustering Two properties of absolute clustering task Existence of an absolute partition Transitivity across different object sets Three types of supervised clustering problems Transductive clustering, Semi-supervised clustering, and Fully supervised clustering Absolute clustering tasks should be formulated as a semi- supervised problem, and relative clustering taks must be formulated as a fully supervised problem.