SlideShare a Scribd company logo
1 of 78
Content based image retrieval in large image databases
    Lukasz Miroslaw, Ph.D. , Wojciech Tarnawski, Ph.D.
    Institute of Informatics

    Wroclaw University of Technology, Poland
Contents

 Introduction. Past Projects.

 Content-based image retrieval.

 Collaboration opportunities.
Division of Artificial Inteligence
Competences:
   Optimization Techniques: Swarm Optimization, Tabu Search,
    Simulated Annealing, Evolutionary Algorithms.
   Machine Learning.
   Data Mining.
   Computer Vision.


 Applications: stock market, CBIR, biology.
How complex is the universe?




               Fig. Credit: Wikipedia. Hubble Ultra Deep Field image of a region
               of the observable universe (equivalent sky area size shown in
               bottom left corner), near the constellation Fornax. Each spot is a
               galaxy, consisting of billions of stars. The light from the smallest,
               most redshifted galaxies is thought to have originated roughly 13
               billion years
Why Do We Need Metaheuristics?

Definition: Metaheuristic designates a computational
 method that optimizes a problem by iteratively trying
 to improve a candidate solution with regard to a
 given measure of quality (Wikipedia).

•Stochastic methods: simulated annealing,
 evolutionary algorithms, ant colony optimization.

•Deterministic methods: gradient methods, tabu
 search, dynamic programming, etc.
Evolutionary Algorithm




            Fig. Scheme of EA.
Is Selection of the best the best ?
        Prof. Roman Galar, Dr Artur Chorazyczewski, Prof. Iwona Karcz-Duleba
Content Based Image Retrieval
Project funded by Polish-Singapur Programme.

Duration: 3 years until March 2011.

Objective: Content-based Image Retrieval in large
 databases.

Problem: What does it mean “similar”? How to
 define a similarity measures between two images?
Brain’s visual circuits do error
   correction on the fly
   Prediction Coding.

   Hierarchical Layers detect objects in the bottom-up
     manner.

                                                                                                            Final Objects




                                                         Prediction error                                   More Complex Shapes / Textures / Detailed Geometry




                                                                                                            Horizontal / Vertical Lines / Basic Geometrical Shapes




        Credit: Physorg.org


Egner T.,Monti J.M., Summerfield C., Expectation and Surprise Determine Neural Population Responses in the
Ventral Visual Stream, J Neuroscience 2010, 30(49):16601-16608.
Multi-Scale Approach :
Anisotropic diffusion
Here, a number of images are generated in the
process of convolving the original image with the
Gaussian kernel with the t-variance (scale):


The set of derivative images I(x,y,t) is
equivalent to concurrent solutions of the heat
transport problem or diffusion on the plane.



 Since, the convolution operation smoothes
 region boundaries we decided to use edge-
 preserving isotropic diffusion:
Mean-Shift Segmentation
The ”mean-shift”-based image segmentation algorithm(3) is a non-parametric
clustering in 5D image space (3D color space + 2D planar space). The
method does not require to know the number and the shape of clusters.




 (3) D. Comaniciu and P. Meer, “Mean shift: A robust approach toward
 feature space analysis” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24,
 no. 5, pp. 603–619, 2002.
The Framework
The aim of the proposed segmentation is to partition the image into non-overlapping most –
meaningful image-regions to receive a very generalized (coarse) view of natural images.
                                                         1) Input image
                                                         2) Isotropic diffusion step
                                                         3) Mean-shift based
                                                            segmentation
                                                         4) Accumulating the
                                                            clustering results over all
                                                            scales
                                                         5) Mode based clustering of
                                                            values collected in
                                                            accumulator space
                                                         6) „Visualization” of the clusters
                                                            detected in previous step -
                                                            mapping input pixels into set
                                                            of cluster-labels
                                                         7) Region merging taking
                                                            into account region size
                                                            and planar co-occurency
                                                         8) Output image
Gestalt Theory in Segmentation
 „Visualization” of clusters detected in the   The most-meaningful (merged)
 accumulator space : Grouping laws             regions: Collaboration laws
Image Retrieval System


                             Image database


   GEN-SEG image
   segmentation
                                         Image-to-image
  Calculation of MPEG-7-based
 features for segmented regions          similarity
                                         calculation



  Calculation of MPEG-7-based
 features for segmented regions

   GEN-SEG image
   segmentation
                                                          List of the most similar
                                         Query Image      images
Descriptors Used
The Moving Picture Experts Group (MPEG) defined few visual descriptors and
the appropriate distance measures (MPEG-7 standard):

Scalable Color Descriptor – color histogram in HSV color space that is
encoded by a Haar transform.

Color Layout Descriptor – represents the spatial distribution of the color of
visual signals.

Edge Histogram – represents the spatial distribution of five types of edges,
namely four directional edges and one non-directional

Texture Browsing – 5D descriptor related to perceptual characterization of
texture in terms of regularity , coarseness and directionality

Region Shape – describes the shape of an object in image. The descriptor is
robust to noise that may be introduced in the process of segmentation
Image Retrieval Concept
We have defined the measure of the similarity between images by composition
of distance metrics defined in MPEG-7 standard. The procedure consists of the
following steps:

1.Calculate region-to-region distances between the query image and images in
the database for chosen set of MPEG-7 descriptors.

2.Sort the list of distances for all regions from query-image.

   Democracy: each region belongs to a certain image in the database and
   votes for it.

3.Accumulate the votes and pick the best candidates.




                     Preliminary results: the method outperforms grid-based
                     segmentation IR (min. F-score = 0.34 on MGV Database)
Image-based Genome-scale RNAi
Profiling

 Objective: Identification of genes in cell division
  process by esiRNA gene silencing method.

 •Initial Object-identification based on template
   matching.

 •Post-processing of candidate solutions: Evolutionary
   Algorithm (soft-selection, gaussian mutation).
Genome-wide High-Content Screening
                                                                                         Image-based analysis of > 1300 genes’ phenotypes
                                                                                         Identification of > 300 novel genes’ functions.

                                                                                                                  Fig. GUI Interface of esiImage.




Fig. Detection of mitotic cells in Phase-Contrast Microscopy. HeLa Cell Line.



esiImage: Java-based toolkit for automatic detection of
phenotypes. Author: Lukasz Miroslaw (2006)



                                                                                Credit: Dr Artur Chorazyczewski
Genome-wide High-Content Screening




                                  Fig. Mitotic Index for CDC16 and control.


 Kittler R, Pelletier L, Heninger AK, Slabicki M, Theis M, Miroslaw L, Poser I, Lawo S, Grabner
 H, Kozak K, Wagner J, Surendranath V, Richter C, Bowen W, Jackson AL, Habermann B, Hyman
 AA, Buchholz F. Genome-scale RNAi profiling of cell division in human tissue culture cells.
 Nature Cell Biol. 2007 Dec;9(12):1401-12.
Rapid Cervical Cancer Diagnosis



                                                                                 Fig. Image partition.


                                                                                 80 Statistical Geometrical Features.

                                                                                 Learning and Testing Phase.
                 Fig. Phase-Contrast Image with endothelial cells of interest.
                                                                                 Sequential Forward-Backward
 Dresden Technical University, Prof. Fuchs Image Processing Group
                                                                                   Selection.

                                                                                 kNN, Linear Fisher Discriminant.
T. Schilling, L. Mirosław, G. Głąb, M. Smereka.
Towards rapid cervical cancer diagnosis: automated                               Post-processing.
detection and classification of pathological cells in
phase contrast images 2007. Int. J. Gynec Cancer
17(1):118-26.
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Rapid Cervical Cancer Diagnosis
Project proposals #1


 Let’s look inside PubMed

   Consortium:
     BIOTEC (Prof. Schroeder): www.gopubmed.com
     Wroclaw University of Technology (CBIR)
     ETH Zurich ? (Computer Vision)



 Objective: A semantic search engine for life
  science images.
Molecular Retrieval System

                             Prof. Kim Baldridge Lab
knAIght - AI in KNIME
Computer Vision meets AI




     Wroclaw University of Technology
          Institute of Informatics
         Wyb. Wyspianskiego 27
             Wroclaw, Poland

           info@ai.pwr.wroc.pl
           www.ai.pwr.wroc.pl
System Architecture
PATSI: Photo Annotation through Similar
 Images with Annotation Length Optimization

     Automatic Image Annotation describes previously unseen image
      with a set of keywords from the semantic dictionary.

     Hypothesis: Similar images should share similar annotations.

     Model:
       Image is described by a set of visual features.

        Each visual feature is composed of low-level attributes
         (shape, color, edges, texture).




[1] M. Stanek, G. Paradowski, H. Kwasnicka, PATSI — Photo Annotation through Finding Similar Images with Multivariate Gaussian Models,
Lecture Notes in Computer Science, 2010, Vol. 6375, pp. 284-291.

[2] B. Broda, M. Stanek, G. Paradowski, H. Kwasnicka, ImageCLEF 2008 (the method was ranked 5 among 18 participants in one of the
Model
 Image model: Multivariate Gaussian Distribution




 Jensen-Shannon Divergence between image A and B is de fined as:




 Kullback - Leibler divergence:
Content Based Image Retrieval

 1. Similaris : a web-based system for gathering similarity measures for a
 given image database.

 2. Visible : a web-based system for evaluation of the CBIR system.



                           Author: Bartek Dzienkowski, M.Sc.
Results


          Grzegorz Paradowski, Marlena Ochocinska
Model-free Detection of Near-
 Duplicate Fragments
 Problem definition:

Given two random images (i.e. without any a’priori knowledge
 about their content) identify pairs of visually similar, near-
 duplicate image fragments that are related by afficine
 transformations. The fragments are formed by sets of matched
 keypoint pairs satisfying the same affine transformation.

   Features analysed: Hessian-Laplace, Harris-Affine, MSER, SURFT and SIFT descriptors.
 Calculate and decompose the affine transformation for each pair of matched triangles.
 Build the histogram of parameters for the decomposed affine transformations.
 Detect and post-process the high density areas (peaks) in the histogram to build the
  near-duplicate fragments in both images.




[1] Mariusz Paradowski, Andrzej Sluzek, Matching Planar Objects of Images using Histograms of Decomposed Affine Transforms. Submitted to Pattern
Recognition.
[2] Mariusz Paradowski, Andrzej Sluzek, Detection of Image Fragments Related by Affine Transforms: Matching Triangles and Ellipses, ICISA 2010, Korea.
Results




  Road sign                Beer cans   Box and a bottle




    Different side of a tower          Landscape
The topological method – topological
graph [1]

       Key points are detected and paired




                                            13
The topological method – topological
graph [2]

         Spatial neighbors are found




                                       14
The topological method – topological
graph [3]

      Topological constraints are verified




                                             15
The topological method – topological
graph [4]

        Nodes and edges are removed




                                      16
Results – topological matching




    Two objects                Same text                Deformed book




Same scene, some differences               Different camera position
                                                                        17
QSAR Retrieval
Fuzzy Image Retrieval
     Automatic Image Annotation describes previously unseen image
      with a set of keywords from the semantic dictionary.

     Hypothesis: Similar images should share similar annotations.

     Model:
       Image is described by a set of visual features.

       Each visual feature is composed of low-level attributes
        (shape, color, edges, texture).




[1] PATSI — Photo Annotation through Finding Similar Images with Multivariate Gaussian Models, Lecture Notes in Computer Science,
2010, Vol. 6375, pp. 284-291.
Institute of Informatics
One of the leading Polish IT institutes.

Main activity : artificial intelligence, machine
 learning, computer vision, data mining.

 Budget: 4.5M EUR (national grants, EU funds)

Collaboration
  LMC at ETH Zurich (Image Processing)
  Industry: Microsoft, IBM, Volvo, Google.
  Research: TU Munchen, TU Dresden, xxx?
Team
  Prof. Halina Kwaśnicka                            Michalak Krzysztof, Ph.D.
   Assoc Prof. at WUT, Poland                         Expertise in Machine Learning.
   Deputy Director of Institute of Informatics
   Head of the AI Division. Project Leader.          Paweł Myszkowski
                                                      Assistant Prof. at WUT, Poland
  Prof. Urszula Markowska-Kaczmar                    Expertise in Data Mining and Evolutionary
   Assoc Prof. at WUT.                                Algorithms.
   Expertise in Computational Intelligence, Neural
   Networks.
                                                     Martin Tabakow
  Elzbieta Hudyma                                    Assistant Prof. at WUT, Poland
   Assistant Prof. at WUT, Poland                     Expertise in Machine Learning and Analysis of
   Expertise in Computer Graphics, Machine            biomedical images.
   Learning.
                                                     Bartłomiej Broda
  Mariusz Paradowski                                 Pre-doc at WUT, Poland
   PhD, Post-doc at Nanyang Technological             Expertise in Machine Learning, CBIR systems
   University, Singapur.                              and image annotations.
   Assistant Profesor at WUT, Poland
   Expertise in Machine Learning and Computer        Wojciech Tarnawski
   Vision.                                            PhD, Post-doc in LTNT at ETH Zurich
                                                      Assistant Prof. at WUT, Poland
  Michał Stanek                                      Expertise in Machine Learning and Computer
   Pre-doc at WUT, Poland                             Vision
   Expertise in Machine Learning, CBIR systems and
   Computer Vision.                                  Lukasz Miroslaw
                                                      Assistant Prof. at WUT, Poland
                                                      Expertise in Machine Learning, Computer
Master Students: Grzegorz Terlikowski, Sylwester      Vision, Bioinformatics.
Plamowski, Bartek Dzienkowski, Marlena
Ochocinska, Agnieszka Glebala, Lukasz Jercinski
Transfer Annotator




              Author: Bartek Dzienkowski, M.Sc.
Transfer Annotator




                                        Author: Bartek Dzienkowski, M.Sc.




       Tests performed on MGV 2006 database.
Transfer Annotator




                                         Author: Bartek Dzienkowski, M.Sc.




       Tests performed on ICPR 2004 database.
Transfer Annotator




       Tests performed on IAPR TC 12 database.
Transfer Annotator



                                   Results



  Grzegorz Terlikowski, Marlena Ochocinska, Wojciech Tarnawski and Lukasz Miroslaw
Model-free Detection of Near-
 Duplicate Fragments
 Problem definition:

Given two random images (i.e. without any a’priori knowledge
 about their content) identify pairs of visually similar, near-
 duplicate image fragments that are related by affine
 transformations. The fragments are formed by sets of matched
 keypoint pairs satisfying the same affine transformation.

   Features analysed: Hessian-Laplace, Harris-Affine, MSER, SURFT and SIFT descriptors.
 Calculate and decompose the affine transformation for each pair of matched triangles.
 Build the histogram of parameters for the decomposed affine transformations.
 Detect and post-process the high density areas (peaks) in the histogram to build the
  near-duplicate fragments in both images.




[1] Mariusz Paradowski, Andrzej Sluzek, Matching Planar Objects of Images using Histograms of Decomposed Affine Transforms. Submitted to peer-
reviewed journal.
[2] Mariusz Paradowski, Andrzej Sluzek, Detection of Image Fragments Related by Affine Transforms: Matching Triangles and Ellipses, ICISA 2010, Korea.
Results

 High precision, lower recall.
 Real-time analysis is possible.
 Method was tested on 3 databases:
   in-house data based with 100 out/indoor images containing
    objects acquired in different conditions (4950 image pairs)
   Faces category of Caltech101 (180K image pairs)
  Oxford5K (270K image pairs)
Results




  Road sign                Beer cans   Box and a bottle




    Different side of a tower          Landscape
Results

                     HarAff   HesLap   MSER
Quality measure                               SURF
                      SIFT     SIFT    SIFT



   Precision
    [object]         0.96     0.95     0.95   0.97


Recall [object]      0.82     0.70     0.74   0.62


Precision [area]     0.95     0.93     0.94   0.90


 Recall [area]       0.65     0.51     0.50   0.50
     Tested on XXX
Results
     Coverage
     face area        10                     20                      30                      40         50
        [%]

    Precision
    [object]     0.48                0.71                   0.89                    0.95              0.96

    Recall
    [object]     0.87                0.84                   0.81                    0.78              0.72

                  Tab. Face identification. Tested on 188790 image pairs (Calltech101 face category)
Results


                                                                        Without ROI                                   With ROI


                        Precision [object]                                   0.88                                       0.94

                           Recall [object]                                   0.56                                       0.54

                                                        Tab. Image Retrieval. Tested on 276970 image pairs (Oxford5K)




Michal Stanek, Oskar Maier, Halina Kwasnicka, "Wroclaw University of Technology Participation at ImageCLEF 2010 Photo Annotation Track", Conf. on Multilingual and Multimodal Information Access
Evaluation, 20-23 Sep 2010, Padva, Italy.

 H. Kwaśnicka, M. Paradowski, M. Stanek, M. Spytkowski, A. Śluzek , Intelligent approaches to searching similar images on the basis of visual content, ICI 2010, 10th Int.Conf on Information at Delta Universit
for Science and Technology.
Test it yourself!
           www.ai.pwr.wroc.pl/similaris
Project proposals #1


 Let’s look inside PubMed

   Consortium:
     BIOTEC (Prof. Schroeder): www.gopubmed.com
     Wroclaw University of Technology (CBIR)
     ETH Zurich (Computer Vision)



 Objective: A semantic search engine for life
  science images.
Project proposals #2
 Automated Sign Language Recognition (ASLR)

   Consortium:
     xxx
     Wroclaw University of Technology (Computer Vision, AI, Machine
      Learning)
     ETH Zurich (Computer Vision, Gesture Analysis)

   Objective: search for the publications and


 Objective: To develop a translation system to
  make it possible to communicate with deaf
  people.

 Problems: sign language is not international,
  three types of gestures: finger spelling, word-
  level sign vocabulary and tongue, body and
  mouth position.
Content Based Image Retrieval
The topological method – topological
graph [1]

       Key points are detected and paired




                                            13
The topological method – topological
graph [2]

         Spatial neighbors are found




                                       14
The topological method – topological
graph [3]

      Topological constraints are verified




                                             15
The topological method – topological
graph [4]

        Nodes and edges are removed




                                      16
Results – topological matching




    Two objects                Same text                Deformed book




Same scene, some differences               Different camera position
                                                                        17

More Related Content

What's hot

Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval Algorithm
Akshit Bum
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
Léo Vetter
 

What's hot (20)

CBIR
CBIRCBIR
CBIR
 
CBIR MIni project2
CBIR MIni project2CBIR MIni project2
CBIR MIni project2
 
CBIR For Medical Imaging...
CBIR For Medical  Imaging...CBIR For Medical  Imaging...
CBIR For Medical Imaging...
 
Color and texture based image retrieval
Color and texture based image retrievalColor and texture based image retrieval
Color and texture based image retrieval
 
Image Indexing and Retrieval
Image Indexing and RetrievalImage Indexing and Retrieval
Image Indexing and Retrieval
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final ppt
 
Content Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval AlgorithmContent Based Image and Video Retrieval Algorithm
Content Based Image and Video Retrieval Algorithm
 
Content based image retrieval
Content based image retrievalContent based image retrieval
Content based image retrieval
 
IEEE Projects 2014-2015
IEEE Projects 2014-2015IEEE Projects 2014-2015
IEEE Projects 2014-2015
 
Image search engine
Image search engineImage search engine
Image search engine
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
CBIR by deep learning
CBIR by deep learningCBIR by deep learning
CBIR by deep learning
 
Gi3411661169
Gi3411661169Gi3411661169
Gi3411661169
 
CBIR
CBIRCBIR
CBIR
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
CBIR Processing Approach on Colored and Texture Images using KNN Classifier a...
CBIR Processing Approach on Colored and Texture Images using KNN Classifier a...CBIR Processing Approach on Colored and Texture Images using KNN Classifier a...
CBIR Processing Approach on Colored and Texture Images using KNN Classifier a...
 
Multimedia content based retrieval slideshare.ppt
Multimedia content based retrieval slideshare.pptMultimedia content based retrieval slideshare.ppt
Multimedia content based retrieval slideshare.ppt
 
Ijaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
Ijaems apr-2016-16 Active Learning Method for Interactive Image RetrievalIjaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
Ijaems apr-2016-16 Active Learning Method for Interactive Image Retrieval
 

Similar to Content-based Image Retrieval

Artificial neural network based cancer cell classification
Artificial neural network based cancer cell classificationArtificial neural network based cancer cell classification
Artificial neural network based cancer cell classification
Alexander Decker
 
11.artificial neural network based cancer cell classification
11.artificial neural network based cancer cell classification11.artificial neural network based cancer cell classification
11.artificial neural network based cancer cell classification
Alexander Decker
 

Similar to Content-based Image Retrieval (20)

HIGH RESOLUTION MRI BRAIN IMAGE SEGMENTATION TECHNIQUE USING HOLDER EXPONENT
HIGH RESOLUTION MRI BRAIN IMAGE SEGMENTATION TECHNIQUE USING HOLDER EXPONENTHIGH RESOLUTION MRI BRAIN IMAGE SEGMENTATION TECHNIQUE USING HOLDER EXPONENT
HIGH RESOLUTION MRI BRAIN IMAGE SEGMENTATION TECHNIQUE USING HOLDER EXPONENT
 
High Resolution Mri Brain Image Segmentation Technique Using Holder Exponent
High Resolution Mri Brain Image Segmentation Technique Using Holder Exponent  High Resolution Mri Brain Image Segmentation Technique Using Holder Exponent
High Resolution Mri Brain Image Segmentation Technique Using Holder Exponent
 
A Hybrid Data Analysis And Mesh Refinement Paradigm For Conformal Voxel
A Hybrid Data Analysis And Mesh Refinement Paradigm For Conformal VoxelA Hybrid Data Analysis And Mesh Refinement Paradigm For Conformal Voxel
A Hybrid Data Analysis And Mesh Refinement Paradigm For Conformal Voxel
 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...
Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...
Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...
 
Artificial neural network based cancer cell classification
Artificial neural network based cancer cell classificationArtificial neural network based cancer cell classification
Artificial neural network based cancer cell classification
 
11.artificial neural network based cancer cell classification
11.artificial neural network based cancer cell classification11.artificial neural network based cancer cell classification
11.artificial neural network based cancer cell classification
 
A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...
A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...
A SIMPLE APPROACH FOR RELATIVELY AUTOMATED HIPPOCAMPUS SEGMENTATION FROM SAGI...
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
Optimized Neural Network for Classification of Multispectral Images
Optimized Neural Network for Classification of Multispectral ImagesOptimized Neural Network for Classification of Multispectral Images
Optimized Neural Network for Classification of Multispectral Images
 
OBJECT DETECTION AND RECOGNITION: A SURVEY
OBJECT DETECTION AND RECOGNITION: A SURVEYOBJECT DETECTION AND RECOGNITION: A SURVEY
OBJECT DETECTION AND RECOGNITION: A SURVEY
 
Whole brain optical imaging
Whole brain optical imagingWhole brain optical imaging
Whole brain optical imaging
 
C1103041623
C1103041623C1103041623
C1103041623
 
JPM1410 Images as Occlusions of Textures: A Framework for Segmentation
JPM1410   Images as Occlusions of Textures: A Framework for SegmentationJPM1410   Images as Occlusions of Textures: A Framework for Segmentation
JPM1410 Images as Occlusions of Textures: A Framework for Segmentation
 
A survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applicationsA survey of deep learning approaches to medical applications
A survey of deep learning approaches to medical applications
 
S04405107111
S04405107111S04405107111
S04405107111
 
BRAINREGION.pptx
BRAINREGION.pptxBRAINREGION.pptx
BRAINREGION.pptx
 
F0522833
F0522833F0522833
F0522833
 
Human Re-identification with Global and Local Siamese Convolution Neural Network
Human Re-identification with Global and Local Siamese Convolution Neural NetworkHuman Re-identification with Global and Local Siamese Convolution Neural Network
Human Re-identification with Global and Local Siamese Convolution Neural Network
 
Fuzzy c-means
Fuzzy c-meansFuzzy c-means
Fuzzy c-means
 

More from University of Zurich

Evolution-based Reaction Path Following
Evolution-based Reaction Path FollowingEvolution-based Reaction Path Following
Evolution-based Reaction Path Following
University of Zurich
 

More from University of Zurich (7)

Blood Flow Simulations in the Cloud
Blood Flow Simulations in the CloudBlood Flow Simulations in the Cloud
Blood Flow Simulations in the Cloud
 
Multimodal Image Processing in Cytology
Multimodal Image Processing in CytologyMultimodal Image Processing in Cytology
Multimodal Image Processing in Cytology
 
Evolutionary-driven Optimization in Computational Chemistry
Evolutionary-driven Optimization in Computational ChemistryEvolutionary-driven Optimization in Computational Chemistry
Evolutionary-driven Optimization in Computational Chemistry
 
Evolution-based Reaction Path Following
Evolution-based Reaction Path FollowingEvolution-based Reaction Path Following
Evolution-based Reaction Path Following
 
Artificial Intelligence in High Content Screening and Cervical Cancer Diagnosis
Artificial Intelligence in High Content Screening and Cervical Cancer DiagnosisArtificial Intelligence in High Content Screening and Cervical Cancer Diagnosis
Artificial Intelligence in High Content Screening and Cervical Cancer Diagnosis
 
SpeedIT : GPU-based acceleration of sparse linear algebra
SpeedIT : GPU-based acceleration of sparse linear algebraSpeedIT : GPU-based acceleration of sparse linear algebra
SpeedIT : GPU-based acceleration of sparse linear algebra
 
SpeedIT FLOW
SpeedIT FLOWSpeedIT FLOW
SpeedIT FLOW
 

Recently uploaded

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 

Content-based Image Retrieval

  • 1. Content based image retrieval in large image databases Lukasz Miroslaw, Ph.D. , Wojciech Tarnawski, Ph.D. Institute of Informatics Wroclaw University of Technology, Poland
  • 2. Contents  Introduction. Past Projects.  Content-based image retrieval.  Collaboration opportunities.
  • 3. Division of Artificial Inteligence Competences:  Optimization Techniques: Swarm Optimization, Tabu Search, Simulated Annealing, Evolutionary Algorithms.  Machine Learning.  Data Mining.  Computer Vision.  Applications: stock market, CBIR, biology.
  • 4. How complex is the universe? Fig. Credit: Wikipedia. Hubble Ultra Deep Field image of a region of the observable universe (equivalent sky area size shown in bottom left corner), near the constellation Fornax. Each spot is a galaxy, consisting of billions of stars. The light from the smallest, most redshifted galaxies is thought to have originated roughly 13 billion years
  • 5. Why Do We Need Metaheuristics? Definition: Metaheuristic designates a computational method that optimizes a problem by iteratively trying to improve a candidate solution with regard to a given measure of quality (Wikipedia). •Stochastic methods: simulated annealing, evolutionary algorithms, ant colony optimization. •Deterministic methods: gradient methods, tabu search, dynamic programming, etc.
  • 6. Evolutionary Algorithm Fig. Scheme of EA.
  • 7. Is Selection of the best the best ? Prof. Roman Galar, Dr Artur Chorazyczewski, Prof. Iwona Karcz-Duleba
  • 8. Content Based Image Retrieval Project funded by Polish-Singapur Programme. Duration: 3 years until March 2011. Objective: Content-based Image Retrieval in large databases. Problem: What does it mean “similar”? How to define a similarity measures between two images?
  • 9. Brain’s visual circuits do error correction on the fly Prediction Coding. Hierarchical Layers detect objects in the bottom-up manner. Final Objects Prediction error More Complex Shapes / Textures / Detailed Geometry Horizontal / Vertical Lines / Basic Geometrical Shapes Credit: Physorg.org Egner T.,Monti J.M., Summerfield C., Expectation and Surprise Determine Neural Population Responses in the Ventral Visual Stream, J Neuroscience 2010, 30(49):16601-16608.
  • 10. Multi-Scale Approach : Anisotropic diffusion Here, a number of images are generated in the process of convolving the original image with the Gaussian kernel with the t-variance (scale): The set of derivative images I(x,y,t) is equivalent to concurrent solutions of the heat transport problem or diffusion on the plane. Since, the convolution operation smoothes region boundaries we decided to use edge- preserving isotropic diffusion:
  • 11. Mean-Shift Segmentation The ”mean-shift”-based image segmentation algorithm(3) is a non-parametric clustering in 5D image space (3D color space + 2D planar space). The method does not require to know the number and the shape of clusters. (3) D. Comaniciu and P. Meer, “Mean shift: A robust approach toward feature space analysis” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 5, pp. 603–619, 2002.
  • 12. The Framework The aim of the proposed segmentation is to partition the image into non-overlapping most – meaningful image-regions to receive a very generalized (coarse) view of natural images. 1) Input image 2) Isotropic diffusion step 3) Mean-shift based segmentation 4) Accumulating the clustering results over all scales 5) Mode based clustering of values collected in accumulator space 6) „Visualization” of the clusters detected in previous step - mapping input pixels into set of cluster-labels 7) Region merging taking into account region size and planar co-occurency 8) Output image
  • 13. Gestalt Theory in Segmentation „Visualization” of clusters detected in the The most-meaningful (merged) accumulator space : Grouping laws regions: Collaboration laws
  • 14. Image Retrieval System Image database GEN-SEG image segmentation Image-to-image Calculation of MPEG-7-based features for segmented regions similarity calculation Calculation of MPEG-7-based features for segmented regions GEN-SEG image segmentation List of the most similar Query Image images
  • 15. Descriptors Used The Moving Picture Experts Group (MPEG) defined few visual descriptors and the appropriate distance measures (MPEG-7 standard): Scalable Color Descriptor – color histogram in HSV color space that is encoded by a Haar transform. Color Layout Descriptor – represents the spatial distribution of the color of visual signals. Edge Histogram – represents the spatial distribution of five types of edges, namely four directional edges and one non-directional Texture Browsing – 5D descriptor related to perceptual characterization of texture in terms of regularity , coarseness and directionality Region Shape – describes the shape of an object in image. The descriptor is robust to noise that may be introduced in the process of segmentation
  • 16. Image Retrieval Concept We have defined the measure of the similarity between images by composition of distance metrics defined in MPEG-7 standard. The procedure consists of the following steps: 1.Calculate region-to-region distances between the query image and images in the database for chosen set of MPEG-7 descriptors. 2.Sort the list of distances for all regions from query-image. Democracy: each region belongs to a certain image in the database and votes for it. 3.Accumulate the votes and pick the best candidates. Preliminary results: the method outperforms grid-based segmentation IR (min. F-score = 0.34 on MGV Database)
  • 17. Image-based Genome-scale RNAi Profiling Objective: Identification of genes in cell division process by esiRNA gene silencing method. •Initial Object-identification based on template matching. •Post-processing of candidate solutions: Evolutionary Algorithm (soft-selection, gaussian mutation).
  • 18. Genome-wide High-Content Screening Image-based analysis of > 1300 genes’ phenotypes Identification of > 300 novel genes’ functions. Fig. GUI Interface of esiImage. Fig. Detection of mitotic cells in Phase-Contrast Microscopy. HeLa Cell Line. esiImage: Java-based toolkit for automatic detection of phenotypes. Author: Lukasz Miroslaw (2006) Credit: Dr Artur Chorazyczewski
  • 19. Genome-wide High-Content Screening Fig. Mitotic Index for CDC16 and control. Kittler R, Pelletier L, Heninger AK, Slabicki M, Theis M, Miroslaw L, Poser I, Lawo S, Grabner H, Kozak K, Wagner J, Surendranath V, Richter C, Bowen W, Jackson AL, Habermann B, Hyman AA, Buchholz F. Genome-scale RNAi profiling of cell division in human tissue culture cells. Nature Cell Biol. 2007 Dec;9(12):1401-12.
  • 20. Rapid Cervical Cancer Diagnosis Fig. Image partition. 80 Statistical Geometrical Features. Learning and Testing Phase. Fig. Phase-Contrast Image with endothelial cells of interest. Sequential Forward-Backward Dresden Technical University, Prof. Fuchs Image Processing Group Selection. kNN, Linear Fisher Discriminant. T. Schilling, L. Mirosław, G. Głąb, M. Smereka. Towards rapid cervical cancer diagnosis: automated Post-processing. detection and classification of pathological cells in phase contrast images 2007. Int. J. Gynec Cancer 17(1):118-26.
  • 39. Project proposals #1 Let’s look inside PubMed  Consortium:  BIOTEC (Prof. Schroeder): www.gopubmed.com  Wroclaw University of Technology (CBIR)  ETH Zurich ? (Computer Vision) Objective: A semantic search engine for life science images.
  • 40. Molecular Retrieval System Prof. Kim Baldridge Lab
  • 41. knAIght - AI in KNIME
  • 42. Computer Vision meets AI Wroclaw University of Technology Institute of Informatics Wyb. Wyspianskiego 27 Wroclaw, Poland info@ai.pwr.wroc.pl www.ai.pwr.wroc.pl
  • 44. PATSI: Photo Annotation through Similar Images with Annotation Length Optimization  Automatic Image Annotation describes previously unseen image with a set of keywords from the semantic dictionary.  Hypothesis: Similar images should share similar annotations.  Model:  Image is described by a set of visual features.  Each visual feature is composed of low-level attributes (shape, color, edges, texture). [1] M. Stanek, G. Paradowski, H. Kwasnicka, PATSI — Photo Annotation through Finding Similar Images with Multivariate Gaussian Models, Lecture Notes in Computer Science, 2010, Vol. 6375, pp. 284-291. [2] B. Broda, M. Stanek, G. Paradowski, H. Kwasnicka, ImageCLEF 2008 (the method was ranked 5 among 18 participants in one of the
  • 45. Model Image model: Multivariate Gaussian Distribution Jensen-Shannon Divergence between image A and B is de fined as: Kullback - Leibler divergence:
  • 46. Content Based Image Retrieval 1. Similaris : a web-based system for gathering similarity measures for a given image database. 2. Visible : a web-based system for evaluation of the CBIR system. Author: Bartek Dzienkowski, M.Sc.
  • 47. Results Grzegorz Paradowski, Marlena Ochocinska
  • 48. Model-free Detection of Near- Duplicate Fragments  Problem definition: Given two random images (i.e. without any a’priori knowledge about their content) identify pairs of visually similar, near- duplicate image fragments that are related by afficine transformations. The fragments are formed by sets of matched keypoint pairs satisfying the same affine transformation. Features analysed: Hessian-Laplace, Harris-Affine, MSER, SURFT and SIFT descriptors.  Calculate and decompose the affine transformation for each pair of matched triangles.  Build the histogram of parameters for the decomposed affine transformations.  Detect and post-process the high density areas (peaks) in the histogram to build the near-duplicate fragments in both images. [1] Mariusz Paradowski, Andrzej Sluzek, Matching Planar Objects of Images using Histograms of Decomposed Affine Transforms. Submitted to Pattern Recognition. [2] Mariusz Paradowski, Andrzej Sluzek, Detection of Image Fragments Related by Affine Transforms: Matching Triangles and Ellipses, ICISA 2010, Korea.
  • 49. Results Road sign Beer cans Box and a bottle Different side of a tower Landscape
  • 50. The topological method – topological graph [1] Key points are detected and paired 13
  • 51. The topological method – topological graph [2] Spatial neighbors are found 14
  • 52. The topological method – topological graph [3] Topological constraints are verified 15
  • 53. The topological method – topological graph [4] Nodes and edges are removed 16
  • 54. Results – topological matching Two objects Same text Deformed book Same scene, some differences Different camera position 17
  • 56. Fuzzy Image Retrieval  Automatic Image Annotation describes previously unseen image with a set of keywords from the semantic dictionary.  Hypothesis: Similar images should share similar annotations.  Model:  Image is described by a set of visual features.  Each visual feature is composed of low-level attributes (shape, color, edges, texture). [1] PATSI — Photo Annotation through Finding Similar Images with Multivariate Gaussian Models, Lecture Notes in Computer Science, 2010, Vol. 6375, pp. 284-291.
  • 57. Institute of Informatics One of the leading Polish IT institutes. Main activity : artificial intelligence, machine learning, computer vision, data mining.  Budget: 4.5M EUR (national grants, EU funds) Collaboration  LMC at ETH Zurich (Image Processing)  Industry: Microsoft, IBM, Volvo, Google.  Research: TU Munchen, TU Dresden, xxx?
  • 58. Team  Prof. Halina Kwaśnicka Michalak Krzysztof, Ph.D. Assoc Prof. at WUT, Poland Expertise in Machine Learning. Deputy Director of Institute of Informatics Head of the AI Division. Project Leader. Paweł Myszkowski Assistant Prof. at WUT, Poland  Prof. Urszula Markowska-Kaczmar Expertise in Data Mining and Evolutionary Assoc Prof. at WUT. Algorithms. Expertise in Computational Intelligence, Neural Networks. Martin Tabakow  Elzbieta Hudyma Assistant Prof. at WUT, Poland Assistant Prof. at WUT, Poland Expertise in Machine Learning and Analysis of Expertise in Computer Graphics, Machine biomedical images. Learning. Bartłomiej Broda  Mariusz Paradowski Pre-doc at WUT, Poland PhD, Post-doc at Nanyang Technological Expertise in Machine Learning, CBIR systems University, Singapur. and image annotations. Assistant Profesor at WUT, Poland Expertise in Machine Learning and Computer Wojciech Tarnawski Vision. PhD, Post-doc in LTNT at ETH Zurich Assistant Prof. at WUT, Poland  Michał Stanek Expertise in Machine Learning and Computer Pre-doc at WUT, Poland Vision Expertise in Machine Learning, CBIR systems and Computer Vision. Lukasz Miroslaw Assistant Prof. at WUT, Poland Expertise in Machine Learning, Computer Master Students: Grzegorz Terlikowski, Sylwester Vision, Bioinformatics. Plamowski, Bartek Dzienkowski, Marlena Ochocinska, Agnieszka Glebala, Lukasz Jercinski
  • 59. Transfer Annotator Author: Bartek Dzienkowski, M.Sc.
  • 60. Transfer Annotator Author: Bartek Dzienkowski, M.Sc. Tests performed on MGV 2006 database.
  • 61. Transfer Annotator Author: Bartek Dzienkowski, M.Sc. Tests performed on ICPR 2004 database.
  • 62. Transfer Annotator Tests performed on IAPR TC 12 database.
  • 63. Transfer Annotator Results Grzegorz Terlikowski, Marlena Ochocinska, Wojciech Tarnawski and Lukasz Miroslaw
  • 64. Model-free Detection of Near- Duplicate Fragments  Problem definition: Given two random images (i.e. without any a’priori knowledge about their content) identify pairs of visually similar, near- duplicate image fragments that are related by affine transformations. The fragments are formed by sets of matched keypoint pairs satisfying the same affine transformation. Features analysed: Hessian-Laplace, Harris-Affine, MSER, SURFT and SIFT descriptors.  Calculate and decompose the affine transformation for each pair of matched triangles.  Build the histogram of parameters for the decomposed affine transformations.  Detect and post-process the high density areas (peaks) in the histogram to build the near-duplicate fragments in both images. [1] Mariusz Paradowski, Andrzej Sluzek, Matching Planar Objects of Images using Histograms of Decomposed Affine Transforms. Submitted to peer- reviewed journal. [2] Mariusz Paradowski, Andrzej Sluzek, Detection of Image Fragments Related by Affine Transforms: Matching Triangles and Ellipses, ICISA 2010, Korea.
  • 65. Results  High precision, lower recall.  Real-time analysis is possible.  Method was tested on 3 databases:  in-house data based with 100 out/indoor images containing objects acquired in different conditions (4950 image pairs)  Faces category of Caltech101 (180K image pairs)  Oxford5K (270K image pairs)
  • 66. Results Road sign Beer cans Box and a bottle Different side of a tower Landscape
  • 67. Results HarAff HesLap MSER Quality measure SURF SIFT SIFT SIFT Precision [object] 0.96 0.95 0.95 0.97 Recall [object] 0.82 0.70 0.74 0.62 Precision [area] 0.95 0.93 0.94 0.90 Recall [area] 0.65 0.51 0.50 0.50 Tested on XXX
  • 68. Results Coverage face area 10 20 30 40 50 [%] Precision [object] 0.48 0.71 0.89 0.95 0.96 Recall [object] 0.87 0.84 0.81 0.78 0.72 Tab. Face identification. Tested on 188790 image pairs (Calltech101 face category)
  • 69. Results Without ROI With ROI Precision [object] 0.88 0.94 Recall [object] 0.56 0.54 Tab. Image Retrieval. Tested on 276970 image pairs (Oxford5K) Michal Stanek, Oskar Maier, Halina Kwasnicka, "Wroclaw University of Technology Participation at ImageCLEF 2010 Photo Annotation Track", Conf. on Multilingual and Multimodal Information Access Evaluation, 20-23 Sep 2010, Padva, Italy. H. Kwaśnicka, M. Paradowski, M. Stanek, M. Spytkowski, A. Śluzek , Intelligent approaches to searching similar images on the basis of visual content, ICI 2010, 10th Int.Conf on Information at Delta Universit for Science and Technology.
  • 70. Test it yourself! www.ai.pwr.wroc.pl/similaris
  • 71. Project proposals #1 Let’s look inside PubMed  Consortium:  BIOTEC (Prof. Schroeder): www.gopubmed.com  Wroclaw University of Technology (CBIR)  ETH Zurich (Computer Vision) Objective: A semantic search engine for life science images.
  • 72. Project proposals #2 Automated Sign Language Recognition (ASLR)  Consortium:  xxx  Wroclaw University of Technology (Computer Vision, AI, Machine Learning)  ETH Zurich (Computer Vision, Gesture Analysis)  Objective: search for the publications and Objective: To develop a translation system to make it possible to communicate with deaf people. Problems: sign language is not international, three types of gestures: finger spelling, word- level sign vocabulary and tongue, body and mouth position.
  • 73. Content Based Image Retrieval
  • 74. The topological method – topological graph [1] Key points are detected and paired 13
  • 75. The topological method – topological graph [2] Spatial neighbors are found 14
  • 76. The topological method – topological graph [3] Topological constraints are verified 15
  • 77. The topological method – topological graph [4] Nodes and edges are removed 16
  • 78. Results – topological matching Two objects Same text Deformed book Same scene, some differences Different camera position 17

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. Gestalt theory first arose in 1890 as a reaction to the prevalent psychological theory of the time - atomism. Atomism examined parts of things with the idea that these parts could then be put back together to make wholes. Atomists believed the nature of things to be absolute and not dependent on context. Gestalt theorists, on the other hand, were intrigued by the way our mind perceives wholes out of incomplete elements [1, 2]. "To the Gestaltists, things are affected by where they are and by what surrounds them...so that things are better described as "more than the sum of their parts."" [1, p. 49]. Gestaltists believed that context was very important in perception. An essay by Christian von Ehrenfels discussed this belief using a musical example. Take a 12 note melody. Play it in one key, say the key of C. Now change to another key, say the key of A flat. There might not be any notes the same in the two songs, yet a person listening to it knows that it is the same tune. It is the relationships between the notes that give us the tune, the whole, not which notes make up the tune.\n\n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. \n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n
  38. \n
  39. Screen shots.\n
  40. \n
  41. \n
  42. \n
  43. \n
  44. \n
  45. \n
  46. \n
  47. \n
  48. Near-duplicate fragments usually represents the same objects captured with different external conditions, e.g. position, lighting, camera setup.\n
  49. \n
  50. \n
  51. \n
  52. \n
  53. \n
  54. \n
  55. \n
  56. \n
  57. \n
  58. \n
  59. \n
  60. \n
  61. \n
  62. \n
  63. \n
  64. Near-duplicate fragments usually represents the same objects captured with different external conditions, e.g. position, lighting, camera setup.\n
  65. Near-duplicate fragments usually represents the same objects captured with different external conditions, e.g. position, lighting, camera setup.\n\nRecall is then computed as the fraction of correct instances among all instances that actually belong to the relevant subset, while precision is the fraction of correct instances among those that the algorithm believes to belong to the relevant subset.\n
  66. \n
  67. \n
  68. \n
  69. "MAGMA : efficient method for image annotation in low dimensional feature space based on Multivariate Gaussian Models",Bartosz Broda, Halina Kwaśnicka, Mariusz Paradowski, Michał Stanek, Proceedings of the International Multiconference on Computer Science and Information Technology [Dokument elektroniczny], Mrągowo, Poland, October 12-14, 2009 / M. Ganzha, M. Paprzycki (eds). Katowice : Polskie Towarzystwo Informatyczne. Oddział Górnośląski, 2009. s. 131-138.\n\n\n "PATSI — Photo Annotation through Finding Similar Images with Multivariate Gaussian Models", Michał Stanek, Bartosz Broda i Halina Kwaśnicka, Proceedings of International Conference on Computer Vision and Graphics 2010. To appear: Lecture Notes in Computer Science\n\n PATSI -- Photo annotation through Finding Similar Images with annotion length optimization" Oskar Maier, Michal Stanek, Halina Kwasnicka, Publikacja zgloszona na konferencje International Joint Conference Intelligent Information Systems\n
  70. \n
  71. Screen shots.\n
  72. \n
  73. \n
  74. \n
  75. \n
  76. \n
  77. \n
  78. \n