SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
Dynamic Two-Stage Image Retrieval from
     Large Multimodal Databases

                      Avi Arampatzis
                    Konstantinos Zagoris
                   Savvas Chatzichristofis


    Department of Electrical and Computer Engineering
            Democritus University of Thrace
                  Xanthi 67100, Greece
                         ECIR 2011
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   2


                                                         Outline

1. Content-based Image Retrieval (CBIR): global vs local features, scalability

2. A problem of global features: The Red Tomato / Red Pie-Chart Problem

3. Multimodal Information Collections

4. Two-stage Image Retrieval from Multimedia Databases
     related work
     our main contributions

5. Experiments on the ImageCLEF 2010 Wikipedia Test Collection
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   3


                        Content-based Image Retrieval (CBIR)

 In CBIR, images are represented by either
     local features:
      ¦ computed at multiple image points; capable of recognizing objects
      ¦ computationally expensive, e.g. due to high dimensionality
     global features:
      ¦ generalize images with single vectors capturing color, texture, shape, etc.
      ¦ computationally cheap (relatively)
        ¥ thus, more popular

 CBIR with either global or local features
     does not scale up well to large databases efficiency-wise
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   4


                                     CBIR with Global Features

 notoriously noisy for image queries of low generality
  (generality = the fraction of relevant images in a collection)
     In text retrieval: documents matching no query keywords are not retrieved
     In image retrieval: CBIR will rank the whole collection
      ¦ The Red Tomato / Red Pie-Chart Problem [next]
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   5


                    The Red Tomato / Red Pie-Chart Problem




  Image query:                                        Collection items:



 If the query has a low generality
     early ranks may be dominated by spurious results (such as the pie-chart)
     possibly ranked even before red tomatoes on non-white backgrounds
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   7


                         Contemporary Information Collections

 Large

 Multimodal (= provide different manners of retrieval )


A good example is Wikipedia:

 Large:
     3,605,426 articles (English version alone), “millions of images”, etc.

 Multimodal:
     Topics are covered in several languages.
     Topics may include non-textual media (image, sound, video, etc.)
     annotated in a variety of metadata fields (caption, description, etc.)
    Thus, there are several ways to get to the same info/topic.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   8


                  Image Retrieval from Multimodal Databases
In image retrieval systems, users are assumed to target visual similarity. Thus,

 Image is the primary medium.
     image modalities are most important than others

 Modalities from other media are secondary.
     but they can still help with the problems of CBIR:
      ¦ low generality (Red Tomato/Pie-Chart) problem (esp. for global feats.)
      ¦ speed

Traditionally, fusion has been used, but:

 weighing of media is not trivial; usually requires training data

 not theoretically sound: influence of textual scores may worsen the visual quality
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   9


       Two-stage Image Retrieval from Multimedia Databases
Key idea for improving CBIR effectiveness:
Before CBIR, raise query generality by reducing collection size via filtering.
Assumptions:

 Query is expressed in the primary medium i.e. image,

 accompanied by a query in a secondary medium, e.g. text.

Two stage retrieval:

 Rank the collection by the secondary medium/query, e.g. text.

 Draw a rank threshold K.

 Re-rank only the top-K items with CBIR.

Using a ‘cheaper’ secondary medium, we also improve speed.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   10


                                         Previous Related Work

Best-results re-ranking by visual content has been seen before, but

 for other purposes, e.g. result clustering, diversity, . . .

 using external info (e.g. sets of images), or training data, . . .

 They all used
     global features
     a static predefined threshold for all queries

Effectiveness results have been mixed,
while some didn’t provide a comparative evaluation.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   11


                                            Main Contributions

In view of the related literature, our main contributions are:

 dynamic thresholds per query
     no external information
     no training data

 extensive evaluation of thresholding types and levels
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   12


              The ImageCLEF 2010 Wikipedia Test Collection

 237,434 items
     Primary medium: image
      ¦ Heterogeneous: color natural images, graphics, greyscale images, etc.
      ¦ variety of image sizes
      ¦ It’s a large benchmark image database for today’s standards.
     + noisy and incomplete user-supplied annotations
     + wikipedia articles containing the images
     Annotations  articles come in 3 languages: En, Fr, De.

 70 test topics
     Visual part:
      ¦ 1 or more example images
     Textual part:
      ¦ 3 titles fields, one per language (En, Fr, De)
     Topics are assessed by visual similarity.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   13


                                         Indexing and Retrieval

 text: only English annotations + English query
     Lemur Toolkit V4.11 / Indri V2.11
      ¦ tf.idf retrieval model (works better with our thresholding methods)
      ¦ default settings + Krovetz stemmer

 image:
     Joint Composite Descriptor (JCD)
      ¦ captures color and texture information
      ¦ developed for color natural images
     Spatial Color Distribution (SpCD)
      ¦ captures color and its spatial distribution
      ¦ more suitable for colored graphics (fewer colors, less texture)
     JCD  SpCD are found to be more effective than MPEG-7 descriptors.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   14


                                   Thresholding and Re-ranking

 Static Thresholding: a fixed pre-selected rank threshold K for all topics
     K  25, 50, 100, 250, 500, 1000.

 Dynamic Thresholding: a variable rank threshold per topic
     Score-Distributional Threshold Optimization (SDTO)
      [Arampatzis et.al., ”Where to Stop Reading a Ranked-list?”, SIGIR 2009]
     Two types:
      ¦ Threshold on Precision: g  0.990, 0.950, 0.800, 0.500, 0.330, 0.100
      ¦ Threshold on prel: θ  0.990, 0.950, 0.800, 0.500, 0.330, 0.100
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   15


                        Initial Experiments: Setting a Baseline
                 item scoring by                                MAP      P@10          P@20          bpref
                 JCD1                                           .0058    .0486         .0479         .0352
                 maxi JCDi                                      .0072    .0614         .0614         .0387
                 maxi JCDi + maxi SpCDi                         .0112    .0871         .0886         .0415
                 tf.idf (text-only)                             .1293    .3614         .3314         .1806


 Image-only runs provide very weak baselines.

 We chose the text-only run as a baseline for the statistical significance testing.

 This makes sense also from an efficiency point of view:

     if using a secondary text modality for image retrieval is more effective than
      current CBIR methods, then there is no reason at all for using
      computationally costly CBIR methods.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases        Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   16


                                                         Results

    threshold        K€         MAP         P@10
                                                 JCD1
                                                         P@20         bpref        MAP
                                                                                        maxi JCDi +
                                                                                             P@10
                                                                                                          maxi SpCDi
                                                                                                            P@20     bpref
    text-only        —          .1293       .3614        .3314        .1806        .1293     .3614          .3314    .1806
           25        25       .1162™      .3957 -       .3457˜       .1641™
                                                                                 .1168   - .3943 -         .3436 ˜
                                                                                                                    .1659™
           50        50        .1144™     .3829 -      .3579˜        .1608™      .1154 -   .3986 -         .3557 -  .1648™
          100        100      .1138 -     .3786 -      .3471 -       .1609™      .1133 -   .3900 -         .3486 - .1623 -
  K
          250        250       .1081™     .3414 -      .3164 -      .1644 -       .1092™   .3771 -        .3564 -  .1664 -
          500        500       .0968™
                                          .3200 -      .3007 -       .1575™       .0999™
                                                                                           .3557 -         .3250 -  .1590™
         1000       1000       .0865™
                                           .2871™       .2729™       .1493™       .0909™
                                                                                           .3329 -         .3064 -  .1511™
         .9900       49       .1364 -     .4214˜       .3550 -       .1902˜       .1385˜    .4371˜œ
                                                                                                           .3743˜   .1921˜
         .9500       68       .1352 -      .4171˜      .3586 -      .1912˜       .1386˜    .4500˜ œ
                                                                                                           .3836˜  .1932˜
         .8000       95       .1318 -     .4000 -      .3536 -      .1892 -      .1365 -    .4443˜œ
                                                                                                          .3871˜   .1924 -
  g
         .5000       151      .1196 -     .3814 -      .3393 -      .1808 -      .1226 -   .4043 -         .3550 - .1813 -
         .3333       237       .1085™
                                          .3500 -      .3000 -      .1707 -       .1121™   .3857 -         .3364 - .1734 -
         .1000       711       .0864™
                                           .2871™       .2621™
                                                                     .1461™
                                                                                  .0909™
                                                                                           .3357 -         .2964 -  .1487™

         .9900       42       .1342 -     .4043 -      .3414 -      .1865 -       .1375˜    .4371˜œ
                                                                                                           .3700˜   .1897˜
         .9500       51       .1371 -      .4214˜      .3586 -       .1903˜       .1417˜œ
                                                                                            .4500˜œ
                                                                                                           .3864˜œ
                                                                                                                    .1924˜œ

         .8000       81       .1384˜      .4229˜       .3614 -       .1921˜      .1427˜ œ
                                                                                           .4629˜ œ
                                                                                                           .3871˜œ
                                                                                                                   .1961˜  œ
  θ
         .5000       91       .1367 -     .4057 -      .3571 -       .1919˜       .1397˜    .4400˜œ
                                                                                                           .3829˜   .1937˜
         .3333       109      .1375 -     .4129 -      .3636 -      .1933˜        .1404˜    .4500˜œ
                                                                                                           .3907˜œ
                                                                                                                    .1949˜œ

         .1000       130      .1314 -     .4100 -      .3629 -      .1866 -      .1370 -    .4371˜         .3843˜   .1922˜
  image-only         —         .0058™
                                           .0486™
                                                        .0479™
                                                                     .0352™
                                                                                  .0112™
                                                                                            .0871™
                                                                                                           .0886™
                                                                                                                    .0415™
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   17


                                                         Results

 Static thresholding improves initial Precision at the cost of MAP and bpref.

 Dynamic thresholding on Precision or prel does not have this drawback.

 The level of static and precision thresholds influences greatly the effectiveness;
  unsuitable choices (e.g. too loose) lead to a degraded performance.

 Prel thresholds are much more robust in this respect.

 Better CBIR at the second stage leads to overall improvements, but the
  thresholding type seems more important: While the two CBIR methods vary
  greatly in performance (the best has almost double the effectiveness of the
  other), static thresholding is not influenced much by this choice. Dynamic
  methods benefit more from improved CBIR.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   18


                                                         Results
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   20


                                                    Conclusions

 Two-stage retrieval with dynamic thresholding:
     more effective  robust than static thresholding
     practically insensitive to a wide range of choices for the optimization measure
     beats significantly the text-only  several image-only baselines

 Two-stage retrieval, irrespective of thresholding type:
     efficiency benefit:
      ¦ it cuts down greatly on expensive image operations
      ¦ on average, only 2 to 5 in 10,000 images had to be scored
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   21


                                               Further Research

 Compare two-stage retrieval to fusion. [Did this; check ECIR11 poster]
     Both methods work better than text-only  image-only;
      two-stage is slightly better than fusion, but the difference is non-significant.
     Both methods are robust in different ways:
      ¦ Fusion provides less variability across topics but it is sensitive to the
        weighing parameter of the contributing media.
      ¦ Two-stage provides a much lower sensitivity to its thresholding parameter
        but has a higher variability across topics.

 Try two-stage retrieval with other type of image descriptors,
  e.g. the visual codebook approach (TOP-SURF).

 Generalization to multi-stage retrieval, where rankings for the media are
  successively being thresholded and re-ranked with respect to a media hierarchy.
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases   Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis   22




                                                  avi@ee.duth.gr



                                                       Thank you !
http://www.mmretrieval.net

Contenu connexe

Tendances

An adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approachAn adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approachCemal Ardil
 
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence MatrixSteganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence MatrixCSCJournals
 
Btv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalBtv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalVinh Bui
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...BrianDeCost
 
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVMRobust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVMFerhat Ozgur Catak
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...AIST
 
Optimized Neural Network for Classification of Multispectral Images
Optimized Neural Network for Classification of Multispectral ImagesOptimized Neural Network for Classification of Multispectral Images
Optimized Neural Network for Classification of Multispectral ImagesIDES Editor
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...PyData
 
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...Ferhat Ozgur Catak
 
Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932Editor IJARCET
 
184816386 x mining
184816386 x mining184816386 x mining
184816386 x mining496573
 
Bhadale group of companies ai neural networks and algorithms catalogue
Bhadale group of companies ai neural networks and algorithms catalogueBhadale group of companies ai neural networks and algorithms catalogue
Bhadale group of companies ai neural networks and algorithms catalogueVijayananda Mohire
 
Indoor Point Cloud Processing
Indoor Point Cloud ProcessingIndoor Point Cloud Processing
Indoor Point Cloud ProcessingPetteriTeikariPhD
 
Fractional step discriminant pruning
Fractional step discriminant pruningFractional step discriminant pruning
Fractional step discriminant pruningVasileiosMezaris
 
Decision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence ModelingDecision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence ModelingTomoya Oda
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesIRJET Journal
 
CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...
CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...
CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...ijcsit
 
Radial Thickness Calculation and Visualization for Volumetric Layers-8397
Radial Thickness Calculation and Visualization for Volumetric Layers-8397Radial Thickness Calculation and Visualization for Volumetric Layers-8397
Radial Thickness Calculation and Visualization for Volumetric Layers-8397Kitware Kitware
 

Tendances (20)

An adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approachAn adaptive-model-for-blind-image-restoration-using-bayesian-approach
An adaptive-model-for-blind-image-restoration-using-bayesian-approach
 
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence MatrixSteganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix
 
Btv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalBtv thesis defense_v1.02-final
Btv thesis defense_v1.02-final
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVMRobust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
Robust Ensemble Classifier Combination Based on Noise Removal with One-Class SVM
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
 
Optimized Neural Network for Classification of Multispectral Images
Optimized Neural Network for Classification of Multispectral ImagesOptimized Neural Network for Classification of Multispectral Images
Optimized Neural Network for Classification of Multispectral Images
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
 
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
Secure Multi-Party Computation Based Privacy Preserving Extreme Learning Mach...
 
Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932Volume 2-issue-6-1930-1932
Volume 2-issue-6-1930-1932
 
A0360109
A0360109A0360109
A0360109
 
184816386 x mining
184816386 x mining184816386 x mining
184816386 x mining
 
Bhadale group of companies ai neural networks and algorithms catalogue
Bhadale group of companies ai neural networks and algorithms catalogueBhadale group of companies ai neural networks and algorithms catalogue
Bhadale group of companies ai neural networks and algorithms catalogue
 
Indoor Point Cloud Processing
Indoor Point Cloud ProcessingIndoor Point Cloud Processing
Indoor Point Cloud Processing
 
Fractional step discriminant pruning
Fractional step discriminant pruningFractional step discriminant pruning
Fractional step discriminant pruning
 
Decision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence ModelingDecision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence Modeling
 
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering TechniquesFeature Subset Selection for High Dimensional Data Using Clustering Techniques
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
 
Tools for Image Retrieval in Large Multimedia Databases
Tools for Image Retrieval in Large Multimedia DatabasesTools for Image Retrieval in Large Multimedia Databases
Tools for Image Retrieval in Large Multimedia Databases
 
CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...
CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...
CONTENT BASED VIDEO CATEGORIZATION USING RELATIONAL CLUSTERING WITH LOCAL SCA...
 
Radial Thickness Calculation and Visualization for Volumetric Layers-8397
Radial Thickness Calculation and Visualization for Volumetric Layers-8397Radial Thickness Calculation and Visualization for Volumetric Layers-8397
Radial Thickness Calculation and Visualization for Volumetric Layers-8397
 

En vedette

Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...Konstantinos Zagoris
 
Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)Konstantinos Zagoris
 
Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...Konstantinos Zagoris
 
Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsKonstantinos Zagoris
 
Text extraction using document structure features and support vector machines
Text extraction using document structure features and support vector machinesText extraction using document structure features and support vector machines
Text extraction using document structure features and support vector machinesKonstantinos Zagoris
 
Query expansion based on visual content new
Query expansion based on visual content newQuery expansion based on visual content new
Query expansion based on visual content newLazaros Tsochatzidis
 
A Review of Feature Extraction Techniques for CBIR based on SVM
A Review of Feature Extraction Techniques for CBIR based on SVMA Review of Feature Extraction Techniques for CBIR based on SVM
A Review of Feature Extraction Techniques for CBIR based on SVMIJEEE
 
Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...Konstantinos Zagoris
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final pptrinki nag
 
Literature Review on Content Based Image Retrieval
Literature Review on Content Based Image RetrievalLiterature Review on Content Based Image Retrieval
Literature Review on Content Based Image RetrievalUpekha Vandebona
 
Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)Raja Sekar
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image RetrievalSOURAV KAR
 
Content based image retrieval (cbir) using
Content based image retrieval (cbir) usingContent based image retrieval (cbir) using
Content based image retrieval (cbir) usingijcsity
 
Content Based Image Retrieval
Content Based Image Retrieval Content Based Image Retrieval
Content Based Image Retrieval Swati Chauhan
 
Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...
Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...
Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...Kalle
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)paddu123
 

En vedette (18)

Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
Comparative Performance Evaluation of Image Descriptors Over IEEE 802.11b Noi...
 
Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)Content and Metadata Based Image Document Retrieval (in Greek)
Content and Metadata Based Image Document Retrieval (in Greek)
 
Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...Handwritten and Machine Printed Text Separation in Document Images using the ...
Handwritten and Machine Printed Text Separation in Document Images using the ...
 
Svm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammogramsSvm based cbir of breast masses on mammograms
Svm based cbir of breast masses on mammograms
 
Text extraction using document structure features and support vector machines
Text extraction using document structure features and support vector machinesText extraction using document structure features and support vector machines
Text extraction using document structure features and support vector machines
 
Query expansion based on visual content new
Query expansion based on visual content newQuery expansion based on visual content new
Query expansion based on visual content new
 
A Review of Feature Extraction Techniques for CBIR based on SVM
A Review of Feature Extraction Techniques for CBIR based on SVMA Review of Feature Extraction Techniques for CBIR based on SVM
A Review of Feature Extraction Techniques for CBIR based on SVM
 
Ga
GaGa
Ga
 
Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...Segmentation - based Historical Handwritten Word Spotting using document-spec...
Segmentation - based Historical Handwritten Word Spotting using document-spec...
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final ppt
 
Literature Review on Content Based Image Retrieval
Literature Review on Content Based Image RetrievalLiterature Review on Content Based Image Retrieval
Literature Review on Content Based Image Retrieval
 
Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)Content based image retrieval using clustering Algorithm(CBIR)
Content based image retrieval using clustering Algorithm(CBIR)
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
CBIR
CBIRCBIR
CBIR
 
Content based image retrieval (cbir) using
Content based image retrieval (cbir) usingContent based image retrieval (cbir) using
Content based image retrieval (cbir) using
 
Content Based Image Retrieval
Content Based Image Retrieval Content Based Image Retrieval
Content Based Image Retrieval
 
Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...
Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...
Faro Visual Attention For Implicit Relevance Feedback In A Content Based Imag...
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
 

Similaire à Dynamic Two-Stage Image Retrieval from Large Multimodal Databases

Mobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesMobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesUnited States Air Force Academy
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrievalijtsrd
 
Image Retrieval using Equalized Histogram Image Bins Moments
Image Retrieval using Equalized Histogram Image Bins MomentsImage Retrieval using Equalized Histogram Image Bins Moments
Image Retrieval using Equalized Histogram Image Bins MomentsIDES Editor
 
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image TransformationDeep Learning JP
 
Image stegnogrpahy(muqeed)
Image stegnogrpahy(muqeed)Image stegnogrpahy(muqeed)
Image stegnogrpahy(muqeed)Muqeed Hussain
 
Intelligent Multimedia Recommendation
Intelligent Multimedia RecommendationIntelligent Multimedia Recommendation
Intelligent Multimedia RecommendationWanjin Yu
 
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Dongmin Choi
 
rasdaman: from barebone Arrays to DataCubes
rasdaman: from barebone Arrays to DataCubesrasdaman: from barebone Arrays to DataCubes
rasdaman: from barebone Arrays to DataCubesEUDAT
 
Multimedia Databases: Performance Measure Benchmarking Model (PMBM) Framework
Multimedia Databases: Performance Measure Benchmarking Model (PMBM) FrameworkMultimedia Databases: Performance Measure Benchmarking Model (PMBM) Framework
Multimedia Databases: Performance Measure Benchmarking Model (PMBM) Frameworktheijes
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSymeon Papadopoulos
 
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵CHENHuiMei
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...researchinventy
 
Research Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and ScienceResearch Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and Scienceresearchinventy
 
Content-based image retrieval based on corel dataset using deep learning
Content-based image retrieval based on corel dataset using deep learningContent-based image retrieval based on corel dataset using deep learning
Content-based image retrieval based on corel dataset using deep learningIAESIJAI
 
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...Larry Smarr
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? Robert Grossman
 
Computer vision for transportation
Computer vision for transportationComputer vision for transportation
Computer vision for transportationWanjin Yu
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overviewLEE HOSEONG
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong LeeMoazzem Hossain
 

Similaire à Dynamic Two-Stage Image Retrieval from Large Multimodal Databases (20)

Mobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large RepositoriesMobile Visual Search: Object Re-Identification Against Large Repositories
Mobile Visual Search: Object Re-Identification Against Large Repositories
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
Image Retrieval using Equalized Histogram Image Bins Moments
Image Retrieval using Equalized Histogram Image Bins MomentsImage Retrieval using Equalized Histogram Image Bins Moments
Image Retrieval using Equalized Histogram Image Bins Moments
 
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
[DLHacks 実装]Perceptual Adversarial Networks for Image-to-Image Transformation
 
Image stegnogrpahy(muqeed)
Image stegnogrpahy(muqeed)Image stegnogrpahy(muqeed)
Image stegnogrpahy(muqeed)
 
Intelligent Multimedia Recommendation
Intelligent Multimedia RecommendationIntelligent Multimedia Recommendation
Intelligent Multimedia Recommendation
 
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
 
rasdaman: from barebone Arrays to DataCubes
rasdaman: from barebone Arrays to DataCubesrasdaman: from barebone Arrays to DataCubes
rasdaman: from barebone Arrays to DataCubes
 
Multimedia Databases: Performance Measure Benchmarking Model (PMBM) Framework
Multimedia Databases: Performance Measure Benchmarking Model (PMBM) FrameworkMultimedia Databases: Performance Measure Benchmarking Model (PMBM) Framework
Multimedia Databases: Performance Measure Benchmarking Model (PMBM) Framework
 
Portfolio
PortfolioPortfolio
Portfolio
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia content
 
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...
 
Research Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and ScienceResearch Inventy: International Journal of Engineering and Science
Research Inventy: International Journal of Engineering and Science
 
Content-based image retrieval based on corel dataset using deep learning
Content-based image retrieval based on corel dataset using deep learningContent-based image retrieval based on corel dataset using deep learning
Content-based image retrieval based on corel dataset using deep learning
 
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
High Performance Cyberinfrastructure Enabling Data-Driven Science in the Biom...
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
Computer vision for transportation
Computer vision for transportationComputer vision for transportation
Computer vision for transportation
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
 

Dernier

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Dernier (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Dynamic Two-Stage Image Retrieval from Large Multimodal Databases

  • 1. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis Konstantinos Zagoris Savvas Chatzichristofis Department of Electrical and Computer Engineering Democritus University of Thrace Xanthi 67100, Greece ECIR 2011
  • 2. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 2 Outline 1. Content-based Image Retrieval (CBIR): global vs local features, scalability 2. A problem of global features: The Red Tomato / Red Pie-Chart Problem 3. Multimodal Information Collections 4. Two-stage Image Retrieval from Multimedia Databases related work our main contributions 5. Experiments on the ImageCLEF 2010 Wikipedia Test Collection
  • 3. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 3 Content-based Image Retrieval (CBIR) In CBIR, images are represented by either local features: ¦ computed at multiple image points; capable of recognizing objects ¦ computationally expensive, e.g. due to high dimensionality global features: ¦ generalize images with single vectors capturing color, texture, shape, etc. ¦ computationally cheap (relatively) ¥ thus, more popular CBIR with either global or local features does not scale up well to large databases efficiency-wise
  • 4. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 4 CBIR with Global Features notoriously noisy for image queries of low generality (generality = the fraction of relevant images in a collection) In text retrieval: documents matching no query keywords are not retrieved In image retrieval: CBIR will rank the whole collection ¦ The Red Tomato / Red Pie-Chart Problem [next]
  • 5. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 5 The Red Tomato / Red Pie-Chart Problem Image query: Collection items: If the query has a low generality early ranks may be dominated by spurious results (such as the pie-chart) possibly ranked even before red tomatoes on non-white backgrounds
  • 6.
  • 7. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 7 Contemporary Information Collections Large Multimodal (= provide different manners of retrieval ) A good example is Wikipedia: Large: 3,605,426 articles (English version alone), “millions of images”, etc. Multimodal: Topics are covered in several languages. Topics may include non-textual media (image, sound, video, etc.) annotated in a variety of metadata fields (caption, description, etc.) Thus, there are several ways to get to the same info/topic.
  • 8. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 8 Image Retrieval from Multimodal Databases In image retrieval systems, users are assumed to target visual similarity. Thus, Image is the primary medium. image modalities are most important than others Modalities from other media are secondary. but they can still help with the problems of CBIR: ¦ low generality (Red Tomato/Pie-Chart) problem (esp. for global feats.) ¦ speed Traditionally, fusion has been used, but: weighing of media is not trivial; usually requires training data not theoretically sound: influence of textual scores may worsen the visual quality
  • 9. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 9 Two-stage Image Retrieval from Multimedia Databases Key idea for improving CBIR effectiveness: Before CBIR, raise query generality by reducing collection size via filtering. Assumptions: Query is expressed in the primary medium i.e. image, accompanied by a query in a secondary medium, e.g. text. Two stage retrieval: Rank the collection by the secondary medium/query, e.g. text. Draw a rank threshold K. Re-rank only the top-K items with CBIR. Using a ‘cheaper’ secondary medium, we also improve speed.
  • 10. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 10 Previous Related Work Best-results re-ranking by visual content has been seen before, but for other purposes, e.g. result clustering, diversity, . . . using external info (e.g. sets of images), or training data, . . . They all used global features a static predefined threshold for all queries Effectiveness results have been mixed, while some didn’t provide a comparative evaluation.
  • 11. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 11 Main Contributions In view of the related literature, our main contributions are: dynamic thresholds per query no external information no training data extensive evaluation of thresholding types and levels
  • 12. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 12 The ImageCLEF 2010 Wikipedia Test Collection 237,434 items Primary medium: image ¦ Heterogeneous: color natural images, graphics, greyscale images, etc. ¦ variety of image sizes ¦ It’s a large benchmark image database for today’s standards. + noisy and incomplete user-supplied annotations + wikipedia articles containing the images Annotations articles come in 3 languages: En, Fr, De. 70 test topics Visual part: ¦ 1 or more example images Textual part: ¦ 3 titles fields, one per language (En, Fr, De) Topics are assessed by visual similarity.
  • 13. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 13 Indexing and Retrieval text: only English annotations + English query Lemur Toolkit V4.11 / Indri V2.11 ¦ tf.idf retrieval model (works better with our thresholding methods) ¦ default settings + Krovetz stemmer image: Joint Composite Descriptor (JCD) ¦ captures color and texture information ¦ developed for color natural images Spatial Color Distribution (SpCD) ¦ captures color and its spatial distribution ¦ more suitable for colored graphics (fewer colors, less texture) JCD SpCD are found to be more effective than MPEG-7 descriptors.
  • 14. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 14 Thresholding and Re-ranking Static Thresholding: a fixed pre-selected rank threshold K for all topics K 25, 50, 100, 250, 500, 1000. Dynamic Thresholding: a variable rank threshold per topic Score-Distributional Threshold Optimization (SDTO) [Arampatzis et.al., ”Where to Stop Reading a Ranked-list?”, SIGIR 2009] Two types: ¦ Threshold on Precision: g 0.990, 0.950, 0.800, 0.500, 0.330, 0.100 ¦ Threshold on prel: θ 0.990, 0.950, 0.800, 0.500, 0.330, 0.100
  • 15. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 15 Initial Experiments: Setting a Baseline item scoring by MAP P@10 P@20 bpref JCD1 .0058 .0486 .0479 .0352 maxi JCDi .0072 .0614 .0614 .0387 maxi JCDi + maxi SpCDi .0112 .0871 .0886 .0415 tf.idf (text-only) .1293 .3614 .3314 .1806 Image-only runs provide very weak baselines. We chose the text-only run as a baseline for the statistical significance testing. This makes sense also from an efficiency point of view: if using a secondary text modality for image retrieval is more effective than current CBIR methods, then there is no reason at all for using computationally costly CBIR methods.
  • 16. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 16 Results threshold K€ MAP P@10 JCD1 P@20 bpref MAP maxi JCDi + P@10 maxi SpCDi P@20 bpref text-only — .1293 .3614 .3314 .1806 .1293 .3614 .3314 .1806 25 25 .1162™ .3957 - .3457˜ .1641™ .1168 - .3943 - .3436 ˜ .1659™ 50 50 .1144™ .3829 - .3579˜ .1608™ .1154 - .3986 - .3557 - .1648™ 100 100 .1138 - .3786 - .3471 - .1609™ .1133 - .3900 - .3486 - .1623 - K 250 250 .1081™ .3414 - .3164 - .1644 - .1092™ .3771 - .3564 - .1664 - 500 500 .0968™ .3200 - .3007 - .1575™ .0999™ .3557 - .3250 - .1590™ 1000 1000 .0865™ .2871™ .2729™ .1493™ .0909™ .3329 - .3064 - .1511™ .9900 49 .1364 - .4214˜ .3550 - .1902˜ .1385˜ .4371˜œ .3743˜ .1921˜ .9500 68 .1352 - .4171˜ .3586 - .1912˜ .1386˜ .4500˜ œ .3836˜ .1932˜ .8000 95 .1318 - .4000 - .3536 - .1892 - .1365 - .4443˜œ .3871˜ .1924 - g .5000 151 .1196 - .3814 - .3393 - .1808 - .1226 - .4043 - .3550 - .1813 - .3333 237 .1085™ .3500 - .3000 - .1707 - .1121™ .3857 - .3364 - .1734 - .1000 711 .0864™ .2871™ .2621™ .1461™ .0909™ .3357 - .2964 - .1487™ .9900 42 .1342 - .4043 - .3414 - .1865 - .1375˜ .4371˜œ .3700˜ .1897˜ .9500 51 .1371 - .4214˜ .3586 - .1903˜ .1417˜œ .4500˜œ .3864˜œ .1924˜œ .8000 81 .1384˜ .4229˜ .3614 - .1921˜ .1427˜ œ .4629˜ œ .3871˜œ .1961˜ œ θ .5000 91 .1367 - .4057 - .3571 - .1919˜ .1397˜ .4400˜œ .3829˜ .1937˜ .3333 109 .1375 - .4129 - .3636 - .1933˜ .1404˜ .4500˜œ .3907˜œ .1949˜œ .1000 130 .1314 - .4100 - .3629 - .1866 - .1370 - .4371˜ .3843˜ .1922˜ image-only — .0058™ .0486™ .0479™ .0352™ .0112™ .0871™ .0886™ .0415™
  • 17. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 17 Results Static thresholding improves initial Precision at the cost of MAP and bpref. Dynamic thresholding on Precision or prel does not have this drawback. The level of static and precision thresholds influences greatly the effectiveness; unsuitable choices (e.g. too loose) lead to a degraded performance. Prel thresholds are much more robust in this respect. Better CBIR at the second stage leads to overall improvements, but the thresholding type seems more important: While the two CBIR methods vary greatly in performance (the best has almost double the effectiveness of the other), static thresholding is not influenced much by this choice. Dynamic methods benefit more from improved CBIR.
  • 18. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 18 Results
  • 19.
  • 20. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 20 Conclusions Two-stage retrieval with dynamic thresholding: more effective robust than static thresholding practically insensitive to a wide range of choices for the optimization measure beats significantly the text-only several image-only baselines Two-stage retrieval, irrespective of thresholding type: efficiency benefit: ¦ it cuts down greatly on expensive image operations ¦ on average, only 2 to 5 in 10,000 images had to be scored
  • 21. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 21 Further Research Compare two-stage retrieval to fusion. [Did this; check ECIR11 poster] Both methods work better than text-only image-only; two-stage is slightly better than fusion, but the difference is non-significant. Both methods are robust in different ways: ¦ Fusion provides less variability across topics but it is sensitive to the weighing parameter of the contributing media. ¦ Two-stage provides a much lower sensitivity to its thresholding parameter but has a higher variability across topics. Try two-stage retrieval with other type of image descriptors, e.g. the visual codebook approach (TOP-SURF). Generalization to multi-stage retrieval, where rankings for the media are successively being thresholded and re-ranked with respect to a media hierarchy.
  • 22. Dynamic Two-Stage Image Retrieval from Large Multimodal Databases Avi Arampatzis, Konstantinos Zagoris, Savvas Chatzichristofis 22 avi@ee.duth.gr Thank you !