SlideShare une entreprise Scribd logo
1  sur  106
Towards Web-Scale
Content-Aware Image Search

             Xian-Sheng Hua
     Lead Researcher, Microsoft Research Asia

               June 23 2010 - TUDelft
Xian-Sheng Hua, MSR Asia




About the Title
• Towards Web-Scale Content-Aware Image Search
  ̵   Towards …
  ̵   Web-Scale …
  ̵   Content-Aware …
  ̵   Image Search …
Xian-Sheng Hua, MSR Asia




Evolution of CBIR

                                      Annotation Based
  Commercial Search                                            ~2006 
      Engines
                                                ~2003     (1) Tagging Based
                                                          (2) Large-Scale CBIR/CBVR
       Conventional CBIR      ~2001
                            Direct-Text Based
                  1990’                           Academia
                                                  Prototypes

        1970~
        1980’
    Manual Labeling Based




                                                                                            3
Xian-Sheng Hua, MSR Asia




Three Schemes

                Example-Based


            Annotation-Based


                 Text-Based
Xian-Sheng Hua, MSR Asia




Limitation of Text-Based Search

                    Surroundings




                       Text



                                     Query
      Social Tags                  Association
Xian-Sheng Hua, MSR Asia




Limitation of Text-Based Search
• High noises
   ̵   Surrounding text: frequently not related to the visual content
   ̵   Social tags: also noisy, and a large portion don’t have tags

• Mostly only covers simple semantics
   ̵   Surrounding text: limited
   ̵   Social tags: People tend to only input simple and personal tags
   ̵   Query association: powerful but coverage is still limited (non-clicked
       images will never be well indexed)


       A complex example:
          Finding Lady Gaga walking on red carpet and wearing in black
Xian-Sheng Hua, MSR Asia




Limitation of Annotation-Based Search
• Low accuracy
  ̵   The accuracy of automatic annotation is still far from satisfactory
  ̵   Difficult to be solved

• Limited coverage
  ̵   The coverage of the semantic concepts is still far from the rich content
      that is contain in an image
  ̵   Difficult to extend

• High computation
  ̵   Learning costs high computation
  ̵   Recognition costs high if the number of concepts is large

                   Still has a long way to go
Xian-Sheng Hua, MSR Asia




Limitation of Large-Scale Example-Based Search
• You need an example first
   ̵   How to do it if I don’t have an initial example?

• Large-scale (semantic) similarity search is still difficult
   ̵   Though large-scale (near) duplicate detection is good




                     An example: TinEye
Xian-Sheng Hua, MSR Asia




Possible Way-Outs?
• Large-scale high-performance annotation?
   ̵   Model-based? Data-driven? Hybrid? …

• Large-scale manual labeling?
   ̵   ESP game? Label Me? Machinery Turk? …

• New query interface?
   ̵   Interactive search?




       Or … To combine text, content and interface?
Xian-Sheng Hua, MSR Asia




Simple Attempts
• Exemplary attempts based on this idea
  ̵ Search by dominant color (Google/Bing)
   ̵ Show similar images (Bing)/Find similar images (Google)
    ̵ Show more sizes (Bing)
Xian-Sheng Hua, MSR Asia




Our Solutions: Two Recent Projects
• Image Search by Color Sketch
• Image Search by Concept Sketch
Search by Color Sketch
      WWW 2010 Demo
Xian-Sheng Hua, MSR Asia




Seems It Is Not New?
• “Query by sketch” has been proposed long time ago
  ̵   But on small datasets and difficult to scale up
  ̵   And seldom work well actually
  ̵   Not use to use – “object sketch”
It is based on a SIGGRAPH 95’s paper:
       C. Jacobs et al. Fast Multiresolution Image Querying. SIGGRAPH 1995
on small scale image set and difficult to scale-up (0.44 second for 10,000 images)
Xian-Sheng Hua, MSR Asia
Xian-Sheng Hua, MSR Asia




Our Approach
• Color sketch …
  ̵   Is not “object sketch”
  ̵   But only rough color distribution
  ̵   Supports large-scale data
  ̵   And uses text at the same time
Xian-Sheng Hua, MSR Asia




Our Approach
• Color sketch …
  ̵   Is not “object sketch”
  ̵   But only rough color distribution
  ̵   Supports large-scale data
  ̵   And uses text at the same time

• Advantages of “Color Sketch”
  ̵   More presentative than dominant color (and text only)
  ̵   More robust than “object sketch”
  ̵   Easy to use compared with “object sketch”
  ̵   Easy to scale up
Xian-Sheng Hua, MSR Asia




Example: Blue flower with green leaf
Xian-Sheng Hua, MSR Asia




Example: Brooke Hogan walking on red carpet
Xian-Sheng Hua, MSR Asia




Color Sketch Is More Powerful
  Blue Flower with Green Leaf   Brooke Hogan walking on red carpet
Xian-Sheng Hua, MSR Asia




Color Sketch Is More Powerful




                                                  32
Xian-Sheng Hua, MSR Asia




  The Techniques
• Color Sketch: Image Representation
   ̵   Capturing both color and spatial information
   ̵   Low computation and storage costs

• Intention Map: Intention Representation
   ̵   Rough: Tolerance to the errors in the colors and positions of the strokes
   ̵   Sparse: Intention estimation from sparse strokes

• Similarity Evaluation
   ̵   Effective and fast
Xian-Sheng Hua, MSR Asia




Color Sketch: Image Representation




Image with grids                         Color Sketch


                   Computation Cost: less than 50ms/image
                   Storage Cost: 80 byte in average/image
Xian-Sheng Hua, MSR Asia




Intention Map:
From User’s Input to Intention Estimation




  Target: user’s input

            (1) Do not require users to input the color map exactly
            (2) Robust to rough and sparse input
Xian-Sheng Hua, MSR Asia




                           >
        Target   Image 1       Image 2




2010/6/24
Xian-Sheng Hua, MSR Asia




Intention Analysis 2: Relation



                           >
        Target   Image 1       Image 2




2010/6/24
Xian-Sheng Hua, MSR Asia




Intention Analysis 3: Intention Propagation




                                                         <
      Propagated                        Image 1                          Image 2

Since the target color sketch is rough, the cells nearby the scribbled cells may have larger
chance to have the similar color. Hence, image 2 should be more similar as the blue color
appearing in the cells near the scribbled cells.


2010/6/24
Xian-Sheng Hua, MSR Asia




Intention Map: An Example




      User’s input

                                                  Intention map for blue
  In scribbled cells, the intention weight is the largest, nearby cells has relatively smaller
  intention weight, but the cells scribbled in green and its nearby cells have negative
  intention weight for blue color.
Xian-Sheng Hua, MSR Asia




Similarity Evaluation
• Image color map

• Intention map

• Similarity
Xian-Sheng Hua, MSR Asia




Experiment: Intention Justification
                                                    C
                                                    C+R
                                                    C+P
                                                    C+R+P




        C: Consistency    R: Relation    P: Propagation
             51 text query, each 3 color sketches
Xian-Sheng Hua, MSR Asia




Sensitivity to Grid Dimension
Xian-Sheng Hua, MSR Asia




Comparison
Xian-Sheng Hua, MSR Asia




Demo
• It has been productized in Bing
Gambia flag
Xian-Sheng Hua, MSR Asia




 Summary – Why it Works
• What are difficult
   ̵   Semantics is difficult
   ̵   Intent prediction is difficult

• What are easier
   ̵   Use color to describe images’ content
   ̵   Use color to describe users’ intent
                   content                color sketch              intent




   Color sketch is an intermediate representation to connect images’ content and users’ intent.
Xian-Sheng Hua, MSR Asia




Limitation of Color Sketch
• Only works well for those semantics that can
  be described by color distribution
• Only works well when people are able to
  transfer their desire into color distribution



      A more complex example:
        A flying butterfly on the top-left of a flower.
Search by Concept Sketch
         SIGIR 2010
Xian-Sheng Hua, MSR Asia




Search By Concept Sketch
• A system for the image search intentions concerning:
   ̵   The presence of the semantic concepts (semantic constraints)
   ̵   the spatial layout of the concepts (spatial constraints)




        butterfly

              flower



 A concept map query             Image search results of our system
Xian-Sheng Hua, MSR Asia




 Framework
A concept map query                                                           A candidate image
                                1
    A: butterfly                    Create Candidate
    B: flower                          Image Set
       butterfly
    A: X=0.5, Y=0.5
    B: X=0.8, flower
              Y=0.8

                                2                                        3
                                    Instant Concept                          Concept Distribution
                                       Modeling                                  Estimation
                                                  Concept Models                      Estimated distributions

                                               butterfly                               butterfly
                   4                              flower
                            Intent                                                      flower
                        Interpretation
        Desired distributions
                                              5
                                                  Spatial distribution
         butterfly                                   comparison

           flower
                                                     Relevance score
Xian-Sheng Hua, MSR Asia




Creating Candidate Image Set
• Goal: get a group of candidate images from Web
     ̵   Ensure the presences of the concepts
     ̵   Reduce computational cost


                   butterfly flower


                   flower butterfly
A: butterfly
                                                             Candidate Images:
B: flower
                                                             Images related to
                   butterfly                                 butterfly & flower

                   flower                                  Training data for
                                                           concept “butterfly”
                                                Training data for
                                                concept “flower”
Xian-Sheng Hua, MSR Asia




Instant Concept Modeling
• A possible solution – manual tag-to-region




            Manual tagging is expensive
Xian-Sheng Hua, MSR Asia




Instant Concept Modeling
• Goal: To learn a simple (but works well on the
  candidate image set) visual detector for each concept

 butterfly




                                                                 [Visual instances]

                                                                                      …

Training data for                                                   Instance-based
   “butterfly”                                                     visual detector of
                                                                       “butterfly”

                    Graph-based representative image selection
Xian-Sheng Hua, MSR Asia




Concept Distribution Estimation
• Goal: Automatic concept localization leveraging Web-
  image resources

                       Web images
                                           A:   A candidate image


                     Instant Concept       B:      Distribution
    A: butterfly
    B: flower           Modeling                   estimation


                                                Estimated distributions

  Advantages: Scalable to Web-scale concepts      butterfly
  by exploiting rich Web-image resources

                                                   flower
Xian-Sheng Hua, MSR Asia




Concept Distribution Estimation
  • Goal: estimate the distribution of a concept in an image


        [Visual instances]

                         …

                                        A visual instance
                                                             A candidate image


                                               …


                                               …

Estimated distributions of representative butterflies
              in the candidate image                        Estimated distribution
Xian-Sheng Hua, MSR Asia




Intention Interpretation
• Goal: represent user’s spatial intention as 2D distributions
• Principles
      ̵      A concept should appear near the specified position
      ̵      A concept should not appear at the positions of the other concepts




                         Desired distributions                             Desired distributions

 butterfly                                              butterfly
                          butterfly                                         butterfly
                                                               flower


                                                                             flower
Xian-Sheng Hua, MSR Asia




Spatial Distribution Comparison
• Goal: compute the similarity of the desired distributions and
  the estimated distributions
   Desired distributions                             Estimated distributions


    butterfly                                                                  …
                                                          butterfly

      flower
                                                                               …
                                                           flower


                           Distribution comparison


                                   Fusion


                              Relevance score
Xian-Sheng Hua, MSR Asia




Demo & Experiments
Xian-Sheng Hua, MSR Asia




 Search Results Comparison
  Task: searching for the images with “snoopy appear at right”



    snoopy
 Text-based image search




   “show similar image”




              snoopy




Image search by concept map
Xian-Sheng Hua, MSR Asia




Search Results
Task: searching for the images with “snoopy appear at top” / “snoopy appear at left”
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a jeep shown above a piece of grass”




  jeep grass
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a car parked in front of a house”




  house car
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a keyboard and a mouse being side by side”




  keyboard mouse
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “sky/Colosseum/grass appear from top to bottom”




  Colosseum grass
Xian-Sheng Hua, MSR Asia




Quantitative Evaluation
Xian-Sheng Hua, MSR Asia




  Advanced Function:
  Influence Scope Adjustment
• Allow users to explicitly specify the occupied region of a
  concept


            house
                    Default
                                           house


     lawn                           lawn




                                           house


                                    lawn




                              Search for the long narrow lawn at the bottom of the image
Xian-Sheng Hua, MSR Asia




Advanced Function:
Influence Scope Adjustment



      Searching for small windmill




      Searching for large windmill
Xian-Sheng Hua, MSR Asia




Advanced Function:
Influence Scope Adjustment
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment
                               sky


                           house


                               lawn


                 Clear                       Search

                         Advanced Function
                 Visual Assistor




                                                      Previous   Next




  Allow users to indicate or narrow down
  what a desired concept looks like (visual constraints)
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment


                Searching for
                front-view jeeps

 jeep




               Searching for
               side-view jeeps




        Visual instances
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment


                  Searching for
                  butterfly with
                  yellow flower

 butterfly


       flower
                  Searching for
                  butterfly with
                  red flower




                Visual instances
Xian-Sheng Hua, MSR Asia




Conclusions
• The approaches of realizing web-scale content-
  aware image search
  ̵ Combing text and content
   ̵ Providing interface to express search intent
    ̵ Indexing and searching efficiently

• Two exemplary approaches introduced
  ̵ Search by color sketch
   ̵ Search by concept sketch
Xian-Sheng Hua, MSR Asia




Thank You

Contenu connexe

Dernier

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 

Dernier (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 

En vedette

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 

En vedette (20)

PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 

Towards Web Scale Image Search

  • 1. Towards Web-Scale Content-Aware Image Search Xian-Sheng Hua Lead Researcher, Microsoft Research Asia June 23 2010 - TUDelft
  • 2. Xian-Sheng Hua, MSR Asia About the Title • Towards Web-Scale Content-Aware Image Search ̵ Towards … ̵ Web-Scale … ̵ Content-Aware … ̵ Image Search …
  • 3. Xian-Sheng Hua, MSR Asia Evolution of CBIR Annotation Based Commercial Search ~2006  Engines ~2003 (1) Tagging Based (2) Large-Scale CBIR/CBVR Conventional CBIR ~2001 Direct-Text Based 1990’ Academia Prototypes 1970~ 1980’ Manual Labeling Based 3
  • 4. Xian-Sheng Hua, MSR Asia Three Schemes Example-Based Annotation-Based Text-Based
  • 5. Xian-Sheng Hua, MSR Asia Limitation of Text-Based Search Surroundings Text Query Social Tags Association
  • 6. Xian-Sheng Hua, MSR Asia Limitation of Text-Based Search • High noises ̵ Surrounding text: frequently not related to the visual content ̵ Social tags: also noisy, and a large portion don’t have tags • Mostly only covers simple semantics ̵ Surrounding text: limited ̵ Social tags: People tend to only input simple and personal tags ̵ Query association: powerful but coverage is still limited (non-clicked images will never be well indexed) A complex example: Finding Lady Gaga walking on red carpet and wearing in black
  • 7.
  • 8. Xian-Sheng Hua, MSR Asia Limitation of Annotation-Based Search • Low accuracy ̵ The accuracy of automatic annotation is still far from satisfactory ̵ Difficult to be solved • Limited coverage ̵ The coverage of the semantic concepts is still far from the rich content that is contain in an image ̵ Difficult to extend • High computation ̵ Learning costs high computation ̵ Recognition costs high if the number of concepts is large Still has a long way to go
  • 9. Xian-Sheng Hua, MSR Asia Limitation of Large-Scale Example-Based Search • You need an example first ̵ How to do it if I don’t have an initial example? • Large-scale (semantic) similarity search is still difficult ̵ Though large-scale (near) duplicate detection is good An example: TinEye
  • 10. Xian-Sheng Hua, MSR Asia Possible Way-Outs? • Large-scale high-performance annotation? ̵ Model-based? Data-driven? Hybrid? … • Large-scale manual labeling? ̵ ESP game? Label Me? Machinery Turk? … • New query interface? ̵ Interactive search? Or … To combine text, content and interface?
  • 11. Xian-Sheng Hua, MSR Asia Simple Attempts • Exemplary attempts based on this idea ̵ Search by dominant color (Google/Bing) ̵ Show similar images (Bing)/Find similar images (Google) ̵ Show more sizes (Bing)
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18. Xian-Sheng Hua, MSR Asia Our Solutions: Two Recent Projects • Image Search by Color Sketch • Image Search by Concept Sketch
  • 19. Search by Color Sketch WWW 2010 Demo
  • 20. Xian-Sheng Hua, MSR Asia Seems It Is Not New? • “Query by sketch” has been proposed long time ago ̵ But on small datasets and difficult to scale up ̵ And seldom work well actually ̵ Not use to use – “object sketch”
  • 21.
  • 22. It is based on a SIGGRAPH 95’s paper: C. Jacobs et al. Fast Multiresolution Image Querying. SIGGRAPH 1995 on small scale image set and difficult to scale-up (0.44 second for 10,000 images)
  • 24.
  • 25. Xian-Sheng Hua, MSR Asia Our Approach • Color sketch … ̵ Is not “object sketch” ̵ But only rough color distribution ̵ Supports large-scale data ̵ And uses text at the same time
  • 26.
  • 27.
  • 28. Xian-Sheng Hua, MSR Asia Our Approach • Color sketch … ̵ Is not “object sketch” ̵ But only rough color distribution ̵ Supports large-scale data ̵ And uses text at the same time • Advantages of “Color Sketch” ̵ More presentative than dominant color (and text only) ̵ More robust than “object sketch” ̵ Easy to use compared with “object sketch” ̵ Easy to scale up
  • 29. Xian-Sheng Hua, MSR Asia Example: Blue flower with green leaf
  • 30. Xian-Sheng Hua, MSR Asia Example: Brooke Hogan walking on red carpet
  • 31. Xian-Sheng Hua, MSR Asia Color Sketch Is More Powerful Blue Flower with Green Leaf Brooke Hogan walking on red carpet
  • 32. Xian-Sheng Hua, MSR Asia Color Sketch Is More Powerful 32
  • 33. Xian-Sheng Hua, MSR Asia The Techniques • Color Sketch: Image Representation ̵ Capturing both color and spatial information ̵ Low computation and storage costs • Intention Map: Intention Representation ̵ Rough: Tolerance to the errors in the colors and positions of the strokes ̵ Sparse: Intention estimation from sparse strokes • Similarity Evaluation ̵ Effective and fast
  • 34. Xian-Sheng Hua, MSR Asia Color Sketch: Image Representation Image with grids Color Sketch Computation Cost: less than 50ms/image Storage Cost: 80 byte in average/image
  • 35. Xian-Sheng Hua, MSR Asia Intention Map: From User’s Input to Intention Estimation Target: user’s input (1) Do not require users to input the color map exactly (2) Robust to rough and sparse input
  • 36. Xian-Sheng Hua, MSR Asia > Target Image 1 Image 2 2010/6/24
  • 37. Xian-Sheng Hua, MSR Asia Intention Analysis 2: Relation > Target Image 1 Image 2 2010/6/24
  • 38. Xian-Sheng Hua, MSR Asia Intention Analysis 3: Intention Propagation < Propagated Image 1 Image 2 Since the target color sketch is rough, the cells nearby the scribbled cells may have larger chance to have the similar color. Hence, image 2 should be more similar as the blue color appearing in the cells near the scribbled cells. 2010/6/24
  • 39. Xian-Sheng Hua, MSR Asia Intention Map: An Example User’s input Intention map for blue In scribbled cells, the intention weight is the largest, nearby cells has relatively smaller intention weight, but the cells scribbled in green and its nearby cells have negative intention weight for blue color.
  • 40. Xian-Sheng Hua, MSR Asia Similarity Evaluation • Image color map • Intention map • Similarity
  • 41. Xian-Sheng Hua, MSR Asia Experiment: Intention Justification C C+R C+P C+R+P C: Consistency R: Relation P: Propagation 51 text query, each 3 color sketches
  • 42. Xian-Sheng Hua, MSR Asia Sensitivity to Grid Dimension
  • 43. Xian-Sheng Hua, MSR Asia Comparison
  • 44. Xian-Sheng Hua, MSR Asia Demo • It has been productized in Bing
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 63.
  • 64.
  • 65.
  • 66.
  • 67.
  • 68.
  • 69.
  • 70.
  • 71.
  • 72.
  • 73.
  • 74.
  • 75.
  • 76.
  • 77.
  • 78.
  • 79. Xian-Sheng Hua, MSR Asia Summary – Why it Works • What are difficult ̵ Semantics is difficult ̵ Intent prediction is difficult • What are easier ̵ Use color to describe images’ content ̵ Use color to describe users’ intent content color sketch intent Color sketch is an intermediate representation to connect images’ content and users’ intent.
  • 80. Xian-Sheng Hua, MSR Asia Limitation of Color Sketch • Only works well for those semantics that can be described by color distribution • Only works well when people are able to transfer their desire into color distribution A more complex example: A flying butterfly on the top-left of a flower.
  • 81. Search by Concept Sketch SIGIR 2010
  • 82. Xian-Sheng Hua, MSR Asia Search By Concept Sketch • A system for the image search intentions concerning: ̵ The presence of the semantic concepts (semantic constraints) ̵ the spatial layout of the concepts (spatial constraints) butterfly flower A concept map query Image search results of our system
  • 83. Xian-Sheng Hua, MSR Asia Framework A concept map query A candidate image 1 A: butterfly Create Candidate B: flower Image Set butterfly A: X=0.5, Y=0.5 B: X=0.8, flower Y=0.8 2 3 Instant Concept Concept Distribution Modeling Estimation Concept Models Estimated distributions butterfly butterfly 4 flower Intent flower Interpretation Desired distributions 5 Spatial distribution butterfly comparison flower Relevance score
  • 84. Xian-Sheng Hua, MSR Asia Creating Candidate Image Set • Goal: get a group of candidate images from Web ̵ Ensure the presences of the concepts ̵ Reduce computational cost butterfly flower flower butterfly A: butterfly Candidate Images: B: flower Images related to butterfly butterfly & flower flower Training data for concept “butterfly” Training data for concept “flower”
  • 85. Xian-Sheng Hua, MSR Asia Instant Concept Modeling • A possible solution – manual tag-to-region Manual tagging is expensive
  • 86. Xian-Sheng Hua, MSR Asia Instant Concept Modeling • Goal: To learn a simple (but works well on the candidate image set) visual detector for each concept butterfly [Visual instances] … Training data for Instance-based “butterfly” visual detector of “butterfly” Graph-based representative image selection
  • 87. Xian-Sheng Hua, MSR Asia Concept Distribution Estimation • Goal: Automatic concept localization leveraging Web- image resources Web images A: A candidate image Instant Concept B: Distribution A: butterfly B: flower Modeling estimation Estimated distributions Advantages: Scalable to Web-scale concepts butterfly by exploiting rich Web-image resources flower
  • 88. Xian-Sheng Hua, MSR Asia Concept Distribution Estimation • Goal: estimate the distribution of a concept in an image [Visual instances] … A visual instance A candidate image … … Estimated distributions of representative butterflies in the candidate image Estimated distribution
  • 89. Xian-Sheng Hua, MSR Asia Intention Interpretation • Goal: represent user’s spatial intention as 2D distributions • Principles ̵ A concept should appear near the specified position ̵ A concept should not appear at the positions of the other concepts Desired distributions Desired distributions butterfly butterfly butterfly butterfly flower flower
  • 90. Xian-Sheng Hua, MSR Asia Spatial Distribution Comparison • Goal: compute the similarity of the desired distributions and the estimated distributions Desired distributions Estimated distributions butterfly … butterfly flower … flower Distribution comparison Fusion Relevance score
  • 91. Xian-Sheng Hua, MSR Asia Demo & Experiments
  • 92. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “snoopy appear at right” snoopy Text-based image search “show similar image” snoopy Image search by concept map
  • 93. Xian-Sheng Hua, MSR Asia Search Results Task: searching for the images with “snoopy appear at top” / “snoopy appear at left”
  • 94. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a jeep shown above a piece of grass” jeep grass
  • 95. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a car parked in front of a house” house car
  • 96. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a keyboard and a mouse being side by side” keyboard mouse
  • 97. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “sky/Colosseum/grass appear from top to bottom” Colosseum grass
  • 98. Xian-Sheng Hua, MSR Asia Quantitative Evaluation
  • 99. Xian-Sheng Hua, MSR Asia Advanced Function: Influence Scope Adjustment • Allow users to explicitly specify the occupied region of a concept house Default house lawn lawn house lawn Search for the long narrow lawn at the bottom of the image
  • 100. Xian-Sheng Hua, MSR Asia Advanced Function: Influence Scope Adjustment Searching for small windmill Searching for large windmill
  • 101. Xian-Sheng Hua, MSR Asia Advanced Function: Influence Scope Adjustment
  • 102. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment sky house lawn Clear Search Advanced Function Visual Assistor Previous Next Allow users to indicate or narrow down what a desired concept looks like (visual constraints)
  • 103. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment Searching for front-view jeeps jeep Searching for side-view jeeps Visual instances
  • 104. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment Searching for butterfly with yellow flower butterfly flower Searching for butterfly with red flower Visual instances
  • 105. Xian-Sheng Hua, MSR Asia Conclusions • The approaches of realizing web-scale content- aware image search ̵ Combing text and content ̵ Providing interface to express search intent ̵ Indexing and searching efficiently • Two exemplary approaches introduced ̵ Search by color sketch ̵ Search by concept sketch
  • 106. Xian-Sheng Hua, MSR Asia Thank You