SlideShare une entreprise Scribd logo
1  sur  48
Seam Carving for Content-Aware Image Resizing Shai  Aidan (Mitsubishi Electric Research Labs) Ariel Shamir (The Interdisciplinary Center & MERL) ACM SIGGRAPH 2007
Resize Seam carving & insertion
Abstract Seams are optimal 8-connected paths of pixels cross the image Carving out or inserting seams to achieve content-aware resizing
Outline Introduction Background Seam-carving operator Discrete image resizing Multi-size images Limitations Conclusions and future work
Introduction
Motivation HTML can support dynamic changes of page layout and text.  Why can not an image deform to fit different layout automatically ? iGoogle How about aspect ratio of an image , such as fitting photo into PDA or phone cells ? Solution ? Resize – content independent Crop – remove pixels from the image periphery only
Basic Idea of Seam-Carving Use energy function to define the importanceof pixels Define seam-carving image operator Image reduction Carving out seams - the connected low energy pixels crossing the image  Preserving the image structure Image enlarging Insert seams on low energy area The order of seam insertion ensures a balance between the  original image content and the artificially inserted pixels
Application Discrete image resizing Aspect Ration Change,  Image Retarget,  Image Enlarging,  Content Amplification,  Seam Carving in gradient domain, Object Removal Multi-size images An image can continuously change their size in a content-aware manner Storing the order of seam removal and insertion
background
Image Retarget Seek to change the size of the image while maintaining the important features  Face detector An automatic thumbnail creation [Suh03]   ROI Fisheye-View warp [Liu and Gleicher 05, 06] Visual saliency []  [Suh 03] [Selur 04,  decompose image to foreground obj and background origin
Saliency map [Itti IEEE99] Simulate neuroscience of human visual system Pyramid tech. to compute 3 feature maps, color, intensity and orientation [Suh 03],  an automatic thumbnail creation, based on either a saliency map or the output of a face detector [Chen 03],  adapting most important region of images to mobile devices.
[Liu 03],  suggesting to trade time for space. Given a collection of regions of interest, they construct an optimal path through these regions and display them serially.
[Santella et al. 06] use eye tracking, in addition to composition rules to crop images intelligently.
ROI (Region-Of-Interest) Such a method was proposed by [Liu and Gleicher 05, 06] for image and video retargeting. For image retargeting they find ROI and construct a novel Fisheye-View warp that essentially applies a piecewise linear scaling function in each dimension to the image. This way the ROI is maintained while the rest of the image is warped. The retargeting can be done in interactive rates, once the ROI is found, so the user can control the desired size of the image by moving a slider. In their video retargeting work they use a combination of image and saliency maps to find the ROI. Then they use a combination of cropping, virtual pan and shot cuts to retarget the video frames.
Feature-aware warping The first solution to the general problem of warping an image into an arbitrary shape while preserving user-specified features was recently proposed by [Gal et al. 06].  The feature-aware warping is achieved by a particular formulation of the Laplacian editing technique, suited to accommodate similarity constraints on parts of the domain.  Since local constraints are propagated by the global optimization process, not all the constraints can always be satisfied at once
Seam Perfect seams to combine parts of a set of photo into a single composite picture [Agarwala et al. 04]  Drag-and-Drop Pasting that extends the Poisson Image Editing to computer an optimal boundary (seam) between the source picture and target images [Jia et al. 06] AutoCollage,  a program that automatically creates a collage image from a collection of images.  [Rother et al. 06] Simultaneously solve matting and compositing. They allow the user to scale the size of the foreground object and paste it back on the original background. [Wang , Cohen 06] evaluated several cost functions for seamless image stitching and concluded that minimizing an L1error norm between the gradients of the stitched image and the gradients of the input images performed well in general [Zomet et al. 05]
Sear Optimal Seam Dijkstra’s shortest path algorithm [98] Dynamic programming [Efros 01] Graph cuts [Kwatra 03]
Seam-carving operator
Strategies of Image Reduction Original e1 energy Optimal  global remove the lowest energy pixels Pixel remove the least energy in each row
Strategies of Image Reduction Original e1 energy Column removing columns with minimal energy Crop find a sub-win with the highest energy
Vertical Seam
Horizontal Seam
Optimal Seam Search
Optimal Seam Search S Dynamic Programming G
e1 energy
Image Energy Preservation The average energy of all pixels during resizing
Energy Functions L1 and L2-norm of the gradient,  saliency measure [Itti 99]
Histogram of Gradient  (HoG) ,[object Object],Dividing the image window into cells For each cell accumulating a local 1-D histogram of gradient directions Normalize cells by the measure of local histogram energy over larger blocks The average gradient image Weighted R-HOG descriptor R-HOG descriptor
Energy Functions Histogram of Gradient  (HoG) [Dalal and Triggs 95] max(HoG(I(x,y)) makes sure the seams run parallel to the edge of objects and not cross them
Energy Functions Entropy  Compute the entropy over a 9 x 9 window and add it to e1 eEntropy(x,y) = +  e1 (x,y)
Energy Functions Segmentation and L1 Image segmentation [Christoudias 02]  Apply e1 on the results
No single e function performs well across all images Similar range for resizing e1 or eHoG works well
Discrete image resizing  Aspect Ratio Change, Retargeting with Optimal Seams-Order,  Image Enlarging, Content Amplification,
Aspect Ratio Change Carving-out /insert seams Original Original Original 1D aspect ratio changing
2D aspect ratio changing Optimal Seams-Order Search + DynamicProgramming =  + min
Retargeting with Optimal Seams-Order h-first alternate v-first Transport map Original optimal
Image Enlarging Find first k seams for removal Duplicate them in order to arrive at I(-k) origin I(t): smaller image after t seam-carving I(-k): enlarged image after k seam insertion t I(-1) I(-k) I(t) I(-k) enlarged image insert seams in order of removal
Image Enlarging (>50%) origin Break into several steps Each step does not enlarge the size of image more than a fraction
Content Amplification Amplified Original
Seam Carving in the Gradient Domain Seam + Poisson Reconstruction [Perez 03] Compute e function Work on the gradient domain Remove seams from the x and y derivatives of the original image Use Poisson Reconstruction original retarget retarget in  Gradient Domain
Object Removal Mark the removing target Remove seams until all the marked pixels are gone * Employ seam insertion to maintain the original size
Object Removal Origin
Multi-size images Store the pre-computed  representation that encodes, for each pixel in V/H map The index of the seam that removed it The negative index of the seam that inserted it Blue (first seam)  Red (last seam) origin V(i,j)=t : pixel (i,j) removed by t-th vertical seam H(i,j)=t : pixel (i,j) removed by t-th horizontal seam
Limitations Seam-Carving  does not work well on all images 	Ex:  face Origin Crop Scale Constraint the face Face the flower Bottom up feature detection
Limitations The amount of content Too density, no “less” important area The layout of the image content origin origin
Conclusions Present a content-aware resizing using the seam-carving image operator Seams are the optimal paths on a single image Carve-out seams Insert seams Application of seam-carving operator Aspect ratio change, image retargeting, content amplification, object removal Multi-size images that support continuous resizingin real-time
Future Work Video resizing Combination of scaling and seam-carving Define more robust multi-size image Better solution to combine horizontal and vertical seams in multi-size image
END

Contenu connexe

Tendances

Picture box control
Picture box controlPicture box control
Picture box controlchauhankapil
 
Real-time object detection coz YOLO!
Real-time object detection coz YOLO!Real-time object detection coz YOLO!
Real-time object detection coz YOLO!J On The Beach
 
A Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaA Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaPreferred Networks
 
Image segmentation
Image segmentationImage segmentation
Image segmentationDeepak Kumar
 
Constructor and Destructor in c++
Constructor  and Destructor in c++Constructor  and Destructor in c++
Constructor and Destructor in c++aleenaguen
 
C mcq practice test 4
C mcq practice test 4C mcq practice test 4
C mcq practice test 4Aman Kamboj
 
Template Matching - Pattern Recognition
Template Matching - Pattern RecognitionTemplate Matching - Pattern Recognition
Template Matching - Pattern RecognitionMustafa Salam
 
C Programming: Structure and Union
C Programming: Structure and UnionC Programming: Structure and Union
C Programming: Structure and UnionSelvaraj Seerangan
 
Visual Basic menu
Visual Basic menuVisual Basic menu
Visual Basic menukuldeep94
 
You only look once (YOLO) : unified real time object detection
You only look once (YOLO) : unified real time object detectionYou only look once (YOLO) : unified real time object detection
You only look once (YOLO) : unified real time object detectionEntrepreneur / Startup
 
Mitchell's Face Recognition
Mitchell's Face RecognitionMitchell's Face Recognition
Mitchell's Face Recognitionbutest
 
Codetecon #KRK 3 - Object detection with Deep Learning
Codetecon #KRK 3 - Object detection with Deep LearningCodetecon #KRK 3 - Object detection with Deep Learning
Codetecon #KRK 3 - Object detection with Deep LearningMatthew Opala
 
05. Conditional Statements
05. Conditional Statements05. Conditional Statements
05. Conditional StatementsIntro C# Book
 

Tendances (20)

Picture box control
Picture box controlPicture box control
Picture box control
 
Real-time object detection coz YOLO!
Real-time object detection coz YOLO!Real-time object detection coz YOLO!
Real-time object detection coz YOLO!
 
A Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaA Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi Kerola
 
I/O Streams
I/O StreamsI/O Streams
I/O Streams
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
 
Constructor and Destructor in c++
Constructor  and Destructor in c++Constructor  and Destructor in c++
Constructor and Destructor in c++
 
C mcq practice test 4
C mcq practice test 4C mcq practice test 4
C mcq practice test 4
 
Template Matching - Pattern Recognition
Template Matching - Pattern RecognitionTemplate Matching - Pattern Recognition
Template Matching - Pattern Recognition
 
Analysis modelling
Analysis modellingAnalysis modelling
Analysis modelling
 
Chapter 6 Intermediate Code Generation
Chapter 6   Intermediate Code GenerationChapter 6   Intermediate Code Generation
Chapter 6 Intermediate Code Generation
 
C Programming: Structure and Union
C Programming: Structure and UnionC Programming: Structure and Union
C Programming: Structure and Union
 
Visual Basic menu
Visual Basic menuVisual Basic menu
Visual Basic menu
 
You only look once (YOLO) : unified real time object detection
You only look once (YOLO) : unified real time object detectionYou only look once (YOLO) : unified real time object detection
You only look once (YOLO) : unified real time object detection
 
Mitchell's Face Recognition
Mitchell's Face RecognitionMitchell's Face Recognition
Mitchell's Face Recognition
 
kmaps
 kmaps kmaps
kmaps
 
Codetecon #KRK 3 - Object detection with Deep Learning
Codetecon #KRK 3 - Object detection with Deep LearningCodetecon #KRK 3 - Object detection with Deep Learning
Codetecon #KRK 3 - Object detection with Deep Learning
 
Html lists
Html listsHtml lists
Html lists
 
Input and Interaction
Input and InteractionInput and Interaction
Input and Interaction
 
Seminar State Chart1
Seminar State Chart1Seminar State Chart1
Seminar State Chart1
 
05. Conditional Statements
05. Conditional Statements05. Conditional Statements
05. Conditional Statements
 

Similaire à study Seam Carving For Content Aware Image Resizing

Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...IOSR Journals
 
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...IJEACS
 
F0255046056
F0255046056F0255046056
F0255046056theijes
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0Kapil Tiwari
 
Design and Implementation of VLSI Architecture for Image Scaling Processor
Design and Implementation of VLSI Architecture for Image  Scaling ProcessorDesign and Implementation of VLSI Architecture for Image  Scaling Processor
Design and Implementation of VLSI Architecture for Image Scaling ProcessorIJMER
 
paper writing
paper writingpaper writing
paper writingMukesh M
 
Performance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGAPerformance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGAIJECEIAES
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...IJCSIS Research Publications
 
IRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-SegmentationIRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-SegmentationIRJET Journal
 
Face Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar ClassifiersFace Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar ClassifiersIJMER
 
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET Journal
 
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET Journal
 
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET Journal
 
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...cscpconf
 

Similaire à study Seam Carving For Content Aware Image Resizing (20)

Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
 
Medial axis transformation based skeletonzation of image patterns using image...
Medial axis transformation based skeletonzation of image patterns using image...Medial axis transformation based skeletonzation of image patterns using image...
Medial axis transformation based skeletonzation of image patterns using image...
 
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
 
Oc2423022305
Oc2423022305Oc2423022305
Oc2423022305
 
F0255046056
F0255046056F0255046056
F0255046056
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0
 
Design and Implementation of VLSI Architecture for Image Scaling Processor
Design and Implementation of VLSI Architecture for Image  Scaling ProcessorDesign and Implementation of VLSI Architecture for Image  Scaling Processor
Design and Implementation of VLSI Architecture for Image Scaling Processor
 
paper writing
paper writingpaper writing
paper writing
 
Performance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGAPerformance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGA
 
A10 avidan
A10 avidanA10 avidan
A10 avidan
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Image inpainting
Image inpaintingImage inpainting
Image inpainting
 
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
 
IRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-SegmentationIRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-Segmentation
 
Face Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar ClassifiersFace Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar Classifiers
 
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
 
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
 
Log polar coordinates
Log polar coordinatesLog polar coordinates
Log polar coordinates
 
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
 
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
 

Plus de Chiamin Hsu

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video ProcessingChiamin Hsu
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwaharaChiamin Hsu
 
study Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filtersstudy Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian FiltersChiamin Hsu
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image CoarseningChiamin Hsu
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large ImagesChiamin Hsu
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface EditingChiamin Hsu
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded ImagesChiamin Hsu
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint MeshesChiamin Hsu
 
study Latent Doodle Space
study Latent Doodle Spacestudy Latent Doodle Space
study Latent Doodle SpaceChiamin Hsu
 
study Coded Aperture
study Coded Aperturestudy Coded Aperture
study Coded ApertureChiamin Hsu
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And VideosChiamin Hsu
 
study Dappled Photography
study Dappled Photographystudy Dappled Photography
study Dappled PhotographyChiamin Hsu
 

Plus de Chiamin Hsu (12)

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processing
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwahara
 
study Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filtersstudy Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filters
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarsening
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface Editing
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshes
 
study Latent Doodle Space
study Latent Doodle Spacestudy Latent Doodle Space
study Latent Doodle Space
 
study Coded Aperture
study Coded Aperturestudy Coded Aperture
study Coded Aperture
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videos
 
study Dappled Photography
study Dappled Photographystudy Dappled Photography
study Dappled Photography
 

Dernier

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 

Dernier (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

study Seam Carving For Content Aware Image Resizing

  • 1. Seam Carving for Content-Aware Image Resizing Shai Aidan (Mitsubishi Electric Research Labs) Ariel Shamir (The Interdisciplinary Center & MERL) ACM SIGGRAPH 2007
  • 2. Resize Seam carving & insertion
  • 3. Abstract Seams are optimal 8-connected paths of pixels cross the image Carving out or inserting seams to achieve content-aware resizing
  • 4. Outline Introduction Background Seam-carving operator Discrete image resizing Multi-size images Limitations Conclusions and future work
  • 6. Motivation HTML can support dynamic changes of page layout and text. Why can not an image deform to fit different layout automatically ? iGoogle How about aspect ratio of an image , such as fitting photo into PDA or phone cells ? Solution ? Resize – content independent Crop – remove pixels from the image periphery only
  • 7. Basic Idea of Seam-Carving Use energy function to define the importanceof pixels Define seam-carving image operator Image reduction Carving out seams - the connected low energy pixels crossing the image Preserving the image structure Image enlarging Insert seams on low energy area The order of seam insertion ensures a balance between the original image content and the artificially inserted pixels
  • 8. Application Discrete image resizing Aspect Ration Change, Image Retarget, Image Enlarging, Content Amplification, Seam Carving in gradient domain, Object Removal Multi-size images An image can continuously change their size in a content-aware manner Storing the order of seam removal and insertion
  • 10. Image Retarget Seek to change the size of the image while maintaining the important features Face detector An automatic thumbnail creation [Suh03] ROI Fisheye-View warp [Liu and Gleicher 05, 06] Visual saliency [] [Suh 03] [Selur 04, decompose image to foreground obj and background origin
  • 11. Saliency map [Itti IEEE99] Simulate neuroscience of human visual system Pyramid tech. to compute 3 feature maps, color, intensity and orientation [Suh 03], an automatic thumbnail creation, based on either a saliency map or the output of a face detector [Chen 03], adapting most important region of images to mobile devices.
  • 12. [Liu 03], suggesting to trade time for space. Given a collection of regions of interest, they construct an optimal path through these regions and display them serially.
  • 13. [Santella et al. 06] use eye tracking, in addition to composition rules to crop images intelligently.
  • 14. ROI (Region-Of-Interest) Such a method was proposed by [Liu and Gleicher 05, 06] for image and video retargeting. For image retargeting they find ROI and construct a novel Fisheye-View warp that essentially applies a piecewise linear scaling function in each dimension to the image. This way the ROI is maintained while the rest of the image is warped. The retargeting can be done in interactive rates, once the ROI is found, so the user can control the desired size of the image by moving a slider. In their video retargeting work they use a combination of image and saliency maps to find the ROI. Then they use a combination of cropping, virtual pan and shot cuts to retarget the video frames.
  • 15. Feature-aware warping The first solution to the general problem of warping an image into an arbitrary shape while preserving user-specified features was recently proposed by [Gal et al. 06]. The feature-aware warping is achieved by a particular formulation of the Laplacian editing technique, suited to accommodate similarity constraints on parts of the domain. Since local constraints are propagated by the global optimization process, not all the constraints can always be satisfied at once
  • 16. Seam Perfect seams to combine parts of a set of photo into a single composite picture [Agarwala et al. 04] Drag-and-Drop Pasting that extends the Poisson Image Editing to computer an optimal boundary (seam) between the source picture and target images [Jia et al. 06] AutoCollage, a program that automatically creates a collage image from a collection of images. [Rother et al. 06] Simultaneously solve matting and compositing. They allow the user to scale the size of the foreground object and paste it back on the original background. [Wang , Cohen 06] evaluated several cost functions for seamless image stitching and concluded that minimizing an L1error norm between the gradients of the stitched image and the gradients of the input images performed well in general [Zomet et al. 05]
  • 17. Sear Optimal Seam Dijkstra’s shortest path algorithm [98] Dynamic programming [Efros 01] Graph cuts [Kwatra 03]
  • 19. Strategies of Image Reduction Original e1 energy Optimal global remove the lowest energy pixels Pixel remove the least energy in each row
  • 20. Strategies of Image Reduction Original e1 energy Column removing columns with minimal energy Crop find a sub-win with the highest energy
  • 24. Optimal Seam Search S Dynamic Programming G
  • 26. Image Energy Preservation The average energy of all pixels during resizing
  • 27. Energy Functions L1 and L2-norm of the gradient, saliency measure [Itti 99]
  • 28.
  • 29. Energy Functions Histogram of Gradient (HoG) [Dalal and Triggs 95] max(HoG(I(x,y)) makes sure the seams run parallel to the edge of objects and not cross them
  • 30. Energy Functions Entropy Compute the entropy over a 9 x 9 window and add it to e1 eEntropy(x,y) = + e1 (x,y)
  • 31. Energy Functions Segmentation and L1 Image segmentation [Christoudias 02] Apply e1 on the results
  • 32. No single e function performs well across all images Similar range for resizing e1 or eHoG works well
  • 33. Discrete image resizing Aspect Ratio Change, Retargeting with Optimal Seams-Order, Image Enlarging, Content Amplification,
  • 34. Aspect Ratio Change Carving-out /insert seams Original Original Original 1D aspect ratio changing
  • 35. 2D aspect ratio changing Optimal Seams-Order Search + DynamicProgramming = + min
  • 36. Retargeting with Optimal Seams-Order h-first alternate v-first Transport map Original optimal
  • 37. Image Enlarging Find first k seams for removal Duplicate them in order to arrive at I(-k) origin I(t): smaller image after t seam-carving I(-k): enlarged image after k seam insertion t I(-1) I(-k) I(t) I(-k) enlarged image insert seams in order of removal
  • 38. Image Enlarging (>50%) origin Break into several steps Each step does not enlarge the size of image more than a fraction
  • 40. Seam Carving in the Gradient Domain Seam + Poisson Reconstruction [Perez 03] Compute e function Work on the gradient domain Remove seams from the x and y derivatives of the original image Use Poisson Reconstruction original retarget retarget in Gradient Domain
  • 41. Object Removal Mark the removing target Remove seams until all the marked pixels are gone * Employ seam insertion to maintain the original size
  • 43. Multi-size images Store the pre-computed representation that encodes, for each pixel in V/H map The index of the seam that removed it The negative index of the seam that inserted it Blue (first seam)  Red (last seam) origin V(i,j)=t : pixel (i,j) removed by t-th vertical seam H(i,j)=t : pixel (i,j) removed by t-th horizontal seam
  • 44. Limitations Seam-Carving does not work well on all images Ex: face Origin Crop Scale Constraint the face Face the flower Bottom up feature detection
  • 45. Limitations The amount of content Too density, no “less” important area The layout of the image content origin origin
  • 46. Conclusions Present a content-aware resizing using the seam-carving image operator Seams are the optimal paths on a single image Carve-out seams Insert seams Application of seam-carving operator Aspect ratio change, image retargeting, content amplification, object removal Multi-size images that support continuous resizingin real-time
  • 47. Future Work Video resizing Combination of scaling and seam-carving Define more robust multi-size image Better solution to combine horizontal and vertical seams in multi-size image
  • 48. END

Notes de l'éditeur

  1. Shai Aidan 以色列人MERL(Mitsubishi Electric Research Labs)沒想到以色列人投效到日本研究中心