SlideShare une entreprise Scribd logo
1  sur  8
Télécharger pour lire hors ligne
Jan Zizka (Eds) : CCSIT, SIPP, AISC, PDCTA - 2013
pp. 249–256, 2013. © CS & IT-CSCP 2013 DOI : 10.5121/csit.2013.3628
MULTIPLE REGION OF INTEREST
TRACKING OF NON-RIGID OBJECTS
USING DEMON'S ALGORITHM
Rohan Pillai1
, Abhishikta Yalavali2
Saima Mohan1
, and Amol Patil1
1
iGate, Tower 3, Magarpatta City, Pune, India,
{rohan.pillai,saima.mohan,amol.vpatil}@igate.com,
2
abhishikthaharsha@gmail.com
ABSTRACT
In this paper we propose an algorithm for tracking multiple ROI (region of interest) undergoing
non-rigid transformations. Demon's algorithm based on the idea of Maxwell's demon, has been
applied here to estimate the displacement field for tracking of multiple ROI. This algorithm
works on pixel intensities of the sequence of images thus making it suitable for tracking
objects/regions undergoing non-rigid transformations. We have incorporated a pyramid-based
approach for demon's algorithm computations of displacement field, which leads to significant
reduction in the convergence speed and improvement in the accuracy. This algorithm is applied
for tracking non-rigid objects in laproscopy videos which would aid surgeons in Minimal
Invasive Surgery (MIS).
KEYWORDS
Computer vision, Maxwell's demon, multi-resolution pyramid, non-rigid deformation, optical
flow, ROI tracking
1. INTRODUCTION
Tracking a region of interest is well-known in image processing community. In its long history, a
number of algorithm have been designed that address this problem quite successfully1
. In some
real world situations, objects tend undergo non-rigid transformation and are thus difficult to
track.These scenarios especially are prominent in medical applications where usually the region
under consideration undergoes non rigid transformation mainly due to camera movement. In this
paper we address the problem of tracking multiple region of interest (ROI) whose geometric
appearance are changing gradually over time. We demonstrate the application of this algorithm
for tracking multiple ROI's in a video recorded during endoscopy.
In our previous paper6
, we have successfully demonstrated the use of demon based technique for
tracking an ROI in video sequence. In our current paper, we refine our earlier work by
considering the entire image and using a multi-resolution approach for calculating the
displacement field. This introduction of multi-resolution approach has resulted in improved
250 Computer Science & Information Technology (CS & IT)
computational efficiency and accuracy of the tracking algorithm which will be discussed in detail
in subsequent sections of this paper. The overview of demons algorithm along with its use in
tracking is presented in section 2. The multi-resolution approach is discussed in section 3.
Discussions, results and conclusion appear in section 4, 5 and 6 respectively.
2. DEMON BASED TRACKING
Thirion7
introduced the demons forces for registration of images acquired from different
modalities. The concept behind his experiment was to deform one image to match another. The
work involved calculation of displacement of a physical point present in two images with the
support of gradient based optical flow computation. The expression for displacement field of two
images is as given in equation (1)
Here is the displacement along x and y direction of a point, p, in the
static image S with reference to a movable image M. is the gradient at point p on the
image S. If is the image to be deformed then movable image M is constructed using a
deformation operator as given in equation (2) and (3).
Thirion had re-normalized the optical flow equation by introducing the intensity difference
between the images in the denominator. Due to this, the displacement of each point has to be
calculated iteratively until the squared error is minimized. The objective of image registration
technique is to solve the minimization problem, to find the displacement field equation (??),
where, d is the dissimilarity metric to measure the similarity between two images.
Furthermore, to improve convergence speed and stability of the registration process, He Wang et
al.8
introduced the forces of the movable image.
Here, α is a positive homogenization factor introduced by Cachier et al.4
to bound the
displacement field.
Computer Science & Information Technology (CS & IT) 251
Tracking objects in a video can be accomplished by calculating the interframe displacement of
the objects. Assuming that the scene being recorded is not undergoing any rapid movement or
change, equation (5) gives us the displacement of points in one image with reference to the next
image. The displacement field is Gaussian smoothed for regularization2
. This displacement field
is used to track a Region of Interest (ROI) in a video.
The metric function, d, used in this paper is Sum of Squared Difference (SSD) of the images.
3. PYRAMIDAL IMPLEMENTATION
Bouguet3
demonstrated the use of pyramidal implementation in computing the optical flow
algorithm based on Lucas Kanade method. The pyramid representation used in this paper is
similar to that proposed by Bouguet. The method specified by Burt and Adelson9
is used to
construct recursively the pyramidal representation of the images It-1 and It as: {It-1}m=0,1,…L
where
m is the pyramid Level. The value L is height of the pyramid. This value can be heuristically
programmed by incorporating an estimation technique. Depending upon the expected inter frame
displacement, L can take values 2, 3 or 4. In the image pyramid, image at mth
level is at the
resolution of the (m-1)th
level. The algorithm starts with the Lth
level images and the initial
displacement field initialized to a zero vector.
Fig. 1. Illustration showing hierarchy of pyramid structure of an image
The displacement field is iteratively computed for the images at this level until the termination
criteria is satisfied. The termination criteria is defined by the iteration count or the rate of change
of error. The estimated displacement field at the Lth
level is then rescaled to the images at the
( L - 1 )th
level in the pyramid and used as the initial displacement field for estimation. The
estimation requires that the displacement field be updated with values computed as per equation
(5) so as to decrease the error. This process is repeated until the displacement field for the 0th
level (original) images in the pyramid is obtained.
Figure 2 summarizes the Pyramidal Implementation of Demon's based algorithm as a flowchart.
252 Computer Science & Information Technology (CS & IT)
Fig. 2. Sequential execution of Pyramidal Implementation of Demon's based algorithm
4. DISCUSSION
The ROI marked by a surgeon corresponds to an array of underlying pixels in an image. The
displacement field calculated by the above algorithm represents the distance of a physical point in
a frame being shifted with reference to the corresponding point in the previous frame. This
displacement field can be applied to one or more ROI's which will propagate over the temporal
sequence without any manual intervention. Using this principle multiple ROI's in the image can
be tracked.
The total number of computations performed on an nx x ny image would be Kmnxny, where m
computations are performed on each pixel of an image per iteration and the image converges after
K iterations. Thus the computation time needed is directly proportional to the size of the image.
Computer Science & Information Technology (CS & IT) 253
With the introduction of pyramidal approach it is observed that the total computation time has
reduced significantly. The displacement field calculated at lower resolution images is refined as
we work towards the higher resolution images. Hence, resulting in fewer iterations at subsequent
levels of the pyramid to converge. The computational advantage gained is illustrated in the
performance chart shown in figure 3(a).
For successive iteration the model image has a higher resemblance to the static image. Thus the
error metric, SSD, that measures the dissimilarity between the two images is expected to decrease
with each iteration. However, the SSD metric is image resolution dependent hence can not be
used while comparing convergence between different pyramid levels. To establish a benchmark
when comparing the displacement field produced at each level in the pyramid, we rescale the
displacement field at each iteration to full image size and then use it to compute error with respect
to the original image. Figure 3(b) shows the graph of error convergence with respect to iterations
for single resolution and pyramidal approach. This confirms that the displacement field calculated
at every level in the pyramid facilitates the overall convergence. The two results can be compared
as they are operating on the same base images. In Figure 3(b) a sharp drop in error is observed at
iteration number 5 and 18 which is caused by transition from one level in the pyramid to another.
An Interesting observation about the type of convergence can be noted in figure 3 when plotted
with respect to iteration count, the convergence graph looks like a series of decaying exponential.
However, when the convergence graph is plotted against time, it looks more like a fast decaying
exponential. Pyramidal implementation provides the freedom of making certain parameters like,
smoothing kernel, termination criteria and the positive homogenization factor level dependent.
The performance of the system can be further improved by selecting suitable values for these
parameters.
Pyramidal implementation provides the freedom of making certain parameters like, smoothing
kernel, termination criteria and the positive homogenization factor level dependent. The
performance of the system can be further improved by selecting suitable values for these
parameters.
(a)
254 Computer Science & Information Technology (CS & IT)
(b)
Fig. 3. Performance graph. (a) Plot depicting the time taken by single resolution (dotted line) and the
pyramidal implementation (dashed line) to converge. (b) Graph of error calculated after every iteration for
pyramidal implementation
5. RESULTS
The performance of the algorithm was tested on a stream of synthesized images involving the
motion of two blobs in arbitrary directions. For the numerical experiment, synthetic images of
size 512x512 are generated with objects of interest imposed on a noisy background. To construct
a noise model in our synthetic image, we used Gaussian noise (having zero mean and 0.005
variance) along with salt and pepper noise (0.005 probability) generated by MATLAB. For the
first experiment figure 4, the objects considered are rigid bodies moving in different directions.
Figure 4(a) indicates the initial state of objects with green marker enclosing them and figure 4(b)
illustrates the process of tracking for an intermediate frame in the simulated video sequence.
Fig. 4. Comparison of single resolution approach (green marker) and multi-resolution approach (red
marker) for tracking multiple objects over time. (a) Second frame in the video (b) Result of tracking ROI's
after 42th
frame.
In medical imaging, regions of interest being filmed tend to undergo affine transformation due to
continuous change in the camera position. The applicability of the discussed algorithm to track
such non rigid objects is demonstrated in figure 5. It can be observed that the red line continues to
Computer Science & Information Technology (CS & IT) 255
capture the complete object including the gradual change in appearance of the object. The array
corresponding to the ROI to accommodate for non-rigid transformation. As the size of the
deformable region changes over time the length of the marker array should be made adaptive to
accommodate this.
Figure 6 shows the results for an endoscope video with a resolution of 640x480 as input. The
algorithm works on the green channel of the video. The ROI's shown in blue are marked by the
end user for tracking. The region undergoes non-rigid deformation over time which is tracked by
the algorithm.
Fig. 5. Illustration of experimental setup for tracking an object deforming over time. (a) First frame in the
video where red marker indicates the ROI. (b) After 39th
frame, the red marker continues to enclose the
region which has undergone deformation
Fig. 6. Results of tracking multiple user-defined ROI on the endoscopy stream. The blue marker indicates
the output of the multi-resolution approach and the green marker depicts the single-resolution approach. (a)
Initial frame in the sequence (b) Result of tracking on the 54th
frame in the sequence.
6. CONCLUSIONS & FUTURE WORK
In this paper, we demonstrated the use of pyramidal implementation for estimating the
displacement field based on Thirion's demon. The algorithm successfully tracks multiple ROI's
that undergo non-rigid transformation. Based on the experiments carried out it clearly indicates
the advantage of pyramidal approach in terms of improvement in the convergence speed.
Moreover, the result obtained using pyramidal approach generated robust and artifact free results
as compared to the single resolution implementation.
256 Computer Science & Information Technology (CS & IT)
Though the current approach is well suited for tracking multiple ROI's it is limited to cases of
occlusion.
Future work will include parallelism of the current implementation using GPU to achieve even
faster convergence. Apart from vision based assistance to MIS, the demon based tracking
approach can also be applied for pathological cell detection and tracking. A classical clinical
condition where cells deform rapidly over time and requires to be tracked for continuous analysis.
REFERENCES
[1] S.S. Beauchemin and J.L. Barron. The computation of optical flow. ACM Computing Surveys, 1995.
[2] P. Anandan. A computational framework and an algorithm for the measurement of visual. Technical
report, University of Massachusetts Amherst, 1987.
[3] Jean-Yves Bouguet. Pyramidal implementation of the lucas kanade feature tracker description of the
algorithm. Technical report, Intel Corporation Microprocessor Research Labs, 2000.
[4] Pascal Cachier, Xavier Pennec, and Nicholas Ayache. Fast non rigid matching by gradient descent:
Study and improvements of the "demons" algo- rithm. Technical report, INSTITUT NATIONAL DE
RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE, June 1999.
[5] Visvanathan Ramesh, Dorin Comaniciu and Peter Meer. Kernel-based object tracking. IEEE
TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 25, NO.
5, 2003.
[6] Abhinav Kumar, B Madhusudan Rao, Rajesh Ghole, Amol Patil, and Nilesh Ghatpande. Demons
based tracking for non-rigid transformed region of interest. In TENCON 2011 - 2011 IEEE Region 10
Conference, IEEEXplore, November 2011.
[7] J.-P. Thirion. Non-rigid matching using demons. In Proc. IEEE Computer Society Conf. Computer
Vision and Pattern Recognition CVPR '96, 1996.
[8] He Wang, Lei Dong, Jennifer O'Daniel, Radhe Mohan, Adam S Garden, K Kian Ang, Deborah A
Kuban, Mark Bonnen, Joe Y Chang, and Rex Cheung. Validation of an accelerated 'demons'
algorithm for deformable image registration in radiation therapy. Physics in Medicine and Biology,
50(12), 2005.
[9] Peter J. Burt and Edward H. Adelson. The laplacian pyramid as a compact image code. IEEE
Transactions on Communications, April 1983.

Contenu connexe

Tendances

Advanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalll
Advanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalllAdvanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalll
Advanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalll
Muddassar Abbasi
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with Hypergraphs
MLAI2
 
Iaetsd a modified image fusion approach using guided filter
Iaetsd a modified image fusion approach using guided filterIaetsd a modified image fusion approach using guided filter
Iaetsd a modified image fusion approach using guided filter
Iaetsd Iaetsd
 
Iaetsd an enhanced circular detection technique rpsw using circular hough t...
Iaetsd an enhanced circular detection technique   rpsw using circular hough t...Iaetsd an enhanced circular detection technique   rpsw using circular hough t...
Iaetsd an enhanced circular detection technique rpsw using circular hough t...
Iaetsd Iaetsd
 

Tendances (19)

3D reconstruction
3D reconstruction3D reconstruction
3D reconstruction
 
06466595
0646659506466595
06466595
 
A Closed-form Solution to Photorealistic Image Stylization
A Closed-form Solution to Photorealistic Image StylizationA Closed-form Solution to Photorealistic Image Stylization
A Closed-form Solution to Photorealistic Image Stylization
 
G143741
G143741G143741
G143741
 
Efficient 3D stereo vision stabilization for multi-camera viewpoints
Efficient 3D stereo vision stabilization for multi-camera viewpointsEfficient 3D stereo vision stabilization for multi-camera viewpoints
Efficient 3D stereo vision stabilization for multi-camera viewpoints
 
Block Image Encryption using Wavelet
Block Image Encryption using WaveletBlock Image Encryption using Wavelet
Block Image Encryption using Wavelet
 
Advanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalll
Advanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalllAdvanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalll
Advanced Image Reconstruction Algorithms in MRIfor ISMRMversion finalll
 
Vf sift
Vf siftVf sift
Vf sift
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with Hypergraphs
 
Kv3419501953
Kv3419501953Kv3419501953
Kv3419501953
 
Image Registration Methode in Radar Interferometry
Image Registration Methode in Radar InterferometryImage Registration Methode in Radar Interferometry
Image Registration Methode in Radar Interferometry
 
Iaetsd a modified image fusion approach using guided filter
Iaetsd a modified image fusion approach using guided filterIaetsd a modified image fusion approach using guided filter
Iaetsd a modified image fusion approach using guided filter
 
Iaetsd an enhanced circular detection technique rpsw using circular hough t...
Iaetsd an enhanced circular detection technique   rpsw using circular hough t...Iaetsd an enhanced circular detection technique   rpsw using circular hough t...
Iaetsd an enhanced circular detection technique rpsw using circular hough t...
 
Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...
 
Fpga implementation of fusion technique for fingerprint application
Fpga implementation of fusion technique for fingerprint applicationFpga implementation of fusion technique for fingerprint application
Fpga implementation of fusion technique for fingerprint application
 
V.KARTHIKEYAN PUBLISHED ARTICLE 1
V.KARTHIKEYAN PUBLISHED ARTICLE 1V.KARTHIKEYAN PUBLISHED ARTICLE 1
V.KARTHIKEYAN PUBLISHED ARTICLE 1
 
Structure from motion
Structure from motionStructure from motion
Structure from motion
 
A Novel Algorithm for Watermarking and Image Encryption
A Novel Algorithm for Watermarking and Image Encryption A Novel Algorithm for Watermarking and Image Encryption
A Novel Algorithm for Watermarking and Image Encryption
 
COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...
COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...
COMPLEMENTARY VISION BASED DATA FUSION FOR ROBUST POSITIONING AND DIRECTED FL...
 

Similaire à MULTIPLE REGION OF INTEREST TRACKING OF NON-RIGID OBJECTS USING DEMON'S ALGORITHM

An automatic algorithm for object recognition and detection based on asift ke...
An automatic algorithm for object recognition and detection based on asift ke...An automatic algorithm for object recognition and detection based on asift ke...
An automatic algorithm for object recognition and detection based on asift ke...
Kunal Kishor Nirala
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0
Kapil Tiwari
 
Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...
Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...
Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...
CSCJournals
 

Similaire à MULTIPLE REGION OF INTEREST TRACKING OF NON-RIGID OBJECTS USING DEMON'S ALGORITHM (20)

An automatic algorithm for object recognition and detection based on asift ke...
An automatic algorithm for object recognition and detection based on asift ke...An automatic algorithm for object recognition and detection based on asift ke...
An automatic algorithm for object recognition and detection based on asift ke...
 
557 480-486
557 480-486557 480-486
557 480-486
 
Dense Visual Odometry Using Genetic Algorithm
Dense Visual Odometry Using Genetic AlgorithmDense Visual Odometry Using Genetic Algorithm
Dense Visual Odometry Using Genetic Algorithm
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Wavelet-Based Warping Technique for Mobile Devices
Wavelet-Based Warping Technique for Mobile DevicesWavelet-Based Warping Technique for Mobile Devices
Wavelet-Based Warping Technique for Mobile Devices
 
Method for a Simple Encryption of Images Based on the Chaotic Map of Bernoulli
Method for a Simple Encryption of Images Based on the Chaotic Map of BernoulliMethod for a Simple Encryption of Images Based on the Chaotic Map of Bernoulli
Method for a Simple Encryption of Images Based on the Chaotic Map of Bernoulli
 
METHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLI
METHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLIMETHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLI
METHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLI
 
METHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLI
METHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLIMETHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLI
METHOD FOR A SIMPLE ENCRYPTION OF IMAGES BASED ON THE CHAOTIC MAP OF BERNOULLI
 
Fpga implementation of fusion technique for fingerprint application
Fpga implementation of fusion technique for fingerprint applicationFpga implementation of fusion technique for fingerprint application
Fpga implementation of fusion technique for fingerprint application
 
Ay33292297
Ay33292297Ay33292297
Ay33292297
 
Ay33292297
Ay33292297Ay33292297
Ay33292297
 
FORGERY (COPY-MOVE) DETECTION IN DIGITAL IMAGES USING BLOCK METHOD
FORGERY (COPY-MOVE) DETECTION IN DIGITAL IMAGES USING BLOCK METHODFORGERY (COPY-MOVE) DETECTION IN DIGITAL IMAGES USING BLOCK METHOD
FORGERY (COPY-MOVE) DETECTION IN DIGITAL IMAGES USING BLOCK METHOD
 
J017377578
J017377578J017377578
J017377578
 
Real-time Moving Object Detection using SURF
Real-time Moving Object Detection using SURFReal-time Moving Object Detection using SURF
Real-time Moving Object Detection using SURF
 
Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...
Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...
Stereo Correspondence Algorithms for Robotic Applications Under Ideal And Non...
 
Effective Pixel Interpolation for Image Super Resolution
Effective Pixel Interpolation for Image Super ResolutionEffective Pixel Interpolation for Image Super Resolution
Effective Pixel Interpolation for Image Super Resolution
 
Effective Pixel Interpolation for Image Super Resolution
Effective Pixel Interpolation for Image Super ResolutionEffective Pixel Interpolation for Image Super Resolution
Effective Pixel Interpolation for Image Super Resolution
 
Comparison of various Image Registration Techniques with the Proposed Hybrid ...
Comparison of various Image Registration Techniques with the Proposed Hybrid ...Comparison of various Image Registration Techniques with the Proposed Hybrid ...
Comparison of various Image Registration Techniques with the Proposed Hybrid ...
 
Based on correlation coefficient in image matching
Based on correlation coefficient in image matchingBased on correlation coefficient in image matching
Based on correlation coefficient in image matching
 

Plus de cscpconf

MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
cscpconf
 

Plus de cscpconf (20)

ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
ANALYSIS OF LAND SURFACE DEFORMATION GRADIENT BY DINSAR
 
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
4D AUTOMATIC LIP-READING FOR SPEAKER'S FACE IDENTIFCATION
 
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
MOVING FROM WATERFALL TO AGILE PROCESS IN SOFTWARE ENGINEERING CAPSTONE PROJE...
 
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIESPROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
PROMOTING STUDENT ENGAGEMENT USING SOCIAL MEDIA TECHNOLOGIES
 
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGICA SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
A SURVEY ON QUESTION ANSWERING SYSTEMS: THE ADVANCES OF FUZZY LOGIC
 
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
DYNAMIC PHONE WARPING – A METHOD TO MEASURE THE DISTANCE BETWEEN PRONUNCIATIONS
 
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
 
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTICTWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
TWO DISCRETE BINARY VERSIONS OF AFRICAN BUFFALO OPTIMIZATION METAHEURISTIC
 
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAINDETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
DETECTION OF ALGORITHMICALLY GENERATED MALICIOUS DOMAIN
 
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
GLOBAL MUSIC ASSET ASSURANCE DIGITAL CURRENCY: A DRM SOLUTION FOR STREAMING C...
 
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEMIMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
IMPORTANCE OF VERB SUFFIX MAPPING IN DISCOURSE TRANSLATION SYSTEM
 
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
EXACT SOLUTIONS OF A FAMILY OF HIGHER-DIMENSIONAL SPACE-TIME FRACTIONAL KDV-T...
 
AUTOMATED PENETRATION TESTING: AN OVERVIEW
AUTOMATED PENETRATION TESTING: AN OVERVIEWAUTOMATED PENETRATION TESTING: AN OVERVIEW
AUTOMATED PENETRATION TESTING: AN OVERVIEW
 
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORKCLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
CLASSIFICATION OF ALZHEIMER USING fMRI DATA AND BRAIN NETWORK
 
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
VALIDATION METHOD OF FUZZY ASSOCIATION RULES BASED ON FUZZY FORMAL CONCEPT AN...
 
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATAPROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
PROBABILITY BASED CLUSTER EXPANSION OVERSAMPLING TECHNIQUE FOR IMBALANCED DATA
 
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCHCHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
CHARACTER AND IMAGE RECOGNITION FOR DATA CATALOGING IN ECOLOGICAL RESEARCH
 
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
SOCIAL MEDIA ANALYTICS FOR SENTIMENT ANALYSIS AND EVENT DETECTION IN SMART CI...
 
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGESOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE
 
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXTGENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
GENERAL REGRESSION NEURAL NETWORK BASED POS TAGGING FOR NEPALI TEXT
 

Dernier

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 

Dernier (20)

microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 

MULTIPLE REGION OF INTEREST TRACKING OF NON-RIGID OBJECTS USING DEMON'S ALGORITHM

  • 1. Jan Zizka (Eds) : CCSIT, SIPP, AISC, PDCTA - 2013 pp. 249–256, 2013. © CS & IT-CSCP 2013 DOI : 10.5121/csit.2013.3628 MULTIPLE REGION OF INTEREST TRACKING OF NON-RIGID OBJECTS USING DEMON'S ALGORITHM Rohan Pillai1 , Abhishikta Yalavali2 Saima Mohan1 , and Amol Patil1 1 iGate, Tower 3, Magarpatta City, Pune, India, {rohan.pillai,saima.mohan,amol.vpatil}@igate.com, 2 abhishikthaharsha@gmail.com ABSTRACT In this paper we propose an algorithm for tracking multiple ROI (region of interest) undergoing non-rigid transformations. Demon's algorithm based on the idea of Maxwell's demon, has been applied here to estimate the displacement field for tracking of multiple ROI. This algorithm works on pixel intensities of the sequence of images thus making it suitable for tracking objects/regions undergoing non-rigid transformations. We have incorporated a pyramid-based approach for demon's algorithm computations of displacement field, which leads to significant reduction in the convergence speed and improvement in the accuracy. This algorithm is applied for tracking non-rigid objects in laproscopy videos which would aid surgeons in Minimal Invasive Surgery (MIS). KEYWORDS Computer vision, Maxwell's demon, multi-resolution pyramid, non-rigid deformation, optical flow, ROI tracking 1. INTRODUCTION Tracking a region of interest is well-known in image processing community. In its long history, a number of algorithm have been designed that address this problem quite successfully1 . In some real world situations, objects tend undergo non-rigid transformation and are thus difficult to track.These scenarios especially are prominent in medical applications where usually the region under consideration undergoes non rigid transformation mainly due to camera movement. In this paper we address the problem of tracking multiple region of interest (ROI) whose geometric appearance are changing gradually over time. We demonstrate the application of this algorithm for tracking multiple ROI's in a video recorded during endoscopy. In our previous paper6 , we have successfully demonstrated the use of demon based technique for tracking an ROI in video sequence. In our current paper, we refine our earlier work by considering the entire image and using a multi-resolution approach for calculating the displacement field. This introduction of multi-resolution approach has resulted in improved
  • 2. 250 Computer Science & Information Technology (CS & IT) computational efficiency and accuracy of the tracking algorithm which will be discussed in detail in subsequent sections of this paper. The overview of demons algorithm along with its use in tracking is presented in section 2. The multi-resolution approach is discussed in section 3. Discussions, results and conclusion appear in section 4, 5 and 6 respectively. 2. DEMON BASED TRACKING Thirion7 introduced the demons forces for registration of images acquired from different modalities. The concept behind his experiment was to deform one image to match another. The work involved calculation of displacement of a physical point present in two images with the support of gradient based optical flow computation. The expression for displacement field of two images is as given in equation (1) Here is the displacement along x and y direction of a point, p, in the static image S with reference to a movable image M. is the gradient at point p on the image S. If is the image to be deformed then movable image M is constructed using a deformation operator as given in equation (2) and (3). Thirion had re-normalized the optical flow equation by introducing the intensity difference between the images in the denominator. Due to this, the displacement of each point has to be calculated iteratively until the squared error is minimized. The objective of image registration technique is to solve the minimization problem, to find the displacement field equation (??), where, d is the dissimilarity metric to measure the similarity between two images. Furthermore, to improve convergence speed and stability of the registration process, He Wang et al.8 introduced the forces of the movable image. Here, α is a positive homogenization factor introduced by Cachier et al.4 to bound the displacement field.
  • 3. Computer Science & Information Technology (CS & IT) 251 Tracking objects in a video can be accomplished by calculating the interframe displacement of the objects. Assuming that the scene being recorded is not undergoing any rapid movement or change, equation (5) gives us the displacement of points in one image with reference to the next image. The displacement field is Gaussian smoothed for regularization2 . This displacement field is used to track a Region of Interest (ROI) in a video. The metric function, d, used in this paper is Sum of Squared Difference (SSD) of the images. 3. PYRAMIDAL IMPLEMENTATION Bouguet3 demonstrated the use of pyramidal implementation in computing the optical flow algorithm based on Lucas Kanade method. The pyramid representation used in this paper is similar to that proposed by Bouguet. The method specified by Burt and Adelson9 is used to construct recursively the pyramidal representation of the images It-1 and It as: {It-1}m=0,1,…L where m is the pyramid Level. The value L is height of the pyramid. This value can be heuristically programmed by incorporating an estimation technique. Depending upon the expected inter frame displacement, L can take values 2, 3 or 4. In the image pyramid, image at mth level is at the resolution of the (m-1)th level. The algorithm starts with the Lth level images and the initial displacement field initialized to a zero vector. Fig. 1. Illustration showing hierarchy of pyramid structure of an image The displacement field is iteratively computed for the images at this level until the termination criteria is satisfied. The termination criteria is defined by the iteration count or the rate of change of error. The estimated displacement field at the Lth level is then rescaled to the images at the ( L - 1 )th level in the pyramid and used as the initial displacement field for estimation. The estimation requires that the displacement field be updated with values computed as per equation (5) so as to decrease the error. This process is repeated until the displacement field for the 0th level (original) images in the pyramid is obtained. Figure 2 summarizes the Pyramidal Implementation of Demon's based algorithm as a flowchart.
  • 4. 252 Computer Science & Information Technology (CS & IT) Fig. 2. Sequential execution of Pyramidal Implementation of Demon's based algorithm 4. DISCUSSION The ROI marked by a surgeon corresponds to an array of underlying pixels in an image. The displacement field calculated by the above algorithm represents the distance of a physical point in a frame being shifted with reference to the corresponding point in the previous frame. This displacement field can be applied to one or more ROI's which will propagate over the temporal sequence without any manual intervention. Using this principle multiple ROI's in the image can be tracked. The total number of computations performed on an nx x ny image would be Kmnxny, where m computations are performed on each pixel of an image per iteration and the image converges after K iterations. Thus the computation time needed is directly proportional to the size of the image.
  • 5. Computer Science & Information Technology (CS & IT) 253 With the introduction of pyramidal approach it is observed that the total computation time has reduced significantly. The displacement field calculated at lower resolution images is refined as we work towards the higher resolution images. Hence, resulting in fewer iterations at subsequent levels of the pyramid to converge. The computational advantage gained is illustrated in the performance chart shown in figure 3(a). For successive iteration the model image has a higher resemblance to the static image. Thus the error metric, SSD, that measures the dissimilarity between the two images is expected to decrease with each iteration. However, the SSD metric is image resolution dependent hence can not be used while comparing convergence between different pyramid levels. To establish a benchmark when comparing the displacement field produced at each level in the pyramid, we rescale the displacement field at each iteration to full image size and then use it to compute error with respect to the original image. Figure 3(b) shows the graph of error convergence with respect to iterations for single resolution and pyramidal approach. This confirms that the displacement field calculated at every level in the pyramid facilitates the overall convergence. The two results can be compared as they are operating on the same base images. In Figure 3(b) a sharp drop in error is observed at iteration number 5 and 18 which is caused by transition from one level in the pyramid to another. An Interesting observation about the type of convergence can be noted in figure 3 when plotted with respect to iteration count, the convergence graph looks like a series of decaying exponential. However, when the convergence graph is plotted against time, it looks more like a fast decaying exponential. Pyramidal implementation provides the freedom of making certain parameters like, smoothing kernel, termination criteria and the positive homogenization factor level dependent. The performance of the system can be further improved by selecting suitable values for these parameters. Pyramidal implementation provides the freedom of making certain parameters like, smoothing kernel, termination criteria and the positive homogenization factor level dependent. The performance of the system can be further improved by selecting suitable values for these parameters. (a)
  • 6. 254 Computer Science & Information Technology (CS & IT) (b) Fig. 3. Performance graph. (a) Plot depicting the time taken by single resolution (dotted line) and the pyramidal implementation (dashed line) to converge. (b) Graph of error calculated after every iteration for pyramidal implementation 5. RESULTS The performance of the algorithm was tested on a stream of synthesized images involving the motion of two blobs in arbitrary directions. For the numerical experiment, synthetic images of size 512x512 are generated with objects of interest imposed on a noisy background. To construct a noise model in our synthetic image, we used Gaussian noise (having zero mean and 0.005 variance) along with salt and pepper noise (0.005 probability) generated by MATLAB. For the first experiment figure 4, the objects considered are rigid bodies moving in different directions. Figure 4(a) indicates the initial state of objects with green marker enclosing them and figure 4(b) illustrates the process of tracking for an intermediate frame in the simulated video sequence. Fig. 4. Comparison of single resolution approach (green marker) and multi-resolution approach (red marker) for tracking multiple objects over time. (a) Second frame in the video (b) Result of tracking ROI's after 42th frame. In medical imaging, regions of interest being filmed tend to undergo affine transformation due to continuous change in the camera position. The applicability of the discussed algorithm to track such non rigid objects is demonstrated in figure 5. It can be observed that the red line continues to
  • 7. Computer Science & Information Technology (CS & IT) 255 capture the complete object including the gradual change in appearance of the object. The array corresponding to the ROI to accommodate for non-rigid transformation. As the size of the deformable region changes over time the length of the marker array should be made adaptive to accommodate this. Figure 6 shows the results for an endoscope video with a resolution of 640x480 as input. The algorithm works on the green channel of the video. The ROI's shown in blue are marked by the end user for tracking. The region undergoes non-rigid deformation over time which is tracked by the algorithm. Fig. 5. Illustration of experimental setup for tracking an object deforming over time. (a) First frame in the video where red marker indicates the ROI. (b) After 39th frame, the red marker continues to enclose the region which has undergone deformation Fig. 6. Results of tracking multiple user-defined ROI on the endoscopy stream. The blue marker indicates the output of the multi-resolution approach and the green marker depicts the single-resolution approach. (a) Initial frame in the sequence (b) Result of tracking on the 54th frame in the sequence. 6. CONCLUSIONS & FUTURE WORK In this paper, we demonstrated the use of pyramidal implementation for estimating the displacement field based on Thirion's demon. The algorithm successfully tracks multiple ROI's that undergo non-rigid transformation. Based on the experiments carried out it clearly indicates the advantage of pyramidal approach in terms of improvement in the convergence speed. Moreover, the result obtained using pyramidal approach generated robust and artifact free results as compared to the single resolution implementation.
  • 8. 256 Computer Science & Information Technology (CS & IT) Though the current approach is well suited for tracking multiple ROI's it is limited to cases of occlusion. Future work will include parallelism of the current implementation using GPU to achieve even faster convergence. Apart from vision based assistance to MIS, the demon based tracking approach can also be applied for pathological cell detection and tracking. A classical clinical condition where cells deform rapidly over time and requires to be tracked for continuous analysis. REFERENCES [1] S.S. Beauchemin and J.L. Barron. The computation of optical flow. ACM Computing Surveys, 1995. [2] P. Anandan. A computational framework and an algorithm for the measurement of visual. Technical report, University of Massachusetts Amherst, 1987. [3] Jean-Yves Bouguet. Pyramidal implementation of the lucas kanade feature tracker description of the algorithm. Technical report, Intel Corporation Microprocessor Research Labs, 2000. [4] Pascal Cachier, Xavier Pennec, and Nicholas Ayache. Fast non rigid matching by gradient descent: Study and improvements of the "demons" algo- rithm. Technical report, INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE, June 1999. [5] Visvanathan Ramesh, Dorin Comaniciu and Peter Meer. Kernel-based object tracking. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 25, NO. 5, 2003. [6] Abhinav Kumar, B Madhusudan Rao, Rajesh Ghole, Amol Patil, and Nilesh Ghatpande. Demons based tracking for non-rigid transformed region of interest. In TENCON 2011 - 2011 IEEE Region 10 Conference, IEEEXplore, November 2011. [7] J.-P. Thirion. Non-rigid matching using demons. In Proc. IEEE Computer Society Conf. Computer Vision and Pattern Recognition CVPR '96, 1996. [8] He Wang, Lei Dong, Jennifer O'Daniel, Radhe Mohan, Adam S Garden, K Kian Ang, Deborah A Kuban, Mark Bonnen, Joe Y Chang, and Rex Cheung. Validation of an accelerated 'demons' algorithm for deformable image registration in radiation therapy. Physics in Medicine and Biology, 50(12), 2005. [9] Peter J. Burt and Edward H. Adelson. The laplacian pyramid as a compact image code. IEEE Transactions on Communications, April 1983.