SlideShare une entreprise Scribd logo
1  sur  46
Research Review
Kuo-Yen Lo
羅国彦
ロ コウエン
2013.4.18
Sato Laboratory, University of Tokyo
Short Curriculum Vitae
• Personal Homepage
http://www.hci.iis.u-tokyo.ac.jp/~kylo/
• Period of past years
University (2006 – 2010)
Internship (2010 – 2011)
Research Assistant (2012 – 2013)
• Two main research topic:
1. 2D-to-3D conversion
2. Photo Aesthetics.
• Wrap up
Short Curriculum Vitae
• Language: English (TOEIC
935), JLPT(N1), Chinese(Native)
• Programming: C/C++, Matlab, Java(android)
Technique: SIFT/SURF/HOG, K-
means, GMM, kNN, SVM, PCA/LDA/ITML, bad-of-
visual word, bilateral filter.
• Have traveled to: USA, Korea.
Want to travel to: China, Thailand, Spain
• Why Japan
-- historical and cultural connection
-- camera companies and electronics maker here
Visual Cues
Low-level
Mathematics
Machine
Learning
Psychology
Computer Vision
Overview
2006
NTU
2007
UPenn
2008
OpenCV
2009
Robotics
Contest
Gender
Recognition
Contest
2012
Research
Assistant
ICPR
2012
ACCV_w
2012
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
National Taiwan University
2006
NTU
2007
UPenn
2008
OpenCV
People in Vision
• Yi-Ping Hung(MM12, CVPR11, CHI11, UIST11)
• Yung-Yu Chuang(CVPR12*3)
• C.J. Lin (Libsvm)
• H.T. Lin(ICML12, NIPS12, CVPR11
KDD12 Champion)
• Winston H. Hsu(MM12*6)
33,000
1928 B.C.
World rank 80
Summer School in UPenn
2006
NTU
2007
UPenn
2008
OpenCV
Summer Language Program
University of Pennsylvania
Join the Lab
2006
NTU
2007
UPenn
2008
OpenCV
Biophotonics and
Bioimaging Laboratory
Prof. Ta-Te Lin
OpenGLOpenCV
Borland
C++ Builder
Robotics Contest @ ASABE 2009
2009
Robotics
Contest
Gender
Recognition
Contest
• Problem:
Detecting and Positioning the circular obstacle
• Technique:
Graphical simulation(OpenGL), Sensor
• Material:
Boe-Bot Toolkit, IR sensor, Ultrasonic
sensor, Zigbee wireless communication
Robotics Contest @ ASABE 2009
2009
Robotics
Contest
Gender
Recognition
Contest
Please Visit the following link
for viewing the video:
http://youtu.be/8EjON8Y2OJ0
Gender Recognition Contest
2009
Robotics
Contest
Gender
Recognition
Contest
• Problem:
Recognize the gender with single face image
• Technique:
Viola-Jones Face detector (OpenCV)
Feature-based alignment
False-alarm check
Rotate the image 35 degree
to detect all possible tilt face
Gender Recognition Contest
2009
Robotics
Contest
Gender
Recognition
Contest
• Problem:
Recognize the gender with single face image
• Technique:
Viola-Jones Face detector (OpenCV)
Feature-based alignment
False-alarm check
Use Eye detector
to wrap the face to
untilt view.
Gender Recognition Contest
2009
Robotics
Contest
Gender
Recognition
Contest
• Problem:
Recognize the gender with single face image
• Technique:
Viola-Jones Face detector (OpenCV)
Feature-based alignment
False-alarm check
Utilize Skin color model, Eye
and Mouth detector to filter
the false-positive result from
the V-J face detector.
Gender Recognition Contest
2009
Robotics
Contest
Gender
Recognition
Contest
• Problem:
Recognize the gender with single face image
• Technique:
Viola-Jones Face detector (OpenCV)
Feature-based alignment
False-alarm check
• Performance
1.2 second per 480*320 image
• Result
~85% face detection accuracy
~75% gender recognition accuracy
Win 3rd place among 20 teams (Taiwan and
China). Bonus 60,0000yen.
Fish Recognition
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
• Problem:
Tuna species recognition for fishery
conservation and management
• Task:
Detection and Classification
Bigeye
YellowfinAlbacore
3 spices are considered2011
Navy
Fish Recognition
• Problem:
Tuna species recognition for fishery
conservation and management
• Task:
Detection and Classification
Fish Image are captured
in certain lighting condition
with measurement plate.
Body part is smooth,
makes it reflect light well.
72%
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Fish Recognition
• Problem:
Tuna species recognition for fishery
conservation and management
• Task:
Detection and Classification
B Y A
B 89 10 5
Y 9 86 3
A 10 9 81
84%
34% 72% 52% 58%
Confusion Matrix
(Head)(Abdomen)(Tail fin) (Tail)
Discriminate part!
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Yeh, Graduation!
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
2D-to-3D conversion
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
• Problem:
Generating 3D video
from 2D content.
• Inspiration:
3D information is
recovered by depth
cues
Captured View + Depth
2D-to-3D conversion
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Reality Comfort
• Accurate depth map
• Correct depth order
• Real-time processing
• Clear boundary
• Temporal smoothness
• Visual impression
How people perceive depth?
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
1. Low-level cue 2. Scene Recognition
2D-to-3D conversion
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Video frame + motion estimation[ICCE 2009]
Approaches
1. Depth map by motion
2. Depth map by saliency
3. Depth map by prior
information fusion
2D-to-3D conversion
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy Video frame + Saliency map [SDA 2010]
Approaches
1. Depth map by motion
2. Depth map by saliency
3. Depth map by prior
information fusion
Introduction to Bilateral Filter
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Bilateral Filter [Tomasi, ICCV98]:
f(x) h(x)
“Bi” lateral = Spatial term + Range term
Introduction to Bilateral Filter
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Bilateral Filter [Tomasi, ICCV98]:
f(x) h(x)
“Bi” lateral = Spatial term + Range term
Introduction to Bilateral Filter
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Bilateral Filter [Tomasi, ICCV98]:
f(x) h(x)
“Bi” lateral = Spatial term + Range term
Application of Bilateral Filter
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
“Bi” lateral = Spatial term + Range term
Smooth Target Edge-Preserving Result
And this one?
2D-to-3D conversion
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Prior fusion [Siggraph 2009]
1. Decide Geometric perspective
2. Integrate Image and Depth map by Bilateral filter
One-year in Navy
2010
ISMAB
NTU
Graduate
Internship
@ TV corp.
2011
Navy
Academia Sinica
2012
Research
Assistant
ICPR
2012
ACCV
2012
Institute of Information Science
(Central Research Academy)
People in Vision
• Chu-Song Chen (CVPR12, CVPR11*2)
• Mark H. Liao (MM12*2, MM11*2)
• Y.-C. Frank Wang (ECCV12, CVPR12)
• Yen-Yu Lin (CVPR13, MM12, TPAMI11)
Prof. Chen
PHOTO AESTHETICS CLASSIFICATION
Predicting the visual appealing quality of photos
good?
jcar@DPChallenge
good?
Mnet @ DPChallenge
Which one is better?
Voted by online photo community
Average: 5.0
88 votes
Average: 7.2
92 votes
Reason?
Reason?
Boundary
Alternating repetition
(Texture)
Contrast Levels of scale
Roughness
Strong centers
Positive space
Local symmetries
The Void
Not-separateness
Good shape
GradientsEchoes
Simplicity and Inner Calm
Deep interlock and ambiguity
Color
Composition
Harmonium
Richness
Application
Image Search & Management Photo evaluation system
Embedded Camera system Media analysis
Photo Aesthetics
2012
Research
Assistant
ICPR
2012
ACCV
2012
• Problem:
Recognition the appealing quality of
photo by computational approaches.
• Technique:
Image analysis, Pattern recognition,
Crowdsourcing, Psychology, Photography
• Application
ICPR 2012
2012
Research
Assistant
ICPR
2012
ACCV
2012
As a Pattern Recognition Problem…
Comparison of feature
1. Edge distribution, Color histogram,
Hue, Saturation.. [Ke, CVPR06]
2. SIFT + BOV [Marchesotti , ICCV11]
3. Composition layout (Edge + HSV),
Color palette, contrast.. [Proposed]
Result
Item Speed on PC Accuracy
CVPR06 0.2s 81%
ICCV11 4s 85%
Proposed 0.16s 84%
[ Photo aesthetics assessment with efficiency ]
Extraction of Color Information
Extract N
Dominant colors
(we set N=5)
K-Nearest Neighbor
(K=20)
List of Palettes
Dictionary
HQ Palettes Dictionary
LQ Palettes Dictionary
Palettes of Photo
Retrieved by Frequency
Retrieved by Kmeans
(Cluster Center)
Proposed
(Weighted Kmeans)
Finding the Dominant Colors
Video Demo
2012
Research
Assistant
ICPR
2012
ACCV
2012
[ Intelligent Photographing Interface with On-Device Aesthetic Quality Assessment ]
Please Visit the following link
for viewing the video:
http://youtu.be/o8mKuTfO6ao
Discussion 1
Device : On-line assistive camera system
• Contextual Information (Viewing angle)
camera < human
• Feedback
from analysis to advice
• Human behavior
What do people take?
How do people take?
• Computation
Server-based v.s. Device
• Market and Needs
Discussion 2
Algorithm: photo aesthetic value assessment
• Definition of photo aesthetics
Expert v.s. Volkswagen
• Labeling process
Individual bias and variance.
Absolute or Relative evaluation
Effect of Labeling order
• Quantify photo aesthetic
Modeling, the Personalization
Thanks for your attention!
10 ratings
5.00/7 average
5 ratings
4.90/7 average

Contenu connexe

En vedette (6)

Structured Data and Semantic SEO
Structured Data and Semantic SEOStructured Data and Semantic SEO
Structured Data and Semantic SEO
 
The Different Worlds of SEO and Content Strategy
The Different Worlds of SEO and Content StrategyThe Different Worlds of SEO and Content Strategy
The Different Worlds of SEO and Content Strategy
 
The Long Play of Content Strategy - Seattle Interactive 2015
The Long Play of Content Strategy - Seattle Interactive 2015The Long Play of Content Strategy - Seattle Interactive 2015
The Long Play of Content Strategy - Seattle Interactive 2015
 
Inside Google Knowledge Graph
Inside Google Knowledge GraphInside Google Knowledge Graph
Inside Google Knowledge Graph
 
Types of Scales in Fishes
Types of Scales in FishesTypes of Scales in Fishes
Types of Scales in Fishes
 
Schema and Open Graph 101 - SMX Munich
Schema and Open Graph 101 - SMX MunichSchema and Open Graph 101 - SMX Munich
Schema and Open Graph 101 - SMX Munich
 

Similaire à Introduction to my Research

A race to your desired future: Pacing yourself to peak performance
A race to your desired future: Pacing yourself to peak performanceA race to your desired future: Pacing yourself to peak performance
A race to your desired future: Pacing yourself to peak performance
Yuliana Tjioe
 
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
CSCJournals
 
Felipe puerta (university presentation)
Felipe puerta (university presentation)Felipe puerta (university presentation)
Felipe puerta (university presentation)
Pipe Puerta Velasco
 
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Yuliana Tjioe
 
Understanding current practice around the Assessment of Multimedia Artefacts
Understanding current practice around the Assessment of Multimedia ArtefactsUnderstanding current practice around the Assessment of Multimedia Artefacts
Understanding current practice around the Assessment of Multimedia Artefacts
Denise Whitelock
 
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Yuliana Tjioe
 
Resume_HaoZhang_Dec07
Resume_HaoZhang_Dec07Resume_HaoZhang_Dec07
Resume_HaoZhang_Dec07
Hao Zhang
 
Examining the Impact of Individual Differences of Informatijon Processing St...
Examining the Impact of Individual Differences of Informatijon Processing St...Examining the Impact of Individual Differences of Informatijon Processing St...
Examining the Impact of Individual Differences of Informatijon Processing St...
Takeshi Sato
 

Similaire à Introduction to my Research (20)

Application of feature point matching to video stabilization
Application of feature point matching to video stabilizationApplication of feature point matching to video stabilization
Application of feature point matching to video stabilization
 
NTU Innovations in Teaching Seminar - students as co creators
NTU Innovations in Teaching Seminar - students as co creatorsNTU Innovations in Teaching Seminar - students as co creators
NTU Innovations in Teaching Seminar - students as co creators
 
Measurement of Course learning Outcome using new NCAAA course report template
Measurement of Course learning Outcome using new NCAAA course report templateMeasurement of Course learning Outcome using new NCAAA course report template
Measurement of Course learning Outcome using new NCAAA course report template
 
Resume 2015/1
Resume 2015/1Resume 2015/1
Resume 2015/1
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
 
CV of Dennis Wu
CV  of Dennis WuCV  of Dennis Wu
CV of Dennis Wu
 
A race to your desired future: Pacing yourself to peak performance
A race to your desired future: Pacing yourself to peak performanceA race to your desired future: Pacing yourself to peak performance
A race to your desired future: Pacing yourself to peak performance
 
Expert Guided Crowd Sourced Learning Content
Expert Guided Crowd Sourced Learning ContentExpert Guided Crowd Sourced Learning Content
Expert Guided Crowd Sourced Learning Content
 
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)International Journal of Image Processing (IJIP) Volume (3) Issue (6)
International Journal of Image Processing (IJIP) Volume (3) Issue (6)
 
Felipe puerta (university presentation)
Felipe puerta (university presentation)Felipe puerta (university presentation)
Felipe puerta (university presentation)
 
Resume
ResumeResume
Resume
 
Introducing Sunmoon University
Introducing Sunmoon UniversityIntroducing Sunmoon University
Introducing Sunmoon University
 
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
 
Understanding current practice around the Assessment of Multimedia Artefacts
Understanding current practice around the Assessment of Multimedia ArtefactsUnderstanding current practice around the Assessment of Multimedia Artefacts
Understanding current practice around the Assessment of Multimedia Artefacts
 
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
Principal's Briefing : Sec 3 Meet-Parents-Session on 1 June 2013
 
Resume
ResumeResume
Resume
 
Resume_HaoZhang_Dec07
Resume_HaoZhang_Dec07Resume_HaoZhang_Dec07
Resume_HaoZhang_Dec07
 
Perception and Quality of Immersive Media
Perception and Quality of Immersive MediaPerception and Quality of Immersive Media
Perception and Quality of Immersive Media
 
maykinCV2017
maykinCV2017maykinCV2017
maykinCV2017
 
Examining the Impact of Individual Differences of Informatijon Processing St...
Examining the Impact of Individual Differences of Informatijon Processing St...Examining the Impact of Individual Differences of Informatijon Processing St...
Examining the Impact of Individual Differences of Informatijon Processing St...
 

Introduction to my Research

Notes de l'éditeur

  1. 2013 Luckily be here with Sato Lab.
  2. I started learning Japanese at the first year in Univ.NTU, National Taiwan University33,000 students (UT40,000)1.6km2 (~= UT)
  3. After ended my first year in university, I figured out the life here is not what I want.So I planed a travel, to see people on the opposite of the earth,How they act, how they
  4. Since there is a required bachelor thesis project at our department,I choose the lab lead by Prof. Lin, which focus on image processing and inspection technique.Start to play with OpenGL, OpenCV, and the GUI software Boaland C++
  5. 2013 Luckily be here with Sato Lab.2012 ICPR and ACCV works, sponsored by Panasonic Scholarship2012 Join Academia Sinica, literally &quot;Central Research Academy”.~50 CS P.I. (principal investigator) in two institution. Has paper on competitive conf. such as ICCV, CVPR, MM, every year.2011 Navy, duty and vomit on the ship. Political warfare2010 Internship , HsinChu, Taiwan intern in Intelligent Video Processing Research Group • Automatic 2D-to-3D Conversion of video for 3D TV• Survey the standards of stereoscopic video• Depth map generation, depth fusion, saliency detection 2010 graduate2010 Paper Tuna fish species classification using support vector ma-chine”. International Symposium on Machinery and Mechatronics for Agricultural and Biosys-tems Engineering (ISMAB) 2009 UTech Machine Vision Prize, 3rd place • Human Face Sex Recognition (3 out of 24 teams from Taiwan and China)• Part of winning team with prize NTD 200,000.2009Robotics Competition American Society of Agricultural and Biological Engineers Annual International Meeting• Build a multi-robot system equipped with Infra-Red sensors and Ultrasonic sensor• Each robot cooperate with Server through Zigbee wireless communication• Lead a team of 4 member for 6 months preparation (rank 4 out of 8 teams from U.S.)2008 Participating in the Lab, began to learn OpenGL and OpenCV2007 Language School at Univ. Pennsylvania, United State2006 National Taiwan University (rank 80, 2012 QS)
  6. 2013 Luckily be here with Sato Lab.2012 ICPR and ACCV works, sponsored by Panasonic Scholarship2012 Join Academia Sinica, literally &quot;Central Research Academy”.~50 CS P.I. (principal investigator) in two institution. Has paper on competitive conf. such as ICCV, CVPR, MM, every year.2011 Navy, duty and vomit on the ship. Political warfare2010 Internship , HsinChu, Taiwan intern in Intelligent Video Processing Research Group • Automatic 2D-to-3D Conversion of video for 3D TV• Survey the standards of stereoscopic video• Depth map generation, depth fusion, saliency detection 2010 graduate2010 Paper Tuna fish species classification using support vector ma-chine”. International Symposium on Machinery and Mechatronics for Agricultural and Biosys-tems Engineering (ISMAB) 2009 UTech Machine Vision Prize, 3rd place • Human Face Sex Recognition (3 out of 24 teams from Taiwan and China)• Part of winning team with prize NTD 200,000.2009Robotics Competition American Society of Agricultural and Biological Engineers Annual International Meeting• Build a multi-robot system equipped with Infra-Red sensors and Ultrasonic sensor• Each robot cooperate with Server through Zigbee wireless communication• Lead a team of 4 member for 6 months preparation (rank 4 out of 8 teams from U.S.)2008 Participating in the Lab, began to learn OpenGL and OpenCV2007 Language School at Univ. Pennsylvania, United State2006 National Taiwan University (rank 80, 2012 QS)