SlideShare une entreprise Scribd logo
1  sur  21
Real-time Guidance Camera Interface to
Enhance Photo Aesthetic Quality
Yan Xu1, Joshua Ratcliff1, James Scovell2, Gheric Speiginer3, Ronald
Azuma1
1Intel Labs 2Intel Corporation 3Georgia Institute of Technology
Motivation
2http://illuminatedmoments.com/blog/clarification-why-i-hired-a-professional/
Learning Photography Takes Time
3
Real-time Guidance for Novice Users
4
A Camera Interface that can
• Understand the scene
• Know the object of interest
• Give concrete guidance
Research question:
Is the real-time guidance interface an effective way to enhance
photos’ aesthetic quality?
Choosing One Photography Rule in One Photo
Scenario
• Rule-of-thirds
• 1) important compositional elements should be placed along these
lines or their intersection [1]
• 2) the proportion of the object of interest should be roughly one third of
the total image space [2]
• One person portraiture
6
[1] Peterson, B. F. (2003). Learning to see creatively, Amphoto Press.
[2] Smith, J. T. (1797). Remarks on rural scenery. Nathaniel Smith ancient Print. Cited by the Wikipedia
page about rule of thirds: http://en.wikipedia.org/wiki/Rule_of_thirds (retrieved September 10, 2014)
An Example for Rule-of-thirds
7
System: Three components
System Components
9
1. Find the region of interest by face detection and foreground
segmentation
+
System Components
10
2. Calculate how much does the region of interest follow rule-of-thirds
The bitmap mask for calculating the alignment between
subject-of-interest and rule-of-thirds
System Components
11
3. User interface that guides users to move the camera in the space
User Study
Procedure
13
• 40 users take portraiture photos of their friend, using our interface
and a static grid interface
• 24 professional photographers rated the photos
• 48 Mechanical Turk raters rated the photos
Interface for Raters
14
Quantitative Results (1)
15
• Photos taken with real-time guidance UI has better aesthetic quality than static gridline
UI
• Using the two-factor repeated measures ANOVA, we found that expert photographers and
Mechanical Turk workers (MT) rated the photos taken by real-time guidance interface to be
significantly better than those taken by static gridline interface (expert: F = 7.62, p < .05, η2
partial =
.249); MT: F = 20.41, p < .01, η2
partial = .303).
Raters
Real-time Guidance
UI
Static Gridline UI
Expert
photographers
M = 43.95
SD = 22.99
M = 40.60
SD = 22.08
Mechanical Turk
workers
M = 65.98
SD = 21.87
M = 60.92
SD = 22.42
Quantitative Results (2)
16
• Users follow rule-of-thirds better when they use real-time guidance
interface
• Users align the subject to the rule-of-thirds grid better with the RG interface
than the SG interface (average diff = 31.76 (on a 0-250 scale), p < .05, one
tailed paired t-test)
• The proportion of human subject’s width is significantly closer to 1/3 when
using the RG interface compared to the SG interface (average difference =
6%, p < .01, one tailed paired t-test). Users tend to have smaller
subject/image ratios when using the SG interface
Qualitative Findings - Experts
17
1. FACE: Genuine/natural smile, emotion, eye
contact, glass reflection, teeth, skin tone,
2. BODY: Aliveness, Natural poses, hands,
sense of movement and action, head/body
proportion
4. BACKGROUND: leading lines (and other
prominent lines) and vanishing points,
distraction, complement the subject or not.
3. AROUND THE SUBJECT: subject saliency,
distracting background right next to the subject
5. BIG PICTURE: Composition (balance, camera
angle/distance , rule of thirds), Lighting (exposure,
evenness, shadows), Color (white balance,
saturation, Camera angle
50%10%0% 20% 30% 40%
Qualitative Findings – Mechanical Turk Raters
18
1. FACE: Smile, facial expression (naturalness,
confidence, attractiveness), teeth, skin, hair,
mood(more fun, happier, more relaxed), eye
contact
2. BODY: Natural poses, ore flattering body
shape; less distraction or cut off by other
elements; pose, clothes
5. BIG PICTURE: Lighting and shadows, Color
(tone, accuracy, saturation, glare, patches,
vividness, true to reality, harmony), composition,
camera angle and distance, sharpness
4. BACKGROUND: less distraction, realistic or
not, broader view and context, leading lines
3. AROUND THE SUBJECT: subject saliency,
distracting background right next to the
subject, fg/bg harmony
50%10%0% 20% 30% 40%
Conclusion and Future Work
Take-away Message
20
• Real-time guidance interface is effective in terms of improving photo
aesthetic quality and user’s conformation to photography rules
• Understanding photos in both RGB and depth can help us better
evaluate photo quality and provide feedback
New Capabilities of Understanding Photos
21
• Depth
• Segmentation
• Geometry
• Lighting
…
Thank you!
Real-time Guidance Camera Interface to
Enhance Photo Aesthetic Quality
Yan Xu, Joshua Ratcliff, James Scovell, Gheric Speiginer , Ronald Azuma
contact: yan.xu@intel.com

Contenu connexe

Similaire à Real-time Camera Interface for Photo Composition

A Review Paper on Fingerprint Image Enhancement with Different Methods
A Review Paper on Fingerprint Image Enhancement with Different MethodsA Review Paper on Fingerprint Image Enhancement with Different Methods
A Review Paper on Fingerprint Image Enhancement with Different MethodsIJMER
 
BTEC Level 2 Creative Digital Media Production: Photography Booklet
BTEC Level 2 Creative Digital Media Production: Photography BookletBTEC Level 2 Creative Digital Media Production: Photography Booklet
BTEC Level 2 Creative Digital Media Production: Photography BookletKate McCabe
 
An evaluation approach for detection of contours with 4 d images a review
An evaluation approach for detection of contours with 4 d images a reviewAn evaluation approach for detection of contours with 4 d images a review
An evaluation approach for detection of contours with 4 d images a revieweSAT Journals
 
IRJET- Robust Edge Detection using Moore’s Algorithm with Median Filter
IRJET- Robust Edge Detection using Moore’s Algorithm with Median FilterIRJET- Robust Edge Detection using Moore’s Algorithm with Median Filter
IRJET- Robust Edge Detection using Moore’s Algorithm with Median FilterIRJET Journal
 
A Novel Approach for Edge Detection using Modified ACIES Filtering
A Novel Approach for Edge Detection using Modified ACIES FilteringA Novel Approach for Edge Detection using Modified ACIES Filtering
A Novel Approach for Edge Detection using Modified ACIES Filteringidescitation
 
Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...
Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...
Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...Showrav Mazumder
 
Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...
Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...
Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...Matthias Trapp
 
The Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdf
The Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdfThe Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdf
The Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdfPriyanka Kardam
 
Automatic Detection of Radius of Bone Fracture
Automatic Detection of Radius of Bone FractureAutomatic Detection of Radius of Bone Fracture
Automatic Detection of Radius of Bone FractureIRJET Journal
 
Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...
Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...
Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...multimediaeval
 
Improvement of Weld Images using MATLAB –A Review
Improvement of Weld Images using MATLAB –A ReviewImprovement of Weld Images using MATLAB –A Review
Improvement of Weld Images using MATLAB –A Reviewinventy
 
Lecture01: Introduction to Photogrammetry
Lecture01: Introduction to PhotogrammetryLecture01: Introduction to Photogrammetry
Lecture01: Introduction to PhotogrammetrySarhat Adam
 
CMC in Mozambique @ CIRS 2012
CMC in Mozambique @ CIRS 2012CMC in Mozambique @ CIRS 2012
CMC in Mozambique @ CIRS 2012Sara Vannini
 
Adaptive approach to retrieve image affected by impulse noise
Adaptive approach to retrieve image affected by impulse noiseAdaptive approach to retrieve image affected by impulse noise
Adaptive approach to retrieve image affected by impulse noiseeSAT Publishing House
 

Similaire à Real-time Camera Interface for Photo Composition (20)

PPT s01-machine vision-s2
PPT s01-machine vision-s2PPT s01-machine vision-s2
PPT s01-machine vision-s2
 
A Review Paper on Fingerprint Image Enhancement with Different Methods
A Review Paper on Fingerprint Image Enhancement with Different MethodsA Review Paper on Fingerprint Image Enhancement with Different Methods
A Review Paper on Fingerprint Image Enhancement with Different Methods
 
BTEC Level 2 Creative Digital Media Production: Photography Booklet
BTEC Level 2 Creative Digital Media Production: Photography BookletBTEC Level 2 Creative Digital Media Production: Photography Booklet
BTEC Level 2 Creative Digital Media Production: Photography Booklet
 
An evaluation approach for detection of contours with 4 d images a review
An evaluation approach for detection of contours with 4 d images a reviewAn evaluation approach for detection of contours with 4 d images a review
An evaluation approach for detection of contours with 4 d images a review
 
IRJET- Robust Edge Detection using Moore’s Algorithm with Median Filter
IRJET- Robust Edge Detection using Moore’s Algorithm with Median FilterIRJET- Robust Edge Detection using Moore’s Algorithm with Median Filter
IRJET- Robust Edge Detection using Moore’s Algorithm with Median Filter
 
final_project
final_projectfinal_project
final_project
 
A Novel Approach for Edge Detection using Modified ACIES Filtering
A Novel Approach for Edge Detection using Modified ACIES FilteringA Novel Approach for Edge Detection using Modified ACIES Filtering
A Novel Approach for Edge Detection using Modified ACIES Filtering
 
Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...
Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...
Poster Presentation [University of Dhaka]- Implementation Techniques of Incor...
 
02 Fall09 Lecture Sept18web
02 Fall09 Lecture Sept18web02 Fall09 Lecture Sept18web
02 Fall09 Lecture Sept18web
 
Photogrammetry1
Photogrammetry1Photogrammetry1
Photogrammetry1
 
Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...
Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...
Evaluating the Perceptual Impact of Rendering Techniques on Thematic Color Ma...
 
The Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdf
The Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdfThe Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdf
The Art of Photography Unveiling the Essence of Capturing Timeless Moments.pdf
 
Automatic Detection of Radius of Bone Fracture
Automatic Detection of Radius of Bone FractureAutomatic Detection of Radius of Bone Fracture
Automatic Detection of Radius of Bone Fracture
 
Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...
Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...
Adversarial Photo Frame: Concealing Sensitive Scene Information in a User-Acc...
 
Title.docx
Title.docxTitle.docx
Title.docx
 
Improvement of Weld Images using MATLAB –A Review
Improvement of Weld Images using MATLAB –A ReviewImprovement of Weld Images using MATLAB –A Review
Improvement of Weld Images using MATLAB –A Review
 
Lecture01: Introduction to Photogrammetry
Lecture01: Introduction to PhotogrammetryLecture01: Introduction to Photogrammetry
Lecture01: Introduction to Photogrammetry
 
Aberration_Errors
Aberration_ErrorsAberration_Errors
Aberration_Errors
 
CMC in Mozambique @ CIRS 2012
CMC in Mozambique @ CIRS 2012CMC in Mozambique @ CIRS 2012
CMC in Mozambique @ CIRS 2012
 
Adaptive approach to retrieve image affected by impulse noise
Adaptive approach to retrieve image affected by impulse noiseAdaptive approach to retrieve image affected by impulse noise
Adaptive approach to retrieve image affected by impulse noise
 

Dernier

Powerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost LoverPowerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost LoverPsychicRuben LoveSpells
 
Leading Mobile App Development Companies in India (2).pdf
Leading Mobile App Development Companies in India (2).pdfLeading Mobile App Development Companies in India (2).pdf
Leading Mobile App Development Companies in India (2).pdfCWS Technology
 
BDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
FULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCR
FULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCRFULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCR
FULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCRnishacall1
 
9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Servicenishacall1
 

Dernier (6)

Powerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost LoverPowerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Arkansas, AR (310) 882-6330 Bring Back Lost Lover
 
Leading Mobile App Development Companies in India (2).pdf
Leading Mobile App Development Companies in India (2).pdfLeading Mobile App Development Companies in India (2).pdf
Leading Mobile App Development Companies in India (2).pdf
 
BDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 71 Noida Escorts >༒8448380779 Escort Service
 
Obat Penggugur Kandungan Di Apotik Kimia Farma (087776558899)
Obat Penggugur Kandungan Di Apotik Kimia Farma (087776558899)Obat Penggugur Kandungan Di Apotik Kimia Farma (087776558899)
Obat Penggugur Kandungan Di Apotik Kimia Farma (087776558899)
 
FULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCR
FULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCRFULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCR
FULL ENJOY - 9999218229 Call Girls in {Mahipalpur}| Delhi NCR
 
9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 52 (Delhi) Call Girl Service
 

Real-time Camera Interface for Photo Composition

  • 1. Real-time Guidance Camera Interface to Enhance Photo Aesthetic Quality Yan Xu1, Joshua Ratcliff1, James Scovell2, Gheric Speiginer3, Ronald Azuma1 1Intel Labs 2Intel Corporation 3Georgia Institute of Technology
  • 4. Real-time Guidance for Novice Users 4 A Camera Interface that can • Understand the scene • Know the object of interest • Give concrete guidance Research question: Is the real-time guidance interface an effective way to enhance photos’ aesthetic quality?
  • 5. Choosing One Photography Rule in One Photo Scenario • Rule-of-thirds • 1) important compositional elements should be placed along these lines or their intersection [1] • 2) the proportion of the object of interest should be roughly one third of the total image space [2] • One person portraiture 6 [1] Peterson, B. F. (2003). Learning to see creatively, Amphoto Press. [2] Smith, J. T. (1797). Remarks on rural scenery. Nathaniel Smith ancient Print. Cited by the Wikipedia page about rule of thirds: http://en.wikipedia.org/wiki/Rule_of_thirds (retrieved September 10, 2014)
  • 6. An Example for Rule-of-thirds 7
  • 8. System Components 9 1. Find the region of interest by face detection and foreground segmentation +
  • 9. System Components 10 2. Calculate how much does the region of interest follow rule-of-thirds The bitmap mask for calculating the alignment between subject-of-interest and rule-of-thirds
  • 10. System Components 11 3. User interface that guides users to move the camera in the space
  • 12. Procedure 13 • 40 users take portraiture photos of their friend, using our interface and a static grid interface • 24 professional photographers rated the photos • 48 Mechanical Turk raters rated the photos
  • 14. Quantitative Results (1) 15 • Photos taken with real-time guidance UI has better aesthetic quality than static gridline UI • Using the two-factor repeated measures ANOVA, we found that expert photographers and Mechanical Turk workers (MT) rated the photos taken by real-time guidance interface to be significantly better than those taken by static gridline interface (expert: F = 7.62, p < .05, η2 partial = .249); MT: F = 20.41, p < .01, η2 partial = .303). Raters Real-time Guidance UI Static Gridline UI Expert photographers M = 43.95 SD = 22.99 M = 40.60 SD = 22.08 Mechanical Turk workers M = 65.98 SD = 21.87 M = 60.92 SD = 22.42
  • 15. Quantitative Results (2) 16 • Users follow rule-of-thirds better when they use real-time guidance interface • Users align the subject to the rule-of-thirds grid better with the RG interface than the SG interface (average diff = 31.76 (on a 0-250 scale), p < .05, one tailed paired t-test) • The proportion of human subject’s width is significantly closer to 1/3 when using the RG interface compared to the SG interface (average difference = 6%, p < .01, one tailed paired t-test). Users tend to have smaller subject/image ratios when using the SG interface
  • 16. Qualitative Findings - Experts 17 1. FACE: Genuine/natural smile, emotion, eye contact, glass reflection, teeth, skin tone, 2. BODY: Aliveness, Natural poses, hands, sense of movement and action, head/body proportion 4. BACKGROUND: leading lines (and other prominent lines) and vanishing points, distraction, complement the subject or not. 3. AROUND THE SUBJECT: subject saliency, distracting background right next to the subject 5. BIG PICTURE: Composition (balance, camera angle/distance , rule of thirds), Lighting (exposure, evenness, shadows), Color (white balance, saturation, Camera angle 50%10%0% 20% 30% 40%
  • 17. Qualitative Findings – Mechanical Turk Raters 18 1. FACE: Smile, facial expression (naturalness, confidence, attractiveness), teeth, skin, hair, mood(more fun, happier, more relaxed), eye contact 2. BODY: Natural poses, ore flattering body shape; less distraction or cut off by other elements; pose, clothes 5. BIG PICTURE: Lighting and shadows, Color (tone, accuracy, saturation, glare, patches, vividness, true to reality, harmony), composition, camera angle and distance, sharpness 4. BACKGROUND: less distraction, realistic or not, broader view and context, leading lines 3. AROUND THE SUBJECT: subject saliency, distracting background right next to the subject, fg/bg harmony 50%10%0% 20% 30% 40%
  • 19. Take-away Message 20 • Real-time guidance interface is effective in terms of improving photo aesthetic quality and user’s conformation to photography rules • Understanding photos in both RGB and depth can help us better evaluate photo quality and provide feedback
  • 20. New Capabilities of Understanding Photos 21 • Depth • Segmentation • Geometry • Lighting …
  • 21. Thank you! Real-time Guidance Camera Interface to Enhance Photo Aesthetic Quality Yan Xu, Joshua Ratcliff, James Scovell, Gheric Speiginer , Ronald Azuma contact: yan.xu@intel.com