Deep Learning Object Detection Basics

•

4 j'aime•3,125 vues

The document discusses different approaches to object detection in images using deep learning. It begins with describing detection as classification, where an image is classified into categories for what objects are present. It then discusses approaches that involve separating detection into a classification head and localization head. The document also covers improvements like R-CNN which uses region proposals to first generate candidate object regions before running classification and bounding box regression on those regions using CNN features. This helps address issues with previous approaches like being too slow when running the CNN over the entire image at multiple locations and scales.

Sciences

Deep Learning based
Object Detection Basics

Detection As Classification
CAT? NO
DOG? NO

Detection As Classification
CAT? YES
DOG? NO

Detection As Classification
CAT? NO
DOG? YES

From Classification To Detection
Classification Head:
● C+1 Scores for C
classes + 1
background
class
Localization Head:
● Class agnostic:
(x,y,w,h)
● Class specific:
(x,y,w,h) X C

From Classification To Detection
● Training
○ Crop random regions from images.
○ Scale to uniform size.
○ A region is labeled according to overlap with ground truth labeling.
○ Optimize using Stochastic Gradient Descent.
○ Handle class imbalance by resampling.
● Detection
○ Use sliding window to go over image.
○ Crop regions.
○ Scale to uniform size.
○ Apply network to all cropped images.
○ Repeat process for different image scales.

How To Handle So Many Detections?
● Problem:
○ Running this algorithm at many locations at many scales result with many detections.
● Solution:
○ Need somehow to suppress weaker detections.

Non-Maximum Suppression (NMS)
● Start with most confident detection D.
● Measure IoU with all other detections.
● Remove detections with IoU>50% with D.
● Repeat with next most confident detection.

From Classification To Detection
● Problem:
○ Previous method was too slow.
○ Network is applied over and over.
● Solution:
○ Sliding window is inherently efficient in the case of CNNs.
● OverFeat: Integrated Recognition, Localization and Detection using
Convolutional Networks (2013)
○ Rob Fergus, Yann LeCun

CNNs Are Still Too Slow
● Problem:
○ Need to test many positions and scales, and use a computationally demanding classifier (CNN)
● Solution:
○ Only look at a tiny subset of possible positions.
● Rich feature hierarchies for accurate object detection and semantic
segmentation (2014)
○ AKA R-CNN
○ Ross Girshick

Region Proposals
● Find “blobby” image regions that are likely to contain objects
● “Class-agnostic” object detector
● Look for “blob-like” regions

R-CNN: Training
1. Train a classification model on a large dataset (ImageNet)
2. Fine-tune model for detection on a smaller dataset (Pascal)
○ Instead of 1000 ImageNet classes, now use 20 classes + background class.
○ Extract region proposals for all images.
○ Use positive / negative regions from detection images.
■ If proposal has >50% IoU with any ground truth → Positive example.
■ Otherwise → Negative example.
■ Batch = 32 positives + 96 negatives.
3. Train final classifiers
○ Extract region proposals for all images.
○ For each region: crop and warp to CNN size, run forward pass, save features to disk.
(Requires ~200GB for Pascal dataset)
○ Train one binary SVM per class to classify region features.
○ Train one linear regression model per class to predict regression offsets.

Looking for brilliant researchers
cv@brodmann17.com

Contenu connexe

Tendances

PR-132: SSD: Single Shot MultiBox DetectorJinwon Lee

A Brief History of Object Detection / Tommi KerolaPreferred Networks

You only look once (YOLO) : unified real time object detectionEntrepreneur / Startup

Action Recognition (Thesis presentation)nikhilus85

Object tracking presentationMrsShwetaBanait1

You Only Look Once: Unified, Real-Time Object DetectionDADAJONJURAKUZIEV

YOLOv4: optimal speed and accuracy of object detection reviewLEE HOSEONG

Object detectionROUSHAN RAJ KUMAR

Deep learning based object detectionchettykulkarni

End-to-End Object Detection with TransformersSeunghyun Hwang

Real Time Object TrackingVanya Valindria

Recent Progress on Object Detection_20170331Jihong Kang

YoloSourav Garai

Real Time Object Dectection using machine learningpratik pratyay

Image segmentation with deep learningAntonio Rueda-Toicen

Faster R-CNN: Towards real-time object detection with region proposal network...Universitat Politècnica de Catalunya

Convolutional Neural Networks (CNN)Gaurav Mittal

You only look onceGin Kyeng Lee

Computer Vision image classificationWael Badawy

Object detection - RCNNs vs RetinanetRishabh Indoria

Tendances (20)

PR-132: SSD: Single Shot MultiBox Detector

A Brief History of Object Detection / Tommi Kerola

You only look once (YOLO) : unified real time object detection

Action Recognition (Thesis presentation)

Object tracking presentation

You Only Look Once: Unified, Real-Time Object Detection

YOLOv4: optimal speed and accuracy of object detection review

Object detection

Deep learning based object detection

End-to-End Object Detection with Transformers

Real Time Object Tracking

Recent Progress on Object Detection_20170331

Yolo

Real Time Object Dectection using machine learning

Image segmentation with deep learning

Faster R-CNN: Towards real-time object detection with region proposal network...

Convolutional Neural Networks (CNN)

You only look once

Computer Vision image classification

Object detection - RCNNs vs Retinanet

Similaire à Deep Learning Object Detection Basics

MLIP - Chapter 5 - Detection, Segmentation, CaptioningCharles Deledalle

物件偵測與辨識技術CHENHuiMei

object detection paper reviewYoonho Na

Fast methods for deep learning based object detectionBrodmann17

Brodmann17 CVPR 2017 review - meetup slides Brodmann17

Cvpr 2017 Summary MeetupAmir Alush

Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Universitat Politècnica de Catalunya

SSD: Single Shot MultiBox Detector (UPC Reading Group)Universitat Politècnica de Catalunya

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya

Original SOINNSOINN Inc.

Anomaly Detection and Localization Using GAN and One-Class Classifier홍배 김

150424 Scalable Object Detection using Deep Neural NetworksJunho Cho

Knn Algorithm presentationRishavSharma112

ngboost.pptxHadrian7

D3L4-objects.pdfssusere945ae

Making BIG DATA smallerTony Tran

Anomaly detection using deep one class classifier홍배 김

Deep Learning for Computer Vision: Object Detection (UPC 2016)Universitat Politècnica de Catalunya

Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen

Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...NopphawanTamkuan

Similaire à Deep Learning Object Detection Basics (20)

MLIP - Chapter 5 - Detection, Segmentation, Captioning

物件偵測與辨識技術

object detection paper review

Fast methods for deep learning based object detection

Brodmann17 CVPR 2017 review - meetup slides

Cvpr 2017 Summary Meetup

Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...

SSD: Single Shot MultiBox Detector (UPC Reading Group)

D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)

Original SOINN

Anomaly Detection and Localization Using GAN and One-Class Classifier

150424 Scalable Object Detection using Deep Neural Networks

Knn Algorithm presentation

ngboost.pptx

D3L4-objects.pdf

Making BIG DATA smaller

Anomaly detection using deep one class classifier

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Machine Learning Foundations for Professional Managers

Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...

Plus de Brodmann17

5 Practical Steps to a Successful Deep Learning ResearchBrodmann17

Advanced deep learning based object detection methodsBrodmann17

Deep Learning on Everyday DevicesBrodmann17

Brodmann17 I The rise of edge vision intelligence I Adi Pinhas I DLD 2017Brodmann17

DLD meetup 2017, Efficient Deep LearningBrodmann17

Geektime 2017Brodmann17

Plus de Brodmann17 (6)

5 Practical Steps to a Successful Deep Learning Research

Advanced deep learning based object detection methods

Deep Learning on Everyday Devices

Brodmann17 I The rise of edge vision intelligence I Adi Pinhas I DLD 2017

DLD meetup 2017, Efficient Deep Learning

Geektime 2017

Dernier

Explainable AI for distinguishing future climate change scenariosZachary Labe

办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书zdzoqco

well logging & petrophysical analysis.pptxzaydmeerab121

Replisome-Cohesin Interfacing A Molecular Perspective.pdfAtiaGohar1

Introduction of Human Body & Structure of cell.pptxMedical College

CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456

FBI Profiling - Forensic Psychology.pptxPayal Shrivastava

Environmental Acoustics- Speech interference level, acoustics calibrator.pptxpriyankatabhane

final waves properties grade 7 - third quarterHanHyoKim

Q4-Mod-1c-Quiz-Projectile-333344444.pptxtuking87

GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptxRitchAndruAgustin

Science (Communication) and Wikipedia - Potentials and PitfallsDobusch Leonhard

KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1

Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova

Immunoblott technique for protein detection.pptAmirRaziq1

Loudspeaker- direct radiating type and horn type.pptxpriyankatabhane

AZOTOBACTER AS BIOFERILIZER.PPTXGovt. N.P.G College of Science Raipur (C.G)

Interferons.pptx.Govt. N.P.G College of Science Raipur (C.G)

Quarter 4_Grade 8_Digestive System Structure and FunctionsCharlene Llagas

DNA isolation molecular biology practical.pptxGiDMOh

Dernier (20)

Explainable AI for distinguishing future climate change scenarios

办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书

well logging & petrophysical analysis.pptx

Replisome-Cohesin Interfacing A Molecular Perspective.pdf

Introduction of Human Body & Structure of cell.pptx

CHROMATOGRAPHY PALLAVI RAWAT.pptx

FBI Profiling - Forensic Psychology.pptx

Environmental Acoustics- Speech interference level, acoustics calibrator.pptx

final waves properties grade 7 - third quarter

Q4-Mod-1c-Quiz-Projectile-333344444.pptx

GENERAL PHYSICS 2 REFRACTION OF LIGHT SENIOR HIGH SCHOOL GENPHYS2.pptx

Science (Communication) and Wikipedia - Potentials and Pitfalls

KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf

Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...

Immunoblott technique for protein detection.ppt

Loudspeaker- direct radiating type and horn type.pptx

AZOTOBACTER AS BIOFERILIZER.PPTX

Interferons.pptx.

Quarter 4_Grade 8_Digestive System Structure and Functions

DNA isolation molecular biology practical.pptx

Deep Learning Object Detection Basics

1. Deep Learning based Object Detection Basics

2. Detection As Regression?

3. Detection As Regression?

4. Detection As Classification CAT? NO DOG? NO

5. Detection As Classification CAT? YES DOG? NO

6. Detection As Classification CAT? NO DOG? NO

7. Detection As Classification CAT? NO DOG? YES

8. From Classification To Detection Classification Head: ● C+1 Scores for C classes + 1 background class Localization Head: ● Class agnostic: (x,y,w,h) ● Class specific: (x,y,w,h) X C

9. From Classification To Detection ● Training ○ Crop random regions from images. ○ Scale to uniform size. ○ A region is labeled according to overlap with ground truth labeling. ○ Optimize using Stochastic Gradient Descent. ○ Handle class imbalance by resampling. ● Detection ○ Use sliding window to go over image. ○ Crop regions. ○ Scale to uniform size. ○ Apply network to all cropped images. ○ Repeat process for different image scales.

10. How To Handle So Many Detections? ● Problem: ○ Running this algorithm at many locations at many scales result with many detections. ● Solution: ○ Need somehow to suppress weaker detections.

11. Non-Maximum Suppression (NMS) ● Start with most confident detection D. ● Measure IoU with all other detections. ● Remove detections with IoU>50% with D. ● Repeat with next most confident detection.

12. From Classification To Detection ● Problem: ○ Previous method was too slow. ○ Network is applied over and over. ● Solution: ○ Sliding window is inherently efficient in the case of CNNs. ● OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks (2013) ○ Rob Fergus, Yann LeCun

13. From Classification To Detection

14. From Detection To Classification

15. From Detection To Classification

16. From Detection To Classification

17. CNNs Are Still Too Slow ● Problem: ○ Need to test many positions and scales, and use a computationally demanding classifier (CNN) ● Solution: ○ Only look at a tiny subset of possible positions. ● Rich feature hierarchies for accurate object detection and semantic segmentation (2014) ○ AKA R-CNN ○ Ross Girshick

18. Region Proposals ● Find “blobby” image regions that are likely to contain objects ● “Class-agnostic” object detector ● Look for “blob-like” regions

19. Region Proposals: Selective Search

20. Region Proposals: Many Other Choices

21. Region Proposals: Many Other Choices

22. R-CNN

23. R-CNN

24. R-CNN

25. R-CNN

26. R-CNN

27. R-CNN

28. R-CNN: Training 1. Train a classification model on a large dataset (ImageNet) 2. Fine-tune model for detection on a smaller dataset (Pascal) ○ Instead of 1000 ImageNet classes, now use 20 classes + background class. ○ Extract region proposals for all images. ○ Use positive / negative regions from detection images. ■ If proposal has >50% IoU with any ground truth → Positive example. ■ Otherwise → Negative example. ■ Batch = 32 positives + 96 negatives. 3. Train final classifiers ○ Extract region proposals for all images. ○ For each region: crop and warp to CNN size, run forward pass, save features to disk. (Requires ~200GB for Pascal dataset) ○ Train one binary SVM per class to classify region features. ○ Train one linear regression model per class to predict regression offsets.

29. R-CNN: 2014’s State Of The Art

30. Looking for brilliant researchers cv@brodmann17.com

Deep Learning Object Detection Basics

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Deep Learning Object Detection Basics

Similaire à Deep Learning Object Detection Basics (20)

Plus de Brodmann17

Plus de Brodmann17 (6)

Dernier

Dernier (20)

Deep Learning Object Detection Basics