[DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

•

2 j'aime•891 vues

Deep Learning JP

2017/5/19 Deep Learning JP: http://deeplearning.jp/seminar-2/

Technologie

DeepSketch2Face: A Deep Learning
Based Sketching System for 3D Face
and Caricature Modeling
19 May, 2017
M2 杉原祥太

書誌情報
• 著者： Xiaoguang Han, Chang Gao and Yizhou Yu
• The University of Hong Kong
• Proceedings of SIGGRAPH 2017
• https://www.youtube.com/watch?v=93WZHKYxqtM
2

概要
• 顔のスケッチから3Dモデルを対話的に⽣成するシステム
• CNNと全結合層を組み合わせた
3

背景
• 顔のモデルは個⼈や表情によって多様である．
• 少ない労⼒で⽣成できたら嬉しい
• Contributions
• 3D顔モデルのための画期的なシステムの提案
• CNNとBilinear modelの組み合わせ
• 顔モデル拡張
4

関連研究
• Data-drivenなスケッチによるモデル⽣成
• ⼊れ物，⽊など(Huang et al. 2016), 建物(Nishida et al. 2016)
• インタラクティブ性，ネットワークの新規性，データの⾮公開
• Morphableな顔モデル
• CNN以外からも形状の特徴を得ているので，推定がより正確
• 3D顔カリカチュア
• 2Dスケッチから3D顔カリカチュアは本研究初
• スケッチベースのモデリング
• スケッチの線だけでなく，Deepから3D座標を推測して制約条件に
5

提案システム
• 3つのインタラクションモードがある．
• 1. Initial Sketching Mode
• 描いたスケッチがそのまま3Dモデルへ
• 2D投影と3Dモデルの形状は正確には⼀致しない
• 2. Follow-up Sketching Mode
• 線を修正していく
• 適宜スケッチとモデルを切り替えられる
• 3. Gesture-Based 3D Face Refinement
• ジェスチャーで編集
6

データベースの構築
• 3Dモデル
• Chao et al. 2014 のデータベースを拡張
• 15000個 (150⼈×表情25通り×誇張4段階)
• 2Dスケッチ
• 雛形にあらかじめ線を定義しておき，線をレンダリング
• ⼿書きスケッチを2000枚⽤意した
7

Bilinear Morphable Representation
• Cao et al. 2014のアイデア
• 顔のデータベースを3階テンソル𝑇で表現する
• 𝑇（11500頂点, 600⼈, 表情25通り）
• 𝑇を特異値分解
• 𝑇×# 𝐔×% 𝐔 = 𝐶,
• 𝐶はcore tensor,
• 𝑇 ~ 𝐶*×# 𝐔+×% 𝐔+
• 𝐶*は左上⾓を保存したcore tensor
• 𝑉 = 𝐶×# 𝑢.×% 𝑣.
• 個⼈，表情を表すベクトル𝑢, 𝑣
8

ネットワーク構成
• 上はAlexNetと⼀緒 (ReLU, Softmax)
• 𝑢, 𝑣を別々に計算したいため，FC Layersで異なるネットワークを⽤いる． 𝑢が⼤きい
• 輪郭を捉えるため，Shape-level InputでBilinear modelを使う
• Loss関数： 𝐸 =
1
2
∑ 𝑤5 𝐶5×# 𝑢. ×%−𝑔5
#
5 , 𝑔5はground truth
9

学習の流れ
• Classifier training
• Identity, expressionと⼀致するよう学習
• 𝑢 − 𝑣 regression
• 𝑢, 𝑣をLoss関数が⼩さくなるよう学習
• Final tuning
• 内挿して10000データを増やす．
• 10%をテストデータに
10

実験
• Iterations
• Classifier training : 500,000
• 𝑢 − 𝑣 regression: 800,000
• Final tuning: 500,000
• Learning rate: 0.00001, mini-batch size: 50
• Momentum: 0.9, weight decay: 0.00005
11

結果
• モデル⽣成時間
• Laplacianとの⽐較
• (b)が提案⼿法
13

結果
• 検証
• 38⼈×12問
• どちらがより⾃然でスケッチに
忠実か
14

結果
• ZBrushとの⽐較
• 10分で未経験者でもプロと似たようなモデルが作れる
15

結果
• ⼿法の⽐較
• PixelShapeCNN(提案)
• PixelCNN(CNNのみ)
• ShapeNN(2Dbilinearのみ)
• PixelCNN-Wrinckle(w/o
wrinckle)
• PixelCNNSingle (u,v同⼀の
ネットワーク）
16

Contenu connexe

Similaire à [DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

Webシステムプログラミング20170527

義広河野

勉強会用スライド

harmonylab

情報理工学院情報工学系村田研究室.pptx

tm1966

[DL輪読会]Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images

Deep Learning JP

SSII2020 技術動向解説セッション SS1 6/11 (木) 14:00～14:30　メイン会場 (vimeo + sli.do) グラフ構造をもつデータに対する DNN、すなわち Graph Neural Networks (GNNs) の研究はこの２、３年で参加する研究者が急増している。現状、様々なアーキテクチャの GNN が様々なドメインや様々なタスクで個別に提案され、概観を捉えるのも簡単ではない状態になっている。本チュートリアルは、広範に散らばった GNN 研究の現状についての概観と基盤技術を紹介するとともに、時間が許す範囲でコンピュータビジョン領域における応用例の紹介にも取り組みたい。

SSII2020SS: グラフデータでも深層学習〜 Graph Neural Networks 入門〜

SSII

AIがAIを生み出す？

Daiki Tsuchiya

紹介論文 Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos 出典: Vincent Casser, Soeren Pirk Reza, Mahjourian, Anelia Angelova : Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos, the AAAI Conference on Artificial Intelligence, Vol. 33, pp. 8001-8008 (2019) 概要: カメラ映像による深度予測は、屋内及び屋外のロボットナビゲーションにとって必要なタスクです。本研究では、教師なし学習を用いて映像の深度予測とカメラのエゴモーション（自身の動き）の学習に取り組んでいます。先行研究で確立されたベースラインのモデルに、移動する個々の物体のモデル化と、オンラインでのモデルの調整を行う手法を取り入れています。結果として、物体の動きを多く含むシーンでの予測結果を大幅に向上させています。

Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised L...

harmonylab

論文輪読資料「Multi-view Face Detection Using Deep Convolutional Neural Networks」

Kaoru Nasuno

三次元点群を取り扱うニューラルネットワークのサーベイ

Naoya Chiba

cvpaper.challenge のメタサーベイ発表スライドです。 cvpaper.challengeはコンピュータビジョン分野の今を映し、トレンドを創り出す挑戦です。論文サマリ作成・アイディア考案・議論・実装・論文投稿に取り組み、凡ゆる知識を共有します。2020の目標は「トップ会議30+本投稿」することです。 http://xpaperchallenge.org/cv/

Geotag Data Mining （メタサーベイ）

cvpaper. challenge

Similaire à [DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling (10)

Webシステムプログラミング20170527

勉強会用スライド

情報理工学院情報工学系村田研究室.pptx

[DL輪読会]Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images

SSII2020SS: グラフデータでも深層学習〜 Graph Neural Networks 入門〜

AIがAIを生み出す？

Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised L...

論文輪読資料「Multi-view Face Detection Using Deep Convolutional Neural Networks」

三次元点群を取り扱うニューラルネットワークのサーベイ

Geotag Data Mining （メタサーベイ）

Plus de Deep Learning JP

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Deep Learning JP

【DL輪読会】事前学習用データセットについて

Deep Learning JP

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

Deep Learning JP

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

Deep Learning JP

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

Deep Learning JP

【DL輪読会】マルチモーダル LLM

Deep Learning JP

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

Deep Learning JP

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

Deep Learning JP

【DL輪読会】Can Neural Network Memorization Be Localized?

Deep Learning JP

【DL輪読会】Hopfield network　関連研究について

Deep Learning JP

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

Deep Learning JP

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

Deep Learning JP

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

Deep Learning JP

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

Deep Learning JP

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

Deep Learning JP

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

Deep Learning JP

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

Deep Learning JP

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

Deep Learning JP

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

Deep Learning JP

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

Deep Learning JP

Plus de Deep Learning JP (20)

【DL輪読会】AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

【DL輪読会】事前学習用データセットについて

【DL輪読会】 "Learning to render novel views from wide-baseline stereo pairs." CVP...

【DL輪読会】Zero-Shot Dual-Lens Super-Resolution

【DL輪読会】BloombergGPT: A Large Language Model for Finance arxiv

【DL輪読会】マルチモーダル LLM

【 DL輪読会】ToolLLM: Facilitating Large Language Models to Master 16000+ Real-wo...

【DL輪読会】AnyLoc: Towards Universal Visual Place Recognition

【DL輪読会】Can Neural Network Memorization Be Localized?

【DL輪読会】Hopfield network　関連研究について

【DL輪読会】SimPer: Simple self-supervised learning of periodic targets( ICLR 2023 )

【DL輪読会】RLCD: Reinforcement Learning from Contrast Distillation for Language M...

【DL輪読会】"Secrets of RLHF in Large Language Models Part I: PPO"

【DL輪読会】"Language Instructed Reinforcement Learning for Human-AI Coordination "

【DL輪読会】Llama 2: Open Foundation and Fine-Tuned Chat Models

【DL輪読会】"Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware"

【DL輪読会】Parameter is Not All You Need:Starting from Non-Parametric Networks fo...

【DL輪読会】Drag Your GAN: Interactive Point-based Manipulation on the Generative ...

【DL輪読会】Self-Supervised Learning from Images with a Joint-Embedding Predictive...

【DL輪読会】Towards Understanding Ensemble, Knowledge Distillation and Self-Distil...

Dernier

論文紹介: The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

atsushi061452

業務で生成AIを活用したい人のための生成AI入門講座（社外公開版：キンドリルジャパン社内勉強会：2024年4月発表）

Hiroshi Tomioka

LoRaWAN スマート距離検出デバイスDS20L日本語マニュアル

CRI Japan, Inc.

NewSQLの可用性構成パターン（OCHaCafe Season 8 #4 発表資料）

NTT DATA Technology & Innovation

新人研修　後半 2024/04/26の勉強会で発表されたものです。

iPride Co., Ltd.

Amazon SES を勉強してみるその２2024/04/26の勉強会で発表されたものです。

iPride Co., Ltd.

Observabilityは従来型の監視と何が違うのか（キンドリルジャパン社内勉強会：2022年10月27日発表）

Hiroshi Tomioka

LoRaWANスマート距離検出センサー DS20L カタログ LiDARデバイス

CRI Japan, Inc.

論文紹介：Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Groun...

Toru Tamaki

論文紹介：Selective Structured State-Spaces for Long-Form Video Understanding

Toru Tamaki

Amazon SES を勉強してみるその３2024/04/26の勉強会で発表されたものです。

iPride Co., Ltd.

Dernier (11)

論文紹介: The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

業務で生成AIを活用したい人のための生成AI入門講座（社外公開版：キンドリルジャパン社内勉強会：2024年4月発表）

LoRaWAN スマート距離検出デバイスDS20L日本語マニュアル

NewSQLの可用性構成パターン（OCHaCafe Season 8 #4 発表資料）

新人研修　後半 2024/04/26の勉強会で発表されたものです。

Amazon SES を勉強してみるその２2024/04/26の勉強会で発表されたものです。

Observabilityは従来型の監視と何が違うのか（キンドリルジャパン社内勉強会：2022年10月27日発表）

LoRaWANスマート距離検出センサー DS20L カタログ LiDARデバイス

論文紹介：Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Groun...

論文紹介：Selective Structured State-Spaces for Long-Form Video Understanding

Amazon SES を勉強してみるその３2024/04/26の勉強会で発表されたものです。

[DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

1. DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling 19 May, 2017 M2 杉原祥太

2. 書誌情報 • 著者： Xiaoguang Han, Chang Gao and Yizhou Yu • The University of Hong Kong • Proceedings of SIGGRAPH 2017 • https://www.youtube.com/watch?v=93WZHKYxqtM 2

3. 概要 • 顔のスケッチから3Dモデルを対話的に⽣成するシステム • CNNと全結合層を組み合わせた 3

4. 背景 • 顔のモデルは個⼈や表情によって多様である． • 少ない労⼒で⽣成できたら嬉しい • Contributions • 3D顔モデルのための画期的なシステムの提案 • CNNとBilinear modelの組み合わせ • 顔モデル拡張 4

5. 関連研究 • Data-drivenなスケッチによるモデル⽣成 • ⼊れ物，⽊など(Huang et al. 2016), 建物(Nishida et al. 2016) • インタラクティブ性，ネットワークの新規性，データの⾮公開 • Morphableな顔モデル • CNN以外からも形状の特徴を得ているので，推定がより正確 • 3D顔カリカチュア • 2Dスケッチから3D顔カリカチュアは本研究初 • スケッチベースのモデリング • スケッチの線だけでなく，Deepから3D座標を推測して制約条件に 5

6. 提案システム • 3つのインタラクションモードがある． • 1. Initial Sketching Mode • 描いたスケッチがそのまま3Dモデルへ • 2D投影と3Dモデルの形状は正確には⼀致しない • 2. Follow-up Sketching Mode • 線を修正していく • 適宜スケッチとモデルを切り替えられる • 3. Gesture-Based 3D Face Refinement • ジェスチャーで編集 6

7. データベースの構築 • 3Dモデル • Chao et al. 2014 のデータベースを拡張 • 15000個 (150⼈×表情25通り×誇張4段階) • 2Dスケッチ • 雛形にあらかじめ線を定義しておき，線をレンダリング • ⼿書きスケッチを2000枚⽤意した 7

8. Bilinear Morphable Representation • Cao et al. 2014のアイデア • 顔のデータベースを3階テンソル𝑇で表現する • 𝑇（11500頂点, 600⼈, 表情25通り） • 𝑇を特異値分解 • 𝑇×# 𝐔×% 𝐔 = 𝐶, • 𝐶はcore tensor, • 𝑇 ~ 𝐶*×# 𝐔+×% 𝐔+ • 𝐶*は左上⾓を保存したcore tensor • 𝑉 = 𝐶×# 𝑢.×% 𝑣. • 個⼈，表情を表すベクトル𝑢, 𝑣 8

9. ネットワーク構成 • 上はAlexNetと⼀緒 (ReLU, Softmax) • 𝑢, 𝑣を別々に計算したいため，FC Layersで異なるネットワークを⽤いる． 𝑢が⼤きい • 輪郭を捉えるため，Shape-level InputでBilinear modelを使う • Loss関数： 𝐸 = 1 2 ∑ 𝑤5 𝐶5×# 𝑢. ×%−𝑔5 # 5 , 𝑔5はground truth 9

10. 学習の流れ • Classifier training • Identity, expressionと⼀致するよう学習 • 𝑢 − 𝑣 regression • 𝑢, 𝑣をLoss関数が⼩さくなるよう学習 • Final tuning • 内挿して10000データを増やす． • 10%をテストデータに 10

11. 実験 • Iterations • Classifier training : 500,000 • 𝑢 − 𝑣 regression: 800,000 • Final tuning: 500,000 • Learning rate: 0.00001, mini-batch size: 50 • Momentum: 0.9, weight decay: 0.00005 11

12. 結果 12

13. 結果 • モデル⽣成時間 • Laplacianとの⽐較 • (b)が提案⼿法 13

14. 結果 • 検証 • 38⼈×12問 • どちらがより⾃然でスケッチに忠実か 14

15. 結果 • ZBrushとの⽐較 • 10分で未経験者でもプロと似たようなモデルが作れる 15

16. 結果 • ⼿法の⽐較 • PixelShapeCNN(提案) • PixelCNN(CNNのみ) • ShapeNN(2Dbilinearのみ) • PixelCNN-Wrinckle(w/o wrinckle) • PixelCNNSingle (u,v同⼀のネットワーク） 16

17. Limitations 17

[DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

Recommandé

Recommandé

Contenu connexe

Similaire à [DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

Similaire à [DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling (10)

Plus de Deep Learning JP

Plus de Deep Learning JP (20)

Dernier

Dernier (11)

[DL輪読会]DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling