Random Thoughts on Paper Implementations [KAIST 2018]

Random thoughts on
Paper Implementation
Taehoon Kim / carpedm20

Sukhbaatar, Sainbayar, Jason Weston, and Rob Fergus. "End-to-end memory networks." Advances in neural information processing systems. 2015.

https://github.com/facebook/MemNN/tree/mater/MemN2N-lang-model

https://github.com/carpedm20/MemN2N-tensorflow

Lua Torch →
Simple Translation

→
https://github.com/carpedm20/MemN2N-tensorflow/blob/master/model.py
Lua Torch 0.5.0
Translation

20+ implementations
900 days
https://carpedm20.github.io/#personal-projects

Received questions about implementation

1. What to implement
2. What to read
3. TensorFlow or PyTorch?
4. Grants and Competitions

Computer Vision
Natural Language Processing
Reinforcement Learning
Semi/Un-supervised Learning
…

GAN
Generative Adversarial Network

Too many GANs..
Relatively easier to implement many but not much of a difference

Reasoning
Question Answering
Controllable Text Generation
Unsupervised Embedding
Exploration Without Reward
Neural Programmers
Neural Architecture Search
Hierarchical Representation
Learning
Video Generation
Program Induction
Self-Play Learning
Adversarial Attack
Multi-agent RL
Model-based RL
Meta Learning
Learned Data Augmentation
Speech Synthesis
Music Generation

Meta (Reinforcement) Learning
Transfer Learning

https://blog.openai.com/evolved-policy-gradients/
Adaptive learning from prior experience
with Evolution Strategies (ES)

Easiest way : learning from pretrained network
But how can we do better?

𝜃
Loss : 𝑓 𝑥; 𝜃 − 𝑦 (
Neural Network : 𝑓 𝑥; 𝜃

𝜃𝜙
Loss : 𝐿 𝑥, 𝑦; 𝜙
Neural Network : 𝑓 𝑥; 𝜃

𝜃𝜙

Evolved Policy Gradient Policy Gradient

Ganin, Yaroslav, et al. "Synthesizing Programs for Images using Reinforced Adversarial Learning." arXiv preprint arXiv:1804.01118 (2018).
Distributed RL + GAN

Ganin, Yaroslav, et al. "Synthesizing Programs for Images using Reinforced Adversarial Learning." arXiv preprint arXiv:1804.01118 (2018).

https://blog.openai.com/ingredients-for-robotics-research/

https://blog.openai.com/robots-that-learn/

https://blog.openai.com/competitive-self-play/

“How to implement a paper FASTER?”

Recommendations
• NLP: End-to-End Memory Network
• https://github.com/carpedm20/MemN2N-tensorflow
• Vision: (Fast) Style Transfer
• https://github.com/lengstrom/fast-style-transfer
• RL: Asynchronous Methods for Deep Reinforcement Learning
• https://github.com/openai/universe-starter-agent (TBH code is dirty but there are lots of things to learn)
• https://github.com/ikostrikov/pytorch-a3c
• Etc: Neural Turing Machine
• https://github.com/loudinthecloud/pytorch-ntm

More (nerdy) recommendations
• from tensorflow.contrib.seq2seq import Helper:
• https://github.com/keithito/tacotron/blob/master/models/helpers.py
• tf.while_loop:
• https://github.com/melodyguan/enas/blob/master/src/ptb/ptb_enas_child.py
• import tensorflow.contrib.graph_editor as ge:
• https://github.com/openai/gradient-checkpointing/blob/master/memory_saving_gradients.py
• Google’s production level code example:
• https://github.com/tensorflow/tensor2tensor
• The Annotated Transformer
• http://nlp.seas.harvard.edu/2018/04/03/attention.html
• Relatively safe-to-read codes:
• https://github.com/tensorflow/models

https://github.com/carpedm20/SPIRAL-tensorflow/blob/master/utils/train.py

https://github.com/carpedm20/SPIRAL-tensorflow/blob/master/utils/image.py

https://github.com/carpedm20/SPIRAL-tensorflow/blob/master/utils/image.py
64 batch image → 8 × 8 image grid summary

Read more recent and correct code
Never read carpedm20’s code (no joke. BAD examples)

• Dirty long code
• More stressful debugging
• Faster development (1.5.0: Jan. 18, 1.6.0-1.7.0: Mar. 18, 1.8.0: Apr. 18, possibly more reliable)
• Easier (partial) save and load models (tf.Supervisor)
• Harder dynamic computation (tf.fold)
• XLA, TensorBoard, TPU, tf.eager, Multi-node distribution, Documentation

… compiling parts of the computational graph with XLA (a TensorFlow Just-In-Time compiler) and
…
Espeholt, Lasse, et al. "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures." arXiv preprint arXiv:1802.01561 (2018).

tensorboard --port 6006 --debugger_port 6064
https://github.com/tensorflow/tensorboard/tree/master/tensorboard/plugins/debugger

https://github.com/tensorflow/tensorboard/tree/master/tensorboard/plugins/debugger

https://twitter.com/jordannovet/status/977645002200252416

• Clean short code
• Less stressful debugging
• Slower development (0.2.0: Aug. 17, 0.3.0: Dec. 17)
• Dirty (partial) save and load models
• Easier dynamic computation

https://blog.openai.com/openai-scholars/

https://blog.openai.com/retro-contest/

Workshop Challenges
NIPS, ICML, ICLR

http://www.cs.mcgill.ca/~jpineau/ICLR2018-ReproducibilityChallenge.html

https://openreview.net/forum?id=r1dHXnH6-
Feedback from Authors

https://arxiv.org/abs/1802.03198

Seize the day :)
Great opportunities await you

https://www.slideshare.net/carpedm20/ai-67616630

Teaching Machines to Understand Visual Manuals
via Attention Supervision for Object Assembly

Teaching Machines to Understand Visual Manuals via Attention Supervision for Object Assembly (work in progress)

https://www.slideshare.net/carpedm20/deview-2017-80824162

https://carpedm20.github.io/tacotron/
Person A

https://carpedm20.github.io/tacotron/
Person B

http://www.devsisters.com/jobs/

Random Thoughts on Paper Implementations [KAIST 2018]

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Random Thoughts on Paper Implementations [KAIST 2018]

Similaire à Random Thoughts on Paper Implementations [KAIST 2018] (20)

Plus de Taehoon Kim

Plus de Taehoon Kim (14)

Dernier

Dernier (20)

Random Thoughts on Paper Implementations [KAIST 2018]