How to Choose a Deep Learning Framework

Deep learning
frameworks
Navid Kalaei
Shiraz University of Technology

“AI is the new
electricity”
Andrew NG, co-founder of Google
Brain

Major Players
● Tensorfelow
● PyTorch
● Caffe2
● Sonet
● Keras
● MXNet
● Gloun
● CNTK
● Chainer
● DL4J
● ONNX

TensorFlow
● The most popular framework! (high job postings/great
resources)
● Developed by Google
● Used in major Google products like Gmail and Translate
● The most active framework on Github

TensorFlow
● Ofﬁcial support for Python
● Experimental support for C++, and Go
● Community support for C#, and Julia
● Reach detailed documentation
● TensorBoard: monitoring and visualizing models
● TensorFlow Serving: serve your model in production at
scale including distributed training

TensorBoard
TensorBoard provides the
visualization and tooling needed for
machine learning experimentation
TensorFlow’s Visualization
Toolkit

TensorBoard
● Track and visualize metrics such as loss and accuracy
● Visualize the model graph (ops and layers)
● View histograms of weights, biases, or other tensors as
they change over time
● Project embeddings to a lower dimensional space
● Display images, text, and audio data
● Proﬁle TensorFlow programs

Tensorﬂow
Serving
TensorFlow Serving is a ﬂexible,
high-performance serving system
for machine learning models

TensorFlow Serving
● Designed for production environments
● TensorFlow Serving makes it easy to deploy new
algorithms and experiments, while keeping the same
server architecture and APIs.
● TensorFlow Serving provides out-of-the-box integration
with TensorFlow models, but can be easily extended to
serve other types of models and data.

TensorFlow Lite
Deploy machine learning models
on mobile and IoT devices
TensorFlow Lite is an open source
deep learning framework for
on-device inference.

Cons of TensorFlow
● Pretty low-level
● Requires a tons of boilerplate coding
● By default, creates static computation graphs at compile
time
● However, with eager execution, dynamic computation
graphs are available

PyTorch
● Created by Facebook
● A native deﬁne by run framework
● Used by Stanford NLP Group, Twitter, and Salesforce

PyTorch
● Is like more traditional programming
● Despite build and run TensorFlow, PyTorch creates
computational graph on each iteration. After the
iteration the memory is freed.
● Great in debugging: pdb, PyCharm, and common python
debuggers
● Mobile and large scale production via Caffe2

PyTorch
● Declarative data parallelism
● Many pretrained models
● Modular parts
● Distributed training

Cons of PyTorch
Lack of model serving
Lack of native monitors and visualizations
Could be connected to TensorBoard

Sonnet
● Created by DeepMind
● Built on top of TensorFlow
● Designed to provide simple, composable abstractions for
machine learning research
● Creates native python objects, then attach them to graph
computation. This makes modularity easier.

Keras
● The minimalist Python based library
● The best learning tool for beginners
● Can be run on top of TensorFlow and Microsoft CNTK
● Support for huge Neural Network types
● Suitable for prototyping
● Very readable
● Built-in support for multiple GPUs
● Can be converted to TensorFlow and be trained on
Google Cloud

Cons of Keras
● Very high-level -> not that customizable
● Constrained to TensorFlow and CNTK
● Less functionality than lower level liberaries

mxnet
● Amazon’s framework
● Adapted by AWS
● Native support for a huge variety of programming
languages
● Designed for scale linearly with number of processors
● High-performance imperative API
● Has the simplicity of Keras and dynamically of PyTorch
● Allows hybridization: Declarative like TensorFlow,
imperative like PyTorch, switch in between by Gluon

MXNet: Mixed Programming Paradigm

CNTK
● Developed by Microsoft
● Supports Python, C++, C#, Java
● Supports for CNNs and RNNs
● Used in Skype, Xbox, and Cortana
● Easily develop products for speech and image problems
● Integrates with Apache Spark
● Very well integration with Azure
● Handles passing sequences like sentences better than
others

Cons of CNTK
● No conventional open source licence
● Mostly Windows developers

Chainer
● Created by preferred.jp a japanese startup
● Similar to PyTorch is deﬁne-by-run imperative API
● But difﬁcult to debug
● Community is relatively small
● Supported by giants like IBM and Intel
● Run on multiple GPUs with little effort
● Main use cases: speech recognition, machine translation,
sentiment analysis

DL4J
● Mainly for Java and Scala
● Supports a huge variety of neural networks
● Is made for enterprise scale
● Works with Apache hadoop and Spark on distributed
CPUs and GPUs
● Great documentation
● Good for developing full-stacked Java pipeline which
includes Android devices

Cons of DL4J
● Java is not popular among machine learning projects
● Hard to integrate with other machine learning libraries

Core ML
● Not a framework to build models
● Helps bring existing models created with other
frameworks to Apple devices

ONNX
● The open ecosystem for interchangeable AI models
● Created in a partnership between Microsoft and
Facebook
● Train a model in one tool stack, and deploy it with the
other

Factors to
consider
● Model architecture
● Programming language
● Training device
● Target device
● Scale of the product
● Project deadline

Conclusion
● A beginner -> Kera
● Production on Google Cloud -> TensorFlow
● A researcher -> PyTorch but also try Sonnet
● Production on AWS -> mxnet
● Production on Azure -> CNTK
● A Java developer -> DL4J
● Some spare time -> Chainer
● Need to interpret models between frameworks -> ONNX
● An IOS developer -> Core ML

How to Choose a Deep Learning Framework

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à How to Choose a Deep Learning Framework

Similaire à How to Choose a Deep Learning Framework (20)

Dernier

Dernier (20)

How to Choose a Deep Learning Framework