Why natural language is next step in the AI evolution

•

0 j'aime•321 vues

In 2010 ImageNet finally ended the AI winter and gave machines the sense of sight. Within the following years dramatic improvements in tasks such as image classification and object detection lead to innovations like face ID and autonomous driving. Recently, similar developments happened in the field of natural language. Using Attention mechanism and transformers tasks such as question answering and text summarization reached new benchmarks. This talk will not only explain those, but point out how Transfer Learning and open source models such as Google Bert will open the field to new innovations in AI. Speaker: Nico Kreiling, inovex Event: AIxIA, 01.10.2019 Mehr Tech-Vorträge: inovex.de/vortraege Mehr Tech-Artikel: inovex.de/blog

Technologie

Why natural language is next step
in the AI evolution
Nico Kreiling Karlsruhe, 1.10.2019

Nico Kreiling
› Data Scientist @ inovex
› Twitter: nicokreiling
› www.techtiefen.de
2

› Most information is available in either written or spoken
words
› Language projects fundamental real world concepts
› Speach is everywhere
3
Language is important...
...but really difficult

5
Walking or even
climbing can be
learned fast

› Language has many syntactic relationships
Stoiber probably thought, he was making a clear point
› Language understanding ofen requires external knowledge
Pippi feeds Mr. Nilsson regulary.
› Language heavily depends on the context
„The president has a great vision“
› Irony and sarcasm are topics on their own
7
Why is language complicated ?

Language
is
more
than
words
9
How does modern NLP work?
Word-Vectors
0,11349057 0,84879879 0,93261575 ... 0,83844097 0,79705175
0,60343267 0,26889299 0,51101431 ... 0,79602685 0,24950905
0,18796149 0,83380881 0,65787064 ... 0,29058833 0,06187716
0,3424444 0,50918356 0,03611215 ... 0,6951714 0,87274548
0,07952056 0,35248701 0,86835106 ... 0,06748431 0,80904259

10
How does modern NLP work?
Word-Vectors
King
Queen
Princess
Prince

11
How does modern NLP work?
Word-Vectors
King
+ Princess
- Queen
= Prince
King
Queen
Princess
Prince

12source: https://explosion.ai/blog/deep-learning-formula-nlp
How does modern NLP work?
Encoder / Decoder Model
Embedding
Represent the text as
Embeddings
Encode
Enrich the embedding
with contextual
information
Attend
Reduces the feature
matrix to a single vectory
Predict
Use the vector to decide
about the output

Grammer is also important.
13source: http://ruder.io/nlp-imagenet/
How does modern NLP work?
Transformer and Self-Supervised Learning
› Self Attention
› Mask some words
› Randomly add a second sentence
Language is more than words.
0
0,2
0,4
0,6
0,8
1
Language
is
m
ore
than
words
Attention

14
How does modern NLP work?
Big Language Models + Transfer learning
› Bert Large
› 24-layer, 340M parameters
› Bert Base (104 languages)
› 12-layer, 110M parameters
› 64 Tesla V100 GPUs for 79.2 hours
› Cost of training $2074–$12,571
But we can build upon this model!

15Source: https://www.researchgate.net/figure/Historical-top5-error-rate-of-the-annual-winner-of-the-ImageNet-image-classification_fig7_303992986 [accessed 6 Sep, 2019]
Solving Vision
top5 error rate of the annual winner of the ImageNet image classification challenge
Introduction of CNN models

› GLUE: Language
Understanding
› SQuAD: Question
Answering
› SWAG: Common Sense
Reasoning
› Coref: Co-Reference
Resolution
16
Solving Speach
0
10
20
30
40
50
60
70
80
90
100
GLUE SQuAD 1.1 SQuAD 2 SWAG Coref
Performance on different NLP Challenges
Sota Bert

Songs as words
18
Spotify
Word Vectors
Auto-Generated
Playlists

19http://benanne.github.io/images/prentje_nips.png

code-
generation
algorithm
21https://tabnine.com/blog/deep/
TabNine
OpenAI GPT-2
Intelligent Code-
Completion

Abstractive text
summerization using on
arbitary long texts using
bidirectional RNNs
22https://www.inovex.de/blog/text-summarization-seq2seq-neural-networks/
Other applications we investigate in
Fine-tune general purpose
language models for text
classificiation in special
language environments
Attention based RNNs for
Q&A on database
information using
reinforcement learning
Extract key aspects on
long, complicated texts
(medical records,
legislative texts)
Limit offenstive speach in
public communication
(bulletin boards, e-mails)
Access previously not
accessible data sources for
easy information access
(hotlines, chatbots...)

Vielen Dank
Questions?
Let‘s talk:
Nico Kreiling
nicokreiling

Recommandé

State of the art in Natural Language Processing (March 2019)Liad Magen

Deep Learning Lightning TalkMateusz Buśkiewicz

easy ten for Silicon Valley Meets Russia 2014Dmitry Zaryuta

Nautral Langauge Processing - Basics / Non Technical Dhruv Gohil

A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceBasis Technology

Deep Learning and the state of AI / 2016Grigory Sapunov

How Did We End up Here?C4Media

34th.余凯.机器学习进展及语音图像中的应用komunling

Recommandé

State of the art in Natural Language Processing (March 2019)Liad Magen

Deep Learning Lightning TalkMateusz Buśkiewicz

easy ten for Silicon Valley Meets Russia 2014Dmitry Zaryuta

Nautral Langauge Processing - Basics / Non Technical Dhruv Gohil

A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceBasis Technology

Deep Learning and the state of AI / 2016Grigory Sapunov

How Did We End up Here?C4Media

34th.余凯.机器学习进展及语音图像中的应用komunling

"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keyn...Edge AI and Vision Alliance

From Dream socialbot to Multiskill AI Assistant PlatformDaniel Kornev

Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Fwdays

Hard and Soft skills: be successful in the IT marketDavide Benvegnù

Top 5 recent research courses on machine learning- simplivSimpliv LLC

Breaking Through The Challenges of Scalable Deep Learning for Video AnalyticsJason Anderson

Natural Language Processing (NLP), Search and Wearable Technologypixelbuilders

16-nlp (2).ppttestbest6

Rethinking Object OrientationIASA

Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017MLconf

Beyond the Symbols: A 30-minute Overview of NLPMENGSAYLOEM1

How ReadSpeaker’s new speech-enabling product extends the reach of your onlin...ReadSpeaker

BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2Karthik Murugesan

Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...AI Frontiers

Hello World - Introduction to coding.pptxJennyGainsford

Introduction to Multimodal LLMs with LLaVARobert McDermott

Python enterprise vento di libertaSimone Federici

AI IN EDUCATION.pptx.pdfKumar Saurav

IC-SDV 2019: Down-to-earth machine learning: What you always wanted your data...Dr. Haxel Consult

lldb – Debugger auf Abwegeninovex GmbH

Are you sure about that?! Uncertainty Quantification in AIinovex GmbH

Contenu connexe

Similaire à Why natural language is next step in the AI evolution

"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keyn...Edge AI and Vision Alliance

From Dream socialbot to Multiskill AI Assistant PlatformDaniel Kornev

Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Fwdays

Hard and Soft skills: be successful in the IT marketDavide Benvegnù

Top 5 recent research courses on machine learning- simplivSimpliv LLC

Breaking Through The Challenges of Scalable Deep Learning for Video AnalyticsJason Anderson

Natural Language Processing (NLP), Search and Wearable Technologypixelbuilders

16-nlp (2).ppttestbest6

Rethinking Object OrientationIASA

Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017MLconf

Beyond the Symbols: A 30-minute Overview of NLPMENGSAYLOEM1

How ReadSpeaker’s new speech-enabling product extends the reach of your onlin...ReadSpeaker

BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2Karthik Murugesan

Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...AI Frontiers

Hello World - Introduction to coding.pptxJennyGainsford

Introduction to Multimodal LLMs with LLaVARobert McDermott

Python enterprise vento di libertaSimone Federici

AI IN EDUCATION.pptx.pdfKumar Saurav

IC-SDV 2019: Down-to-earth machine learning: What you always wanted your data...Dr. Haxel Consult

Similaire à Why natural language is next step in the AI evolution (20)

"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keyn...

From Dream socialbot to Multiskill AI Assistant Platform

Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"

Hard and Soft skills: be successful in the IT market

Top 5 recent research courses on machine learning- simpliv

Breaking Through The Challenges of Scalable Deep Learning for Video Analytics

Natural Language Processing (NLP), Search and Wearable Technology

16-nlp (2).ppt

Rethinking Object Orientation

Sanjeev Satheesj, Research Scientist, Baidu at The AI Conference 2017

Beyond the Symbols: A 30-minute Overview of NLP

How ReadSpeaker’s new speech-enabling product extends the reach of your onlin...

BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2

Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...

Hello World - Introduction to coding.pptx

Introduction to Multimodal LLMs with LLaVA

Python enterprise vento di liberta

AI IN EDUCATION.pptx.pdf

IC-SDV 2019: Down-to-earth machine learning: What you always wanted your data...

Plus de inovex GmbH

lldb – Debugger auf Abwegeninovex GmbH

Are you sure about that?! Uncertainty Quantification in AIinovex GmbH

WWDC 2019 Recapinovex GmbH

Network Policiesinovex GmbH

Interpretable Machine Learninginovex GmbH

Jenkins X – CI/CD in wolkigen Umgebungeninovex GmbH

AI auf Edge-Geraeteninovex GmbH

Prometheus on Kubernetesinovex GmbH

Deep Learning for Recommender Systemsinovex GmbH

Azure IoT Edgeinovex GmbH

Representation Learning von Zeitreiheninovex GmbH

Talk to me – Chatbots und digitale Assistenteninovex GmbH

Künstlich intelligent?inovex GmbH

Dev + Ops = Goinovex GmbH

Das Android Open Source Projectinovex GmbH

Machine Learning Interpretabilityinovex GmbH

Performance evaluation of GANs in a semisupervised OCR use caseinovex GmbH

People & Products – Lessons learned from the daily IT madnessinovex GmbH

Infrastructure as (real) Code – Manage your K8s resources with Pulumiinovex GmbH

Remote First – Der Arbeitsplatz in der Cloudinovex GmbH

Plus de inovex GmbH (20)

lldb – Debugger auf Abwegen

Are you sure about that?! Uncertainty Quantification in AI

WWDC 2019 Recap

Network Policies

Interpretable Machine Learning

Jenkins X – CI/CD in wolkigen Umgebungen

AI auf Edge-Geraeten

Prometheus on Kubernetes

Deep Learning for Recommender Systems

Azure IoT Edge

Representation Learning von Zeitreihen

Talk to me – Chatbots und digitale Assistenten

Künstlich intelligent?

Dev + Ops = Go

Das Android Open Source Project

Machine Learning Interpretability

Performance evaluation of GANs in a semisupervised OCR use case

People & Products – Lessons learned from the daily IT madness

Infrastructure as (real) Code – Manage your K8s resources with Pulumi

Remote First – Der Arbeitsplatz in der Cloud

Dernier

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Real Time Object Detection Using Open CVKhem

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

How to convert PDF to text with Nanonetsnaman860154

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Scaling API-first – The story of a global engineering organizationRadu Cotescu

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Dernier (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Finology Group – Insurtech Innovation Award 2024

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Advantages of Hiring UIUX Design Service Providers for Your Business

08448380779 Call Girls In Friends Colony Women Seeking Men

The 7 Things I Know About Cyber Security After 25 Years | April 2024

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Real Time Object Detection Using Open CV

Driving Behavioral Change for Information Management through Data-Driven Gree...

Exploring the Future Potential of AI-Enabled Smartphone Processors

How to convert PDF to text with Nanonets

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Breaking the Kubernetes Kill Chain: Host Path Mount

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Scaling API-first – The story of a global engineering organization

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Presentation on how to chat with PDF using ChatGPT code interpreter

Handwritten Text Recognition for manuscripts and early printed texts

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Why natural language is next step in the AI evolution

1. Why natural language is next step in the AI evolution Nico Kreiling Karlsruhe, 1.10.2019

2. Nico Kreiling › Data Scientist @ inovex › Twitter: nicokreiling › www.techtiefen.de 2

3. › Most information is available in either written or spoken words › Language projects fundamental real world concepts › Speach is everywhere 3 Language is important... ...but really difficult

4. 4Vision is easy

5. 5 Walking or even climbing can be learned fast

6. 6 But language is complicated

7. › Language has many syntactic relationships Stoiber probably thought, he was making a clear point › Language understanding ofen requires external knowledge Pippi feeds Mr. Nilsson regulary. › Language heavily depends on the context „The president has a great vision“ › Irony and sarcasm are topics on their own 7 Why is language complicated ?

8. 8 But it works... ...how does it?

9. Language is more than words 9 How does modern NLP work? Word-Vectors 0,11349057 0,84879879 0,93261575 ... 0,83844097 0,79705175 0,60343267 0,26889299 0,51101431 ... 0,79602685 0,24950905 0,18796149 0,83380881 0,65787064 ... 0,29058833 0,06187716 0,3424444 0,50918356 0,03611215 ... 0,6951714 0,87274548 0,07952056 0,35248701 0,86835106 ... 0,06748431 0,80904259

10. 10 How does modern NLP work? Word-Vectors King Queen Princess Prince

11. 11 How does modern NLP work? Word-Vectors King + Princess - Queen = Prince King Queen Princess Prince

12. 12source: https://explosion.ai/blog/deep-learning-formula-nlp How does modern NLP work? Encoder / Decoder Model Embedding Represent the text as Embeddings Encode Enrich the embedding with contextual information Attend Reduces the feature matrix to a single vectory Predict Use the vector to decide about the output

13. Grammer is also important. 13source: http://ruder.io/nlp-imagenet/ How does modern NLP work? Transformer and Self-Supervised Learning › Self Attention › Mask some words › Randomly add a second sentence Language is more than words. 0 0,2 0,4 0,6 0,8 1 Language is m ore than words Attention

14. 14 How does modern NLP work? Big Language Models + Transfer learning › Bert Large › 24-layer, 340M parameters › Bert Base (104 languages) › 12-layer, 110M parameters › 64 Tesla V100 GPUs for 79.2 hours › Cost of training $2074–$12,571 But we can build upon this model!

15. 15Source: https://www.researchgate.net/figure/Historical-top5-error-rate-of-the-annual-winner-of-the-ImageNet-image-classification_fig7_303992986 [accessed 6 Sep, 2019] Solving Vision top5 error rate of the annual winner of the ImageNet image classification challenge Introduction of CNN models

16. › GLUE: Language Understanding › SQuAD: Question Answering › SWAG: Common Sense Reasoning › Coref: Co-Reference Resolution 16 Solving Speach 0 10 20 30 40 50 60 70 80 90 100 GLUE SQuAD 1.1 SQuAD 2 SWAG Coref Performance on different NLP Challenges Sota Bert

17. 17 Time for applications

18. Songs as words 18 Spotify Word Vectors Auto-Generated Playlists

19. 19http://benanne.github.io/images/prentje_nips.png

20. 20

21. code- generation algorithm 21https://tabnine.com/blog/deep/ TabNine OpenAI GPT-2 Intelligent Code- Completion

22. Abstractive text summerization using on arbitary long texts using bidirectional RNNs 22https://www.inovex.de/blog/text-summarization-seq2seq-neural-networks/ Other applications we investigate in Fine-tune general purpose language models for text classificiation in special language environments Attention based RNNs for Q&A on database information using reinforcement learning Extract key aspects on long, complicated texts (medical records, legislative texts) Limit offenstive speach in public communication (bulletin boards, e-mails) Access previously not accessible data sources for easy information access (hotlines, chatbots...)

23. Vielen Dank Questions? Let‘s talk: Nico Kreiling nicokreiling