Federated Learning: Budowanie modeli uczenia maszynowego bez wglądu w rozproszone dane

Federated Learning: Budowanie modeli
uczenia maszynowego bez wglądu w
rozproszone dane
Autorzy:
Rafał Wojdan
AI & NLP Day 2020 1

Scentralizowane uczenie (Centralised learning)
2
https://ekkono.ai/wp-content/uploads/2020/06/SWP_Federated_Lear
ning_Ekkono_Solutions_May_2020.pdf

Rośnie rynek IoT i Mobile
3
Wyzwania dla scentralizowanego
uczenia
● Łączność (Connectivity)
● Przepustowość łącza (Bandwidth)
● Opóźnienie (Latency)
● Prywatność (Privacy)

Zdecentralizowane uczenie (decentralised
learning)
4
https://ekkono.ai/wp-content/uploads/2020/06/SWP_Federated_Lear
ning_Ekkono_Solutions_May_2020.pdf
Brak
generalizacji

Federated Learning (Od 2019)
5
https://medium.com/accenture-the-dock/instilling-responsible-and-reliable-ai-development-with-federated-learning-d23c366c5efd

FL - korzyści
6
Mniejsze wymagania sprzętowe:
● Łączność (Connectivity)
● Przepustowość łącza (Bandwidth)
Predykcja w czasie rzeczywistym
● Opóźnienie (Latency)
Uczenie bez wglądu do danych
● Prywatność (Privacy)
Globalny model
● Generalizacja

Algorytmy update modelu w FL
7
● Federated Averaging
(FedAvg)
● Federated Stochastic
Gradient Descent
(FedSGD)
● Federated Learning
with Matched
Averaging (FedMA)
https://arxiv.org/pdf/1602.05629.pdf

Uśrednianie modelu (FedAvg) vs uśrednianie
gradientu (FedSGD)
8
Porównanie:
● FedSGD gwarantuje zbieżność
● FedAvg lżejszy komunikacyjnie (mniej
update’ów modelu globalnego)
Przykład CIFAR-10
Liczba update’ów dająca 82% dokładności
(Accuracy):
● FedSGD 6 600
● FedAvg 630
FedAvg 10x mniej rund niż
FedSGD

Nowe hiperparametry
9
Hiperparametry:
C - część klientów, którzy uczestniczą w uczeniu w każdej rundzie
E - liczba iteracji treningowych u każdego klienta na lokalnym zbiorze
B - rozmiar lokalnego mini-batcha
Więcej: https://arxiv.org/pdf/1602.05629.pdf

Wyzwania techniczne dla Federated Learning
10
● Komunikacja
○ Liczba aktualizacji modelu globalnego
○ Przesłanie nowego modelu globalnego na urządzenia lokalne
● Zmienny udział lokalnych urządzeń
○ Część urządzeń dostępnych do update’u
○ Ograniczenia sprzętowe lokalnych urządzeń
○ Odłączenie od sieci FL
● Ochrona prywatności

Zabezpieczone agregowanie
11
https://federated.withgoogle.com/#about

Wyzwania MLowe dla Federated Learning
12
● Inference attack
● Model poisoning
● Prywatność danych (Memorization)
● Heterogeniczność danych

Model poisoning
13
https://medium.com/datadriveninvestor/an-overview-of-federated-learning-8a1a62b0600d

GBoard
14
Wzrost o 26%
Poprawa CTR
o 10%
GBoard - predykcja następnego
słowa

Memorization
15
https://www.pdl.cmu.edu/SDI/2019/slides/2019-09
-05Federated%20Learning.pdf
A co jeśli model
językowy będzie za
dobry?

Differential privacy
16
https://medium.com/georgian-impact-blog/a-brief-introduction-t
o-differential-privacy-eacf8722283b
https://www.accessnow.org/understanding-differential-privacy-
matters-digital-rights/
Kwerenda: Liczba bad credit rating

Różne architektury i business case’y
17
https://www.arxiv-vanity.com/papers/1902.04885/

Problem dobry do zastosowania Federated
Learning
18
1) Dane ze źródła lepsza niż dane
przybliżone (proxy data) dostępne
centralnie
2) Wymóg prywatności
3) Dostępność etykiet - są
generowane w ramach procesu np.
przez użytkowników
Your problem and FL have liked each other
PROBLEM FEDERATED
LEARNING

Czemu korzystamy z FL w Sotrender?
19
Nasze wyzwania:
● Zróżnicowanie rozkładów w czasie
● Zróżnicowanie rozkładów pomiędzy klientami
● Zróżnicowany rozmiar danych
● Małe dane per klient
Separacja danych FB
Nie korzystamy z danych mobile.
Nie korzystamy z danych edge device.
Ale…
Jak szpitale musimy zachować
prywatność danych klientów.

Rozwiązania open source
20
● Xgboost https://github.com/mc2-project/secure-xgboost
● TFF - Tensorflow Federated
● PySyft dla PyTorcha
● IBM differential privacy
https://github.com/IBM/differential-privacy-library

Problem
22
https://www.pdl.cmu.edu/SDI/2019/slides/2019-09-05Federated%20Learning.pdf Coraz
więcej danych on the edge, rośnie prywatność. Ale co z analizami i ML.
Trochę historii 2014 -2019, trzy rozwiązania

Różnice między tradycyjnym rozproszonym
uczeniem, a Federated Learning
23
https://www.pdl.cmu.edu/SDI/2019/slides/2019-09-05Fede
rated%20Learning.pdf

Korzyści
24
Benefits
Here are some primary benefits of federated machine learning:
● FL enables devices like mobile phones to collaboratively learn a shared prediction model while keeping the training
data on the device instead of requiring the data to be uploaded and stored on a central server.
● Moves model training to the edge, namely devices such as smartphones, tablets, IoT, or even “organizations” like
hospitals that are required to operate under strict privacy constraints. Having personal data remain local is a strong
security benefit.
● Makes real-time prediction possible, since prediction happens on the device itself. FL reduces the time lag that
occurs due to transmitting raw data back to a central server and then shipping the results back to the device.
● Since the models reside on the device, the prediction process works even when there is no internet connectivity.
● FL reduces the amount of hardware infrastructure required. FL uses minimal hardware and what is available in
mobile devices is more than enough to run the FL models.

Wyzwania dla Federated Learning
25
Challenges
● There are a number of core challenges associated with FL. First, communication is a critical bottleneck in FL networks
where data generated on each device remain local. In order to train a model using data generated by the devices in the
network, it is necessary to develop communication-efficient methods that reduce the total number of communication
rounds, and also iteratively send small model updates as part of the training process, as opposed to sending the entire
data set.
● Additionally, FL methods must: anticipate low levels of device participation, i.e. only a small fraction of the devices being
active at once; tolerate variability in hardware that affects storage, computational, and communication capabilities of
each device in a federated network; and be able to handle dropped devices in the network.
● Finally, FL helps to protect data generated on a device by sharing model updates such as gradient data instead of raw
data. But communicating model updates throughout the training process can still reveal sensitive information, either to a
third party, or to the central server.

Model memorization
26
Understanding and mitigating the risks of model memorization is an active area of research. Techniques to measure memorization
are explored, e.g. in the 2018 paper The Secret Sharer: Measuring Unintended Neural Network Memorization & Extracting Secrets.
Memorization risk can be mitigated by pre-ﬁltering rare or sensitive information before training. More sophisticated mitigation
techniques include differentially private model training as explored, for example, in the 2018 paper Learning Differentially Private
Recurrent Language Models, which shows how to learn model weights that are not too dependent on any one device’s data. For more
information on differential privacy, the canonical textbook “The Algorithmic Foundations of Differential Privacy” by Cynthia Dwork and
Aaron Roth is available from NOW publishers and online.
https://federated.withgoogle.com/#about

Wyzwania dla Federated Learning
27
Tutaj są 3 fajne: ML, komunikacja i privacy
https://medium.com/accenture-the-dock/instilling-responsi
ble-and-reliable-ai-development-with-federated-learning-d
23c366c5efd
● Wyzwania dla FL https://medium.com/datadriveninvestor/an-overview-of-federated-learning-8a1a62b0600d
1. Inference attack - pozyskanie informacji o specyficznych userach -> rozwiązanie to differential privacy
https://medium.com/georgian-impact-blog/a-brief-introduction-to-differential-privacy-eacf8722283b
2. Model poisoning

Wyzwania ML dla Federated Learning
28
1. Problemy ML - moim zdaniem
1. Rozkład danych
2. Ile epoch uczenia lokalnie
3. Ogólnie hiperparameters tuning
4. Wagi udziału każdego klienta
https://www.pdl.cmu.edu/SDI/2019/slides/2019-09-05Federated%20Learning.pdf kilka ciekawych przykładów nie tylko ML

Differential privacy
29
https://towardsdatascience.com/ai-differential-privacy-and-
federated-learning-523146d46b85
Super przykład z kartą kredytową
https://www.pdl.cmu.edu/SDI/2019/slides/2019-09-05Fede
rated%20Learning.pdf

Przykłady użycia
30
https://www.pdl.cmu.edu/SDI/2019/
slides/2019-09-05Federated%20Le
arning.pdf dużo fajnych z Google’a

Federated Learning: Budowanie modeli uczenia maszynowego bez wglądu w rozproszone dane

Recommandé

Recommandé

Contenu connexe

Similaire à Federated Learning: Budowanie modeli uczenia maszynowego bez wglądu w rozproszone dane

Similaire à Federated Learning: Budowanie modeli uczenia maszynowego bez wglądu w rozproszone dane (20)

Plus de Sotrender

Plus de Sotrender (20)

Federated Learning: Budowanie modeli uczenia maszynowego bez wglądu w rozproszone dane