10. Distributed Online Learning
▪ Definition:
– Agent presents an example
– User responses with a reward r
– Agent updates the model w
11. Distributed Online Learning
▪ Definition:
– Agent presents an example
– User responses with a reward r
– Agent updates the model w
▪ Challenges:
– Users’ feedback data too few
▪ Distributed Learning
12. Distributed Online Learning
▪ Definition:
– Agent presents an example
– User responses with a reward r
– Agent updates the models
▪ Challenges:
– Users’ feedback data too few
▪ Distributed Learning
– Everyone has different preferences
▪ Personalization
14. ▪ Bulk Synchronous Parallel (Hadoop & Spark)
– ~ Thousands of interactions to converge
Distributed Gradient Descent
15. ▪ Stale Synchronous Parallel [Ho and etc. 13’]
– For some users, staleness is forever
Distributed Gradient Descent
What did I do?
16. ▪ Blessing
– It is one of the key reasons for PGDs to converge
fast
▪ Challenge
– It goes diminished, and the data comes later has
smaller and smaller impact
– Restart? Residue constant? Hard to manage
Learning Rate
28. Personalized Models
▪ The personalization strength:
– Allow divergence of personal models from the
consensus model
– Improve relevance
– Improve convergence (speed)