Kdd15 - distributed personalization

•Télécharger en tant que PPTX, PDF•

0 j'aime•904 vues

Xu Miao

Presentation slides for KDD15'

Présentations et discours publics

Aug 11st, 2015
Xu Miao, Lijun Tang, Yitong Zhou, Joel Young LinkedIn
Chun-te Chu, Microsoft
Anmol Bhasin Groupon
Distributed Personalization

Motivation
Distributed Learning
Personalization
Experiments

Common Solution
Apps Tracking ETL
DM
Delivering

Common Solution -- Cold Start
Apps Tracking ETL
DM
Delivering
minutes
hours
days
Apps
seconds
seconds
seconds

Common Solution -- Warm Start
Apps Tracking ETL
DM
Delivering
minutes
hours
days
seconds
seconds
seconds

seconds
seconds
seconds
Bring ML Closer to Users
Apps Tracking ETL
DM
Delivering
minutes
hours
days

Distributed Online Learning
▪ Definition:
– Agent presents an example
– User responses with a reward r
– Agent updates the model w

Distributed Online Learning
▪ Definition:
– Agent presents an example
– User responses with a reward r
– Agent updates the model w
▪ Challenges:
– Users’ feedback data too few
▪ Distributed Learning

Distributed Online Learning
▪ Definition:
– Agent presents an example
– User responses with a reward r
– Agent updates the models
▪ Challenges:
– Users’ feedback data too few
▪ Distributed Learning
– Everyone has different preferences
▪ Personalization

▪ Bulk Synchronous Parallel (Hadoop & Spark)
– ~ Thousands of interactions to converge
Distributed Gradient Descent

▪ Stale Synchronous Parallel [Ho and etc. 13’]
– For some users, staleness is forever
Distributed Gradient Descent
What did I do?

▪ Blessing
– It is one of the key reasons for PGDs to converge
fast
▪ Challenge
– It goes diminished, and the data comes later has
smaller and smaller impact
– Restart? Residue constant? Hard to manage
Learning Rate

Alternating Direction Method of Multipliers (ADMMs)

ADMMs -- Asynchronous Parallel
[Miao, Chu, Tang, Zhou, Young, Bhasin 15’]
timelines
V1
V1’
V1’’
t0
t1
t2

ADMMs -- Asynchronous Parallel
[Miao, Chu, Tang, Zhou, Young, Bhasin 15’]
timelines
V1
V1’
V1’’
t0
t1
t2
t3
t4
V2

V1’’
ADMMs -- Asynchronous Parallel
[Miao, Chu, Tang, Zhou, Young, Bhasin 15’]
Weighted Merge
1
1
timelines
V1
V1’
t0
t1
t2
V2 V3
t3
t4

ADMMs -- Asynchronous Parallel
[Miao, Chu, Tang, Zhou, Young, Bhasin 15’]
Master Versions
timelines

ADMMs -- Asynchronous Parallel
[Miao, Chu, Tang, Zhou, Young, Bhasin 15’]
▪ Same convergence rate as Bulk Synchronous Parallel
▪ No learning rate
– Out-of-order sequences of mini-optimizations
– Continuous Learning

Personalized Models
▪ The personalization strength:
– Allow divergence of personal models from the
consensus model
– Improve relevance
– Improve convergence (speed)

Conclusion
▪ Asynchronous ADMMs
– Continuous learning
▪ Personalized Models
– Fits users better
– Improves convergence speed

ADMMs -- Asynchronous Parallel
▪ Delay variations
– Weighted Merge (v.s. Stale Synchronous Parallel)
– Flexible to handle non-stationary distribution
▪ Crazy active users
▪ Passive important users
▪ Spammers

Recommandé

Configuration Optimization for Big Data SoftwarePooyan Jamshidi

Neural Networks and Deep Learning for PhysicistsHéloïse Nonne

Distributed machine learningStanley Wang

Scalable machine learningArnaud Rachez

Challenges in Large Scale Machine LearningSudarsun Santhiappan

UNET: Massive Scale DNN on SparkZhan Zhang

Using Gradient Descent for Optimization and LearningDr. Volkan OBAN

Distributed machine learning examplesStanley Wang

Recommandé

Configuration Optimization for Big Data SoftwarePooyan Jamshidi

Neural Networks and Deep Learning for PhysicistsHéloïse Nonne

Distributed machine learningStanley Wang

Scalable machine learningArnaud Rachez

Challenges in Large Scale Machine LearningSudarsun Santhiappan

UNET: Massive Scale DNN on SparkZhan Zhang

Using Gradient Descent for Optimization and LearningDr. Volkan OBAN

Distributed machine learning examplesStanley Wang

Challenges on Distributed Machine Learningjie cao

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016MLconf

Performance and scalability for machine learningArnaud Rachez

Machine learning pt.1: Artificial Neural Networks ® All Rights ReservedJonathan Mitchell

Using Topic Modeling to Study Everyday "Civic Talk" and Proto-political Engag...Tuukka Ylä-Anttila

Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016MLconf

Déjà Vu: The Importance of Time and Causality in Recommender SystemsJustin Basilico

Adaptive Testing Methodology [ ATM ]Daniel Miessler

Recommender Systems from A to Z – Model TrainingCrossing Minds

Hanno Jarvet - VSM, Planning and Problem Solving - ConFuDevConFu

Artificial Intelligence at LinkedInBill Liu

kdd2015Deepak Agarwal

Crowdsourcing for Information Retrieval: From Statistics to EthicsMatthew Lease

How to Use Machine Learning as a Product Manager by Wework PMProduct School

CSC410-PresentationRahul Patidar

Machine Learning Experimentation at Sift ScienceSift Science

Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain

Using Yammer & SharePoint Intranets to Drive Employee EngagementPerficient, Inc.

MeasureWorks - Shoppingtoday - 5 must-do's for the holiday seasonMeasureWorks

2021 Chrome Dev Summit: Web Performance 101Tammy Everts

Hanno Jarvet - The Lean Toolkit – Value Stream Mapping and Problem SolvingDevConFu

Web search-metrics-tutorial-www2010-section-1of7-introductionAli Dasdan

Contenu connexe

En vedette

Challenges on Distributed Machine Learningjie cao

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016MLconf

Performance and scalability for machine learningArnaud Rachez

Machine learning pt.1: Artificial Neural Networks ® All Rights ReservedJonathan Mitchell

Using Topic Modeling to Study Everyday "Civic Talk" and Proto-political Engag...Tuukka Ylä-Anttila

Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016MLconf

En vedette (6)

Challenges on Distributed Machine Learning

Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016

Performance and scalability for machine learning

Using Topic Modeling to Study Everyday "Civic Talk" and Proto-political Engag...

Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016

Similaire à Kdd15 - distributed personalization

Déjà Vu: The Importance of Time and Causality in Recommender SystemsJustin Basilico

Adaptive Testing Methodology [ ATM ]Daniel Miessler

Recommender Systems from A to Z – Model TrainingCrossing Minds

Hanno Jarvet - VSM, Planning and Problem Solving - ConFuDevConFu

Artificial Intelligence at LinkedInBill Liu

kdd2015Deepak Agarwal

Crowdsourcing for Information Retrieval: From Statistics to EthicsMatthew Lease

How to Use Machine Learning as a Product Manager by Wework PMProduct School

CSC410-PresentationRahul Patidar

Machine Learning Experimentation at Sift ScienceSift Science

Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain

Using Yammer & SharePoint Intranets to Drive Employee EngagementPerficient, Inc.

MeasureWorks - Shoppingtoday - 5 must-do's for the holiday seasonMeasureWorks

2021 Chrome Dev Summit: Web Performance 101Tammy Everts

Hanno Jarvet - The Lean Toolkit – Value Stream Mapping and Problem SolvingDevConFu

Web search-metrics-tutorial-www2010-section-1of7-introductionAli Dasdan

Tempo, Maneuverability, and InitiativeMichael Nygard

Competitive Intelligence ATSTincup & Co.

Enabling CD in Enterprises with TestingAnand Bagmar

Service pemanas air solahart hp 081313462267Service Solahart 081313462267

Similaire à Kdd15 - distributed personalization (20)

Déjà Vu: The Importance of Time and Causality in Recommender Systems

Adaptive Testing Methodology [ ATM ]

Recommender Systems from A to Z – Model Training

Hanno Jarvet - VSM, Planning and Problem Solving - ConFu

Artificial Intelligence at LinkedIn

kdd2015

Crowdsourcing for Information Retrieval: From Statistics to Ethics

How to Use Machine Learning as a Product Manager by Wework PM

CSC410-Presentation

Machine Learning Experimentation at Sift Science

Past, present, and future of Recommender Systems: an industry perspective

Using Yammer & SharePoint Intranets to Drive Employee Engagement

MeasureWorks - Shoppingtoday - 5 must-do's for the holiday season

2021 Chrome Dev Summit: Web Performance 101

Hanno Jarvet - The Lean Toolkit – Value Stream Mapping and Problem Solving

Web search-metrics-tutorial-www2010-section-1of7-introduction

Tempo, Maneuverability, and Initiative

Competitive Intelligence ATS

Enabling CD in Enterprises with Testing

Service pemanas air solahart hp 081313462267

Dernier

Event 4 Introduction to Open Source.pptxaryanv1753

Mathan flower ppt.pptx slide orchids ✨🌸mathanramanathan2005

SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comsaastr

Genshin Impact PPT Template by EaTemp.pptxJohnree4

THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...漢銘謝

call girls in delhi malviya nagar @9811711561@vikas rana

Work Remotely with Confluence ACE 2.pptxmavinoikein

Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Krijn Poppe

PHYSICS PROJECT BY MSC - NANOTECHNOLOGYpruthirajnayak525

Call Girls In Aerocity 🤳 Call Us +919599264170Escort Service

PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.KathleenAnnCordero2

Genesis part 2 Isaiah Scudder 04-24-2024.pptxFamilyWorshipCenterD

The Ten Facts About People With Autism PresentationNathan Young

Anne Frank A Beacon of Hope amidst darkness ppt.pptxnoorehahmad

Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSebastiano Panichella

The 3rd Intl. Workshop on NL-based Software EngineeringSebastiano Panichella

Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝soniya singh

miladyskindiseases-200705210221 2.!!pptxCarrieButtitta

James Joyce, Dubliners and Ulysses.ppt !risocarla2016

Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power

Dernier (20)

Event 4 Introduction to Open Source.pptx

Mathan flower ppt.pptx slide orchids ✨🌸

SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com

Genshin Impact PPT Template by EaTemp.pptx

THE COUNTRY WHO SOLVED THE WORLD_HOW CHINA LAUNCHED THE CIVILIZATION REVOLUTI...

call girls in delhi malviya nagar @9811711561@

Work Remotely with Confluence ACE 2.pptx

Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...

PHYSICS PROJECT BY MSC - NANOTECHNOLOGY

Call Girls In Aerocity 🤳 Call Us +919599264170

PAG-UNLAD NG EKONOMIYA na dapat isaalang alang sa pag-aaral.

Genesis part 2 Isaiah Scudder 04-24-2024.pptx

The Ten Facts About People With Autism Presentation

Anne Frank A Beacon of Hope amidst darkness ppt.pptx

Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist

The 3rd Intl. Workshop on NL-based Software Engineering

Call Girls in Rohini Delhi 💯Call Us 🔝8264348440🔝

miladyskindiseases-200705210221 2.!!pptx

James Joyce, Dubliners and Ulysses.ppt !

Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics

Kdd15 - distributed personalization

1. Aug 11st, 2015 Xu Miao, Lijun Tang, Yitong Zhou, Joel Young LinkedIn Chun-te Chu, Microsoft Anmol Bhasin Groupon Distributed Personalization

2. Motivation Distributed Learning Personalization Experiments

3. Recommendation

4. Recommendation

5. Recommendation

6. Common Solution Apps Tracking ETL DM Delivering

7. Common Solution -- Cold Start Apps Tracking ETL DM Delivering minutes hours days Apps seconds seconds seconds

8. Common Solution -- Warm Start Apps Tracking ETL DM Delivering minutes hours days seconds seconds seconds

9. seconds seconds seconds Bring ML Closer to Users Apps Tracking ETL DM Delivering minutes hours days

10. Distributed Online Learning ▪ Definition: – Agent presents an example – User responses with a reward r – Agent updates the model w

11. Distributed Online Learning ▪ Definition: – Agent presents an example – User responses with a reward r – Agent updates the model w ▪ Challenges: – Users’ feedback data too few ▪ Distributed Learning

12. Distributed Online Learning ▪ Definition: – Agent presents an example – User responses with a reward r – Agent updates the models ▪ Challenges: – Users’ feedback data too few ▪ Distributed Learning – Everyone has different preferences ▪ Personalization

13. Motivation Distributed Learning Personalization Experiments

14. ▪ Bulk Synchronous Parallel (Hadoop & Spark) – ~ Thousands of interactions to converge Distributed Gradient Descent

15. ▪ Stale Synchronous Parallel [Ho and etc. 13’] – For some users, staleness is forever Distributed Gradient Descent What did I do?

16. ▪ Blessing – It is one of the key reasons for PGDs to converge fast ▪ Challenge – It goes diminished, and the data comes later has smaller and smaller impact – Restart? Residue constant? Hard to manage Learning Rate

17. Alternating Direction Method of Multipliers (ADMMs)

18. ADMMs -- Bulk Synchronous Parallel

19. ADMMs -- Bulk Synchronous Parallel

20. ADMMs -- Asynchronous Parallel [Miao, Chu, Tang, Zhou, Young, Bhasin 15’] timelines V1 V1’ V1’’ t0 t1 t2

21. ADMMs -- Asynchronous Parallel [Miao, Chu, Tang, Zhou, Young, Bhasin 15’] timelines V1 V1’ V1’’ t0 t1 t2 t3 t4 V2

22. V1’’ ADMMs -- Asynchronous Parallel [Miao, Chu, Tang, Zhou, Young, Bhasin 15’] Weighted Merge 1 1 timelines V1 V1’ t0 t1 t2 V2 V3 t3 t4

23. ADMMs -- Asynchronous Parallel [Miao, Chu, Tang, Zhou, Young, Bhasin 15’] Master Versions timelines

24. ADMMs -- Asynchronous Parallel [Miao, Chu, Tang, Zhou, Young, Bhasin 15’] ▪ Same convergence rate as Bulk Synchronous Parallel ▪ No learning rate – Out-of-order sequences of mini-optimizations – Continuous Learning

25. Motivation Distributed Learning Personalization Experiments

26. Personalized Models

27. Personalized Models

28. Personalized Models ▪ The personalization strength: – Allow divergence of personal models from the consensus model – Improve relevance – Improve convergence (speed)

29. Motivation Distributed Learning Personalization Experiments

30. Facial Expression Recognition

31. Facial Expression Recognition

32. Facial Expression Recognition

33. Job Recommendation

34. Job Recommendation

35. Speed

36. Conclusion ▪ Asynchronous ADMMs – Continuous learning ▪ Personalized Models – Fits users better – Improves convergence speed

37. Thank You and Questions

38. ADMMs -- Asynchronous Parallel ▪ Delay variations – Weighted Merge (v.s. Stale Synchronous Parallel) – Flexible to handle non-stationary distribution ▪ Crazy active users ▪ Passive important users ▪ Spammers