What does it take to win the Kaggle/Yandex competition

•Télécharger en tant que PPTX, PDF•

6 j'aime•4,614 vues

Kenji LEFEVRE

A feedback on how we won Kaggle/Yandex competition

Technologie Business

WHAT DOES IT TAKE TO WIN THE
KAGGLE/YANDEX COMPETITION

Christophe Bourguignat
Kenji Lefèvre-Hasegawa
Paul Masurel @Dataiku
Matthieu Scordia @Dataiku

OUTLINE OF THE TALK

• Review of the Kaggle/Yandex Challenge
• How we worked (team work & tools)
• The winning model

GOAL Re-rank URLs returned by Yandex according to
the personal preferences of the users
url1

url3

url2

url2
GOAL

url3

url1

url4

url4

ML CHALLENGE Predict user’s pertinence

for urls and rerank result set accordingly
The Kaggle/Yandex challenge

GIVEN
• 30 days logs test: 3 days, train: 27 days
• Users historic queries, clicks & dwell-times
Q

Q

Q

Q

• Test session prior activity queries, clicks & dwell-times
Test session :

SIZE
• 15Gb size
The Kaggle/Yandex challenge

Q

Q

T

?

QUALITY METRIC
• One query test / user on the last 3 days
• NDCG metric penalize error of pertinence on top ranked
urls

• No A/B test
url1
url2

OK

BAD

url4
url3

Kaggle
The Kaggle/Yandex challenge

Prediction

Another ranking

TEAM DATAIKU SCIENCE STUDIO / KAGGLE

•
•
•
•

Christophe Bourguignat Engineer, Data enthusiastic
Kenji Lefèvre-Hasegawa Ph.D. math, new to ML
Paul Masurel Software Engineer @dataiku
Matthieu Scordia Data Scientist @dataiku
First meeting : October16th 2013

How we worked (Team work & tools)

WE’VE USED
•
•
•
•
•

Related papers (mainly Microsoft’s)
12 core, 64 Gb
Python scikit-learn
Dataiku Science Studio
Java Ranklib

How we worked (Team work & tools)

DATAIKU SCIENCE STUDIO
Features & labels

Features

Labels

Split train & validation

Original train

LEARNING
Team members
work independantly

FEATURES CONSTRUCTION
Team members work
independantly
DATA DRIVEN
COMPUTATION
How we worked (Team work & tools)

HOW MUCH WORK ?
• 960+ emails
• 360+ features
• 50+ ideas grid tuned (300+ models fitted)
• Server heavily loaded the last 3 weeks
• 56 kaggle submissions
• 196 teams, 264 players, 3570 submissions

How we worked (Team work & tools)

2014-01-01

1st

Future top 2 & 3
enter race

1 week

3rd

1 week

1st

5th

Top 10

Top 25

1/2 month

1 week

PROBLEM ANALYSIS
Query

Result Set
• Rank
• URL Snippet Quality
• URL is skipped, clicked or missed

CLICK
Reading URL
• URL & Domain pertinence with dwell-time

The winning model

FEATURES
Features :
• Rank
• User habits, query specificity (entropy, frequency,…)
• Snippet pertinence
• Missed, Skipped, Clicked
• URL & Domain Pertinence
Declinaison of
& Clicked
• Probability, Stimuli freq., Mean Reciprocal Rank (MRR)
• For each user : historic & previous activity in test session &
aggregate
• For all user
• Declined for all queries & same query
The winning model

MODELS
• Random Forest (predict proba)
+ maximize E(NDCG)
Kaggle/Yandex Top 1
then 3rd

• Lambda MART (Gradient Boosting Tree
optimized for NDCG) WINS !
The winning model

Recommandé

Ben Hamner, Co-founder and CTO, Kaggle at MLconf SF - 11/13/15MLconf

Kaggle presentationHJ van Veen

Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorVivian S. Zhang

Tips for data science competitionsOwen Zhang

Dataiku at SF DataMining Meetup - Kaggle Yandex ChallengeDataiku

Personalized Re-Ranking of Documentskswapna9

Quality Jam 2017: Jesse Reed & Kyle McMeekin "Test Case Management & Explorat...QASymphony

Florian Douetteau @ DataikuPAPIs.io

Recommandé

Ben Hamner, Co-founder and CTO, Kaggle at MLconf SF - 11/13/15MLconf

Kaggle presentationHJ van Veen

Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorVivian S. Zhang

Tips for data science competitionsOwen Zhang

Dataiku at SF DataMining Meetup - Kaggle Yandex ChallengeDataiku

Personalized Re-Ranking of Documentskswapna9

Quality Jam 2017: Jesse Reed & Kyle McMeekin "Test Case Management & Explorat...QASymphony

Florian Douetteau @ DataikuPAPIs.io

Emptying Your Cup an Agile Primer Todd Shelton

Agile testing experimentsBaiju Joseph

SenchaCon 2016: Creating a Flexible and Usable Industry Specific Solution - D...Sencha

Continuous Delivery without Significant Test AutomationMaaret Pyhäjärvi

My Experiments In Agile Testing in Yahoo.pptxBaiju Joseph

Play with KaggleMatthieu Scordia

Moving from fast to solr on atglucenerevolution

CommerceSearch: Moving from FAST to Solr on ATGlucenerevolution

Agile Testingdanielbilling

20121213 qa introduction smileryangnetdbncku

User Centered Agile Development at NASA - One Groups Path to Better SoftwareBalanced Team

User centered agile dev balanced team 2013Jay Trimble

Agile by KDKarl Dickman

Journey of Migrating Millions of Queries on The Cloudtakezoe

SCQAA-SF Meeting on May 21 2014 Sujit Ghosh

ResumeAkansha Khanna

Scalability and performance for e commerceEcommerce Solution Provider SysIQ

Effective ScrumSándor Zolta Székely Sipos

Java Certification by HUJAK - 2015-05-12 - at JavaCro'15 conferenceHUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association

"ML in Production",Oleksandr BaganFwdays

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

Contenu connexe

Similaire à What does it take to win the Kaggle/Yandex competition

Emptying Your Cup an Agile Primer Todd Shelton

Agile testing experimentsBaiju Joseph

SenchaCon 2016: Creating a Flexible and Usable Industry Specific Solution - D...Sencha

Continuous Delivery without Significant Test AutomationMaaret Pyhäjärvi

My Experiments In Agile Testing in Yahoo.pptxBaiju Joseph

Play with KaggleMatthieu Scordia

Moving from fast to solr on atglucenerevolution

CommerceSearch: Moving from FAST to Solr on ATGlucenerevolution

Agile Testingdanielbilling

20121213 qa introduction smileryangnetdbncku

User Centered Agile Development at NASA - One Groups Path to Better SoftwareBalanced Team

User centered agile dev balanced team 2013Jay Trimble

Agile by KDKarl Dickman

Journey of Migrating Millions of Queries on The Cloudtakezoe

SCQAA-SF Meeting on May 21 2014 Sujit Ghosh

ResumeAkansha Khanna

Scalability and performance for e commerceEcommerce Solution Provider SysIQ

Effective ScrumSándor Zolta Székely Sipos

Java Certification by HUJAK - 2015-05-12 - at JavaCro'15 conferenceHUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association

Similaire à What does it take to win the Kaggle/Yandex competition (20)

Emptying Your Cup an Agile Primer

Agile testing experiments

SenchaCon 2016: Creating a Flexible and Usable Industry Specific Solution - D...

Continuous Delivery without Significant Test Automation

My Experiments In Agile Testing in Yahoo.pptx

Play with Kaggle

Moving from fast to solr on atg

CommerceSearch: Moving from FAST to Solr on ATG

Agile Testing

20121213 qa introduction smileryang

User Centered Agile Development at NASA - One Groups Path to Better Software

User centered agile dev balanced team 2013

Agile by KD

Journey of Migrating Millions of Queries on The Cloud

SCQAA-SF Meeting on May 21 2014

Resume

Scalability and performance for e commerce

Effective Scrum

Java Certification by HUJAK - 2015-05-12 - at JavaCro'15 conference

Dernier

"ML in Production",Oleksandr BaganFwdays

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

From Family Reminiscence to Scholarly Archive .Alan Dix

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Advanced Computer Architecture – An IntroductionDilum Bandara

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

The State of Passkeys with FIDO Alliance.pptxLoriGlavin3

Take control of your SAP testing with UiPath Test SuiteDianaGray10

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Dernier (20)

"ML in Production",Oleksandr Bagan

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

SAP Build Work Zone - Overview L2-L3.pptx

Connect Wave/ connectwave Pitch Deck Presentation

From Family Reminiscence to Scholarly Archive .

DevEX - reference for building teams, processes, and platforms

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Advanced Computer Architecture – An Introduction

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Scanning the Internet for External Cloud Exposures via SSL Certs

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

The State of Passkeys with FIDO Alliance.pptx

Take control of your SAP testing with UiPath Test Suite

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

What is DBT - The Ultimate Data Build Tool.pdf

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

SIP trunking in Janus @ Kamailio World 2024

Ensuring Technical Readiness For Copilot in Microsoft 365

Streamlining Python Development: A Guide to a Modern Project Setup

What does it take to win the Kaggle/Yandex competition

1. WHAT DOES IT TAKE TO WIN THE KAGGLE/YANDEX COMPETITION Christophe Bourguignat Kenji Lefèvre-Hasegawa Paul Masurel @Dataiku Matthieu Scordia @Dataiku

2. OUTLINE OF THE TALK • Review of the Kaggle/Yandex Challenge • How we worked (team work & tools) • The winning model

3. GOAL Re-rank URLs returned by Yandex according to the personal preferences of the users url1 url3 url2 url2 GOAL url3 url1 url4 url4 ML CHALLENGE Predict user’s pertinence for urls and rerank result set accordingly The Kaggle/Yandex challenge

4. GIVEN • 30 days logs test: 3 days, train: 27 days • Users historic queries, clicks & dwell-times Q Q Q Q • Test session prior activity queries, clicks & dwell-times Test session : SIZE • 15Gb size The Kaggle/Yandex challenge Q Q T ?

5. QUALITY METRIC • One query test / user on the last 3 days • NDCG metric penalize error of pertinence on top ranked urls • No A/B test url1 url2 OK BAD url4 url3 Kaggle The Kaggle/Yandex challenge Prediction Another ranking

6. TEAM DATAIKU SCIENCE STUDIO / KAGGLE • • • • Christophe Bourguignat Engineer, Data enthusiastic Kenji Lefèvre-Hasegawa Ph.D. math, new to ML Paul Masurel Software Engineer @dataiku Matthieu Scordia Data Scientist @dataiku First meeting : October16th 2013 How we worked (Team work & tools)

7. WE’VE USED • • • • • Related papers (mainly Microsoft’s) 12 core, 64 Gb Python scikit-learn Dataiku Science Studio Java Ranklib How we worked (Team work & tools)

8. DATAIKU SCIENCE STUDIO Features & labels Features Labels Split train & validation Original train LEARNING Team members work independantly FEATURES CONSTRUCTION Team members work independantly DATA DRIVEN COMPUTATION How we worked (Team work & tools)

9. HOW MUCH WORK ? • 960+ emails • 360+ features • 50+ ideas grid tuned (300+ models fitted) • Server heavily loaded the last 3 weeks • 56 kaggle submissions • 196 teams, 264 players, 3570 submissions How we worked (Team work & tools) 2014-01-01 1st Future top 2 & 3 enter race 1 week 3rd 1 week 1st 5th Top 10 Top 25 1/2 month 1 week

10. PROBLEM ANALYSIS Query Result Set • Rank • URL Snippet Quality • URL is skipped, clicked or missed CLICK Reading URL • URL & Domain pertinence with dwell-time The winning model

11. FEATURES Features : • Rank • User habits, query specificity (entropy, frequency,…) • Snippet pertinence • Missed, Skipped, Clicked • URL & Domain Pertinence Declinaison of & Clicked • Probability, Stimuli freq., Mean Reciprocal Rank (MRR) • For each user : historic & previous activity in test session & aggregate • For all user • Declined for all queries & same query The winning model

12. MODELS • Random Forest (predict proba) + maximize E(NDCG) Kaggle/Yandex Top 1 then 3rd • Lambda MART (Gradient Boosting Tree optimized for NDCG) WINS ! The winning model

13. QUESTIONS ? Thank you !