ML at Quora: How Machine Learning Powers Recommendations & Ranking

•

3 j'aime•1,719 vues

ML algorithms are crucial to Quora's mission of growing the world's knowledge. They help rank answers by quality, relevance, and other factors. They also personalize the question feed, recommend new topics and users to follow, find related questions, and detect duplicate or spam content. Models used include logistic regression, gradient boosted trees, neural networks and more. ML helps scale content moderation and predict user contributions, both key to Quora's success in optimizing for an engaging experience.

Technologie

ML @ Quora
ML Algorithms for Growing the World’s Knowledge
Seattle, 05/01/2015Xavier Amatriain (@xamat)

Our Mission
“To share and grow the world’s
knowledge”
• Millions of questions & answers
• Millions of users
• Thousands of topics
• ...

Demand
What we care about
Quality
Relevance

Ranking - Answer ranking
What is a good Quora answer?
• truthful
• reusable
• provides explanation
• well formatted
• ...

Ranking - Answer ranking
How are those dimensions translated
into features?
• Features that relate to the text
quality itself
• Interaction features
(upvotes/downvotes, clicks,
comments…)
• User features (e.g. expertise in topic)

Ranking - Feed
• Personalized learning-to-rank
approach
• Goal: Present most interesting stories
for a user at a given time
• Interesting = topical relevance +
social relevance + timeliness
• Stories = questions + answers

Ranking - Feed
• Features
• Quality of question/answer
• Topics the user is interested on/
knows about
• Users the user is following
• What is trending/popular
• …
• Different temporal windows
• Multi-stage solution with different
“streams”

Recommendations - Topics
Goal: Recommend new topics for the
user to follow
• Based on
• Other topics followed
• Users followed
• User interactions
• Topic-related features
• ...

Recommendations - Users
Goal: Recommend new users to follow
• Based on:
• Other users followed
• Topics followed
• User interactions
• User-related features
• ...

Related Questions
• Given interest in question A (source) what other
questions will be interesting?
• Not only about similarity, but also “interestingness”
• Features such as:
• Textual
• Co-visit
• Topics
• …
• Important for logged-out use case

Duplicate Questions
• Important issue for Quora
• Want to make sure we don’t disperse
knowledge to the same question
• Solution: binary classifier trained with
labelled data
• Features
• Textual vector space models
• Usage-based features
• ...

User Trust/Expertise Inference
Goal: Infer user’s trustworthiness in relation
to a given topic
• We take into account:
• Answers written on topic
• Upvotes/downvotes received
• Endorsements
• ...
• Trust/expertise propagates through the network
• Must be taken into account by other algorithms

Trending Topics
Goal: Highlight current events that are
interesting for the user
• We take into account:
• Global “Trendiness”
• Social “Trendiness”
• User’s interest
• ...
• Trending topics are a great discovery mechanism

Spam Detection/Moderation
• Very important for Quora to keep quality of
content
• Pure manual approaches do not scale
• Hard to get algorithms 100% right
• ML algorithms detect content/user issues
• Output of the algorithms feed manually
curated moderation queues

Content Creation Prediction
• Quora’s algorithms not only optimize for
probability of reading
• Important to predict probability of a user
answering a question
• Parts of our system completely rely on
that prediction
• E.g. A2A (ask to answer) suggestions

Models
● Logistic Regression
● Elastic Nets
● Gradient Boosted Decision
Trees
● Random Forests
● Neural Networks
● LambdaMART
● Matrix Factorization
● LDA
● ...

Conclusions
• At Quora we have not only Big, but also “rich” data
• Our algorithms need to understand and optimize complex aspects
such as quality, interestingness, or user expertise
• We believe ML will be one of the keys to our success
• We have many interesting problems, and many unsolved challenges

We’re Hiring!
http://www.quora.com/careers/

Recommandé

MLConf Seattle 2015 - ML@QuoraXavier Amatriain

Machine Learning for Q&A Sites: The Quora ExampleXavier Amatriain

Machine Learning to Grow the World's KnowledgeXavier Amatriain

Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain

Recsys 2016Mindaugas Zickus

BIG2016- Lessons Learned from building real-life user-focused Big Data systemsXavier Amatriain

Michael Gage SOED 2016Colleen Ganley

Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain

Recommandé

MLConf Seattle 2015 - ML@QuoraXavier Amatriain

Machine Learning for Q&A Sites: The Quora ExampleXavier Amatriain

Machine Learning to Grow the World's KnowledgeXavier Amatriain

Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain

Recsys 2016Mindaugas Zickus

BIG2016- Lessons Learned from building real-life user-focused Big Data systemsXavier Amatriain

Michael Gage SOED 2016Colleen Ganley

Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain

Recsys 2014 Tutorial - The Recommender Problem RevisitedXavier Amatriain

Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleXavier Amatriain

Kdd 2014 Tutorial - the recommender problem revisitedXavier Amatriain

Déjà Vu: The Importance of Time and Causality in Recommender SystemsJustin Basilico

MLConf - Emmys, Oscars & Machine Learning Algorithms at NetflixXavier Amatriain

Cikm 2013 - Beyond Data From User Information to Business ValueXavier Amatriain

Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Xavier Amatriain

Big & Personal: the data and the models behind Netflix recommendations by Xa...BigMine

H2O World - Quora: Machine Learning Algorithms to Grow the World's Knowledge ...Sri Ambati

Machine Learning at Quora (2/26/2016)Nikhil Dandekar

Search, Discovery and Questions at QuoraNikhil Dandekar

Engaging with Users on Public Social MediaJeffrey Nichols

The Hive Think Tank: Machine Learning at Pinterest by Jure LeskovecThe Hive

Scaling Quality on Quora Using Machine LearningVo Viet Anh

The Costs Associated with Buying an LMS (June 2017)Lambda Solutions

Social Media for Learning: A Balanced ApproachQuickLessons LLC

Designing Mobile UXFarah Nuraini

Lecture 5: How to make the Social Web Personalized? (VU Amsterdam Social Web ...Lora Aroyo

When Mobile meets UX/UI powered by Growth Hacking AsiaGrowth Hacking Asia

Towards identifying Collaborative Learning groups using Social MediaSelver Softic

Dlf 2012sherriberger

Machine Learning Applications in E-learning - Bias, Risks, and MitigationsStella Lee

Contenu connexe

Tendances