Personal Information
Entreprise/Lieu de travail
London, United Kingdom United Kingdom
Profession
Data Science and Big Data
Secteur d’activité
Technology / Software / Internet
À propos
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Mots-clés
newbie
pycon
python
programming
pycon2010
Tout plus
Présentations
(2)Documents
(1)J’aime
(24)Netezza Architecture and Administration
Braja Krishna Das
•
il y a 7 ans
Netezza Deep Dives
Rush Shah
•
il y a 7 ans
Notes from Coursera Deep Learning courses by Andrew Ng
Tess Ferrandez
•
il y a 6 ans
Strata NYC 2015: Sketching Big Data with Spark: randomized algorithms for large-scale data analytics
Databricks
•
il y a 8 ans
Developing Real-Time Data Pipelines with Apache Kafka
Joe Stein
•
il y a 8 ans
Scala - The Simple Parts, SFScala presentation
Martin Odersky
•
il y a 9 ans
Pragmatic Real-World Scala (short version)
Jonas Bonér
•
il y a 15 ans
Scala Data Pipelines @ Spotify
Neville Li
•
il y a 8 ans
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
•
il y a 9 ans
Hive tuning
Michael Zhang
•
il y a 10 ans
Spark SQL Deep Dive @ Melbourne Spark Meetup
Databricks
•
il y a 8 ans
Spark Summit East 2015 Advanced Devops Student Slides
Databricks
•
il y a 9 ans
DTCC '14 Spark Runtime Internals
Cheng Lian
•
il y a 10 ans
Tuning and Debugging in Apache Spark
Databricks
•
il y a 9 ans
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
il y a 9 ans
Why Scala Is Taking Over the Big Data World
Dean Wampler
•
il y a 9 ans
storm at twitter
Krishna Gade
•
il y a 10 ans
Collaborative Filtering with Spark
Chris Johnson
•
il y a 9 ans
DataFu @ ApacheCon 2014
William Vaughan
•
il y a 10 ans
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Paco Nathan
•
il y a 9 ans
Hadoop World 2011: Advanced HBase Schema Design - Lars George, Cloudera
Cloudera, Inc.
•
il y a 12 ans
HBase schema design Big Data TechCon Boston
amansk
•
il y a 11 ans
HBaseCon 2012 | HBase Schema Design - Ian Varley, Salesforce
Cloudera, Inc.
•
il y a 11 ans
The 21 Coolest Internet Of Things Gadgets
Bernard Marr
•
il y a 9 ans
Personal Information
Entreprise/Lieu de travail
London, United Kingdom United Kingdom
Profession
Data Science and Big Data
Secteur d’activité
Technology / Software / Internet
À propos
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Mots-clés
newbie
pycon
python
programming
pycon2010
Tout plus