Personal Information
Entreprise/Lieu de travail
San Francisco Bay Area United States
Secteur d’activité
Electronics / Computer Hardware
À propos
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Mots-clés
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
Tout plus
Présentations
(8)J’aime
(8)Frustration-Reduced PySpark: Data engineering with DataFrames
Ilya Ganelin
•
il y a 8 ans
sparklyr - Jeff Allen
Sri Ambati
•
il y a 7 ans
A lightweight browser start page - 3x3 Links
Federico Elles
•
il y a 15 ans
The Secret Sauce of Successful Teams
Sven Peters
•
il y a 7 ans
Web Services Testing
Vladimir Soghoyan
•
il y a 10 ans
Network Intrusion Detection Analysis using Random Forest Algorithm on Apache Mahout
Cisco
•
il y a 9 ans
Clustering and Association Rule
Cisco
•
il y a 9 ans
Time Series Forecasting for Google Inc. and Break-even analysis for Google glass.
Cisco
•
il y a 9 ans
Personal Information
Entreprise/Lieu de travail
San Francisco Bay Area United States
Secteur d’activité
Electronics / Computer Hardware
À propos
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Mots-clés
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
Tout plus