SlideShare une entreprise Scribd logo
1  sur  14
Outline About Me The VivaSmart Story The EIR Experience The Cloudera Story .. so far What is Hadoop? Open Source Business Models Lessons Learned & Advice
About Me I got BS and MS from Cairo University in Egypt. I came to US in 1995 to get my PhD from Stanford, with goal to go back to Egypt and teach. I got infected by the Entrepreneurship bug, it is rampant at Stanford, hopefully you’ll get infected too In 1999 I took a leave of absence from PhD to start VivaSmart, which I sold to Yahoo in 2000. I stayed with Yahoo till mid-2008, also finished my PhD in mid-2007 with Mendel Rosenblum. I started Cloudera in fall of 2008.
The VivaSmart Story It started as Booksmart in Spring of 1999. Initial prototype was built by Thai Tran. We got funded by a great angel (Frank Marshall). Couldn’t raise VC money, but we were able to raise more angel money, and got lighthouse customers. Noticed that it is hard to drive traffic, decided to focus on catalog management technology (Aptivia). Got initial acquisition termsheet from Excite@Home for $12M but they reneged at last minute (4/2000) Yahoo Shopping acquired us for $9M in June 2000.
The EIR Experience EIR = Entrepreneur in Residence. Joined Accel Partners in June 2008 as an EIR. Spent most of the summer researching possible ideas for my next venture, also helped with due diligence for a number of companies. Experienced the fund raising process from the VC side, very useful to see how they think. Met my Cloudera co-founders through Accel Andrew Braccia (agb) and Ping Li (pli) from Accel Partners joined the Cloudera Board of Directors.
The Cloudera Story … so far Oct 2008: Got $5M round A funding from Accel Partners and a number of strategic angel investors. Four founders (too many?):  Mike Olson (Oracle) Jeff Hammerbacher (Facebook) Christophe Bisciglia (Google) AmrAwadallah (Yahoo) Announced the company in March of 2009. May 2009: Got $6M in funding from Greylock Ventures (opportunistic B round) AneelBhusri joined our board from Greylock
Cloudera’s Elevator Pitch A single,consolidated repository to enable insights across complex and structured data. Complex Data Documents Web feeds System logs Online forums SharePoint Sensor data EMB archives Photo/Video Structured Data (“relational”)  CRM Financials Logistics Inventory Sales records HR records
What is Hadoop? The foundation of our system is built on top of Apache Hadoop, which is a scalable distributed data processing system. The scalability of Hadoop comes from marriage of: HDFS: Self-Healing High-Bandwidth Clustered Storage. MapReduce: Fault-Tolerant Distributed Processing. The software manages and heals it self. Leverages the economies of scale of commodity hardware (multi-core chips, many disks per system) Compute moves to data (not other way around).
Hadoop History 2002-2004: Doug Cutting and Mike Cafarella started working on Nutch 2003-2004: Google publishes GFS and MapReduce papers  2004: Cutting adds DFS & MapReduce support to Nutch 2006: Yahoo! hires Cutting, Hadoop spins out of Nutch 2007: NY Times converts 4TB of archives over 100 EC2s 2008: Web-scale deployments at Y!, Facebook, Last.fm April 2008: Yahoo does fastest sort of a TB, 3.5mins over 910 nodes May 2009: Yahoo does fastest sort of a TB, 62secs over 1460 nodes Yahoo sorts a PB in 16.25hours over 3658 nodes June 2009, Oct 2009: Hadoop Summit (750), Hadoop World (500) September 2009: Doug Cutting joins Cloudera
Open Source Software Business Models Open Source is attractive since it gets you: Free Distribution: People can download and try it out Darwinian Effect: Lots of developers try to solve the problem, best solution wins. Faster Innovation: Customers build the product with you! OSS Business Models: Support/Maintenance/Service agreements Open Core: core is free, but there is value-add proprietary technology around it (“Community” vs “Enterprise” Edition) Monetization through enablement of other services (e.g. Firefox makes money from Google Search).
Lessons Learned & Advice Make sure your idea can actually make money! Hire great people (corollary: Fire swiftly). Make sure you are passionate about your idea. Listen to customers, but look for the problems, it is your job to come up with solutions. Be agile, iterate quickly, don’t spend a year planning, don’t be afraid to make mistakes. Don’t be afraid to fail, but don’t persist in your failing ways, learn from failure quickly and evolve (Moore) Have faith, but don’t let it blind you from reality
Books I Recommend “Blue Ocean Strategy”, W. Chan Kim, Renée Mauborgne. “The Innovator’s Dilemma”, Clayton Christensen “The Innovator’s Solution”, Clayton Christensen, and Michael Raynor “Good to Great”, Jim Collins “The Seven Habits of Highly Effective People”, Stephen Covey “Crossing the Chasm”,“Tornado”, Geoffrey Moore “The Black Swan”, NassimTaleb.
Contact Information We Are Hiring: jobs+ee203@cloudera.com AmrAwadallah CTO, Cloudera Inc. http://twitter.com/awadallah Online Training Videos and Info: http://cloudera.com/hadoop-training http://cloudera.com/blog http://twitter.com/cloudera
Cloudera/Stanford EE203 (Entrepreneurial Engineer)

Contenu connexe

Tendances

Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?
Hortonworks
 
Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015
Jongwook Woo
 
The Evolution of Big Data Frameworks
The Evolution of Big Data FrameworksThe Evolution of Big Data Frameworks
The Evolution of Big Data Frameworks
eXascale Infolab
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
m_hepburn
 
Big data, map reduce and beyond
Big data, map reduce and beyondBig data, map reduce and beyond
Big data, map reduce and beyond
datasalt
 

Tendances (20)

Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?
 
Cisco event 6 05 2014v3 wwt only
Cisco event 6 05 2014v3 wwt onlyCisco event 6 05 2014v3 wwt only
Cisco event 6 05 2014v3 wwt only
 
Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...
Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...
Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & Hadoop
 
Introduction to Big Data & Hadoop
Introduction to Big Data & HadoopIntroduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 
Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015
 
Introduction to Big data & Hadoop -I
Introduction to Big data & Hadoop -IIntroduction to Big data & Hadoop -I
Introduction to Big data & Hadoop -I
 
The Evolution of Big Data Frameworks
The Evolution of Big Data FrameworksThe Evolution of Big Data Frameworks
The Evolution of Big Data Frameworks
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An Overview
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Hadoop at the Center: The Next Generation of Hadoop
Hadoop at the Center: The Next Generation of HadoopHadoop at the Center: The Next Generation of Hadoop
Hadoop at the Center: The Next Generation of Hadoop
 
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | EdurekaHadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
 
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |EdurekaHadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
 
Webinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use HadoopWebinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use Hadoop
 
Big data, map reduce and beyond
Big data, map reduce and beyondBig data, map reduce and beyond
Big data, map reduce and beyond
 
Data Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerData Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
 
Big data 101
Big data 101Big data 101
Big data 101
 
What is Hadoop? Oct 17 2013
What is Hadoop? Oct 17 2013What is Hadoop? Oct 17 2013
What is Hadoop? Oct 17 2013
 

En vedette

How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
Amr Awadallah
 
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and CassandraBrief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Somnath Mazumdar
 
MapR-DB Elasticsearch Integration
MapR-DB Elasticsearch IntegrationMapR-DB Elasticsearch Integration
MapR-DB Elasticsearch Integration
MapR Technologies
 
Introduction to Apache Hadoop
Introduction to Apache HadoopIntroduction to Apache Hadoop
Introduction to Apache Hadoop
Christopher Pezza
 

En vedette (20)

How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
Schema-on-Read vs Schema-on-Write
Schema-on-Read vs Schema-on-WriteSchema-on-Read vs Schema-on-Write
Schema-on-Read vs Schema-on-Write
 
Service Primitives for Internet Scale Applications
Service Primitives for Internet Scale ApplicationsService Primitives for Internet Scale Applications
Service Primitives for Internet Scale Applications
 
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and FacebookHow Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
 
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and CassandraBrief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
 
ElasticES-Hadoop: Bridging the world of Hadoop and Elasticsearch
ElasticES-Hadoop: Bridging the world of Hadoop and ElasticsearchElasticES-Hadoop: Bridging the world of Hadoop and Elasticsearch
ElasticES-Hadoop: Bridging the world of Hadoop and Elasticsearch
 
MapR-DB Elasticsearch Integration
MapR-DB Elasticsearch IntegrationMapR-DB Elasticsearch Integration
MapR-DB Elasticsearch Integration
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
 
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-timeReal-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
 
Big Data Modeling and Analytic Patterns – Beyond Schema on Read
Big Data Modeling and Analytic Patterns – Beyond Schema on ReadBig Data Modeling and Analytic Patterns – Beyond Schema on Read
Big Data Modeling and Analytic Patterns – Beyond Schema on Read
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry Perspective
 
Introduction to Apache Hadoop
Introduction to Apache HadoopIntroduction to Apache Hadoop
Introduction to Apache Hadoop
 
Baptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big DataBaptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big Data
 
Apache Drill - Why, What, How
Apache Drill - Why, What, HowApache Drill - Why, What, How
Apache Drill - Why, What, How
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
 
Hadoop application architectures - using Customer 360 as an example
Hadoop application architectures - using Customer 360 as an exampleHadoop application architectures - using Customer 360 as an example
Hadoop application architectures - using Customer 360 as an example
 
Introduction to Apache HBase, MapR Tables and Security
Introduction to Apache HBase, MapR Tables and SecurityIntroduction to Apache HBase, MapR Tables and Security
Introduction to Apache HBase, MapR Tables and Security
 
Apache Drill: Building Highly Flexible, High Performance Query Engines by M.C...
Apache Drill: Building Highly Flexible, High Performance Query Engines by M.C...Apache Drill: Building Highly Flexible, High Performance Query Engines by M.C...
Apache Drill: Building Highly Flexible, High Performance Query Engines by M.C...
 

Similaire à Cloudera/Stanford EE203 (Entrepreneurial Engineer)

A Self Funding Agile Transformation
A Self Funding Agile TransformationA Self Funding Agile Transformation
A Self Funding Agile Transformation
Daniel Poon
 
Dropbox Startup Lessons Learned
Dropbox Startup Lessons LearnedDropbox Startup Lessons Learned
Dropbox Startup Lessons Learned
gueste94e4c
 
Dropbox Startuplessonslearned 100423230315 Phpapp02
Dropbox Startuplessonslearned 100423230315 Phpapp02Dropbox Startuplessonslearned 100423230315 Phpapp02
Dropbox Startuplessonslearned 100423230315 Phpapp02
Nateal33t
 
A Delicious Tale
A Delicious TaleA Delicious Tale
A Delicious Tale
gwmm
 

Similaire à Cloudera/Stanford EE203 (Entrepreneurial Engineer) (20)

Scaling from new start to enterprise platform
Scaling from new start to enterprise platformScaling from new start to enterprise platform
Scaling from new start to enterprise platform
 
Hofmockel ignite ames2010
Hofmockel ignite ames2010Hofmockel ignite ames2010
Hofmockel ignite ames2010
 
A Self Funding Agile Transformation
A Self Funding Agile TransformationA Self Funding Agile Transformation
A Self Funding Agile Transformation
 
Dropbox Startup Lessons Learned
Dropbox Startup Lessons LearnedDropbox Startup Lessons Learned
Dropbox Startup Lessons Learned
 
Dropbox Startuplessonslearned 100423230315 Phpapp02
Dropbox Startuplessonslearned 100423230315 Phpapp02Dropbox Startuplessonslearned 100423230315 Phpapp02
Dropbox Startuplessonslearned 100423230315 Phpapp02
 
A Delicious Tale
A Delicious TaleA Delicious Tale
A Delicious Tale
 
cloud-application-architectures-oreilly-media.pdf
cloud-application-architectures-oreilly-media.pdfcloud-application-architectures-oreilly-media.pdf
cloud-application-architectures-oreilly-media.pdf
 
Convergence - Diverse Journeys to the Same Truth
Convergence - Diverse Journeys to the Same TruthConvergence - Diverse Journeys to the Same Truth
Convergence - Diverse Journeys to the Same Truth
 
Open for Business: A Quick Guide to Starting Your Venture in the Cloud
Open for Business: A Quick Guide to Starting Your Venture in the CloudOpen for Business: A Quick Guide to Starting Your Venture in the Cloud
Open for Business: A Quick Guide to Starting Your Venture in the Cloud
 
Why Open Always Trumps Closed (Eventually) - Drupalcamp Finland Keynote
Why Open Always Trumps Closed (Eventually) - Drupalcamp Finland KeynoteWhy Open Always Trumps Closed (Eventually) - Drupalcamp Finland Keynote
Why Open Always Trumps Closed (Eventually) - Drupalcamp Finland Keynote
 
Cloud Transformations
Cloud TransformationsCloud Transformations
Cloud Transformations
 
Absi Presentation at BBQ 2013
Absi Presentation at BBQ 2013 Absi Presentation at BBQ 2013
Absi Presentation at BBQ 2013
 
Opening Keynote by Dr. Werner Vogels
Opening Keynote by Dr. Werner VogelsOpening Keynote by Dr. Werner Vogels
Opening Keynote by Dr. Werner Vogels
 
Cloudsourcing2013
Cloudsourcing2013Cloudsourcing2013
Cloudsourcing2013
 
Google's company profile
Google's company profileGoogle's company profile
Google's company profile
 
Special talk: Introduction to Big Data and FinTech at Financial Supervisory S...
Special talk: Introduction to Big Data and FinTech at Financial Supervisory S...Special talk: Introduction to Big Data and FinTech at Financial Supervisory S...
Special talk: Introduction to Big Data and FinTech at Financial Supervisory S...
 
DrupalCon Chicago 2011 ReportBack (11/03/30 - G. Bedford)
DrupalCon Chicago 2011 ReportBack (11/03/30 - G. Bedford)DrupalCon Chicago 2011 ReportBack (11/03/30 - G. Bedford)
DrupalCon Chicago 2011 ReportBack (11/03/30 - G. Bedford)
 
Intelligence artificielle. Pourquoi et comment. Web à Québec 2017.
Intelligence artificielle. Pourquoi et comment. Web à Québec 2017.Intelligence artificielle. Pourquoi et comment. Web à Québec 2017.
Intelligence artificielle. Pourquoi et comment. Web à Québec 2017.
 
A study on Google
A study on GoogleA study on Google
A study on Google
 
Built to Thrive
Built to ThriveBuilt to Thrive
Built to Thrive
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

Cloudera/Stanford EE203 (Entrepreneurial Engineer)

  • 1.
  • 2. Outline About Me The VivaSmart Story The EIR Experience The Cloudera Story .. so far What is Hadoop? Open Source Business Models Lessons Learned & Advice
  • 3. About Me I got BS and MS from Cairo University in Egypt. I came to US in 1995 to get my PhD from Stanford, with goal to go back to Egypt and teach. I got infected by the Entrepreneurship bug, it is rampant at Stanford, hopefully you’ll get infected too In 1999 I took a leave of absence from PhD to start VivaSmart, which I sold to Yahoo in 2000. I stayed with Yahoo till mid-2008, also finished my PhD in mid-2007 with Mendel Rosenblum. I started Cloudera in fall of 2008.
  • 4. The VivaSmart Story It started as Booksmart in Spring of 1999. Initial prototype was built by Thai Tran. We got funded by a great angel (Frank Marshall). Couldn’t raise VC money, but we were able to raise more angel money, and got lighthouse customers. Noticed that it is hard to drive traffic, decided to focus on catalog management technology (Aptivia). Got initial acquisition termsheet from Excite@Home for $12M but they reneged at last minute (4/2000) Yahoo Shopping acquired us for $9M in June 2000.
  • 5. The EIR Experience EIR = Entrepreneur in Residence. Joined Accel Partners in June 2008 as an EIR. Spent most of the summer researching possible ideas for my next venture, also helped with due diligence for a number of companies. Experienced the fund raising process from the VC side, very useful to see how they think. Met my Cloudera co-founders through Accel Andrew Braccia (agb) and Ping Li (pli) from Accel Partners joined the Cloudera Board of Directors.
  • 6. The Cloudera Story … so far Oct 2008: Got $5M round A funding from Accel Partners and a number of strategic angel investors. Four founders (too many?): Mike Olson (Oracle) Jeff Hammerbacher (Facebook) Christophe Bisciglia (Google) AmrAwadallah (Yahoo) Announced the company in March of 2009. May 2009: Got $6M in funding from Greylock Ventures (opportunistic B round) AneelBhusri joined our board from Greylock
  • 7. Cloudera’s Elevator Pitch A single,consolidated repository to enable insights across complex and structured data. Complex Data Documents Web feeds System logs Online forums SharePoint Sensor data EMB archives Photo/Video Structured Data (“relational”) CRM Financials Logistics Inventory Sales records HR records
  • 8. What is Hadoop? The foundation of our system is built on top of Apache Hadoop, which is a scalable distributed data processing system. The scalability of Hadoop comes from marriage of: HDFS: Self-Healing High-Bandwidth Clustered Storage. MapReduce: Fault-Tolerant Distributed Processing. The software manages and heals it self. Leverages the economies of scale of commodity hardware (multi-core chips, many disks per system) Compute moves to data (not other way around).
  • 9. Hadoop History 2002-2004: Doug Cutting and Mike Cafarella started working on Nutch 2003-2004: Google publishes GFS and MapReduce papers 2004: Cutting adds DFS & MapReduce support to Nutch 2006: Yahoo! hires Cutting, Hadoop spins out of Nutch 2007: NY Times converts 4TB of archives over 100 EC2s 2008: Web-scale deployments at Y!, Facebook, Last.fm April 2008: Yahoo does fastest sort of a TB, 3.5mins over 910 nodes May 2009: Yahoo does fastest sort of a TB, 62secs over 1460 nodes Yahoo sorts a PB in 16.25hours over 3658 nodes June 2009, Oct 2009: Hadoop Summit (750), Hadoop World (500) September 2009: Doug Cutting joins Cloudera
  • 10. Open Source Software Business Models Open Source is attractive since it gets you: Free Distribution: People can download and try it out Darwinian Effect: Lots of developers try to solve the problem, best solution wins. Faster Innovation: Customers build the product with you! OSS Business Models: Support/Maintenance/Service agreements Open Core: core is free, but there is value-add proprietary technology around it (“Community” vs “Enterprise” Edition) Monetization through enablement of other services (e.g. Firefox makes money from Google Search).
  • 11. Lessons Learned & Advice Make sure your idea can actually make money! Hire great people (corollary: Fire swiftly). Make sure you are passionate about your idea. Listen to customers, but look for the problems, it is your job to come up with solutions. Be agile, iterate quickly, don’t spend a year planning, don’t be afraid to make mistakes. Don’t be afraid to fail, but don’t persist in your failing ways, learn from failure quickly and evolve (Moore) Have faith, but don’t let it blind you from reality
  • 12. Books I Recommend “Blue Ocean Strategy”, W. Chan Kim, Renée Mauborgne. “The Innovator’s Dilemma”, Clayton Christensen “The Innovator’s Solution”, Clayton Christensen, and Michael Raynor “Good to Great”, Jim Collins “The Seven Habits of Highly Effective People”, Stephen Covey “Crossing the Chasm”,“Tornado”, Geoffrey Moore “The Black Swan”, NassimTaleb.
  • 13. Contact Information We Are Hiring: jobs+ee203@cloudera.com AmrAwadallah CTO, Cloudera Inc. http://twitter.com/awadallah Online Training Videos and Info: http://cloudera.com/hadoop-training http://cloudera.com/blog http://twitter.com/cloudera

Notes de l'éditeur

  1. http://developer.yahoo.net/blogs/hadoop/2009/05/hadoop_sorts_a_petabyte_in_162.html100s of deployments worldwide (http://wiki.apache.org/hadoop/PoweredBy)