Soumettre la recherche
Mettre en ligne
Justin Erickson, Cloudera_Hadoop&SQL
•
Télécharger en tant que PPTX, PDF
•
1 j'aime
•
886 vues
The Hive
Suivre
Signaler
Partager
Signaler
Partager
1 sur 3
Télécharger maintenant
Recommandé
Digital Strategies for Retail: Eshopper Index by Christophe Biget
Digital Strategies for Retail: Eshopper Index by Christophe Biget
The Hive
E shopper Index - by Christophe Biget at IVentures
E shopper Index - by Christophe Biget at IVentures
The Hive
Redefine healthcare with IT by Niranjan Thirumale
Redefine healthcare with IT by Niranjan Thirumale
The Hive
The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR
The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR
The Hive
1.nigam shah stanford_meetup
1.nigam shah stanford_meetup
The Hive
Untethered health in a networked society by James Mathews
Untethered health in a networked society by James Mathews
The Hive
Big Data App servor by Lance Riedel, CTO, The Hive for The Hive India event
Big Data App servor by Lance Riedel, CTO, The Hive for The Hive India event
The Hive
The Hive Think Tank: Heron at Twitter
The Hive Think Tank: Heron at Twitter
The Hive
Recommandé
Digital Strategies for Retail: Eshopper Index by Christophe Biget
Digital Strategies for Retail: Eshopper Index by Christophe Biget
The Hive
E shopper Index - by Christophe Biget at IVentures
E shopper Index - by Christophe Biget at IVentures
The Hive
Redefine healthcare with IT by Niranjan Thirumale
Redefine healthcare with IT by Niranjan Thirumale
The Hive
The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR
The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR
The Hive
1.nigam shah stanford_meetup
1.nigam shah stanford_meetup
The Hive
Untethered health in a networked society by James Mathews
Untethered health in a networked society by James Mathews
The Hive
Big Data App servor by Lance Riedel, CTO, The Hive for The Hive India event
Big Data App servor by Lance Riedel, CTO, The Hive for The Hive India event
The Hive
The Hive Think Tank: Heron at Twitter
The Hive Think Tank: Heron at Twitter
The Hive
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
The Hive
Big data, data science and creative disruption of holistic wellbeing by Poona...
Big data, data science and creative disruption of holistic wellbeing by Poona...
The Hive
The Hive Think Tank: Ceph + RocksDB by Sage Weil, Red Hat.
The Hive Think Tank: Ceph + RocksDB by Sage Weil, Red Hat.
The Hive
The Hive Think Tank: "Stream Processing Systems" by Karthik Ramasamy of Twitter
The Hive Think Tank: "Stream Processing Systems" by Karthik Ramasamy of Twitter
The Hive
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
The Hive
Priyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQL
The Hive
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive
Building Hadoop Data Applications with Kite by Tom White
Building Hadoop Data Applications with Kite by Tom White
The Hive
"Responsible AI", by Charlie Muirhead
"Responsible AI", by Charlie Muirhead
The Hive
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
The Hive
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
The Hive
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
The Hive
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive
Data Science in the Enterprise
Data Science in the Enterprise
The Hive
AI in Software for Augmenting Intelligence Across the Enterprise
AI in Software for Augmenting Intelligence Across the Enterprise
The Hive
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
The Hive
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
The Hive
Social Impact & Ethics of AI by Steve Omohundro
Social Impact & Ethics of AI by Steve Omohundro
The Hive
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive
Contenu connexe
En vedette
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
The Hive
Big data, data science and creative disruption of holistic wellbeing by Poona...
Big data, data science and creative disruption of holistic wellbeing by Poona...
The Hive
The Hive Think Tank: Ceph + RocksDB by Sage Weil, Red Hat.
The Hive Think Tank: Ceph + RocksDB by Sage Weil, Red Hat.
The Hive
The Hive Think Tank: "Stream Processing Systems" by Karthik Ramasamy of Twitter
The Hive Think Tank: "Stream Processing Systems" by Karthik Ramasamy of Twitter
The Hive
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
The Hive
Priyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQL
The Hive
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive
Building Hadoop Data Applications with Kite by Tom White
Building Hadoop Data Applications with Kite by Tom White
The Hive
En vedette
(9)
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
Big data, data science and creative disruption of holistic wellbeing by Poona...
Big data, data science and creative disruption of holistic wellbeing by Poona...
The Hive Think Tank: Ceph + RocksDB by Sage Weil, Red Hat.
The Hive Think Tank: Ceph + RocksDB by Sage Weil, Red Hat.
The Hive Think Tank: "Stream Processing Systems" by Karthik Ramasamy of Twitter
The Hive Think Tank: "Stream Processing Systems" by Karthik Ramasamy of Twitter
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
Priyank Patel, Teradata, Hadoop & SQL
Priyank Patel, Teradata, Hadoop & SQL
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
Building Hadoop Data Applications with Kite by Tom White
Building Hadoop Data Applications with Kite by Tom White
Plus de The Hive
"Responsible AI", by Charlie Muirhead
"Responsible AI", by Charlie Muirhead
The Hive
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
The Hive
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
The Hive
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
The Hive
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive
Data Science in the Enterprise
Data Science in the Enterprise
The Hive
AI in Software for Augmenting Intelligence Across the Enterprise
AI in Software for Augmenting Intelligence Across the Enterprise
The Hive
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
The Hive
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
The Hive
Social Impact & Ethics of AI by Steve Omohundro
Social Impact & Ethics of AI by Steve Omohundro
The Hive
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
The Hive
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
The Hive
The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Unpacking AI for Healthcare
The Hive
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
The Hive
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive
The Hive Think Tank - Design Thinking by Bernie Roth, Professor at Stanford U...
The Hive Think Tank - Design Thinking by Bernie Roth, Professor at Stanford U...
The Hive
The Hive Think Tank: Sidechains by Adam Back, President of Blockstream
The Hive Think Tank: Sidechains by Adam Back, President of Blockstream
The Hive
Plus de The Hive
(20)
"Responsible AI", by Charlie Muirhead
"Responsible AI", by Charlie Muirhead
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
Data Science in the Enterprise
Data Science in the Enterprise
AI in Software for Augmenting Intelligence Across the Enterprise
AI in Software for Augmenting Intelligence Across the Enterprise
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
Social Impact & Ethics of AI by Steve Omohundro
Social Impact & Ethics of AI by Steve Omohundro
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - Design Thinking by Bernie Roth, Professor at Stanford U...
The Hive Think Tank - Design Thinking by Bernie Roth, Professor at Stanford U...
The Hive Think Tank: Sidechains by Adam Back, President of Blockstream
The Hive Think Tank: Sidechains by Adam Back, President of Blockstream
Justin Erickson, Cloudera_Hadoop&SQL
1.
The Platform for
Big Data 1 It’s Not Just About SQL on Hadoop Storage Integration Resource Management Metadata Batch Processing MAPREDUCE, HIVE & PIG … Interactive SQL IMPALA Interactive Search Solr HDFS HBase TEXT, RCFILE, PARQUET, AVRO… RECORDS Engines Management | Support Single platform for processing ML, SQL, Search, SAS, R, … Scales to ‘000s of servers No upfront schema 10% the cost per TB Open source platform ©2013 Cloudera, Inc. All Rights Reserved. Interactive Analytics SAS, R, …
2.
Impala Today • Interactive
SQL • Typically 4-65x faster than the latest Hive (observed 100x faster) • Responses in seconds instead of minutes (sometimes sub-second) • ANSI-92 standard SQL queries with HiveQL • Compatible SQL interface for existing Hadoop/CDH applications • Industry standard SQL • Natively on Hadoop/HBase storage and metadata • Flexibility, scale, and cost advantages of Hadoop • No duplication/synchronization of data and metadata • Local processing to avoid network bottlenecks • Separate runtime from batch Hive, Pig, or MapReduce • Hive is designed and great for batch • Impala is purpose-built for low-latency SQL queries on Hadoop 2 ©2013 Cloudera, Inc. All Rights Reserved.
3.
Impala’s Benefits Today •
Unlocks BI/analytics on Hadoop • Interactive SQL in seconds/milliseconds • Highly concurrent to handle 100s and 1000s of users • Native Hadoop flexibility • No data migration, conversion, or duplication required • Query across existing Hadoop data • Run multiple frameworks on the same data at the same time • Supports Parquet for best-of-breed columnar performance • Native MPP query engine designed into Hadoop: • Unified Hadoop storage • Unified Hadoop metadata (uses Hive and HCatalog) • Unified Hadoop security • Fine-grained role-based access controls with Sentry • Apache-licensed open source • Deployed and proven across many customers today ©2013 Cloudera, Inc. All Rights Reserved. 3
Télécharger maintenant