SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Curb	
  Your	
  Insecurity	
  with	
  	
  HDP	
  
Tips	
  for	
  a	
  Secure	
  Cluster	
  (with	
  Spark	
  too)	
  	
  
	
  
Ancil	
  McBarneA	
  
Senior	
  Solu*ons	
  Engineer	
  –	
  Security	
  &	
  
Governance	
  
	
  
Future	
  of	
  Data	
  Meetup	
  –	
  New	
  York	
  
June	
  2nd,	
  2016	
  
2	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Hadoop Security in 4 Steps
3	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Agenda
•  Introduction to Hadoop Security
–  The	
  4	
  Steps	
  to	
  Hadoop	
  Security	
  
•  Authentication with Kerbeos
–  Integra*on	
  with	
  LDAP	
  
•  Authorization with Apache Ranger
–  Hive,	
  HDFS,	
  YARN	
  
•  Rest API Security with Apache Knox
–  WebHDFS	
  
–  Hive	
  
•  Encrypt the Data/ Data Protection
–  Transparent	
  Data	
  Encryp*on	
  and	
  KMS	
  
4	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
How	
  do	
  I	
  set	
  policy	
  across	
  the	
  en*re	
  cluster?	
  
Who	
  am	
  I/prove	
  it?	
  
What	
  can	
  I	
  do?	
  
What	
  did	
  I	
  do?	
  
How	
  can	
  I	
  encrypt	
  at	
  rest	
  and	
  over	
  the	
  wire?	
  
Comprehensive	
  Approach	
  to	
  Security	
  
Data	
  ProtecDon	
  
Protect	
  data	
  at	
  rest	
  and	
  in	
  mo*on	
  
In	
  order	
  to	
  protect	
  any	
  data	
  system	
  you	
  must	
  implement	
  the	
  following:	
  
Audit	
  
Maintain	
  a	
  record	
  of	
  data	
  access	
  
AuthorizaDon	
  
Provision	
  access	
  to	
  data	
  
AuthenDcaDon	
  
Authen*cate	
  users	
  and	
  systems	
  
AdministraDon	
  
Central	
  management	
  and	
  consistent	
  security	
  
5	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
HDP	
  Security:	
  Comprehensive,	
  Complete,	
  Extensible	
  
Data	
  ProtecDon	
  
Protect	
  data	
  at	
  rest	
  and	
  in	
  mo*on	
  
Security	
  in	
  HDP	
  is	
  the	
  most	
  comprehensive,	
  complete	
  and	
  extensible	
  for	
  Hadoop	
  
Audit	
  
Maintain	
  a	
  record	
  of	
  data	
  access	
  
AuthorizaDon	
  
Provision	
  access	
  to	
  data	
  
AuthenDcaDon	
  
Authen*cate	
  users	
  and	
  systems	
  
AdministraDon	
  
Central	
  management	
  and	
  consistent	
  security	
  
Single	
  administra*ve	
  console	
  to	
  set	
  policy	
  across	
  
the	
  en*re	
  cluster:	
  Apache	
  Ranger	
  
Authen*ca*on	
  for	
  perimeter	
  and	
  cluster;	
  
integrates	
  with	
  exis*ng	
  Ac*ve	
  Directory	
  and	
  
LDAP	
  solu*ons:	
  Kerberos	
  	
  |	
  	
  Apache	
  Knox	
  
Consistent	
  authoriza*on	
  controls	
  across	
  all	
  
Apache	
  components	
  within	
  HDP:	
  Apache	
  Ranger	
  
Record	
  of	
  data	
  access	
  events	
  across	
  all	
  
components	
  that	
  is	
  consistent	
  and	
  accessible:	
  
Apache	
  Ranger	
  	
  	
  
Encrypts	
  data	
  in	
  mo*on	
  and	
  data	
  at	
  rest;	
  refer	
  
partner	
  encryp*on	
  solu*ons	
  for	
  broader	
  needs:	
  
HDFS	
  TDE	
  with	
  Ranger	
  KMS	
  
6	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Security: Rings of Defense
Perimeter	
  Level	
  Security	
  
•  Network	
  Security	
  (i.e.	
  Firewalls)	
  
•  Apache	
  Knox	
  (i.e.	
  Gateways)	
  
AuthenDcaDon	
  
•  Kerberos	
  
OS	
  Security	
  
AuthorizaDon	
  
•  MR	
  ACLs	
  
•  HDFS	
  Permissions	
  
•  HDFS	
  ACLs	
  
•  HiveATZ-­‐NG	
  
•  HBase	
  ACLs	
  
•  Accumulo	
  Label	
  Security	
  
7	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
AuthenDcaDon	
  with	
  Kerberos	
  
8	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Security	
  Without	
  Kerberos	
  
9	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Configure	
  Kerberos	
  –	
  Ambari	
  Wizard	
  
10	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Security	
  With	
  Kerberos	
  
11	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Apache	
  Ranger	
  	
  
12	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Apache	
  Ranger	
  
13	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Centralized	
  Security	
  with	
  Ranger	
  
•  Administrators have complete
visibility into the security
administration process
Deep	
  Visibility	
  Centralized	
  PlaVorm	
  
•  Administer security for:
– Database	
  
– Table	
  
– Column	
  
– LDAP	
  Groups	
  
– Specific	
  Users	
  
Fine-­‐Grained	
  Security	
  
DefiniDon	
  
•  Centralized platform to define,
administer and manage security
policies consistently
•  Define security policy once and
apply it to all the applicable
components across the stack
14	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
HDFS	
  File	
  Security	
  
15	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Hive	
  Database	
  and	
  Table	
  Security	
  
16	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Authorization and Audit
Authorization
Fine	
  grain	
  access	
  control	
  
•  HDFS	
  –	
  Folder,	
  File	
  
•  Hive	
  –	
  Database,	
  Table,	
  Column	
  
•  HBase	
  –	
  Table,	
  Column	
  Family,	
  Column	
  
•  Storm,	
  Knox	
  and	
  more	
  
	
  
Audit
Extensive	
  user	
  access	
  audi*ng	
  in	
  	
  
HDFS,	
  Hive	
  and	
  HBase	
  
•  IP	
  Address	
  
•  Resource	
  type/	
  resource	
  
•  Timestamp	
  
•  Access	
  granted	
  or	
  denied	
  
Control	
  access	
  
into	
  system	
  
Flexibility	
  
in	
  defining	
  
policies	
  
17	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Rest	
  API	
  Security	
  with	
  Apache	
  Knox	
  	
  
18	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
AuthenDcaDon—API	
  Security	
  with	
  Knox	
  
Eliminates SSH “edge node”
Central API management
Central audit control
Service level authorization
SSO Integration—Siteminder
and OAM
LDAP and AD integration
Incubated	
  and	
  led	
  by	
  Hortonworks,	
  	
  
Apache	
  Knox	
  extends	
  the	
  reach	
  of	
  Hadoop	
  REST	
  API	
  
without	
  Kerberos	
  complexi*es	
  
Integrated	
  with	
  exisDng	
  systems	
  to	
  
simplify	
  idenDty	
  maintenance	
  
Single,	
  simple	
  point	
  of	
  access	
  for	
  a	
  
cluster	
  
Central	
  controls	
  ensure	
  consistency	
  
across	
  one	
  or	
  more	
  clusters	
  
Kerberos Encapsulation
Single Hadoop access point
REST API hierarchy
Consolidated API calls
Multi-cluster support
19	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Load	
  Balancer	
  
Extend Hadoop API reach with Knox
Hadoop	
  Cluster	
  
Applica*on	
  Tier	
  App	
  A	
   App	
  N	
  App	
  B	
   App	
  C	
  
Data	
  Ingest	
  
ETL	
  
Admin/	
  
Operators	
  
Bas*an	
  Node	
  
SSH	
  
RPC	
  Call	
  
Falcon	
  
Oozie	
  
Scoop	
  
Flume	
  
Data	
  
Operator	
  
Business	
  
User	
  
Hadoop	
  
	
  Admin	
  
JDBC/ODBC	
  REST/HTTP	
  
Knox	
  
20	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Hadoop REST APIs
Ã Useful for connecting to Hadoop from the outside the cluster
Ã When more client language flexibility is required
–  i.e.	
  Java	
  binding	
  not	
  an	
  op*on	
  
Ã Challenges
–  Client	
  must	
  have	
  knowledge	
  of	
  cluster	
  topology	
  
–  Required	
  to	
  open	
  ports	
  (and	
  in	
  some	
  cases,	
  on	
  every	
  host)	
  outside	
  the	
  cluster	
  
Service	
   API	
  
WebHDFS	
   Supports	
  HDFS	
  user	
  opera*ons	
  including	
  reading	
  files,	
  wri*ng	
  to	
  
files,	
  making	
  directories,	
  changing	
  permissions	
  and	
  renaming.	
  
WebHCat	
   Job	
  control	
  for	
  MapReduce,	
  Pig	
  and	
  Hive	
  jobs,	
  and	
  HCatalog	
  DDL	
  
commands.	
  Learn	
  more	
  about	
  WebHCat.	
  
Hive	
   Hive	
  REST	
  API	
  opera*ons	
  
HBase	
   HBase	
  REST	
  API	
  opera*ons	
  
Oozie	
   Job	
  submission	
  and	
  management,	
  and	
  Oozie	
  administra*on.	
  
21	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Hadoop REST API with Knox – Representative Examples
Service	
   Direct	
  URL	
   Knox	
  URL	
  
WebHDFS	
   hkp://namenode-­‐host:50070/webhdfs	
  
	
  
hkps://knox-­‐host:8443/webhdfs	
  
WebHCat	
   hkp://webhcat-­‐host:50111/templeton	
  
	
  
hkps://knox-­‐host:8443/templeton	
  
	
  
Oozie	
   hkp://ooziehost:11000/oozie	
  
	
  
hkps://knox-­‐host:8443/oozie	
  
	
  
Hbase/
Stargate	
  
hkp://hbasehost:60080	
  
	
  
hkps://knox-­‐host:8443/hbase	
  
	
  
Hive	
   hkp://hivehost:10001/cliservice	
   hkps://knox-­‐host:8443/hive	
  
YARN	
   hkp://yarn-­‐host:yarn-­‐port/ws	
   hkps://knox-­‐host:8443/resourcemanager	
  
23	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Data	
  ProtecDon	
  
24	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Security	
  in	
  Hadoop	
  with	
  HDP	
  
	
  
•  Wire	
  encryp*on	
  in	
  
Hadoop	
  
•  HDFS	
  Encryp*on	
  
with	
  Ranger	
  KMS	
  
	
  
•  Centralized	
  audit	
  
repor*ng	
  with	
  
Apache	
  Ranger	
  
	
  
•  Fine-­‐grain	
  access	
  
control	
  with	
  	
  
Apache	
  Ranger	
  
AuthorizaDon	
  
What	
  can	
  I	
  do?	
  
Audit	
  
What	
  did	
  I	
  do?	
  
Data	
  ProtecDon	
  
Can	
  data	
  be	
  encrypted	
  at	
  rest	
  
and	
  over	
  the	
  wire?	
  
•  Kerberos	
  
•  API	
  security	
  with	
  Apache	
  
Knox	
  
AuthenDcaDon	
  
Who	
  am	
  I/prove	
  it?	
  
HDP	
  2.4	
  	
  
Centralized	
  Security	
  AdministraDon	
  with	
  Ranger	
  	
  	
  
25	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Data Protection
HDP allows you to apply data protection policy at
different layers across the Hadoop stack
Layer	
   What?	
   How	
  ?	
  
Storage	
  and	
  
Access	
  
Encrypt	
  data	
  while	
  it	
  is	
  at	
  rest	
  
HDFS	
  Transparent	
  Data	
  Encryp*on,	
  Partners,	
  
Hbase	
  encryp*on,	
  OS	
  level	
  encrypt,	
  	
  
Transmission	
   Encrypt	
  data	
  as	
  it	
  moves	
   SSL,	
  SASL,	
  Supported	
  from	
  HDP	
  2.1	
  
26	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Points	
  of	
  CommunicaDon	
  
Page	
  26	
  
WebHDFS	
  
DataTransferProtocol	
  
Nodes	
  
M/R	
  Shuffle	
  
Client	
  
1	
  
2	
  
4	
  
RPC	
  3	
  
Nodes	
  
DataTransfer	
  2	
  
JDBC/ODBC	
  
3	
  
Hadoop	
  Cluster	
  
RPC	
  
4	
  
27	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Data	
  ProtecDon	
  -­‐	
  HDFS	
  EncrypDon	
  
DATA	
  	
  ACCESS	
  
	
  	
  
DATA	
  	
  MANAGEMENT	
  
SECURITY	
  PARTNERS	
  
	
  	
  YARN	
  
KeyProvider	
  API	
  
(partner	
  integra*on	
  point)	
  
	
  	
  	
  	
  	
  
	
  
Key	
  Management	
  System	
  (KMS)	
  
Stateless	
  Key	
  Management	
  
°	
  
1	
  
°	
  
°	
  
°	
  
°	
  
°	
   °	
  
°	
   °	
  
°	
   °	
  
°	
   °	
  
°	
   N	
  °	
  
1	
   °	
   °	
   °	
   °	
   °	
  
°	
   °	
   °	
   °	
   °	
   °	
  
°	
   °	
   °	
   °	
   °	
   °	
  
°	
   °	
   °	
   °	
   °	
   °	
  
°	
   °	
   °	
   °	
   °	
   °	
  
°	
   °	
  
°	
   °	
  
°	
   °	
  
°	
   °	
  
°	
  
HDFS	
  	
  
EncrypDon	
  Zone	
  	
  
Encrypted	
  
	
  File	
  
Encrypted	
  
	
  File	
  
Encrypted	
  
	
  File	
  
Encrypted	
  
	
  File	
  
Encrypted	
  
	
  Files	
  
Name	
  
Node	
  
HDFS	
  
Client	
  
HDFS	
  
Client	
  
•  Hortonworks	
  collabora*ng	
  with	
  partners	
  to	
  deliver	
  enterprise	
  scale	
  	
  	
  
Key	
  Management	
  ,	
  deliver	
  more	
  choices	
  to	
  customers	
  
•  Open	
  source	
  KMS	
  	
  with	
  Ranger	
  
•  Or	
  Partner	
  with	
  Voltage	
  KMS	
  
-  Partner	
  joint	
  engineering	
  resources	
  
-  Voltage	
  Stateless	
  Key	
  Management	
  integrated	
  with	
  KeyProvider	
  API	
  	
  
Only	
  HDP	
  offers	
  open	
  
source	
  and	
  
commercial	
  choices	
  
for	
  key	
  management	
  Open	
  Source	
  Key	
  Management	
  
28	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Demo	
  Transparent	
  Data	
  EncrypDon	
  
29	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
Security in Spark?
Spark supports running in a Kerberized Cluster
Only Spark on YARN supports security (Kerberos support)
From command line run kinit before submitting spark jobs
Spark reads data from HDFS & ORC
•  HDFS file permissions (& Ranger integration) applicable to Spark jobs
Spark submits job to YARN queue
•  YARN queue ACL (& Ranger integration) applicable to Spark jobs
Wire Encryption
•  Spark has some coverage, not all channels are covered
LDAP Authentication
•  No Authentication in Spark UI OOB, supports filter for hooking in LDAP
30	
   ©	
  Hortonworks	
  Inc.	
  2011	
  –	
  2016.	
  All	
  Rights	
  Reserved	
  
What	
  makes	
  Hadoop	
  Summit	
  Different?	
  
– Deep	
  technical	
  sessions	
  chosen	
  by	
  the	
  community	
  	
  
– Business	
  Track	
  based	
  on	
  real-­‐world	
  implementa*ons	
  
– Keynotes	
  from	
  Progressive	
  Insurance,	
  Ford,	
  Macy’s,	
  	
  
MD	
  Anderson,	
  GE,	
  Capital	
  One,	
  …	
  	
  
– Free	
  Hands-­‐on	
  labs	
  
– Networking	
  events	
  and	
  10	
  Year	
  Celebra*on!	
  	
  
– 	
  20%	
  Off	
  Code:	
  16SJext20x	
  
Apache	
  Hadoop,	
  SPARK,	
  IoT,	
  Streaming,	
  Data	
  Science	
  
EVERYTHING	
  DATA!	
  	
  	
  

Contenu connexe

Tendances

Apache ranger meetup
Apache ranger meetupApache ranger meetup
Apache ranger meetupnvvrajesh
 
Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...
Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...
Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...DataWorks Summit
 
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...Kevin Minder
 
Hadoop Security Architecture
Hadoop Security ArchitectureHadoop Security Architecture
Hadoop Security ArchitectureOwen O'Malley
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Hortonworks
 
Ranger admin dev overview
Ranger admin dev overviewRanger admin dev overview
Ranger admin dev overviewTushar Dudhatra
 
Hadoop Security Today & Tomorrow with Apache Knox
Hadoop Security Today & Tomorrow with Apache KnoxHadoop Security Today & Tomorrow with Apache Knox
Hadoop Security Today & Tomorrow with Apache KnoxVinay Shukla
 
Hadoop Security Today and Tomorrow
Hadoop Security Today and TomorrowHadoop Security Today and Tomorrow
Hadoop Security Today and TomorrowDataWorks Summit
 
Hortonworks Technical Workshop: Interactive Query with Apache Hive
Hortonworks Technical Workshop: Interactive Query with Apache Hive Hortonworks Technical Workshop: Interactive Query with Apache Hive
Hortonworks Technical Workshop: Interactive Query with Apache Hive Hortonworks
 
Hadoop & Security - Past, Present, Future
Hadoop & Security - Past, Present, FutureHadoop & Security - Past, Present, Future
Hadoop & Security - Past, Present, FutureUwe Printz
 
Open Source Security Tools for Big Data
Open Source Security Tools for Big DataOpen Source Security Tools for Big Data
Open Source Security Tools for Big DataRommel Garcia
 
Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox
Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache KnoxFortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox
Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache KnoxDataWorks Summit
 
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...huguk
 
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...Artem Ervits
 
Securing the Hadoop Ecosystem
Securing the Hadoop EcosystemSecuring the Hadoop Ecosystem
Securing the Hadoop EcosystemDataWorks Summit
 
Hadoop Security: Overview
Hadoop Security: OverviewHadoop Security: Overview
Hadoop Security: OverviewCloudera, Inc.
 
Hadoop Operations: How to Secure and Control Cluster Access
Hadoop Operations: How to Secure and Control Cluster AccessHadoop Operations: How to Secure and Control Cluster Access
Hadoop Operations: How to Secure and Control Cluster AccessCloudera, Inc.
 
Managing enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystemManaging enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystemDataWorks Summit
 
Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015Shravan (Sean) Pabba
 
A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 DataWorks Summit
 

Tendances (20)

Apache ranger meetup
Apache ranger meetupApache ranger meetup
Apache ranger meetup
 
Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...
Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...
Extending Apache Ranger Authorization Beyond Hadoop: Review of Apache Ranger ...
 
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
 
Hadoop Security Architecture
Hadoop Security ArchitectureHadoop Security Architecture
Hadoop Security Architecture
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
 
Ranger admin dev overview
Ranger admin dev overviewRanger admin dev overview
Ranger admin dev overview
 
Hadoop Security Today & Tomorrow with Apache Knox
Hadoop Security Today & Tomorrow with Apache KnoxHadoop Security Today & Tomorrow with Apache Knox
Hadoop Security Today & Tomorrow with Apache Knox
 
Hadoop Security Today and Tomorrow
Hadoop Security Today and TomorrowHadoop Security Today and Tomorrow
Hadoop Security Today and Tomorrow
 
Hortonworks Technical Workshop: Interactive Query with Apache Hive
Hortonworks Technical Workshop: Interactive Query with Apache Hive Hortonworks Technical Workshop: Interactive Query with Apache Hive
Hortonworks Technical Workshop: Interactive Query with Apache Hive
 
Hadoop & Security - Past, Present, Future
Hadoop & Security - Past, Present, FutureHadoop & Security - Past, Present, Future
Hadoop & Security - Past, Present, Future
 
Open Source Security Tools for Big Data
Open Source Security Tools for Big DataOpen Source Security Tools for Big Data
Open Source Security Tools for Big Data
 
Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox
Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache KnoxFortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox
Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox
 
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
 
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
Security and Governance on Hadoop with Apache Atlas and Apache Ranger by Srik...
 
Securing the Hadoop Ecosystem
Securing the Hadoop EcosystemSecuring the Hadoop Ecosystem
Securing the Hadoop Ecosystem
 
Hadoop Security: Overview
Hadoop Security: OverviewHadoop Security: Overview
Hadoop Security: Overview
 
Hadoop Operations: How to Secure and Control Cluster Access
Hadoop Operations: How to Secure and Control Cluster AccessHadoop Operations: How to Secure and Control Cluster Access
Hadoop Operations: How to Secure and Control Cluster Access
 
Managing enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystemManaging enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystem
 
Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015
 
A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3
 

En vedette

Simplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and CentrifySimplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and CentrifyHortonworks
 
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshopDeep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshopHortonworks
 
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...Hortonworks
 
Protecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache HadoopProtecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache HadoopHortonworks
 
Securing Hadoop with Apache Ranger
Securing Hadoop with Apache RangerSecuring Hadoop with Apache Ranger
Securing Hadoop with Apache RangerDataWorks Summit
 
Hadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHortonworks
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseHortonworks
 

En vedette (8)

Simplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and CentrifySimplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and Centrify
 
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshopDeep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
 
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
 
Protecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache HadoopProtecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache Hadoop
 
Securing Hadoop with Apache Ranger
Securing Hadoop with Apache RangerSecuring Hadoop with Apache Ranger
Securing Hadoop with Apache Ranger
 
Hadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHAHadoop and Data Virtualization - A Case Study by VHA
Hadoop and Data Virtualization - A Case Study by VHA
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
 

Similaire à Curb your insecurity with HDP - Tips for a Secure Cluster

Treat your enterprise data lake indigestion: Enterprise ready security and go...
Treat your enterprise data lake indigestion: Enterprise ready security and go...Treat your enterprise data lake indigestion: Enterprise ready security and go...
Treat your enterprise data lake indigestion: Enterprise ready security and go...DataWorks Summit
 
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...DataWorks Summit
 
August 2014 HUG : Comprehensive Security for Hadoop
August 2014 HUG : Comprehensive Security for HadoopAugust 2014 HUG : Comprehensive Security for Hadoop
August 2014 HUG : Comprehensive Security for HadoopYahoo Developer Network
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseDataWorks Summit
 
Built-In Security for the Cloud
Built-In Security for the CloudBuilt-In Security for the Cloud
Built-In Security for the CloudDataWorks Summit
 
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFSDiscover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFSHortonworks
 
Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0Rommel Garcia
 
Realtime Analytics in Hadoop
Realtime Analytics in HadoopRealtime Analytics in Hadoop
Realtime Analytics in HadoopRommel Garcia
 
Improvements in Hadoop Security
Improvements in Hadoop SecurityImprovements in Hadoop Security
Improvements in Hadoop SecurityChris Nauroth
 
2014 sept 4_hadoop_security
2014 sept 4_hadoop_security2014 sept 4_hadoop_security
2014 sept 4_hadoop_securityAdam Muise
 
Big data spain keynote nov 2016
Big data spain keynote nov 2016Big data spain keynote nov 2016
Big data spain keynote nov 2016alanfgates
 
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in HadoopDiscover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in HadoopHortonworks
 
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...Hortonworks
 
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...Big Data Spain
 
Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017alanfgates
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseDataWorks Summit
 
Saving the elephant—now, not later
Saving the elephant—now, not laterSaving the elephant—now, not later
Saving the elephant—now, not laterDataWorks Summit
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoopGergely Devenyi
 

Similaire à Curb your insecurity with HDP - Tips for a Secure Cluster (20)

Curb Your Insecurity - Tips for a Secure Cluster (with Spark too)!!
Curb Your Insecurity - Tips for a Secure Cluster (with Spark too)!!Curb Your Insecurity - Tips for a Secure Cluster (with Spark too)!!
Curb Your Insecurity - Tips for a Secure Cluster (with Spark too)!!
 
Curb your insecurity with HDP
Curb your insecurity with HDPCurb your insecurity with HDP
Curb your insecurity with HDP
 
Treat your enterprise data lake indigestion: Enterprise ready security and go...
Treat your enterprise data lake indigestion: Enterprise ready security and go...Treat your enterprise data lake indigestion: Enterprise ready security and go...
Treat your enterprise data lake indigestion: Enterprise ready security and go...
 
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
 
August 2014 HUG : Comprehensive Security for Hadoop
August 2014 HUG : Comprehensive Security for HadoopAugust 2014 HUG : Comprehensive Security for Hadoop
August 2014 HUG : Comprehensive Security for Hadoop
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
 
Built-In Security for the Cloud
Built-In Security for the CloudBuilt-In Security for the Cloud
Built-In Security for the Cloud
 
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFSDiscover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
 
Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0
 
Realtime Analytics in Hadoop
Realtime Analytics in HadoopRealtime Analytics in Hadoop
Realtime Analytics in Hadoop
 
Improvements in Hadoop Security
Improvements in Hadoop SecurityImprovements in Hadoop Security
Improvements in Hadoop Security
 
2014 sept 4_hadoop_security
2014 sept 4_hadoop_security2014 sept 4_hadoop_security
2014 sept 4_hadoop_security
 
Big data spain keynote nov 2016
Big data spain keynote nov 2016Big data spain keynote nov 2016
Big data spain keynote nov 2016
 
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in HadoopDiscover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
 
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
 
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
 
Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
 
Saving the elephant—now, not later
Saving the elephant—now, not laterSaving the elephant—now, not later
Saving the elephant—now, not later
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoop
 

Dernier

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Dernier (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Curb your insecurity with HDP - Tips for a Secure Cluster

  • 1. Curb  Your  Insecurity  with    HDP   Tips  for  a  Secure  Cluster  (with  Spark  too)       Ancil  McBarneA   Senior  Solu*ons  Engineer  –  Security  &   Governance     Future  of  Data  Meetup  –  New  York   June  2nd,  2016  
  • 2. 2   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Hadoop Security in 4 Steps
  • 3. 3   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Agenda •  Introduction to Hadoop Security –  The  4  Steps  to  Hadoop  Security   •  Authentication with Kerbeos –  Integra*on  with  LDAP   •  Authorization with Apache Ranger –  Hive,  HDFS,  YARN   •  Rest API Security with Apache Knox –  WebHDFS   –  Hive   •  Encrypt the Data/ Data Protection –  Transparent  Data  Encryp*on  and  KMS  
  • 4. 4   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   How  do  I  set  policy  across  the  en*re  cluster?   Who  am  I/prove  it?   What  can  I  do?   What  did  I  do?   How  can  I  encrypt  at  rest  and  over  the  wire?   Comprehensive  Approach  to  Security   Data  ProtecDon   Protect  data  at  rest  and  in  mo*on   In  order  to  protect  any  data  system  you  must  implement  the  following:   Audit   Maintain  a  record  of  data  access   AuthorizaDon   Provision  access  to  data   AuthenDcaDon   Authen*cate  users  and  systems   AdministraDon   Central  management  and  consistent  security  
  • 5. 5   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   HDP  Security:  Comprehensive,  Complete,  Extensible   Data  ProtecDon   Protect  data  at  rest  and  in  mo*on   Security  in  HDP  is  the  most  comprehensive,  complete  and  extensible  for  Hadoop   Audit   Maintain  a  record  of  data  access   AuthorizaDon   Provision  access  to  data   AuthenDcaDon   Authen*cate  users  and  systems   AdministraDon   Central  management  and  consistent  security   Single  administra*ve  console  to  set  policy  across   the  en*re  cluster:  Apache  Ranger   Authen*ca*on  for  perimeter  and  cluster;   integrates  with  exis*ng  Ac*ve  Directory  and   LDAP  solu*ons:  Kerberos    |    Apache  Knox   Consistent  authoriza*on  controls  across  all   Apache  components  within  HDP:  Apache  Ranger   Record  of  data  access  events  across  all   components  that  is  consistent  and  accessible:   Apache  Ranger       Encrypts  data  in  mo*on  and  data  at  rest;  refer   partner  encryp*on  solu*ons  for  broader  needs:   HDFS  TDE  with  Ranger  KMS  
  • 6. 6   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Security: Rings of Defense Perimeter  Level  Security   •  Network  Security  (i.e.  Firewalls)   •  Apache  Knox  (i.e.  Gateways)   AuthenDcaDon   •  Kerberos   OS  Security   AuthorizaDon   •  MR  ACLs   •  HDFS  Permissions   •  HDFS  ACLs   •  HiveATZ-­‐NG   •  HBase  ACLs   •  Accumulo  Label  Security  
  • 7. 7   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   AuthenDcaDon  with  Kerberos  
  • 8. 8   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Security  Without  Kerberos  
  • 9. 9   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Configure  Kerberos  –  Ambari  Wizard  
  • 10. 10   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Security  With  Kerberos  
  • 11. 11   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Apache  Ranger    
  • 12. 12   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Apache  Ranger  
  • 13. 13   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Centralized  Security  with  Ranger   •  Administrators have complete visibility into the security administration process Deep  Visibility  Centralized  PlaVorm   •  Administer security for: – Database   – Table   – Column   – LDAP  Groups   – Specific  Users   Fine-­‐Grained  Security   DefiniDon   •  Centralized platform to define, administer and manage security policies consistently •  Define security policy once and apply it to all the applicable components across the stack
  • 14. 14   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   HDFS  File  Security  
  • 15. 15   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Hive  Database  and  Table  Security  
  • 16. 16   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Authorization and Audit Authorization Fine  grain  access  control   •  HDFS  –  Folder,  File   •  Hive  –  Database,  Table,  Column   •  HBase  –  Table,  Column  Family,  Column   •  Storm,  Knox  and  more     Audit Extensive  user  access  audi*ng  in     HDFS,  Hive  and  HBase   •  IP  Address   •  Resource  type/  resource   •  Timestamp   •  Access  granted  or  denied   Control  access   into  system   Flexibility   in  defining   policies  
  • 17. 17   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Rest  API  Security  with  Apache  Knox    
  • 18. 18   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   AuthenDcaDon—API  Security  with  Knox   Eliminates SSH “edge node” Central API management Central audit control Service level authorization SSO Integration—Siteminder and OAM LDAP and AD integration Incubated  and  led  by  Hortonworks,     Apache  Knox  extends  the  reach  of  Hadoop  REST  API   without  Kerberos  complexi*es   Integrated  with  exisDng  systems  to   simplify  idenDty  maintenance   Single,  simple  point  of  access  for  a   cluster   Central  controls  ensure  consistency   across  one  or  more  clusters   Kerberos Encapsulation Single Hadoop access point REST API hierarchy Consolidated API calls Multi-cluster support
  • 19. 19   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Load  Balancer   Extend Hadoop API reach with Knox Hadoop  Cluster   Applica*on  Tier  App  A   App  N  App  B   App  C   Data  Ingest   ETL   Admin/   Operators   Bas*an  Node   SSH   RPC  Call   Falcon   Oozie   Scoop   Flume   Data   Operator   Business   User   Hadoop    Admin   JDBC/ODBC  REST/HTTP   Knox  
  • 20. 20   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Hadoop REST APIs Ã Useful for connecting to Hadoop from the outside the cluster Ã When more client language flexibility is required –  i.e.  Java  binding  not  an  op*on   Ã Challenges –  Client  must  have  knowledge  of  cluster  topology   –  Required  to  open  ports  (and  in  some  cases,  on  every  host)  outside  the  cluster   Service   API   WebHDFS   Supports  HDFS  user  opera*ons  including  reading  files,  wri*ng  to   files,  making  directories,  changing  permissions  and  renaming.   WebHCat   Job  control  for  MapReduce,  Pig  and  Hive  jobs,  and  HCatalog  DDL   commands.  Learn  more  about  WebHCat.   Hive   Hive  REST  API  opera*ons   HBase   HBase  REST  API  opera*ons   Oozie   Job  submission  and  management,  and  Oozie  administra*on.  
  • 21. 21   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Hadoop REST API with Knox – Representative Examples Service   Direct  URL   Knox  URL   WebHDFS   hkp://namenode-­‐host:50070/webhdfs     hkps://knox-­‐host:8443/webhdfs   WebHCat   hkp://webhcat-­‐host:50111/templeton     hkps://knox-­‐host:8443/templeton     Oozie   hkp://ooziehost:11000/oozie     hkps://knox-­‐host:8443/oozie     Hbase/ Stargate   hkp://hbasehost:60080     hkps://knox-­‐host:8443/hbase     Hive   hkp://hivehost:10001/cliservice   hkps://knox-­‐host:8443/hive   YARN   hkp://yarn-­‐host:yarn-­‐port/ws   hkps://knox-­‐host:8443/resourcemanager  
  • 22. 23   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Data  ProtecDon  
  • 23. 24   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Security  in  Hadoop  with  HDP     •  Wire  encryp*on  in   Hadoop   •  HDFS  Encryp*on   with  Ranger  KMS     •  Centralized  audit   repor*ng  with   Apache  Ranger     •  Fine-­‐grain  access   control  with     Apache  Ranger   AuthorizaDon   What  can  I  do?   Audit   What  did  I  do?   Data  ProtecDon   Can  data  be  encrypted  at  rest   and  over  the  wire?   •  Kerberos   •  API  security  with  Apache   Knox   AuthenDcaDon   Who  am  I/prove  it?   HDP  2.4     Centralized  Security  AdministraDon  with  Ranger      
  • 24. 25   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Data Protection HDP allows you to apply data protection policy at different layers across the Hadoop stack Layer   What?   How  ?   Storage  and   Access   Encrypt  data  while  it  is  at  rest   HDFS  Transparent  Data  Encryp*on,  Partners,   Hbase  encryp*on,  OS  level  encrypt,     Transmission   Encrypt  data  as  it  moves   SSL,  SASL,  Supported  from  HDP  2.1  
  • 25. 26   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Points  of  CommunicaDon   Page  26   WebHDFS   DataTransferProtocol   Nodes   M/R  Shuffle   Client   1   2   4   RPC  3   Nodes   DataTransfer  2   JDBC/ODBC   3   Hadoop  Cluster   RPC   4  
  • 26. 27   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Data  ProtecDon  -­‐  HDFS  EncrypDon   DATA    ACCESS       DATA    MANAGEMENT   SECURITY  PARTNERS      YARN   KeyProvider  API   (partner  integra*on  point)               Key  Management  System  (KMS)   Stateless  Key  Management   °   1   °   °   °   °   °   °   °   °   °   °   °   °   °   N  °   1   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   HDFS     EncrypDon  Zone     Encrypted    File   Encrypted    File   Encrypted    File   Encrypted    File   Encrypted    Files   Name   Node   HDFS   Client   HDFS   Client   •  Hortonworks  collabora*ng  with  partners  to  deliver  enterprise  scale       Key  Management  ,  deliver  more  choices  to  customers   •  Open  source  KMS    with  Ranger   •  Or  Partner  with  Voltage  KMS   -  Partner  joint  engineering  resources   -  Voltage  Stateless  Key  Management  integrated  with  KeyProvider  API     Only  HDP  offers  open   source  and   commercial  choices   for  key  management  Open  Source  Key  Management  
  • 27. 28   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Demo  Transparent  Data  EncrypDon  
  • 28. 29   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   Security in Spark? Spark supports running in a Kerberized Cluster Only Spark on YARN supports security (Kerberos support) From command line run kinit before submitting spark jobs Spark reads data from HDFS & ORC •  HDFS file permissions (& Ranger integration) applicable to Spark jobs Spark submits job to YARN queue •  YARN queue ACL (& Ranger integration) applicable to Spark jobs Wire Encryption •  Spark has some coverage, not all channels are covered LDAP Authentication •  No Authentication in Spark UI OOB, supports filter for hooking in LDAP
  • 29. 30   ©  Hortonworks  Inc.  2011  –  2016.  All  Rights  Reserved   What  makes  Hadoop  Summit  Different?   – Deep  technical  sessions  chosen  by  the  community     – Business  Track  based  on  real-­‐world  implementa*ons   – Keynotes  from  Progressive  Insurance,  Ford,  Macy’s,     MD  Anderson,  GE,  Capital  One,  …     – Free  Hands-­‐on  labs   – Networking  events  and  10  Year  Celebra*on!     –   20%  Off  Code:  16SJext20x   Apache  Hadoop,  SPARK,  IoT,  Streaming,  Data  Science   EVERYTHING  DATA!