SlideShare une entreprise Scribd logo
1  sur  44
Télécharger pour lire hors ligne
Scalable L2 overlay
networks with routed VXLAN
/ MP-BGP EVPN
Stefan Kooman
AKA
HELP! Network Operations took
over Hypervisor networking!
Welcome

About me

About BIT

About this talk
Benefits of BGP EVPN / VXLAN

Overcome 4K VLAN limit

Customers can use own V(x)LAN schema

Strech L2 Segment accross DC

Reduces flooding traffic (Arp suppresion)

MAC mobility (VM migration)

Egress load balancing (ECMP)
Our “old” DC network setup
Limitations and problems
old DC network

Bandwidth limitation

STP

Slow convergence

IPv6 routing in CPU

Unsupported gear (too old)
Requirements for
new DC network

Open standards and protocols

Must work in a IPv6 only setup

No more STP (all active links ?? + L3 only ??)

Bandwidth on demand

All current customer setups must be supported

Must work with our current billing software
What if...

Every switch tracks all attached hosts / IP addresses

Switch creates a host route (/32 in IPv4 world, /128 in IPv6
world) for every directly-attached IP host

Host routes are redistributed into a routing protocol,
allowing every other switch in the network to route towards
any other host

Traffic to unknown destinations is dropped instead of
forwarded out all ports

Possible ???
BGP in control
VXLAN in summary
VXLAN in summary

RFC7348

24 bits VNI field

Minimum recommended L2 MTU = VM MTU + 50 bytes

Recommended L2 >1600 bytes (VLAN tags and IPv6)

SRC UDP port = payload hash (inner-ethernet header)

DST UDP port = 4789 (linux default is 8472)
EVPN in summary

RFC7432

Multi-tenant control plane for L2/L3 VPNs

Uses a new BGP address family

Works with many data-plane encapsulations

Carries IP+MAC reachability information

MAC/IP advertisement (EVPN route type 2)

VTEP advertisments (EVPN route type 3)

IP prefix route (EVPN route type 5)
Underlay
IP network
(BGP)
Hypervisor
VTEP
Virtual
Ethernet
Segments
(VXLAN)
VM
https://vincent.bernat.ch/en/blog/2017-vxlan-bgp-evpn
Underlay network
Underlay network single purpose is to ensure reachability
of the loopback interfaces, because there are used as
VxLAN tunnel endpoints!
BIT approach
Our switch hardware
7050QX
7050SX
Underlay design

Only one address family needed in the underlay

Only p2p /31 links between spines and leafs

One AS for all spines and one unique AS per switch pair

eBGP to make loopbacks (VTEPs) reachable

BGP timers tweaked; no BFD needed

Very simple to setup and troubleshoot

MTU >9000

Less than 300 routes in BGP for whole underlay setup which
makes convergence really Speedy (Gonzales)
Overlay design

Dual stack address family

One overlay AS (65101) for all spines and leafs

Spines are BGP RR’s for overlay network

BGP timers tweaked; no BFD needed

Line failure in core network has no impact on overlay RIB
which makes convergence sub-second.

All overlay (production) traffic in non-defaultVrf

VRF_ID * 10000 + VLAN_ID = VNI_ID
Overlay design

MC-LAG as first hop redundancy

vARP (all active gateway)

No access to defaultVrf; No unwanted tunnel access

Loopback as source for icmp replies

Filter advertised routes learned from spines
EVPN + VXLAN on Hypervisor
Required:
Linux distro with kernel >= 4.5
FRRouting >= 5.1dev build with cumulus option
Recommended:
ifupdown2 >=1.0
Step 1: Underlay

1 or more /31 uplink(s)

1 loopback address in defaultVrf for VTEP endpoint

MgmtVrf for in-band management (netns)

BGP session(s) on uplink(s) to leaf switches

Make loopback reachable to all other loopbacks/vteps

MTU >1600
Step 2: Overlay Data Plane

Create VRF (internetVrf)

Create at least two bridges (L2VNI + L3VNI) per VRF

Create 1 VTEP for each bridge with ip address of loopback

Attach VTEP interface to bridge

Attach VNET interface to bridge

Configure mac + ip address on L2VNI bridge

Filter ARP traffic on VTEP interface

Enable forwarding + sysctl tuning
Create VRF
ip link add internetVrf type vrf table 1000
ip link set internetVrf up
Create two bridges
brctl addbr br-vlan601
ip link set br-vlan601 master internetVrf
ip link set br-vlan601 up
brctl addbr br-vlan4003
ip link set br-vlan4003 master internetVrf
ip link set br-vlan4003 up
Create VTEP for each bridge
ip link add vtep10601 type vxlan id 10601 proxy
nolearning dstport 4789 local 213.136.24.130
ip link add vtep20003 type vxlan id 20003 proxy
nolearning dstport 4789 local 213.136.24.130
Attach VTEP + VNET to bridge
ip link set vtep10601 mtu 9000
ip link set vtep10601 up
brctl addif br-vlan601 vtep10601
ip link set vtep20003 mtu 9000
ip link set vtep20003 up
brctl addif br-vlan4003 vtep20003
Configure MAC + IP, drop ARP
ip addr add 213.136.24.161/28 dev br-vlan601
ip link set dev br-vlan601 address 02:62:69:74:67:77
ebtables -A OUTPUT -p arp -o vtep+ -j DROP
Sysctl tuning
sysctl -qw net.ipv4.conf.vnet.proxy_arp=1
sysctl -qw net.ipv4.neigh.vnet.proxy_delay=0
sysctl -qw net.ipv4.conf.all.forwarding=1
sysctl -qw net.ipv6.conf.all.forwarding=1
sysctl -qw net.ipv6.route.max_size=4194304
OpenNebula support
cat /var/lib/one/remotes/etc/vnm/OpenNebulaNetwork.conf

...
# Multicast protocol for multi destination BUM traffic. Options:
# - multicast, for IP multicast
# - evpn, for BGP EVPN control plane
:vxlan_mode: evpn
# Tunnel endpoint communication type. Only for evpn vxlan_mode.
# - dev, tunnel endpoint communication is sent to PHYDEV
# - local_ip, first ip addr of PHYDEV is used as address for the communiation
:vxlan_tep: local_ip
# Additional ip link options, uncomment the following to disable learning for EVPN mode
:ip_link_conf:
:nolearning:
:proxy:
:srcport: 49152 65535
:dstport: 4789
Step 3: Overlay Control Plane
/etc/network/interfaces
Credits
Jeroen Louwes (BIT) ← as he did all the work
Sebastian Mangelkramer (convince ONE to
integrate in ONE, issue #2161)
Vincent Bernat (great blogs posts about everything
networking, including numerous examples on L3
routing to hypervisors)
Symmetric IRB vs Asymmetric IRB

Asymmetrical IRB
Route on ingress switch
Bridge from ingress switch to destination MAC
Ingress switch needs MAC-IP entries for all destinations
More easy to troubleshoot
Symmetric IRB vs Asymmetric IRB

Symmetrical IRB
Route on ingress switch
intermediate segment across network (L3VNI)
Route on egress switch
Requires extra intermediate VNI per VRF
Scalable
Future wishes and plans

Migrating BGP RR role from spine switches to external

Unknown unicast block

IPv6 only underlay

All hypervisors (and AP’s) are VTEPs (this talk)

Upgrade current core-ring from EAPS (Extreme) to VXLAN
Spines BGP config (Arista)
Leafs BGP config (Arista)

Contenu connexe

Tendances

Chassis Cluster Configuration
Chassis Cluster ConfigurationChassis Cluster Configuration
Chassis Cluster Configuration
Kashif Latif
 
[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN
[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN
[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN
OpenStack Korea Community
 
Virtualized network with openvswitch
Virtualized network with openvswitchVirtualized network with openvswitch
Virtualized network with openvswitch
Sim Janghoon
 

Tendances (20)

Vxlan control plane and routing
Vxlan control plane and routingVxlan control plane and routing
Vxlan control plane and routing
 
OpenvSwitch Deep Dive
OpenvSwitch Deep DiveOpenvSwitch Deep Dive
OpenvSwitch Deep Dive
 
The Basic Introduction of Open vSwitch
The Basic Introduction of Open vSwitchThe Basic Introduction of Open vSwitch
The Basic Introduction of Open vSwitch
 
Openstack Neutron & Interconnections with BGP/MPLS VPNs
Openstack Neutron & Interconnections with BGP/MPLS VPNsOpenstack Neutron & Interconnections with BGP/MPLS VPNs
Openstack Neutron & Interconnections with BGP/MPLS VPNs
 
Introduction to vxlan
Introduction to vxlanIntroduction to vxlan
Introduction to vxlan
 
Interconnecting Neutron and Network Operators' BGP VPNs
Interconnecting Neutron and Network Operators' BGP VPNsInterconnecting Neutron and Network Operators' BGP VPNs
Interconnecting Neutron and Network Operators' BGP VPNs
 
VLAN vs VXLAN
VLAN vs VXLANVLAN vs VXLAN
VLAN vs VXLAN
 
Building DataCenter networks with VXLAN BGP-EVPN
Building DataCenter networks with VXLAN BGP-EVPNBuilding DataCenter networks with VXLAN BGP-EVPN
Building DataCenter networks with VXLAN BGP-EVPN
 
Chassis Cluster Configuration
Chassis Cluster ConfigurationChassis Cluster Configuration
Chassis Cluster Configuration
 
[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN
[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN
[OpenStack 하반기 스터디] Interoperability with ML2: LinuxBridge, OVS and SDN
 
Overview of Distributed Virtual Router (DVR) in Openstack/Neutron
Overview of Distributed Virtual Router (DVR) in Openstack/NeutronOverview of Distributed Virtual Router (DVR) in Openstack/Neutron
Overview of Distributed Virtual Router (DVR) in Openstack/Neutron
 
MPLS L3 VPN Deployment
MPLS L3 VPN DeploymentMPLS L3 VPN Deployment
MPLS L3 VPN Deployment
 
Border Gatway Protocol
Border Gatway ProtocolBorder Gatway Protocol
Border Gatway Protocol
 
Site-to-Site IPSEC VPN Between Cisco ASA and Pfsense
Site-to-Site IPSEC VPN Between Cisco ASA and PfsenseSite-to-Site IPSEC VPN Between Cisco ASA and Pfsense
Site-to-Site IPSEC VPN Between Cisco ASA and Pfsense
 
IPSec VPN & IPSec Protocols
IPSec VPN & IPSec ProtocolsIPSec VPN & IPSec Protocols
IPSec VPN & IPSec Protocols
 
Data center network reference architecture with hpe flex fabric
Data center network reference architecture with hpe flex fabricData center network reference architecture with hpe flex fabric
Data center network reference architecture with hpe flex fabric
 
Virtualized network with openvswitch
Virtualized network with openvswitchVirtualized network with openvswitch
Virtualized network with openvswitch
 
OpenShift Kubernetes Native Infrastructure for 5GC and Telco Edge Cloud
OpenShift  Kubernetes Native Infrastructure for 5GC and Telco Edge Cloud OpenShift  Kubernetes Native Infrastructure for 5GC and Telco Edge Cloud
OpenShift Kubernetes Native Infrastructure for 5GC and Telco Edge Cloud
 
VXLAN and FRRouting
VXLAN and FRRoutingVXLAN and FRRouting
VXLAN and FRRouting
 
News And Development Update Of The CloudStack Tungsten Fabric SDN Plug-in
News And Development Update Of The CloudStack Tungsten Fabric SDN Plug-inNews And Development Update Of The CloudStack Tungsten Fabric SDN Plug-in
News And Development Update Of The CloudStack Tungsten Fabric SDN Plug-in
 

Similaire à OpenNebulaConf2018 - Scalable L2 overlay networks with routed VXLAN / BGP EVPN - Stefan Kooman - BIT

Cisco data center support
Cisco data center supportCisco data center support
Cisco data center support
Krunal Shah
 

Similaire à OpenNebulaConf2018 - Scalable L2 overlay networks with routed VXLAN / BGP EVPN - Stefan Kooman - BIT (20)

Day one-poster-vpns
Day one-poster-vpnsDay one-poster-vpns
Day one-poster-vpns
 
VXLAN BGP EVPN: Technology Building Blocks
VXLAN BGP EVPN: Technology Building BlocksVXLAN BGP EVPN: Technology Building Blocks
VXLAN BGP EVPN: Technology Building Blocks
 
Flexible NFV WAN interconnections with Neutron BGP VPN
 Flexible NFV WAN interconnections with Neutron BGP VPN Flexible NFV WAN interconnections with Neutron BGP VPN
Flexible NFV WAN interconnections with Neutron BGP VPN
 
Xpress path vxlan_bgp_evpn_appricot2019-v2_
Xpress path vxlan_bgp_evpn_appricot2019-v2_Xpress path vxlan_bgp_evpn_appricot2019-v2_
Xpress path vxlan_bgp_evpn_appricot2019-v2_
 
ODA000017 MPLS VPN(L3).ppt
ODA000017 MPLS VPN(L3).pptODA000017 MPLS VPN(L3).ppt
ODA000017 MPLS VPN(L3).ppt
 
evpn_in_service_provider_network-web.pdf
evpn_in_service_provider_network-web.pdfevpn_in_service_provider_network-web.pdf
evpn_in_service_provider_network-web.pdf
 
Mpls vpn.rip
Mpls vpn.ripMpls vpn.rip
Mpls vpn.rip
 
OpenStack MeetUp - OpenContrail Presentation
OpenStack MeetUp - OpenContrail PresentationOpenStack MeetUp - OpenContrail Presentation
OpenStack MeetUp - OpenContrail Presentation
 
Cisco data center support
Cisco data center supportCisco data center support
Cisco data center support
 
Thebasicintroductionofopenvswitch
ThebasicintroductionofopenvswitchThebasicintroductionofopenvswitch
Thebasicintroductionofopenvswitch
 
Openstack openswitch basics
Openstack openswitch basicsOpenstack openswitch basics
Openstack openswitch basics
 
EVPN-Applications.pdf
EVPN-Applications.pdfEVPN-Applications.pdf
EVPN-Applications.pdf
 
PLNOG 13: Emil Gągała: EVPN – rozwiązanie nie tylko dla Data Center
PLNOG 13: Emil Gągała: EVPN – rozwiązanie nie tylko dla Data CenterPLNOG 13: Emil Gągała: EVPN – rozwiązanie nie tylko dla Data Center
PLNOG 13: Emil Gągała: EVPN – rozwiązanie nie tylko dla Data Center
 
VXLAN Distributed Service Node
VXLAN Distributed Service NodeVXLAN Distributed Service Node
VXLAN Distributed Service Node
 
Dc fabric path
Dc fabric pathDc fabric path
Dc fabric path
 
IP Infusion Application Note for 4G LTE Fixed Wireless Access
IP Infusion Application Note for 4G LTE Fixed Wireless AccessIP Infusion Application Note for 4G LTE Fixed Wireless Access
IP Infusion Application Note for 4G LTE Fixed Wireless Access
 
20 - IDNOG03 - Franki Lim (ARISTA) - Overlay Networking with VXLAN
20 - IDNOG03 - Franki Lim (ARISTA) - Overlay Networking with VXLAN20 - IDNOG03 - Franki Lim (ARISTA) - Overlay Networking with VXLAN
20 - IDNOG03 - Franki Lim (ARISTA) - Overlay Networking with VXLAN
 
Branching out with SDN
Branching out with SDNBranching out with SDN
Branching out with SDN
 
PLNOG15: Is there something less complicated than connecting two LAN networks...
PLNOG15: Is there something less complicated than connecting two LAN networks...PLNOG15: Is there something less complicated than connecting two LAN networks...
PLNOG15: Is there something less complicated than connecting two LAN networks...
 
Mpls
MplsMpls
Mpls
 

Plus de OpenNebula Project

OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
OpenNebula Project
 

Plus de OpenNebula Project (20)

OpenNebulaConf2019 - Welcome and Project Update - Ignacio M. Llorente, Rubén ...
OpenNebulaConf2019 - Welcome and Project Update - Ignacio M. Llorente, Rubén ...OpenNebulaConf2019 - Welcome and Project Update - Ignacio M. Llorente, Rubén ...
OpenNebulaConf2019 - Welcome and Project Update - Ignacio M. Llorente, Rubén ...
 
OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
 
OpenNebulaConf2019 - CORD and Edge computing with OpenNebula - Alfonso Aureli...
OpenNebulaConf2019 - CORD and Edge computing with OpenNebula - Alfonso Aureli...OpenNebulaConf2019 - CORD and Edge computing with OpenNebula - Alfonso Aureli...
OpenNebulaConf2019 - CORD and Edge computing with OpenNebula - Alfonso Aureli...
 
OpenNebulaConf2019 - 6 years (+) OpenNebula - Lessons learned - Sebastian Man...
OpenNebulaConf2019 - 6 years (+) OpenNebula - Lessons learned - Sebastian Man...OpenNebulaConf2019 - 6 years (+) OpenNebula - Lessons learned - Sebastian Man...
OpenNebulaConf2019 - 6 years (+) OpenNebula - Lessons learned - Sebastian Man...
 
OpenNebulaConf2019 - Performant and Resilient Storage the Open Source & Linux...
OpenNebulaConf2019 - Performant and Resilient Storage the Open Source & Linux...OpenNebulaConf2019 - Performant and Resilient Storage the Open Source & Linux...
OpenNebulaConf2019 - Performant and Resilient Storage the Open Source & Linux...
 
OpenNebulaConf2019 - Image Backups in OpenNebula - Momčilo Medić - ITAF
OpenNebulaConf2019 - Image Backups in OpenNebula - Momčilo Medić - ITAFOpenNebulaConf2019 - Image Backups in OpenNebula - Momčilo Medić - ITAF
OpenNebulaConf2019 - Image Backups in OpenNebula - Momčilo Medić - ITAF
 
OpenNebulaConf2019 - How We Use GOCA to Manage our OpenNebula Cloud - Jean-Ph...
OpenNebulaConf2019 - How We Use GOCA to Manage our OpenNebula Cloud - Jean-Ph...OpenNebulaConf2019 - How We Use GOCA to Manage our OpenNebula Cloud - Jean-Ph...
OpenNebulaConf2019 - How We Use GOCA to Manage our OpenNebula Cloud - Jean-Ph...
 
OpenNebulaConf2019 - Crytek: A Video gaming Edge Implementation "on the shoul...
OpenNebulaConf2019 - Crytek: A Video gaming Edge Implementation "on the shoul...OpenNebulaConf2019 - Crytek: A Video gaming Edge Implementation "on the shoul...
OpenNebulaConf2019 - Crytek: A Video gaming Edge Implementation "on the shoul...
 
Replacing vCloud with OpenNebula
Replacing vCloud with OpenNebulaReplacing vCloud with OpenNebula
Replacing vCloud with OpenNebula
 
NTS: What We Do With OpenNebula - and Why We Do It
NTS: What We Do With OpenNebula - and Why We Do ItNTS: What We Do With OpenNebula - and Why We Do It
NTS: What We Do With OpenNebula - and Why We Do It
 
OpenNebula from the Perspective of an ISP
OpenNebula from the Perspective of an ISPOpenNebula from the Perspective of an ISP
OpenNebula from the Perspective of an ISP
 
NTS CAPTAIN / OpenNebula at Julius Blum GmbH
NTS CAPTAIN / OpenNebula at Julius Blum GmbHNTS CAPTAIN / OpenNebula at Julius Blum GmbH
NTS CAPTAIN / OpenNebula at Julius Blum GmbH
 
Performant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux WayPerformant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux Way
 
NetApp Hybrid Cloud with OpenNebula
NetApp Hybrid Cloud with OpenNebulaNetApp Hybrid Cloud with OpenNebula
NetApp Hybrid Cloud with OpenNebula
 
NSX with OpenNebula - upcoming 5.10
NSX with OpenNebula - upcoming 5.10NSX with OpenNebula - upcoming 5.10
NSX with OpenNebula - upcoming 5.10
 
Security for Private Cloud Environments
Security for Private Cloud EnvironmentsSecurity for Private Cloud Environments
Security for Private Cloud Environments
 
CheckPoint R80.30 Installation on OpenNebula
CheckPoint R80.30 Installation on OpenNebulaCheckPoint R80.30 Installation on OpenNebula
CheckPoint R80.30 Installation on OpenNebula
 
DE-CIX: CloudConnectivity
DE-CIX: CloudConnectivityDE-CIX: CloudConnectivity
DE-CIX: CloudConnectivity
 
DDC Demo
DDC DemoDDC Demo
DDC Demo
 
Cloud Disaggregation with OpenNebula
Cloud Disaggregation with OpenNebulaCloud Disaggregation with OpenNebula
Cloud Disaggregation with OpenNebula
 

Dernier

%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 

Dernier (20)

%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
WSO2CON 2024 - API Management Usage at La Poste and Its Impact on Business an...
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 

OpenNebulaConf2018 - Scalable L2 overlay networks with routed VXLAN / BGP EVPN - Stefan Kooman - BIT

  • 1. Scalable L2 overlay networks with routed VXLAN / MP-BGP EVPN Stefan Kooman
  • 2. AKA HELP! Network Operations took over Hypervisor networking!
  • 3.
  • 5. Benefits of BGP EVPN / VXLAN  Overcome 4K VLAN limit  Customers can use own V(x)LAN schema  Strech L2 Segment accross DC  Reduces flooding traffic (Arp suppresion)  MAC mobility (VM migration)  Egress load balancing (ECMP)
  • 6. Our “old” DC network setup
  • 7. Limitations and problems old DC network  Bandwidth limitation  STP  Slow convergence  IPv6 routing in CPU  Unsupported gear (too old)
  • 8. Requirements for new DC network  Open standards and protocols  Must work in a IPv6 only setup  No more STP (all active links ?? + L3 only ??)  Bandwidth on demand  All current customer setups must be supported  Must work with our current billing software
  • 9. What if...  Every switch tracks all attached hosts / IP addresses  Switch creates a host route (/32 in IPv4 world, /128 in IPv6 world) for every directly-attached IP host  Host routes are redistributed into a routing protocol, allowing every other switch in the network to route towards any other host  Traffic to unknown destinations is dropped instead of forwarded out all ports  Possible ???
  • 12. VXLAN in summary  RFC7348  24 bits VNI field  Minimum recommended L2 MTU = VM MTU + 50 bytes  Recommended L2 >1600 bytes (VLAN tags and IPv6)  SRC UDP port = payload hash (inner-ethernet header)  DST UDP port = 4789 (linux default is 8472)
  • 13. EVPN in summary  RFC7432  Multi-tenant control plane for L2/L3 VPNs  Uses a new BGP address family  Works with many data-plane encapsulations  Carries IP+MAC reachability information  MAC/IP advertisement (EVPN route type 2)  VTEP advertisments (EVPN route type 3)  IP prefix route (EVPN route type 5)
  • 15. Underlay network Underlay network single purpose is to ensure reachability of the loopback interfaces, because there are used as VxLAN tunnel endpoints!
  • 18.
  • 19. Underlay design  Only one address family needed in the underlay  Only p2p /31 links between spines and leafs  One AS for all spines and one unique AS per switch pair  eBGP to make loopbacks (VTEPs) reachable  BGP timers tweaked; no BFD needed  Very simple to setup and troubleshoot  MTU >9000  Less than 300 routes in BGP for whole underlay setup which makes convergence really Speedy (Gonzales)
  • 20. Overlay design  Dual stack address family  One overlay AS (65101) for all spines and leafs  Spines are BGP RR’s for overlay network  BGP timers tweaked; no BFD needed  Line failure in core network has no impact on overlay RIB which makes convergence sub-second.  All overlay (production) traffic in non-defaultVrf  VRF_ID * 10000 + VLAN_ID = VNI_ID
  • 21. Overlay design  MC-LAG as first hop redundancy  vARP (all active gateway)  No access to defaultVrf; No unwanted tunnel access  Loopback as source for icmp replies  Filter advertised routes learned from spines
  • 22.
  • 23.
  • 24. EVPN + VXLAN on Hypervisor Required: Linux distro with kernel >= 4.5 FRRouting >= 5.1dev build with cumulus option Recommended: ifupdown2 >=1.0
  • 25. Step 1: Underlay  1 or more /31 uplink(s)  1 loopback address in defaultVrf for VTEP endpoint  MgmtVrf for in-band management (netns)  BGP session(s) on uplink(s) to leaf switches  Make loopback reachable to all other loopbacks/vteps  MTU >1600
  • 26. Step 2: Overlay Data Plane  Create VRF (internetVrf)  Create at least two bridges (L2VNI + L3VNI) per VRF  Create 1 VTEP for each bridge with ip address of loopback  Attach VTEP interface to bridge  Attach VNET interface to bridge  Configure mac + ip address on L2VNI bridge  Filter ARP traffic on VTEP interface  Enable forwarding + sysctl tuning
  • 27. Create VRF ip link add internetVrf type vrf table 1000 ip link set internetVrf up
  • 28. Create two bridges brctl addbr br-vlan601 ip link set br-vlan601 master internetVrf ip link set br-vlan601 up brctl addbr br-vlan4003 ip link set br-vlan4003 master internetVrf ip link set br-vlan4003 up
  • 29. Create VTEP for each bridge ip link add vtep10601 type vxlan id 10601 proxy nolearning dstport 4789 local 213.136.24.130 ip link add vtep20003 type vxlan id 20003 proxy nolearning dstport 4789 local 213.136.24.130
  • 30. Attach VTEP + VNET to bridge ip link set vtep10601 mtu 9000 ip link set vtep10601 up brctl addif br-vlan601 vtep10601 ip link set vtep20003 mtu 9000 ip link set vtep20003 up brctl addif br-vlan4003 vtep20003
  • 31. Configure MAC + IP, drop ARP ip addr add 213.136.24.161/28 dev br-vlan601 ip link set dev br-vlan601 address 02:62:69:74:67:77 ebtables -A OUTPUT -p arp -o vtep+ -j DROP
  • 32. Sysctl tuning sysctl -qw net.ipv4.conf.vnet.proxy_arp=1 sysctl -qw net.ipv4.neigh.vnet.proxy_delay=0 sysctl -qw net.ipv4.conf.all.forwarding=1 sysctl -qw net.ipv6.conf.all.forwarding=1 sysctl -qw net.ipv6.route.max_size=4194304
  • 33. OpenNebula support cat /var/lib/one/remotes/etc/vnm/OpenNebulaNetwork.conf  ... # Multicast protocol for multi destination BUM traffic. Options: # - multicast, for IP multicast # - evpn, for BGP EVPN control plane :vxlan_mode: evpn # Tunnel endpoint communication type. Only for evpn vxlan_mode. # - dev, tunnel endpoint communication is sent to PHYDEV # - local_ip, first ip addr of PHYDEV is used as address for the communiation :vxlan_tep: local_ip # Additional ip link options, uncomment the following to disable learning for EVPN mode :ip_link_conf: :nolearning: :proxy: :srcport: 49152 65535 :dstport: 4789
  • 34. Step 3: Overlay Control Plane
  • 36.
  • 37.
  • 38.
  • 39. Credits Jeroen Louwes (BIT) ← as he did all the work Sebastian Mangelkramer (convince ONE to integrate in ONE, issue #2161) Vincent Bernat (great blogs posts about everything networking, including numerous examples on L3 routing to hypervisors)
  • 40. Symmetric IRB vs Asymmetric IRB  Asymmetrical IRB Route on ingress switch Bridge from ingress switch to destination MAC Ingress switch needs MAC-IP entries for all destinations More easy to troubleshoot
  • 41. Symmetric IRB vs Asymmetric IRB  Symmetrical IRB Route on ingress switch intermediate segment across network (L3VNI) Route on egress switch Requires extra intermediate VNI per VRF Scalable
  • 42. Future wishes and plans  Migrating BGP RR role from spine switches to external  Unknown unicast block  IPv6 only underlay  All hypervisors (and AP’s) are VTEPs (this talk)  Upgrade current core-ring from EAPS (Extreme) to VXLAN
  • 43. Spines BGP config (Arista)
  • 44. Leafs BGP config (Arista)