SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
Sakir Sezer – CTO
29 January 2019, at Global Foundries , Santa Clara
1
ODSA use case – Smart NIC
Harnessing Domain Specific Acceleration at the Datacenter
What is a Smart NIC
• Support of baseline NIC functions and features, such as MAC and L3/L4
packet filtering and forwarding
• “Smartness” of a NIC implies its capability of making semi-autonomous
decisions based on IP traffic
• However, “smartness” will NOT enable any critical advantage for
networking or the host, if major workload and network related
responsibilities cannot be effectively offloaded from the host.
• Offloaded workload does not necessarily have to be networking
related, but must be network enabled so that tasks executed on the
NIC enable performance and/or feature advantage that otherwise
would be too expensive or impossible to execute on the local host.
2
Key Features Defining a Smart NIC
• High-level programmability of the NIC enabling in-field
customisation and extension of NIC features.
• On device processing of upper-layer functions (up to layer 7)
and applications that will enable localized decision making,
critical for services, networking and security.
• On device acceleration for offloading of heavy-duty tasks on
network traffic before it is forwarded to the host or to the
network, such as encryption, switching, inspection etc.
3
Generic ”Smart NIC” Architecture
nxGigE PHY
SERDES
Standard
Host Interface
PCIe Gen4
Embedded
General Purpose
Processing
Embedded General
Purpose Processing
ARM – RISC-V
Standard NIC
Functions
MAC + IP Header
processing
Accelerated Flow
Processing
(Flow
classification/tracking,
Firewall/ACL, NAT, etc.)
Domain Specific
Accelerators
(DPI, Crypto (ECC/AES),
Compression, TCP
offload
Network
Interface
External
Memory
Interface
External
Accelerator
Interfaced
Storage
Network on Chip
Host Interface
PCIe Gen 4
AI/ML Chip
SSD (NVMe)
(PCIe / CCIX
HMC or
DDR4
Embedded
Memory
(fast storage)
Last Level
Cache (LLC)
4
Tradition vs Smart NIC approach
for Domain Specific Offload Acceleration
Traditional NIC with separate offload PCIe card
based on GPU or FPGA (e.g. AWS F1)
Highly constrained by the host interface acting as a “bridge”
between the network and the accelerators
Smart NIC
with emended
processing
(ARM/RISC-V)
& accelerator
Host
2 x Xeon
Accelerators
AI/ML, FPGA
NVMe SSD
Storage
DDR4
(64-bit)
PCIe
Accelerator
Searching
Computing
AI/ML
(GPU / FPGA)
Host
2 x Xeon
SSD
Storage
NIC
DDR4
(64-bit)
PCIe
PCIe
Smart NIC with offload acceleration
without involving the host
Accelerators can access and process data directly
from the network before forwarding the original data
(and/or any extracted information) to the host
e.g. Storage, Security, OVS, AI/ML
5
What is Titan IC Regular Expression Processor
• Our RXP, Regular Expression Processor is a programmable custom-purpose content
processor for high-speed pattern matching, supporting PCRE/POSIX regular expressions
• Optimised for matching large number of regex rules in parallel
• Scalable single regex processor core capable of supporting beyond 100Gb/s pattern
matching bandwidth
• Rich set of software support, compiler, API, etc.
• Customisable for target applications, Memory, Performance, Footprint, Power(ASIC)
• Complex regex-based pattern matching for:
– Traditional (ACL) and NextGen Firewall (DPI, Intrusion Detection/Prevention
(IDS/IPS), e.g. Snort
– Application Recognition, Protocol Recognition,
– Application Firewall, detection of SQL injection, Application DoS
– SDN rule lookup/matching (Multi-Table), …………….
6
Titan IC - 100Gb/s RXP Processor
Parameter Value
Data width 128-bit
Clock frequency 800 MHz
Prefix capacity 16K
Number of clusters 8
TCM:CACHE 2K:2K
Total memory 27,132,864 bits
Memory macro
area
14.628 mm2
Standard cell area 0.935 mm2
Total post P&R area 19.665 mm2
Power 4.55 W
Technology: GlobalFoundries, 28nm HPP
7
Centralised vs Smart NIC based Network Security
Switch
NIC
NIC
NIC
NIC
“Middle Box”
Security
Appliance
Physical or Virtualised
as NFV or AWS Virtual Appliance
Security Management
Switch
Smart NIC
Smart NIC
Smart NIC
Smart NIC
SEC
SEC
SEC
SEC
Security is an embedded function and integral part of
a NIC, customised for the applications on the server
Key Advantages
- Distributed, inherently resilient
- No single point of failure
- Smaller attack surface
- Tailored to the application
- Fully virtualizable without
the compute overhead (Advanced NFV)
8
ARM
Rule
alert tcp $EXTERNAL_NET any -> $HOME_NET 1978
(msg:"APP-DETECT Apple OSX Remote Mouse usage";
flow:to_server,established;
content:"mos "; fast_pattern:only;
pcre:"/moss{2}dmsd/";
reference:url,pastebin.com/F81NCiYE;
classtype:policy-violation; sid:20443; rev:2;)
Snort Use-Case For RXP
Expected Snort Performance: between 4 to 5 x better
performance with Content Scanning Offload
PCRE Rule
MatchJob
DPDK Framework
Snort Application
RXP Plugin RXP API
Core-1
Fast Pattern
Rules
RXPI/O
9
Smart NIC HW accelerated IDS/IP
• Asynchronous operation
- Supporting multiple packets in-flight
• Multithreading
- Sub-blocks can be an independent HW offload or SW thread
Packet
Acquisition
Packet Decoding
Packet Pre-
Processors
OutputInspection
Titan IC
RXP
HW-NIC
HW Packet
Processing SW/SW Thread SW Thread SW Thread
Multiple packet in-flight HW Plug-in
10
ODSA Reference Multi-Die Architecture
Source: ODSA
11
Design Considerations
• Standard on-device communication related driver stack
– Open sources and part of the standard platform for on-chip embedded
processors and external (PCIe) hosts
• On-device communication (NOC)
– Scalable bandwidth
– low latency
– Technology independent high-speed on-chip interface (28nm TSMC <-> 14 nm GF)
• Low-latency high-bandwidth external memory access, preferably
with embedded LLC
12
Software Drivers
13
Low-level Drivers and API - Kernel vs Userspace
Application
Hyperion
PCIe
driver
RXP
Userspace API
AWS F1
EDMA/PCIe
driver
RXP driver
common
functions
Code
common
across all
platforms
Userspace
Kernel
Platform specific drivers
ODSA
NOC
driver
14
ODSA Ref Model - RXP Native Interface
• Data Plane,
• Control Plane,
• Programming Plane,
• External (Shared) Memory
Comm Agent
NOC Fabric
Comm
Agent
N x ARM A72
Cores
N x ARM A72
Cores
N x ARM A72
Cores
N x RISC
Cores
RXP Native Interface
Shim Layer
Interface, could be
AWS F1 type I/O Shell Architecture
15
External Memory
• External memory architecture underpins overall systems
performance and require a CPU centric approach
• Memory management is essential to deal with:
– Effective utilization of memory resources
– Software integration
• Most high-performance use-cases require cache deployment
– LLC tightly coupled to external DDR and to embedded processor
• Simple external private memory integration
16
Reducing Complexity
Network on Chip
Comm
Agent
LLC
Internal Bus
DDR4 / HMC
Controller
N x ARM A72
Cores
N x ARM A72
Cores
N x ARM A72
Cores
N x RISC
Cores
Accelerator
(RXP)
Host Interface
PCIe
DDR4 / HMC
Controller
Private Mem
Controller
DDR4 or HMCDDR4 or HMC
Comm AgentComm Agent
Comm Agent
Host (x86)
Chiplets
Other
Accelerator
or interface
Chiplest
17
In Summary
• Exciting new opportunities for rapid development and
deployment of high-performance and highly tailored solutions
• Reducing redesign impact on SW (drivers and device specific API)
interfaces may have to more restrictive than any NOC type state-
of-the-art deployments.
• NOC interface and interface adaptation layer (shim) must be
provided for various use-cases (open source Verilog/System-C)
• Tools generating low-level drivers (pre-allocated common and
custom register maps) will reduce software integration efforts and
cost, enabling independent third parties software development
18

Contenu connexe

Tendances

Design and Testing Challenges for Chiplet Based Design: Assembly and Test View
Design and Testing Challenges for Chiplet Based Design: Assembly and Test ViewDesign and Testing Challenges for Chiplet Based Design: Assembly and Test View
Design and Testing Challenges for Chiplet Based Design: Assembly and Test ViewODSA Workgroup
 
An open flow for dn ns on ultra low-power RISC-V cores
An open flow for dn ns on ultra low-power RISC-V coresAn open flow for dn ns on ultra low-power RISC-V cores
An open flow for dn ns on ultra low-power RISC-V coresRISC-V International
 
00 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver200 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver2Yutaka Kawai
 
OCP U.S. Summit 2017 Presentation
OCP U.S. Summit 2017 PresentationOCP U.S. Summit 2017 Presentation
OCP U.S. Summit 2017 PresentationNetronome
 
BUD17 Socionext SC2A11 ARM Server SoC
BUD17 Socionext SC2A11 ARM Server SoCBUD17 Socionext SC2A11 ARM Server SoC
BUD17 Socionext SC2A11 ARM Server SoCLinaro
 
Unifying Network Filtering Rules for the Linux Kernel with eBPF
Unifying Network Filtering Rules for the Linux Kernel with eBPFUnifying Network Filtering Rules for the Linux Kernel with eBPF
Unifying Network Filtering Rules for the Linux Kernel with eBPFNetronome
 
DPDK Summit 2015 - Sprint - Arun Rajagopal
DPDK Summit 2015 - Sprint - Arun RajagopalDPDK Summit 2015 - Sprint - Arun Rajagopal
DPDK Summit 2015 - Sprint - Arun RajagopalJim St. Leger
 
Fueling the datasphere how RISC-V enables the storage ecosystem
Fueling the datasphere   how RISC-V enables the storage ecosystemFueling the datasphere   how RISC-V enables the storage ecosystem
Fueling the datasphere how RISC-V enables the storage ecosystemRISC-V International
 
Netsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvNetsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvIntel
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Netronome
 
An Automatic Generation of NoC Architectures: An Application-Mapping Approach
An Automatic Generation of NoC Architectures: An Application-Mapping ApproachAn Automatic Generation of NoC Architectures: An Application-Mapping Approach
An Automatic Generation of NoC Architectures: An Application-Mapping ApproachMostafa Khamis
 
DPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith WilesDPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith WilesJim St. Leger
 
The Power of SmartNICs
The Power of SmartNICsThe Power of SmartNICs
The Power of SmartNICsNetronome
 
DPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles ShiflettDPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles ShiflettJim St. Leger
 
Andes building a secure platform with the enhanced iopmp
Andes building a secure platform with the enhanced iopmpAndes building a secure platform with the enhanced iopmp
Andes building a secure platform with the enhanced iopmpRISC-V International
 
Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...
Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...
Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...Haidee McMahon
 
LF_DPDK17_mediated devices: better userland IO
LF_DPDK17_mediated devices: better userland IOLF_DPDK17_mediated devices: better userland IO
LF_DPDK17_mediated devices: better userland IOLF_DPDK
 
RISC-V growth and successes in technology and industry - embedded world 2021
RISC-V growth and successes in technology and industry - embedded world 2021RISC-V growth and successes in technology and industry - embedded world 2021
RISC-V growth and successes in technology and industry - embedded world 2021RISC-V International
 
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel ArchitectureDPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel ArchitectureJim St. Leger
 
DPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFV
DPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFVDPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFV
DPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFVJim St. Leger
 

Tendances (20)

Design and Testing Challenges for Chiplet Based Design: Assembly and Test View
Design and Testing Challenges for Chiplet Based Design: Assembly and Test ViewDesign and Testing Challenges for Chiplet Based Design: Assembly and Test View
Design and Testing Challenges for Chiplet Based Design: Assembly and Test View
 
An open flow for dn ns on ultra low-power RISC-V cores
An open flow for dn ns on ultra low-power RISC-V coresAn open flow for dn ns on ultra low-power RISC-V cores
An open flow for dn ns on ultra low-power RISC-V cores
 
00 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver200 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver2
 
OCP U.S. Summit 2017 Presentation
OCP U.S. Summit 2017 PresentationOCP U.S. Summit 2017 Presentation
OCP U.S. Summit 2017 Presentation
 
BUD17 Socionext SC2A11 ARM Server SoC
BUD17 Socionext SC2A11 ARM Server SoCBUD17 Socionext SC2A11 ARM Server SoC
BUD17 Socionext SC2A11 ARM Server SoC
 
Unifying Network Filtering Rules for the Linux Kernel with eBPF
Unifying Network Filtering Rules for the Linux Kernel with eBPFUnifying Network Filtering Rules for the Linux Kernel with eBPF
Unifying Network Filtering Rules for the Linux Kernel with eBPF
 
DPDK Summit 2015 - Sprint - Arun Rajagopal
DPDK Summit 2015 - Sprint - Arun RajagopalDPDK Summit 2015 - Sprint - Arun Rajagopal
DPDK Summit 2015 - Sprint - Arun Rajagopal
 
Fueling the datasphere how RISC-V enables the storage ecosystem
Fueling the datasphere   how RISC-V enables the storage ecosystemFueling the datasphere   how RISC-V enables the storage ecosystem
Fueling the datasphere how RISC-V enables the storage ecosystem
 
Netsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvNetsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfv
 
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
Disaggregation a Primer: Optimizing design for Edge Cloud & Bare Metal applic...
 
An Automatic Generation of NoC Architectures: An Application-Mapping Approach
An Automatic Generation of NoC Architectures: An Application-Mapping ApproachAn Automatic Generation of NoC Architectures: An Application-Mapping Approach
An Automatic Generation of NoC Architectures: An Application-Mapping Approach
 
DPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith WilesDPDK Summit 2015 - Intel - Keith Wiles
DPDK Summit 2015 - Intel - Keith Wiles
 
The Power of SmartNICs
The Power of SmartNICsThe Power of SmartNICs
The Power of SmartNICs
 
DPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles ShiflettDPDK Summit 2015 - Aspera - Charles Shiflett
DPDK Summit 2015 - Aspera - Charles Shiflett
 
Andes building a secure platform with the enhanced iopmp
Andes building a secure platform with the enhanced iopmpAndes building a secure platform with the enhanced iopmp
Andes building a secure platform with the enhanced iopmp
 
Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...
Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...
Software Network Data Plane - Satisfying the need for speed - FD.io - VPP and...
 
LF_DPDK17_mediated devices: better userland IO
LF_DPDK17_mediated devices: better userland IOLF_DPDK17_mediated devices: better userland IO
LF_DPDK17_mediated devices: better userland IO
 
RISC-V growth and successes in technology and industry - embedded world 2021
RISC-V growth and successes in technology and industry - embedded world 2021RISC-V growth and successes in technology and industry - embedded world 2021
RISC-V growth and successes in technology and industry - embedded world 2021
 
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel ArchitectureDPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
 
DPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFV
DPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFVDPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFV
DPDK Summit - 08 Sept 2014 - Ericsson - A Multi-Socket Ferrari for NFV
 

Similaire à ODSA Use Case - SmartNIC

Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDKKernel TLV
 
Sharing High-Performance Interconnects Across Multiple Virtual Machines
Sharing High-Performance Interconnects Across Multiple Virtual MachinesSharing High-Performance Interconnects Across Multiple Virtual Machines
Sharing High-Performance Interconnects Across Multiple Virtual Machinesinside-BigData.com
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Ontico
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGATO project
 
SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4UniFabric
 
6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_finalYutaka Kawai
 
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors Michelle Holley
 
Heterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsHeterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsAnand Haridass
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersRyousei Takano
 
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...Shuquan Huang
 
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...PROIDEA
 
AMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialmadhuinturi
 
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I Core of Embedded Systems
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I   Core of Embedded SystemsSYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I   Core of Embedded Systems
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I Core of Embedded SystemsArti Parab Academics
 
Arm DynamIQ: Intelligent Solutions Using Cluster Based Multiprocessing
Arm DynamIQ: Intelligent Solutions Using Cluster Based MultiprocessingArm DynamIQ: Intelligent Solutions Using Cluster Based Multiprocessing
Arm DynamIQ: Intelligent Solutions Using Cluster Based MultiprocessingArm
 
ETHERNET PACKET PROCESSOR FOR SOC APPLICATION
ETHERNET PACKET PROCESSOR FOR SOC APPLICATIONETHERNET PACKET PROCESSOR FOR SOC APPLICATION
ETHERNET PACKET PROCESSOR FOR SOC APPLICATIONcscpconf
 
Recent Developments in Donard
Recent Developments in DonardRecent Developments in Donard
Recent Developments in DonardPMC-Sierra Inc.
 

Similaire à ODSA Use Case - SmartNIC (20)

Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDK
 
Sharing High-Performance Interconnects Across Multiple Virtual Machines
Sharing High-Performance Interconnects Across Multiple Virtual MachinesSharing High-Performance Interconnects Across Multiple Virtual Machines
Sharing High-Performance Interconnects Across Multiple Virtual Machines
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
 
Cloud Networking Trends
Cloud Networking TrendsCloud Networking Trends
Cloud Networking Trends
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous Hardware
 
SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4
 
6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final
 
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
 
Heterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsHeterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of Systems
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computers
 
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
Optimized HPC/AI cloud with OpenStack acceleration service and composable har...
 
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...PLNOG16: Obsługa 100M pps na platformie PC, Przemysław Frasunek, Paweł Mała...
PLNOG16: Obsługa 100M pps na platformie PC , Przemysław Frasunek, Paweł Mała...
 
AMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor ArchitectureAMD EPYC™ Microprocessor Architecture
AMD EPYC™ Microprocessor Architecture
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorial
 
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I Core of Embedded Systems
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I   Core of Embedded SystemsSYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I   Core of Embedded Systems
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT I Core of Embedded Systems
 
Arm DynamIQ: Intelligent Solutions Using Cluster Based Multiprocessing
Arm DynamIQ: Intelligent Solutions Using Cluster Based MultiprocessingArm DynamIQ: Intelligent Solutions Using Cluster Based Multiprocessing
Arm DynamIQ: Intelligent Solutions Using Cluster Based Multiprocessing
 
uCluster
uClusteruCluster
uCluster
 
Smart NIC
Smart NICSmart NIC
Smart NIC
 
ETHERNET PACKET PROCESSOR FOR SOC APPLICATION
ETHERNET PACKET PROCESSOR FOR SOC APPLICATIONETHERNET PACKET PROCESSOR FOR SOC APPLICATION
ETHERNET PACKET PROCESSOR FOR SOC APPLICATION
 
Recent Developments in Donard
Recent Developments in DonardRecent Developments in Donard
Recent Developments in Donard
 

Plus de ODSA Workgroup

ODSA - Business Workstream
ODSA - Business WorkstreamODSA - Business Workstream
ODSA - Business WorkstreamODSA Workgroup
 
ODSA - PoC Planning & Call for Help
ODSA - PoC Planning & Call for HelpODSA - PoC Planning & Call for Help
ODSA - PoC Planning & Call for HelpODSA Workgroup
 
ODSA - DirectConnect Options for the PoC
ODSA - DirectConnect Options for the PoCODSA - DirectConnect Options for the PoC
ODSA - DirectConnect Options for the PoCODSA Workgroup
 
ODSA - PoC Physical Design Elements
ODSA - PoC Physical Design ElementsODSA - PoC Physical Design Elements
ODSA - PoC Physical Design ElementsODSA Workgroup
 
ODSA - NXP PoC Presentation
ODSA - NXP PoC PresentationODSA - NXP PoC Presentation
ODSA - NXP PoC PresentationODSA Workgroup
 
ODSA - PoC Requirements and Use Cases
ODSA - PoC Requirements and Use CasesODSA - PoC Requirements and Use Cases
ODSA - PoC Requirements and Use CasesODSA Workgroup
 
ODSA - Speedster22i FPGA for POC
ODSA - Speedster22i FPGA for POCODSA - Speedster22i FPGA for POC
ODSA - Speedster22i FPGA for POCODSA Workgroup
 
ODSA Design Tools for Chiplet-Based Design
ODSA Design Tools for Chiplet-Based DesignODSA Design Tools for Chiplet-Based Design
ODSA Design Tools for Chiplet-Based DesignODSA Workgroup
 
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface SolutionsOODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface SolutionsODSA Workgroup
 
ODSA - Samsung Welcomes OCP's ODSA
ODSA - Samsung Welcomes OCP's ODSAODSA - Samsung Welcomes OCP's ODSA
ODSA - Samsung Welcomes OCP's ODSAODSA Workgroup
 
ODSA - Technical Introduction
ODSA - Technical IntroductionODSA - Technical Introduction
ODSA - Technical IntroductionODSA Workgroup
 
ODSA - OCP Accelerator Module and the Infrastructure
ODSA - OCP Accelerator Module and the InfrastructureODSA - OCP Accelerator Module and the Infrastructure
ODSA - OCP Accelerator Module and the InfrastructureODSA Workgroup
 
ODSA Sub-Project Launch
ODSA Sub-Project LaunchODSA Sub-Project Launch
ODSA Sub-Project LaunchODSA Workgroup
 

Plus de ODSA Workgroup (18)

ODSA - Business Workstream
ODSA - Business WorkstreamODSA - Business Workstream
ODSA - Business Workstream
 
ODSA - PoC Planning & Call for Help
ODSA - PoC Planning & Call for HelpODSA - PoC Planning & Call for Help
ODSA - PoC Planning & Call for Help
 
ODSA - DirectConnect Options for the PoC
ODSA - DirectConnect Options for the PoCODSA - DirectConnect Options for the PoC
ODSA - DirectConnect Options for the PoC
 
ODSA - PoC Physical Design Elements
ODSA - PoC Physical Design ElementsODSA - PoC Physical Design Elements
ODSA - PoC Physical Design Elements
 
ODSA - NXP PoC Presentation
ODSA - NXP PoC PresentationODSA - NXP PoC Presentation
ODSA - NXP PoC Presentation
 
ODSA - PoC Requirements and Use Cases
ODSA - PoC Requirements and Use CasesODSA - PoC Requirements and Use Cases
ODSA - PoC Requirements and Use Cases
 
ODSA - Speedster22i FPGA for POC
ODSA - Speedster22i FPGA for POCODSA - Speedster22i FPGA for POC
ODSA - Speedster22i FPGA for POC
 
ODSA Design Tools for Chiplet-Based Design
ODSA Design Tools for Chiplet-Based DesignODSA Design Tools for Chiplet-Based Design
ODSA Design Tools for Chiplet-Based Design
 
ODSA - Power and Heat
ODSA - Power and HeatODSA - Power and Heat
ODSA - Power and Heat
 
ODSA - PIPE Adapters
ODSA - PIPE Adapters ODSA - PIPE Adapters
ODSA - PIPE Adapters
 
ODSA - PHY Layer
ODSA - PHY LayerODSA - PHY Layer
ODSA - PHY Layer
 
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface SolutionsOODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
 
ODSA - Samsung Welcomes OCP's ODSA
ODSA - Samsung Welcomes OCP's ODSAODSA - Samsung Welcomes OCP's ODSA
ODSA - Samsung Welcomes OCP's ODSA
 
ODSA - Technical Introduction
ODSA - Technical IntroductionODSA - Technical Introduction
ODSA - Technical Introduction
 
ODSA - OCP Welcome
ODSA - OCP WelcomeODSA - OCP Welcome
ODSA - OCP Welcome
 
ODSA - OCP Accelerator Module and the Infrastructure
ODSA - OCP Accelerator Module and the InfrastructureODSA - OCP Accelerator Module and the Infrastructure
ODSA - OCP Accelerator Module and the Infrastructure
 
ODSA Sub-Project Launch
ODSA Sub-Project LaunchODSA Sub-Project Launch
ODSA Sub-Project Launch
 
ODSA Workshop
ODSA WorkshopODSA Workshop
ODSA Workshop
 

Dernier

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Dernier (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

ODSA Use Case - SmartNIC

  • 1. Sakir Sezer – CTO 29 January 2019, at Global Foundries , Santa Clara 1 ODSA use case – Smart NIC Harnessing Domain Specific Acceleration at the Datacenter
  • 2. What is a Smart NIC • Support of baseline NIC functions and features, such as MAC and L3/L4 packet filtering and forwarding • “Smartness” of a NIC implies its capability of making semi-autonomous decisions based on IP traffic • However, “smartness” will NOT enable any critical advantage for networking or the host, if major workload and network related responsibilities cannot be effectively offloaded from the host. • Offloaded workload does not necessarily have to be networking related, but must be network enabled so that tasks executed on the NIC enable performance and/or feature advantage that otherwise would be too expensive or impossible to execute on the local host. 2
  • 3. Key Features Defining a Smart NIC • High-level programmability of the NIC enabling in-field customisation and extension of NIC features. • On device processing of upper-layer functions (up to layer 7) and applications that will enable localized decision making, critical for services, networking and security. • On device acceleration for offloading of heavy-duty tasks on network traffic before it is forwarded to the host or to the network, such as encryption, switching, inspection etc. 3
  • 4. Generic ”Smart NIC” Architecture nxGigE PHY SERDES Standard Host Interface PCIe Gen4 Embedded General Purpose Processing Embedded General Purpose Processing ARM – RISC-V Standard NIC Functions MAC + IP Header processing Accelerated Flow Processing (Flow classification/tracking, Firewall/ACL, NAT, etc.) Domain Specific Accelerators (DPI, Crypto (ECC/AES), Compression, TCP offload Network Interface External Memory Interface External Accelerator Interfaced Storage Network on Chip Host Interface PCIe Gen 4 AI/ML Chip SSD (NVMe) (PCIe / CCIX HMC or DDR4 Embedded Memory (fast storage) Last Level Cache (LLC) 4
  • 5. Tradition vs Smart NIC approach for Domain Specific Offload Acceleration Traditional NIC with separate offload PCIe card based on GPU or FPGA (e.g. AWS F1) Highly constrained by the host interface acting as a “bridge” between the network and the accelerators Smart NIC with emended processing (ARM/RISC-V) & accelerator Host 2 x Xeon Accelerators AI/ML, FPGA NVMe SSD Storage DDR4 (64-bit) PCIe Accelerator Searching Computing AI/ML (GPU / FPGA) Host 2 x Xeon SSD Storage NIC DDR4 (64-bit) PCIe PCIe Smart NIC with offload acceleration without involving the host Accelerators can access and process data directly from the network before forwarding the original data (and/or any extracted information) to the host e.g. Storage, Security, OVS, AI/ML 5
  • 6. What is Titan IC Regular Expression Processor • Our RXP, Regular Expression Processor is a programmable custom-purpose content processor for high-speed pattern matching, supporting PCRE/POSIX regular expressions • Optimised for matching large number of regex rules in parallel • Scalable single regex processor core capable of supporting beyond 100Gb/s pattern matching bandwidth • Rich set of software support, compiler, API, etc. • Customisable for target applications, Memory, Performance, Footprint, Power(ASIC) • Complex regex-based pattern matching for: – Traditional (ACL) and NextGen Firewall (DPI, Intrusion Detection/Prevention (IDS/IPS), e.g. Snort – Application Recognition, Protocol Recognition, – Application Firewall, detection of SQL injection, Application DoS – SDN rule lookup/matching (Multi-Table), ……………. 6
  • 7. Titan IC - 100Gb/s RXP Processor Parameter Value Data width 128-bit Clock frequency 800 MHz Prefix capacity 16K Number of clusters 8 TCM:CACHE 2K:2K Total memory 27,132,864 bits Memory macro area 14.628 mm2 Standard cell area 0.935 mm2 Total post P&R area 19.665 mm2 Power 4.55 W Technology: GlobalFoundries, 28nm HPP 7
  • 8. Centralised vs Smart NIC based Network Security Switch NIC NIC NIC NIC “Middle Box” Security Appliance Physical or Virtualised as NFV or AWS Virtual Appliance Security Management Switch Smart NIC Smart NIC Smart NIC Smart NIC SEC SEC SEC SEC Security is an embedded function and integral part of a NIC, customised for the applications on the server Key Advantages - Distributed, inherently resilient - No single point of failure - Smaller attack surface - Tailored to the application - Fully virtualizable without the compute overhead (Advanced NFV) 8
  • 9. ARM Rule alert tcp $EXTERNAL_NET any -> $HOME_NET 1978 (msg:"APP-DETECT Apple OSX Remote Mouse usage"; flow:to_server,established; content:"mos "; fast_pattern:only; pcre:"/moss{2}dmsd/"; reference:url,pastebin.com/F81NCiYE; classtype:policy-violation; sid:20443; rev:2;) Snort Use-Case For RXP Expected Snort Performance: between 4 to 5 x better performance with Content Scanning Offload PCRE Rule MatchJob DPDK Framework Snort Application RXP Plugin RXP API Core-1 Fast Pattern Rules RXPI/O 9
  • 10. Smart NIC HW accelerated IDS/IP • Asynchronous operation - Supporting multiple packets in-flight • Multithreading - Sub-blocks can be an independent HW offload or SW thread Packet Acquisition Packet Decoding Packet Pre- Processors OutputInspection Titan IC RXP HW-NIC HW Packet Processing SW/SW Thread SW Thread SW Thread Multiple packet in-flight HW Plug-in 10
  • 11. ODSA Reference Multi-Die Architecture Source: ODSA 11
  • 12. Design Considerations • Standard on-device communication related driver stack – Open sources and part of the standard platform for on-chip embedded processors and external (PCIe) hosts • On-device communication (NOC) – Scalable bandwidth – low latency – Technology independent high-speed on-chip interface (28nm TSMC <-> 14 nm GF) • Low-latency high-bandwidth external memory access, preferably with embedded LLC 12
  • 14. Low-level Drivers and API - Kernel vs Userspace Application Hyperion PCIe driver RXP Userspace API AWS F1 EDMA/PCIe driver RXP driver common functions Code common across all platforms Userspace Kernel Platform specific drivers ODSA NOC driver 14
  • 15. ODSA Ref Model - RXP Native Interface • Data Plane, • Control Plane, • Programming Plane, • External (Shared) Memory Comm Agent NOC Fabric Comm Agent N x ARM A72 Cores N x ARM A72 Cores N x ARM A72 Cores N x RISC Cores RXP Native Interface Shim Layer Interface, could be AWS F1 type I/O Shell Architecture 15
  • 16. External Memory • External memory architecture underpins overall systems performance and require a CPU centric approach • Memory management is essential to deal with: – Effective utilization of memory resources – Software integration • Most high-performance use-cases require cache deployment – LLC tightly coupled to external DDR and to embedded processor • Simple external private memory integration 16
  • 17. Reducing Complexity Network on Chip Comm Agent LLC Internal Bus DDR4 / HMC Controller N x ARM A72 Cores N x ARM A72 Cores N x ARM A72 Cores N x RISC Cores Accelerator (RXP) Host Interface PCIe DDR4 / HMC Controller Private Mem Controller DDR4 or HMCDDR4 or HMC Comm AgentComm Agent Comm Agent Host (x86) Chiplets Other Accelerator or interface Chiplest 17
  • 18. In Summary • Exciting new opportunities for rapid development and deployment of high-performance and highly tailored solutions • Reducing redesign impact on SW (drivers and device specific API) interfaces may have to more restrictive than any NOC type state- of-the-art deployments. • NOC interface and interface adaptation layer (shim) must be provided for various use-cases (open source Verilog/System-C) • Tools generating low-level drivers (pre-allocated common and custom register maps) will reduce software integration efforts and cost, enabling independent third parties software development 18