SlideShare une entreprise Scribd logo
1  sur  48
Télécharger pour lire hors ligne
HSAemu ‐ A F ll S t
HSA
A Full System Emulator 
E l t
for HSA Platform
for HSA Platform
Prof. Yeh‐Ching Chung
System Software Laboratory
Department of Computer science 
Department of Computer science
National Tsing Hua University

National Tsing Hua University ® copyright OIA
National Tsing Hua University

1
Outline





Introduction to HSA
Introduction to HSA
Design of HSAemu
Performance Evaluation
P f
E l ti
Conclusions and Future Work

National Tsing Hua University ® copyright OIA
National Tsing Hua University

2
Introduction to HSA 

HSA is an industry standard to 
f
g
define next‐generation 
hardware/software architecture 
for heterogeneous computing 
for heterogeneous computing

National Tsing Hua University ® copyright OIA
National Tsing Hua University

3
Hardware Platform of HSA

National Tsing Hua University ® copyright OIA
National Tsing Hua University

4
Simplified HSA Software Stack
Application
Domain Specific Libs
(Bolt, OpenCV™, … many others)

Application 
SW

Renderscript
p
/OpenCL
Runtime

HSA Runtime

HSA Software

HSAIL

OpenGL‐ES
O
GL ES
Runtime

Legacy 
Driver

Other
Oth
Runtime

Legacy 
Driver

Ctl

Drivers
HSA Finalizer

Kernel Driver

GPU ISA

Differentiated HW

National Tsing Hua University ® copyright OIA
National Tsing Hua University

CPU(s)

GPU(s)

Other 
Accelerators

5
Specification of Simple HSA Platform


Hardware
– Memory
Memory 
• Shared Virtual Memory (hUMA)
• Cache Coherency Domains
• Memory‐Based Signaling and 
Synchronization for CPU and GPU

– Task Control
• Architected Queuing Language (AQL)
• Efficient Syscall Infrastructure
• Preemptive Context Switching 

– Debugging Infrastructure
gg g
• Allow system software to set 
Instruction/ Memory/ Conditional, etc., 
breakpoints



Software
– HSA R ti
HSA Runtime APIs
API
•
•
•
•
•
•

Initialization of HSA components
Topology discovery
Manage AQL packets
Manage AQL packets
Dispatch application tasks
Signal HW and wait for result
Recycle available resources

– User Mode Queue
• Store AQL packets

– Virtual ISA ‐ HSAIL
Virtual ISA 
• A low level instruction set designed for 
parallel computing

– E
Exception Handling
ti H dli
• GPU trap handler to trigger GPU 
interrupt for GPU exception

National Tsing Hua University ® copyright OIA
National Tsing Hua University

6
What Is HSAemu


HSAemu is a full system emulator that supports 
the following HSA features
–
–
–
–
–
–
–




Shared virtual memory between CPU and GPU
Memory based signaling and synchronization
Memory based signaling and synchronization
Multiple user level command queues
Preemptive GPU context switching
Concurrent execution of CPU threads and GPU threads
Concurrent execution of CPU threads and GPU threads
HSA runtime
Finalizer

A Project Sponsored by MediaTek (MTK)
AP j S
d b M di T k
Currently, it supports simple HSA platform 
simulation
–
–

Functional‐accurate simulation
Cycle‐accurate simulation

National Tsing Hua University ® copyright OIA
National Tsing Hua University

7
Architecture of HSAemu


HSAemu consists of 6 components
– HSA Runtime
– CPU Simulation Module
– GPU Task Dispatcher
– Functional‐Accurate GPU Simulator (Fast‐

GPU Simulator)
– Cycle‐Accurate GPU Simulator (Mult2sim)
– GPU Helper Functions

National Tsing Hua University ® copyright OIA
National Tsing Hua University

8
HSAemu Runtime



User Mode Queue
– Store AQL packets
Store AQL packets



AQL Queue Manager 
– Manage AQL packets in User Mode  

Queue


AQL Command Dispatcher 
– Launch the execution of kernel jobs on
Launch the execution of kernel jobs on  

HSAemu


National Tsing Hua University ® copyright OIA
National Tsing Hua University

Support OpenCL runtime
pp
p
9
CPU Simulation Module (1)

PQEMU – Perform multicore CPU simulation
 HSA Signal Handler – Receive AQL command 
from HSA Runtime and launch GPU simulation


National Tsing Hua University ® copyright OIA
National Tsing Hua University

10
CPU Simulation Module (2)


PQEMU
– A parallel system emulator based on QEMU
A parallel system emulator based on QEMU
– Tow efficient synchronization models (UCC/SCC)
– Dynamic binary translation (DBT) technique
– A project sponsored by MTK



Agent code, HSA runtime, and operating 
system are run on PQEMU
system are run on PQEMU
Code Cache
DBT

DBT

DBT

DBT

CPU

CPU

CPU

CPU

Unified Code Cache (UCC) Model
“PQEMU: A Parallel System Emulator Based on QEMU” (ICPADS 2011)
National Tsing Hua University ® copyright OIA
National Tsing Hua University

11
GPU Task Dispatcher (1)


AQL Command Monitor
– Receive signal from HSA Signal Handler
– Copy AQL packets from User Mode Queue 

to HW AQL Queue
– Launch AQL Packet Worker


AQL Packet Worker
– Dequeue AQL packets from HW AQL Queue
– Parse AQL packet
Parse AQL packet
– Dispatch kernel jobs to Fast‐GPU Simulator 

or M2S‐GPU Simulator according to the 
kernel information
kernel information

National Tsing Hua University ® copyright OIA
National Tsing Hua University

12
GPU Task Dispatcher (2)


Execution Flow 

National Tsing Hua University ® copyright OIA
National Tsing Hua University
GPU Task Dispatcher (3)

Signal from
HAS Signal Handler

National Tsing Hua University ® copyright OIA
National Tsing Hua University
GPU Task Dispatcher (4)

Copy AQL packets from
User Mode Queue

National Tsing Hua University ® copyright OIA
National Tsing Hua University
GPU Task Dispatcher (5)

Ask AQL Packet Worker
to parse AQL Packet

National Tsing Hua University ® copyright OIA
National Tsing Hua University
GPU Task Dispatcher (6)

Launch Fast-GPU
Simulator

National Tsing Hua University ® copyright OIA
National Tsing Hua University
GPU Task Dispatcher (7)

Launch M2S-GPU
Simulation

National Tsing Hua University ® copyright OIA
National Tsing Hua University
Fast‐GPU Simulator


A functional‐accurate simulator 
for generic GPU model simulation
– HSAIL Translator 
• Act as a Finalizer
• Use static binary translation technique  
to translate BRIG file to host executable 
to translate BRIG file to host executable
binary file (x86) based on LLVM
• Host SSE instruction optimization

– GPU Thread Scheduler
• Simulate a generic GPU model

National Tsing Hua University ® copyright OIA
National Tsing Hua University

19
HSAIL Translator (1)


Architecture

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (2)

Launch LLVM
HSAIL Translator

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (3)

Construct
Control Flow
Graph of HSAIL

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (4)

Translate HSAIL
to LLVM IR

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (5)

Translate LLVM IR
to Host Executable
Object File

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (6)

Load Host Executable
Object File
to memory

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (7)

Link to GPU
Helper Functions

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (8)

Store the translation result
to GPU Code Cache

National Tsing Hua University ® copyright OIA
National Tsing Hua University
HSAIL Translator (2)


Host SSE instruction Optimization
– Reconstruct the control flow graph of kernel 

function
– Use bitmap masking and packing/unpacking 
algorithms to generate host SSE instructions 
algorithms to generate host SSE instructions

National Tsing Hua University ® copyright OIA
National Tsing Hua University

28
HSAIL Translator (3)


Example : The control flow graph for kernel 
function $foo

National Tsing Hua University ® copyright OIA
National Tsing Hua University

29
HSAIL Translator (4)


Reconstruct the control flow graph by depth‐first traversal



Perform bitmap masking
and packing & unpacking
algorithms 
algorithms

National Tsing Hua University ® copyright OIA
National Tsing Hua University

30
GPU Thread Scheduler



Simulate a generic GPU model
– GPU Thread Scheduler assigns work groups 

to free CU threads in the GPU Thread Pool
to free CU threads in the GPU Thread Pool
– Each CU thread executes all work items in a 
work group 
– The maximum number of CU threads is
The maximum number of CU threads is 
limited by host operating system   

National Tsing Hua University ® copyright OIA
National Tsing Hua University

31
M2S‐GPU Simulator (1)


A cycle‐accurate simulator for 
AMD Southern Islands GPU 
model simulation
– HSAIL Translator 
• Translate BRIG file to GPU binary

– M2S Bridge
• Bridge Multi2Sim GPU Model to 
HSAemu

– M2S GPU Module
• Simulate a cycle‐accurate GPU model
Simulate a cycle accurate GPU model

National Tsing Hua University ® copyright OIA
National Tsing Hua University

32
M2S‐GPU Simulator (2)


HSAIL Translator
– Act as a Finalizer
– Translate HSAIL to AMD Southern 

Islands GPU binary
– Use static binary translation 
technique based on LLVM

National Tsing Hua University ® copyright OIA
National Tsing Hua University

33
M2S‐GPU Simulator (3)


M2S Bridge : An interface to launch 
M2S GPU Module
M2S GPU M d l
– Initialize the data structures used by 

AMD Southern Islands GPU, including a 
AMD Southern Islands GPU, including a
memory register for AMD Southern 
Islands GPU to access the shared system 
memory in HSAemu
memory in HSAemu
– Invoke M2S GPU Module (the AMD 
Southern Islands GPU module in 
Multi2Sim)  

National Tsing Hua University ® copyright OIA
National Tsing Hua University

34
M2S‐GPU Simulator (4)


M2S GPU Module
– A cycle‐accurate AMD Southern 

Islands GPU simulator in Multi2Sim


National Tsing Hua University ® copyright OIA
National Tsing Hua University

Memory access is performed by 
y
p
y
HSAemu memory helper function 
to comply the hUMA model
py

35
GPU Helper Functions (1)


Memory Helper Function
– A soft‐mmu of GPU with a page table 

worker and a TLB to enable hUMA model
– Support the redirect access of a local 
pp
segment memory to a non‐shared private 
memory in GPU 


Kernel Information Helper Function
K
lI f
ti H l
F ti
– Collect and return information of GPU 

s u at o a d cu e t e ecut o state
simulation and current execution state 
– Retrieve kernel information such as 
working item ID, work group size, etc, from 
AQL packet
AQL packet

National Tsing Hua University ® copyright OIA
National Tsing Hua University

36
GPU Helper Functions (2)


Mathematic Helper Function
– Simulate special mathematical instructions 

such as trigonometric instructions by 
calling the corresponding mathematical 
functions in standard library 


Synchronization Helper Function
– Barrier synchronization implementation for 

generic GPU model simulation 

National Tsing Hua University ® copyright OIA
National Tsing Hua University

37
hUMA Model in HSAemu


Unified coherent address space 
– GPU can access a  virtual memory 

page allocated by CPU


Soft‐mmu is simulated for GPU
– TLB hit/miss events can be traced



Memory segment access
– Global memory segment access is 

handled by memory helper function
– Group memory segment access is 
handled by host ld/st instructions

National Tsing Hua University ® copyright OIA
National Tsing Hua University

38
Recall: Hardware Simulation of HSAemu


HSA hardware components simulated
– Multicore CPU: A parallel multicore CPU model simulation
– Functional‐Accrate GPU: A generic GPU model simulation
– Cycle‐Accurate GPU: AMD Southern Islands GPU model 

simulation
– hUMA: A unified address space between CPU and GPU 
simulation
– Synchronization Primitive: Barrier instruction simulation
– Hardware AQL Queue: A HW dispatch queue for GPU 
simulation
i l ti

National Tsing Hua University ® copyright OIA
National Tsing Hua University

39
Recall: Software Utilities of HSAemu


HSA software utilities designed
– HAS Runtime: HSA runtime library (OpenCL runtime)
– Topology Discovery: Discover the current platform topology
– User Mode Queue: A queue for each user application
– Signal Event: Notify GPU to work
– HSAIL Generator: A PTX to HSAIL source level translator
– BRIG Generator: Generate a binary format from a Kernel file
– HSAIL Translator: Translate HSAIL to host executable binary
– GPU Code Cache: store translated host binaries

National Tsing Hua University ® copyright OIA
National Tsing Hua University

40
Performance Evaluation


Experimental Environment



Benchmarks: 
– Nearest Neightbor (NN), K‐Means, FFT, FWT, N‐Body
– Binary Search, Bitonic Sort, Reduction, FWT
y
,
,
,

National Tsing Hua University ® copyright OIA
National Tsing Hua University

41
Scalability of Fast‐GPU Simulator



Comparison of NN, K‐means and FWT benchmarks on 32 
physical cores
physical cores
The speedup is scalable when # of CU threads < # of host 
physical cores
physical cores

National Tsing Hua University ® copyright OIA
National Tsing Hua University

42
SSE Optimization of Fast‐GPU Simulator


Performance comparison of FFT when turn on/off 
SSE optimization
SSE
i i i

National Tsing Hua University ® copyright OIA
National Tsing Hua University

43
N‐Body Simulation by Fast‐GPU Simulator


N‐Body Simulation 

All of host physical 
CPUs are running

National Tsing Hua University ® copyright OIA
National Tsing Hua University

44
Comparison of HSAemu and Multi2Sim
benchmark speedup
20
18
16

Fast‐GPU Sim > M2S‐GPU sim > Multi2Sim

14
12
10
8
6
4
2
0

multi2sim
HSAemu
Hybrid

BinarySearch
1
2.931317
2.873768

BitonicSort
1
18.88827
0.921835
multi2sim

National Tsing Hua University ® copyright OIA
National Tsing Hua University

HSAemu

FastWalshTransform
1
8.645516
2.407809

Reduction
1
6.294213
2.105663

Hybrid

45
Conclusions 


An HSA‐compliant full system emulator has been 
implemented
– A functional‐accurate simulator for generic GPU model
– A cycle‐accurate simulator for AMD Southern Islands GPU 

model (from Multi2Sim)

The HSAIL Translator acts as a finalizer that enables 
the integration of HSAemu with existing simulators, 
for example, Multi2Sim
 Open source – Nov. 12, 2013
p
,



– http://hsaemu.org/

National Tsing Hua University ® copyright OIA
National Tsing Hua University

46
Future work


Enhance HSAemu by implementing more HSA 
features
f t



Integrate HSAemu with some existing cycle‐accurate 
I
HSA
ih
i i
l
GPU simulators



Design a cycle‐accurate simulator based on PQEMU 
for generic CPU model



Deisgn a cycle‐accurate simulator based on PQEMU 
for big.LITTLE CPU model

National Tsing Hua University ® copyright OIA
National Tsing Hua University

47
Q & A
Q&A
National Tsing Hua University ® copyright OIA
National Tsing Hua University

48

Contenu connexe

Plus de AMD Developer Central

Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceAMD Developer Central
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...AMD Developer Central
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozAMD Developer Central
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellAMD Developer Central
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonAMD Developer Central
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornAMD Developer Central
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevAMD Developer Central
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasAMD Developer Central
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...AMD Developer Central
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...AMD Developer Central
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14AMD Developer Central
 
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14AMD Developer Central
 
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...AMD Developer Central
 
Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14AMD Developer Central
 
Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14AMD Developer Central
 
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14AMD Developer Central
 
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahGS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahAMD Developer Central
 

Plus de AMD Developer Central (20)

Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop Intelligence
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
 
Inside XBox- One, by Martin Fuller
Inside XBox- One, by Martin FullerInside XBox- One, by Martin Fuller
Inside XBox- One, by Martin Fuller
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas Thibieroz
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
 
Gcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodesGcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodes
 
Inside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin FullerInside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin Fuller
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan Nevraev
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
 
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
 
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
Mantle and Nitrous - Combining Efficient Engine Design with a modern API - AM...
 
Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14Mantle - Introducing a new API for Graphics - AMD at GDC14
Mantle - Introducing a new API for Graphics - AMD at GDC14
 
Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14Direct3D and the Future of Graphics APIs - AMD at GDC14
Direct3D and the Future of Graphics APIs - AMD at GDC14
 
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
Vertex Shader Tricks by Bill Bilodeau - AMD at GDC14
 
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahGS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
 

Dernier

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 

Dernier (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 

HSA-4138, HSAemu – A Full System Emulator for HSA Platform, by Yeh Ching Chung and Jiun-Hung Ding