SlideShare une entreprise Scribd logo
1  sur  26
Online Demo: Accelerate Your Video Format Conversion Using the 1080p Video Framework 2009 Video Framework Online Series Part 1: Upgrade Your Broadcast System to PCIe Gen2 Part 2: Remove the External Memory Bottleneck in  Your Video Design
Agenda ,[object Object],[object Object],[object Object],[object Object]
Video Framework Target Apps ,[object Object],[object Object],[object Object],[object Object],Need for an efficient memory arbiter and controllers Video servers Switchers Video I/O cards Digital video effects Multiviewers Monitors Video walls Dramatic rise in bandwidth Image size Frame size: (Total # of pixels) Frame size: (Assume 30 bits per pixel) Data rate: (Assuming 60FPS) 1920x1080p 1920 x 1080 = 2.08M pixels 62 Mbits or 7.78 MBytes 3,732 Mbps 1920x1080i 1920 x 1080 x 0.5 = 1.04M pixels 31 Mbits or 3.89 MBytes 1,866 Mbps 1920x720p 1280 x 720 = 921K pixels 27.7 Mbits or 3.46 MBytes 1,659 Mbps SD 1920x480i 720 x 480 x 0.5 = 173K pixels 5.2 Mbits or 0.65 MBytes 311 Mbps
Current Broadcast Video Solution (UDX3.0) Memory Subsystem Two-Channel Format Conversion, OSD, Nios II Processor UDX3.0 memory access efficiency is > 42.35/51.2 = 82.7% DDR3 SDRAM (51.2 Gbps) Mem ctrl Master1 Master2 Master3 Master 18 … .... 400 MHz * 64 bits * 2 UDX3.0 memory bandwidth:   42.35 Gbps 256-bit wide System Memory Breakdown MA deinterlacer1: 10.83 Gbps MA deinterlacer2: 10.83 Gbps Frame buffer1:  7.64 Gbps Frame buffer1:  7.64 Gbps OSD:   4.9 Gbps Nios II  ®  processor:  0.5 Gbps Total:   42.35 Gbps
Altera’s Multi-Port Front End Memory Controller  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Master 1 Master 2 Master 3 Master 4 Video datapath Video in Video out Time-critical data slave Time-critical data slave Time-critical data slave Time-critical data slave Non-critical data slave Non-critical data slave Non-critical data slave Non-critical data slave Multi-port front end Master port DDRx  SDRAM Memory  controller Nios II Processor Width-adapting block SOPCB Pipeline bridge Write master Read master SG DMA for the OSD Data master Instr. master
Video Framework With Multi-Port Front End DVI/HDMI SDI Format conversion BT656 Deinterlacer (VIP) Deinterlacer (VIP) Scaler (VIP) Scaler (VIP) Input CODEC SDI BT656 Output DVI/HDMI Audio Sample rate converter  CODEC Audio Video Delay frame sync Ethernet PCIe TS  multiplexer Video over IP PCIe Deinterlacer (VIP) Scaler (VIP) Altera IP Custom IP  DDR2/3 Memory control DMA control DMA control Display  port Deinterlacer (VIP) Scaler (VIP) Display port Third-party IP PCIe PCIe MPEG2 H.264 JPEG2000 Custom video functions
Video Framework: SOPC/Avalon-ST Video   Avalon ® -ST Video SOPC-ready function SOPC  Builder ready SDI/HDMI/DP Clocked video input AV-ST-V vip1 AV-ST-V vip2 AV-ST-V Clocked video output SDI/HDMI/DP
Example: Multi-Channel Format Conversion ASSP based FPGA +  video framework  SDI Tx SDI Rx Format  conversion ASSP DDR memory Channel 1 ,[object Object],[object Object],[object Object],DDR memory SDI SDI Multi-channel format  conversion SDI 1 SDI 2 SDI 1 SDI 2 Memory control ,[object Object],[object Object],[object Object],[object Object],SDI Tx SDI Rx Format  conversion ASSP DDR memory Channel 1
Devices’ Memory Bandwidth for Video Processing   *Maximum memory bandwidth per side using DDR3 interface on the largest FPGA in the family ** Altera FPGAs also support RLDRAM and QDR II Device DDR3 DDR2 DDR Max. memory bandwidth/ side * Arria ®  II GX FPGA 600 Mbps 300 MHz 600 Mbps 300 MHz 400 Mbps 200 MHz 57 Gbps Stratix ®  IV GX FPGA 1,067 Mbps 533 MHz 800 Mbps 400 MHz 400 Mbps 200 MHz 153 Gbps HardCopy ®  IV GX ASIC 1,067 Mbps 533 MHz 667 Mbps 333 MHz 400 Mbps 200 MHz 153 Gbps
Example: Format Conversion Using Video Framework and FPGAs SD/HD/3G-SDI CLIP Motion adaptive Polyphase scaler  6x6 taps, 4:4:4 mode Deinterlacer 4:2:2 Frame rate conversion SCL Frame buffer DDR2 HP memory Avalon-ST video Clocked video Nios II processor Run-time configuration through Nios II processor CRS 4:2:2 to 4:4:4 CRS 4:4:4 to 4:2:2 Interlacer 4:2:2 Res.: 480i to 1080p SDI SD/HD/3G-SDI SD/HD/3G-SDI CLIP Motion adaptive Polyphase scaler  6x6 taps, 4:4:4 mode Deinterlacer 4:2:2 Frame rate conversion SCL Frame buffer CRS 4:2:2 to 4:4:4 CRS 4:4:4 to 4:2:2 Interlacer 4:2:2 SDI SD/HD/3G-SDI CVI CVO CVI CVO
Remove the External Memory Bottleneck in Your Video Design
Efficient Memory Access Share ,[object Object],[object Object],[object Object],[object Object],Y Cb Y Cr SD, HD, 3G SDI  Chroma resampler  MA deinterlacer DDRx memory SDI  Nios II SG-DMA SDI  SDI  Alpha blending mixer  DVI Tx Frame buffer Color space converter Background Scaler B G R Frame reader  Frame buffer Chroma resampler  Weave deinterlacer Scaler Color space converter SD, HD, 3G Multiple bus masters  access single external memory Input switch
Generate a Multi-Port Front End in SOPC Builder SDI  Chroma resampler SDI Frame buffer DDRx memory controller Interlacer Scaler MA deinterlacer  Chroma resampler SDI Frame buffer Interlacer Scaler MA deinterlacer  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],SDI
Generate a Multi-Port Front End in SOPC Builder ,[object Object],MA deinterlacer (bank0) Frame buffer (bank 1)  SOPC Builder switch fabric Pipeline bridge  Altera DDRx High-Performance Memory Controller II Pipeline bridge  Pipeline bridge  Pipeline bridge  MA deinterlacer (bank2) Frame buffer (bank 3)  Multi-port front end to DDRx memory controller built in SOPC Builder Multi-port front end connects directly to the Altera DDRx  High-Performance Memory Controller I/II
Generate a Multi-Port Front End in SOPC Builder ,[object Object],[object Object],Separate data processing and memory clock domain Long bursts (32, 64, 128) within a row minimizes the penalty for switching rows  Large buffers;  on-chip buffers twice the burst size
[object Object],[object Object],[object Object],Generate a Multi-Port Front End in SOPC Builder Refer to DDR2 High-Performance Memory Controller User Guide to map local address to bank address Set the base address of each frame buffer and deinterlacer to a different bank
Generate a Multi-Port Front End in SOPC Builder ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multi-Port Front End Component ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multi-Port Front End Component ,[object Object],[object Object]
Multi-Port Front End Component ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multi-Port Front End Component ,[object Object],[object Object],Write master Read master Width-adapting block Video datapath Master 1 Master 2 Master 3 Master 4 Video in Video out Time-critical data slave Time-critical data slave Time-critical data slave Time-critical data slave Multi-port front end Non-critical data slave Non-critical data slave Non-critical data slave Debug slave SG DMA for the OSD Master port DDRx SDRAM memory controller SOPCB pipeline bridge Data master Instr. master Nios II processor
Multi-Port Front End Component ,[object Object],[object Object],Time-critical data slave Time-critical data slave Time-critical data slave Time-critical data slave Debug slave Non-critical data slave Non-critical data slave Master port DDRx SDRAM memory controller 1 4 3 2 ? 1 3 2 Non-critical data slave
UDX3 Reference Design ,[object Object],[object Object],[object Object],[object Object],[object Object],Frame buffer Frame reader DMA controller SOPC Builder switch fabric Nios II processor DDR3 HP memory controller MA deinterlacer Frame buffer MA deinterlacer Clock crossing bridge Master Memory access efficiency (%) (Bandwidth required/theoretical max bandwidth)*100 Masters sharing external memory 92% Video, Nios II program code, and 60 fps on-screen display sharing external memory
Summary  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Next Steps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Thank You For more information, visit www.altera.com

Contenu connexe

Dernier

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Dernier (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 

En vedette

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Remove the External Memory Bottleneck in Your Video Design

  • 1. Online Demo: Accelerate Your Video Format Conversion Using the 1080p Video Framework 2009 Video Framework Online Series Part 1: Upgrade Your Broadcast System to PCIe Gen2 Part 2: Remove the External Memory Bottleneck in Your Video Design
  • 2.
  • 3.
  • 4. Current Broadcast Video Solution (UDX3.0) Memory Subsystem Two-Channel Format Conversion, OSD, Nios II Processor UDX3.0 memory access efficiency is > 42.35/51.2 = 82.7% DDR3 SDRAM (51.2 Gbps) Mem ctrl Master1 Master2 Master3 Master 18 … .... 400 MHz * 64 bits * 2 UDX3.0 memory bandwidth: 42.35 Gbps 256-bit wide System Memory Breakdown MA deinterlacer1: 10.83 Gbps MA deinterlacer2: 10.83 Gbps Frame buffer1: 7.64 Gbps Frame buffer1: 7.64 Gbps OSD: 4.9 Gbps Nios II ® processor: 0.5 Gbps Total: 42.35 Gbps
  • 5.
  • 6. Video Framework With Multi-Port Front End DVI/HDMI SDI Format conversion BT656 Deinterlacer (VIP) Deinterlacer (VIP) Scaler (VIP) Scaler (VIP) Input CODEC SDI BT656 Output DVI/HDMI Audio Sample rate converter CODEC Audio Video Delay frame sync Ethernet PCIe TS multiplexer Video over IP PCIe Deinterlacer (VIP) Scaler (VIP) Altera IP Custom IP DDR2/3 Memory control DMA control DMA control Display port Deinterlacer (VIP) Scaler (VIP) Display port Third-party IP PCIe PCIe MPEG2 H.264 JPEG2000 Custom video functions
  • 7. Video Framework: SOPC/Avalon-ST Video Avalon ® -ST Video SOPC-ready function SOPC Builder ready SDI/HDMI/DP Clocked video input AV-ST-V vip1 AV-ST-V vip2 AV-ST-V Clocked video output SDI/HDMI/DP
  • 8.
  • 9. Devices’ Memory Bandwidth for Video Processing *Maximum memory bandwidth per side using DDR3 interface on the largest FPGA in the family ** Altera FPGAs also support RLDRAM and QDR II Device DDR3 DDR2 DDR Max. memory bandwidth/ side * Arria ® II GX FPGA 600 Mbps 300 MHz 600 Mbps 300 MHz 400 Mbps 200 MHz 57 Gbps Stratix ® IV GX FPGA 1,067 Mbps 533 MHz 800 Mbps 400 MHz 400 Mbps 200 MHz 153 Gbps HardCopy ® IV GX ASIC 1,067 Mbps 533 MHz 667 Mbps 333 MHz 400 Mbps 200 MHz 153 Gbps
  • 10. Example: Format Conversion Using Video Framework and FPGAs SD/HD/3G-SDI CLIP Motion adaptive Polyphase scaler 6x6 taps, 4:4:4 mode Deinterlacer 4:2:2 Frame rate conversion SCL Frame buffer DDR2 HP memory Avalon-ST video Clocked video Nios II processor Run-time configuration through Nios II processor CRS 4:2:2 to 4:4:4 CRS 4:4:4 to 4:2:2 Interlacer 4:2:2 Res.: 480i to 1080p SDI SD/HD/3G-SDI SD/HD/3G-SDI CLIP Motion adaptive Polyphase scaler 6x6 taps, 4:4:4 mode Deinterlacer 4:2:2 Frame rate conversion SCL Frame buffer CRS 4:2:2 to 4:4:4 CRS 4:4:4 to 4:2:2 Interlacer 4:2:2 SDI SD/HD/3G-SDI CVI CVO CVI CVO
  • 11. Remove the External Memory Bottleneck in Your Video Design
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26. Thank You For more information, visit www.altera.com

Notes de l'éditeur

  1. Hello and welcome to webcast on how to Remove the External Memory Bottleneck in Your video design, brought to you by Altera Corporation. My name is Girish Malipeddi, technical marketing manager at Altera focused on broadcast and video applications. Today I will be discussing the second part of the video framework series - Remove the External Memory Bottleneck in Your video design. Please refer to the earlier webcast, Upgrade Your Broadcast System to PCIe Gen2, on Altera.com for the first part of this series.
  2. First, we will look at the Video processing applications and their memory bandwidth requirement challenges. Next, we will discuss altera’s video framework that simplifies hardware design for video systems. Then, Pete Brookes, Senior Engineering Manager of the Multimedia System Solutions Group will discuss technical details of the multiport front end memory controller and how the users can use it improve the efficiency of their external memory system. Finally, we will look at next steps you can take to find out more and evaluate this solution.
  3. Altera’s Video framework targets variety of broadcast application including Video servers, switchers, Video Input and Output cards, Video effects cards, Multiviewers and Broadcast Monitoring applications. All these applications have common requirements – Format conversion handling data upto full HD data rates. The multichannel applications such as switchers, multiviewers and video effects cards also need to implement multiple channels of video processing handling full HD and channel sync. All these video functions are memory hungry as they need to cache multiple frames of video for manipulation at the same time the data per frame significantly increases as you go from SD to HD to full HD as you can see in the table here hence the need for an efficient memory arbiter and controller for getting the most of the external memory available in the system. Having an efficient memory controller with high performance interfaces can reduce the external memory requirements costs and hence the system costs.
  4. Here we show memory bandwidth requirement for 2-channel video format conversion. The estimates on this slide are based on Altera 2-ch format conversion reference design which includes OSD and an on chip processor and does basic frame rate conversion. One of the versions of the design has 18 masters accessing the external memory and uses Altera’s hi-performance DDR3 memory controller. The design is implemented on a Stratix IV GX FPGA development kit. The memory interface is running at 400Mhz, DDR3, 64-bit wide for maximum bandwidth of 51.2Gbps. The estimated memory bw requirements for the design as you see in the table here is about 42.35 Gbps. So in effect, the memory interface has to be at least 82.7% efficient to ensure successful implementation of the video system. The newer version of this design requires even higher memory bw efficiency in excess of 90%. This is a significant challenge which led us to build an memory arbiter in which the user can set priorities and weights on the masters in order to avoid starving critical masters. This memory controller and arbiter are targeted broadcast system but can be leveraged at other applications.
  5. Here’s an overview altera’s multiport front end memory arbiter solution which addresses the aforementioned challenges of building a video system. The multi-port front end for the memory controller today supports 16 masters but can easily be exapanded to more, and users can assign masters as time crictical or non-critical masters. It also allows round-robin weighted arbitration in which masters are allowed to share available bw at user set ratios Altera reference design UDX3.0 shows that you can achieve > 90% efficiency and support various memory types such as DDR2, DDR3 and RLDRAM Finally the design is available as reference design upon request as part of the UDX3.0 design
  6. This diagram shows a typical broadcast system and solutions that exist from Altera and its 3 rd party network. SDI is the primary I/O standard for broadcast; but other standards co-exist as well. For example, many multiviewers have DVI or HDMI outputs today and may add Display Port in the future. Today Altera hardware supports SDI, DVI and Display port interfaces and has IP to enable the support for these interaces More and more, studios want all their equipment to support any format; Thus, format conversion is moving from an optional feature to a standard requirement on all I/O. To implement Format conversion, altera offers a comprehensive set of images processing functions such as scaler, deinterlacer, color space conversion and others. More details on Altera;s Video and image processing suite of fucntions can be found on Altera.com Video Codecs are implemented in FPGAs in video server, contribution encoders, distribution encoders, and IRDs. Altera has a network of companies offering solutions in that space. A number of solutions for Audio exist, including audio de-embed, sample rate converter. Altera has been a leader in enabling video over IP in studio and headend applications, first introducing its first video over IP reference design in 2006. As we talked in earlier slides, all video systems require high performance multi-master memory controller. This webcast primarily focuses on the high performance Multi port front end memory controller.
  7. Altera has various video and image processing functions as seen in here in the green box. They include scaler, deinteracer and other functions required to implement format conversion. Altera also introduced an open standard called Avalon Streaming Video. It defines a connectivity and lightweight protocol to enable interoperability for functions developed by different design teams. Avalon streaming video is the basis of a video framework that speeds video design. Altera also offers two key IO blocks namely CVI and CVO as seen at the bottom of the green box. These functions make it easy to get any external video interface such as SDI, HDMI or display port into the avalon streaming video format. There are also a number of 3 rd party video processing cores that support Avalon Video Streaming. Altera has a tool namely SOPC builder that recognizes this standard and enables integration of this blocks through a graphical tool. This tool creates interconnect that is correct by construction. This simplifies FPGA design for video systems.
  8. Using off the shelf ASSPs to implement format conversion or multi-channel video processing has many limitations. They typically don’t scale well, more channels leads to more components on board and very soon the board can get ugly. Also systems designers are stuck with very limited memory type such DDR1 or DDR2 support and not necessarily the latest or the most cost effective memory solutions. Lastly, as the memory cannot be shared between channels, it can result in very inefficient and costly memory sub system. On the other hand using video frameowrk and FPGAs, you can easily integrate a lot of external components into the fpga – such as the SDI serializer and deserializer. Mutilple channels of format convaerion can be implemented in one FPGA and a signle external memroy sub system that can be shared across multiple channels.
  9. There are several different choices of FPGAs for video processing applications. As an example, Altera’s 40-nm based Stratix IV GX and Arria II GX devices support various memory interface standards such as DDR3, DDR2, DDR, RLDRAM, QDR II. You can see that performace of the external interface are the best in the industry. For example DDR3 533 MHz external memory performance is the FPGA industry’s highest performance. The startix IV GX family can effectively support external memory bw in excess of 150 Gbps per side Altera’s 40-nm Arria II GX family delivers cost and power-optimized silicon formulti-channel video processing, featuring up to 16 3.75 Gbps transceivers and 300 MHz external memory performance. The arris II GX device can effectively sypport external mmeory bw in excess of 50 Gbps. The high performance external memory interfaces makes it easy to implement memory hungry video processing application such as format conversion.
  10. Format conversion is a design that Altera developed which leverages this video framework plus internally developed video processing blocks to implement up/down/cross conversion. This design features two channels with support from 480i up to 1080p with full motion adaptive deinterlacing. A risc processor embedded in the FPGA is called Nios and it allows run-time configuration of all of the functions. The design integartes SDI functionality eliminating the need for SDI receivers and transmitters. This designs also offers efficient memory arbiter which enables memory sharing and scalability to multiple channels eliminating the need for separate memories for separate channels. For example using the MPFE and the video framework, you can implement this design in an Arria II GX device making it very cost effective versus other solutions. Also, using the video framework you can easily scale this solution to multiple channels in larger FPGA without increasing the board complexity linearly. Next Pete Brookes will provide more details on the Altera’s multi-port front end memory controller.
  11. Thank you Girish. Hello, my name is Peter Brookes, and I’m going to describe how to design video processing systems which efficiently share access to external memory.
  12. As Girish mentioned, a typical multi-channel SD/HD or 3G video format conversion system will contain multiple bus masters sharing access to a single external memory, such as DDR2 or DDR3. This is often because of the temporal nature of the video processing functions or to synchronize multiple video channels. By using system level design tools and off the shelf parameterizable IP you can easily and efficiently utilise available external memory bandwidth and focus an increased amount of engineering development time on *your* differentiating technology. I will be describing two approaches to designing a video memory system, showing how each solution works, their best use cases and their limitations. The first approach uses the system level tool, SOPC Builder, to automatically generate an application specific bus interconnect. The second approach shows how the SOPC Builder bus interconnect can easily be replaced by a Multi-Port Front End component to improve performance and support more complex memory access patterns.
  13. Let’s start with the first approach where the interconnect is generated by SOPC Builder. To demonstrate the solution we’ve developed a reference design, called V2, which performs high-quality up, down, and cross conversion of two SD, HD and 3G video streams in interlaced or progressive format . The V2 design uses the Altera Video and Image Processing Suite of IP Functions as well as triple rate SDI and the High Performance DDR2 Memory Controller. The design provides an easy starting p oint for further customisation. For example, you could replace one of Altera’s video processing functions with your own differentiating IP and re-use the memory infrastructure and video interfaces. In V2 there are 14 bus masters which read and write video data from and to a single DDR2 64 bit wide external memory. The external memory is clocked at 266 Mhz providing a theoretical bandwidth of 34 GigaBits per second. In this design, deinterlacing two 1080i60 video sources requires a bandwidth of 26 GBit/s, which equates to a memory access efficiency requirement of 77.4%. Each master also has a memory access latency requirement which must be satisfied to ensure video is always streamed at the required rate.
  14. In V2, the MPFE is constructed from the switch fabric generated by SOPC Builder that connects the video store bus masters to the memory controller and bridge components in the system. The generated interconnect includes the address decode logic and the logic to arbitrate between multiple masters requesting access to a single external memory component. The slave side arbitration scheme is round robin, so when multiple masters contend for access to the memory slave, the arbiter grants shares in round robin order. The pipeline bridge components are inserted into the design for two reasons. The bridges insert registers in the path between its master and slave so can help reduce register to register delay and improve system FMax. Secondly, the pipeline bridges are also used to control the topology of the generated interconnect and the arbitration. Specifically this provides the control to group masters that access a particular memory bank. This allows the arbitration algorithm to visit each pipeline bridge in turn, servicing a different bank each time. The DDR2 memory controller provides a simplified interface to industry standard DDR2 memory and instantiates the external memory PHY interface. The goal of the MPFE is really to reduce the impact of memory data management commands, such as activate and pre-charge cycles. We can achieve this in a number of ways.
  15. In this design we co nfigure the bus masters associated with the frame buffer and deinterlacer to perform large burst transfers of length 64 with the initial transfer aligned to the start of a memory row. B y making the burst size a factor of the row size you ensure that a burst doesn’t cross a row boundary, which minimizes the penalty for switching rows. This requires large on-chip buffers (inside the deinterlacer and frame buffer) to create and receive the burst. B y making the on-chip buffers twice the burst size the video function can process a burst whilst transferring another to/from memory. This allows the video processing path to cope with the longer latency caused by waiting for access to the memory controller. Also using a separate clock domain for the memory system, with sufficient on-chip buffering, allows the memory to be run at a higher rate without being limited by the speed of the datapath; this allows more masters to be handled efficiently.
  16. We can further minimize the penalty for switching rows in a bank, by overlapping the bank management commands of one bank with the data transfer to/from another bank. We achieve this by setting the base address, at compile time, of each frame buffer and deinterlacer to a different memory bank, and by grouping the masters that are on the same bank together using a pipeline bridge. The round robin arbitration scheme of SOPC Builder will then switch between the pipeline bridges enabling bank interleaving. The Altera DDR2 High Performance memory controller user guide describes which bits, set in a local address, map to a particular memory bank. Efficient DDR High Performance Memory Controllers improve this further by providing predictive bank management, which allows the bank management commands to be issued earlier.
  17. This solution works well for video processing systems when the memory bandwidth requirements for each master or group of masters are equal. This is because of the round robin arbitration logic in the system interconnect. However, when the bandwidth requirements of the masters or groups of masters are not balanced, or when frame store functions share memory banks, it becomes difficult to satisfy both the bandwidth and latency requirements of all the masters in the system. The solution is also well suited to systems where the read and write data transfers can be performed in large bursts such as streaming video data. This system does not provide efficient memory access for bus masters with random memory access patterns such as a processor performing a cache line fill or a scatter gather DMA controller copying graphic sprites around in memory. Typically, for video systems with 14 or more masters, the auto-generated switch fabric can achieve an FMax of up to 160MHz on a Stratix III or Stratix IV device. For the V2 reference design, reducing the external memory clock frequency shows the method can achieve up to 88.5% memory access efficiency.
  18. A second system design method to efficiently share memory bandwidth is to use a dedicated Multi-Port Front End IP component. The MPFE is optimized for high data rate video processing applications, as in the previous approach, but also includes support for small, random address accesses. The component has been designed to specifically target video processing applications and has been proven using a representative reference design, UDX3. The MPFE includes a multi-class weighted arbitration scheme that allows you to control the traffic flow to and from the external memory interface. This enables efficient video systems to be designed when bus masters have *different* bandwidth requirements. This also simplifies the design methodology by often removing the need for pipeline bridge components to control the topology of the system. Also , by defining which ports are critical, you can ensure that the time-critical masters in your system, such as video functions, have priority over other less time sensitive blocks. Another important consideration is the maximum clock rate that can be achieved by the system interconnect, because the system critical path can often be in the switch fabric. The MPFE component supports clock rates over 200MHz on a Stratix IV GX C3 speed grade device, which allows you to connect it directly to, say, a DDR3 memory controller in half rate mode with a memory clock frequency of more than 400MHz.
  19. The MPFE component replaces the auto-generated SOPC Builder interconnect logic, including the round robin arbitration scheme. In SOPC Builder the component is selectable from the Component Library, and can be parameterised to suit the requirements of the application. For example, the component can be configured with the number and width of the slave ports, and the maximum supported burst size. Because the component is available in the SOPC Builder environment, it allows easy connection to the Altera Video and Image Processing Suite, DMA controllers and the Altera High Performance Memory Controller. However, if you want to use the MPFE component outside of the SOPC Builder environment, you can. You can parameterise, instantiate, and connect the component directly in your HDL.
  20. The MPFE can be configured with up to 16 Avalon Memory Mapped read or write ports. Each slave port can then be connected to the system bus masters. The slave port data width is configurable at compile time, upto 512 bits. When developing complex video systems it is very important that the user can monitor the system performance; this is particularly true when considering multi-master access to external memory. The MPFE provides visibility of the behaviour of the arbitration at run time by exposing useful count and wait data. For example, the number of times a slave has been granted, the number of words of read data a slave has received, or the worst case number of cycles a slave has had to wait between requesting and being granted. This data can allow the user to tune the performance of their system. The MPFE, which connects directly to the Altera High Performance Memory Controller, is available as clear text RTL as part of the UDX3 reference design.
  21. Each of the slave ports can be configured to be either time critical or non critical. By defining which ports are critical, you can ensure that the time-critical masters in your system, such as video blocks, have priority over other less time sensitive blocks. It also allows these non-critical masters to use any available bandwidth in times when the critical masters are not requesting access. The video processing function masters will typically be defined as critical bus masters. The processor instruction and data masters as well as the scatter gather DMA masters for On Screen Display are considered non-critical.
  22. Whenever there are no time-critical requests outstanding, the arbiter will accept the next pending transaction from the non-critical ring. Once that has been serviced, the arbiter will check to see if there are any new requests on the time-critical ring and will return to service that request. If more than one request is present on the time-critical ring, the arbiter will continue to service them. Once all the pending time-critical requests have been serviced, the arbiter will switch back to servicing the non-critical ring. The MPFE uses an enhancement of a weighted round-robin scheme to share the available bandwidth between the slave ports at the ratios set by the user. The Bandwidth Settings tab, in the parameterization GUI, allows you to control the ratio of bandwidth that each port is given. By setting weights for each port, you can restrict how often a port is allowed access to the external memory interface. If you assign larger numbers to a port, it will be allowed a larger proportion of the external memory bandwidth, while smaller numbers will allow a port less bandwidth. The arbiter will cycle around the slave ports, granting each the ability to issue a read or write burst to the memory controller. The arbiter will continue to go around the ring, servicing slave ports that have not exceeded their bandwidth allowance. The MPFE uses a sliding window bandwidth allocation system to distribute the accesses more evenly to reduce the worst case latency. This in turn means less buffering in the user’s design.
  23. The UDX3 reference design is implemented on a Stratix IV GX230 device, and demonstrates both high-quality up, down, cross conversion and on screen display. There are a total of 19 bus masters sharing one 64-bit DDR3 Memory. The Altera High Performance DDR3 memory controller is running in half rate mode with a memory clock frequency of 400 MHz. As you can see in the diagram, the bus masters connected to the MPFE perform a combination of video frame buffering, processor instruction and data bus accesses and DMA reads and writes for on-screen display. The performance capability of the MPFE is demonstrated by increasing the rate of the OSD layer written to external memory. To write the OSD layer to external memory at 60 frames per second, a memory access efficiency of 92% is required. Memory access efficiency is again defined as the ratio between the required bandwidth and the theoretical maximum memory bandwidth. The MPFE satisfies this requirement and ensures that all the critical masters satisfy their read and write latency requirements.
  24. So, in summary, a multi-port front end to the Altera High Performance Memory Controller has been designed targeting video processing applications. This solution enables memory access efficiency of over 90% even when combining buffering video data in large bursts and performing random memory accesses. Reference designs demonstrate how to use the solutions today and provide a great starting point for further system development. The V2 design shows how to use SOPC Builder to generate an efficient interconnect for high data rate video streaming applications. The MPFE component, including parameterization GUI and HDL source code, is available as part of the UDX3 reference design. The UDX3 design shows how to use this component in a complex, multi-channel video system. I’ll now hand you back to Girish for the remainder of the presentation….
  25. You can go to Altera’s website to get access to the design for evaluation. You can also download user guides and purchase audio video development kits at the links provided here. As mentioned during the webcast, this design runs on Stratix IV GX development kit and can be purchased at the link provided. There are also similar designs targeting Arria II GX audio video kit.