Grid Asia2008 Low Latency Data Grid

•Télécharger en tant que PPT, PDF•

0 j'aime•478 vues

Investment banks rely extensively on grids to dramatically increase throughput for their calculations for analytics (especially risk). The traditional design pattern involves executing compute intensive workflows where jobs require movement of large data files to the compute nodes, calculation results creating files which then are again consumed by the next job in the flow. Increasingly, the pattern is shifting to running short lived tasks where the bottleneck is data i.e. the time spent to move data back and forth between compute nodes can be overwhelming - turning a compute bound job to be a IO bound one. For instance, real time pricing for financial derivative instruments could just take a few milliseconds, but, the time required for the data transfer could be hundreds of milliseconds. The talk focuses on one architectural pattern gaining popularity - move the compute to the data. The data is partitioned in grid memory across many nodes and the compute task is routed to the node with the right data set provisioned based on the data hints it provides during launch. We discuss the features of the main-memory based data grid solution that uses different data partitioning policies such as hashing or data relationship based to manage data across a large cluster of nodes. We also discuss techniques for rebalancing data and behavior across the Grid nodes to achieve the best throughput and lowest latency.

Technologie

Low Latency Data Grids in Finance Jags Ramnarayan Chief Architect GemStone Systems [email_address]

Background on GemStone Systems ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Use of Grid computing in finance ,[object Object],[object Object],[object Object]

State of affairs – Risk Analytics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

State of affairs – Pricing (derivatives) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Where is the problem? Compute farm Data warehouses Rational databases ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],File system CPU bound job turns into a IO bound Job Grid Scheduler

Data Fabric for Risk Analytics When data is stored, it is transparently replicated and/or partitioned; Redundant storage can be in memory and/or on disk— ensures continuous availability Keep reference data replicated on many; partition trade data Machine nodes can be added dynamically to expand storage capacity or to handle increased client load Pool memory (and disk) across cluster ; parallelize data access and computation to achieve very high aggregate throughput

Data Fabric for Risk Analytics TaskFlow - As results are generated push events to compute nodes to initiate subsequent computation Avoid bulk data transfer across tasks or Jobs Thousands of compute nodes can maintain local cache of most frequently used data; Optionally use local disk for overflow Move reference data to local cache Synchronous read through, write through or Asynchronous write-behind to other data sources and sinks

Move business logic to data f 1 , f 2 , … f n FIFO Queue Data fabric Resources Exec functions Sept Trades Submit (f1) -> AggregateHighValueTrades(<input data>, “ where trades.month=‘Sept ’) Function (f1) Function (f2) ,[object Object],[object Object],[object Object],[object Object],[object Object]

Key lessons ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Contenu connexe

Tendances

Are New Orleans Data Centers Making Green Strategies a Priority? (SlideShare)SP Home Run Inc.

Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...yashbheda

Analysis of big data in pandemic case Muh Saleh

Big data toolsNovita Sari

ThilgaTHILAKAVATHIRAMRAJ

Big Data EcosystemIvo Vachkov

Big Data and OSS at IBMBoulder Java User's Group

Data Centers In USmsirmajritchie

NoSQL Type, Bigdata, and AnalyticsSandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW

Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Syst...Nagios

Trends in Database ManagementMarlon Jamera

Data Center Automation - Cisco ASAP Data CenterE.S.G. JR. Consulting, Inc.

Data warehouseingSajan Sahu

Three Things to Consider When Making Investments in Your Big Data InfrastructureFlyData Inc.

BigDataShankar R

Big data introductionChirag Ahuja

Thinking Outside the TableOntotext

R programming analysisdigitaladitya

Top 10 data science technologiesBrainware University

Big data frameworksCuelogic Technologies Pvt. Ltd.

Tendances (20)

Are New Orleans Data Centers Making Green Strategies a Priority? (SlideShare)

Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...

Analysis of big data in pandemic case

Big data tools

Thilga

Big Data Ecosystem

Big Data and OSS at IBM

Data Centers In US

NoSQL Type, Bigdata, and Analytics

Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Syst...

Trends in Database Management

Data Center Automation - Cisco ASAP Data Center

Data warehouseing

Three Things to Consider When Making Investments in Your Big Data Infrastructure

BigData

Big data introduction

Thinking Outside the Table

R programming analysis

Top 10 data science technologies

Big data frameworks

Similaire à Grid Asia2008 Low Latency Data Grid

Hdf5Smith Kim

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...Niraj Tolia

Introduction to Data WarehousingJason S

BigdataShankar R

Waters Grid & HPC Coursejimliddle

DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDATAVERSITY

Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics

Big data analysis concepts and referencesInformation Security Awareness Group

Enterprise Data and Analytics Architecture Overview for Electric UtilityPrajesh Bhattacharya

BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...Big Data Week

Best practices and trends in people softHazelknight Media & Entertainment Pvt Ltd

Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...Kevin Mao

Introduction Big DataFrank Kienle

ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY

Alluxio - Virtual Unified File System Alluxio, Inc.

London Cloud Computing Meetup: From GigaSpaces to the Cloud - a demonstration...Skills Matter

Vikram Andem Big Data Strategy @ IATA Technology Roadmap IT Strategy Group

Hadoop introductionSubhas Kumar Ghosh

Big Data .. Are you ready for the next wave?Mahmoud Sabri

TSE_Pres12.pptxssuseracaaae2

Similaire à Grid Asia2008 Low Latency Data Grid (20)

Hdf5

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...

Introduction to Data Warehousing

Bigdata

Waters Grid & HPC Course

DM Radio Webinar: Adopting a Streaming-Enabled Architecture

Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...

Big data analysis concepts and references

Enterprise Data and Analytics Architecture Overview for Electric Utility

BDW16 London - Deenar Toraskar, Think Reactive - Fast Data Key to Efficient C...

Best practices and trends in people soft

Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...

Introduction Big Data

ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...

Alluxio - Virtual Unified File System

London Cloud Computing Meetup: From GigaSpaces to the Cloud - a demonstration...

Vikram Andem Big Data Strategy @ IATA Technology Roadmap

Hadoop introduction

Big Data .. Are you ready for the next wave?

TSE_Pres12.pptx

Dernier

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

MINDCTI Revenue Release Quarter One 2024MIND CTI

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Real Time Object Detection Using Open CVKhem

DBX First Quarter 2024 Investor PresentationDropbox

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

Corporate and higher education May webinar.pptxRustici Software

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

GenAI Risks & Security Meetup 01052024.pdflior mazor

Ransomware_Q4_2023. The report. [EN].pdfOverkill Security

A Year of the Servo Reboot: Where Are We Now?Igalia

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

Dernier (20)

Strategies for Landing an Oracle DBA Job as a Fresher

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Artificial Intelligence Chap.5 : Uncertainty

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

MINDCTI Revenue Release Quarter One 2024

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Automating Google Workspace (GWS) & more with Apps Script

Real Time Object Detection Using Open CV

DBX First Quarter 2024 Investor Presentation

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

Corporate and higher education May webinar.pptx

A Beginners Guide to Building a RAG App Using Open Source Milvus

Boost Fertility New Invention Ups Success Rates.pdf

GenAI Risks & Security Meetup 01052024.pdf

Ransomware_Q4_2023. The report. [EN].pdf

A Year of the Servo Reboot: Where Are We Now?

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model

Grid Asia2008 Low Latency Data Grid

1. Low Latency Data Grids in Finance Jags Ramnarayan Chief Architect GemStone Systems [email_address]

7. Data Fabric for Risk Analytics When data is stored, it is transparently replicated and/or partitioned; Redundant storage can be in memory and/or on disk— ensures continuous availability Keep reference data replicated on many; partition trade data Machine nodes can be added dynamically to expand storage capacity or to handle increased client load Pool memory (and disk) across cluster ; parallelize data access and computation to achieve very high aggregate throughput

8. Data Fabric for Risk Analytics TaskFlow - As results are generated push events to compute nodes to initiate subsequent computation Avoid bulk data transfer across tasks or Jobs Thousands of compute nodes can maintain local cache of most frequently used data; Optionally use local disk for overflow Move reference data to local cache Synchronous read through, write through or Asynchronous write-behind to other data sources and sinks

10.

Grid Asia2008 Low Latency Data Grid

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Grid Asia2008 Low Latency Data Grid

Similaire à Grid Asia2008 Low Latency Data Grid (20)

Dernier

Dernier (20)

Grid Asia2008 Low Latency Data Grid