SlideShare une entreprise Scribd logo
1  sur  13
Télécharger pour lire hors ligne
August 2013

[research overview]
University tutor:

Università di Catania

A distributed algorithm for

Medilink srl

STATELESS LOAD
BALANCING

Eng. A. Maddalena

Abstract: Distributing data-packets on stations with

Dipartimento di Ingegneria Elettrica,
Elettronica e Informatica

Prof. Eng. O. Tomarchio
Company supervisor:
Team Leader - R&D Manager

Università di Catania

scalable and optimal store and retrival functionalities.
Ensuring load balance without collecting load-info
from stations.

Dr. A. Tino

Keywords: Distributed-Systems, Algorithms, Big-Data,
Cloud, Balancing

Trainee:

Facoltà di Ingegneria Informatica
Specialistica

Medilink srl

Sezione Ricerca e Sviluppo
August 2013

PROBLEM DESCRIPTION
Many stations & data to store. Data can be fregmented into little units (packets) and sent to
stations. When balancing load, some problems occur.

problems

solutions of modern algorithms

Which station to choose for a packet?

Basing on info collected from stations or by
uniformly distributed random algorithms.

How to send a packet to a station?

IP address database, centralized
solutions, distributed ip tables.

How to retireve a packet? How to locate
the station it is stored in?

Need to memorize couple (packet-id,
station-id) after choosing dst station.

How to balance packets among
different stations?

Round-robin (stateless) approaches or
basing on station loads.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

DEFINING TARGETS
What we want to find is an algorithm for load balancing able to reach the following objectives.

distributed system
No centralization. If one station
faults, the system will still be
running. Performance decay is,
however, allowed.

The algorithm does not need any
info regarding station current load
to perform station selection.

Sezione Ricerca e Sviluppo

When retrieving a packet from a
station, the process must be the
most efficient possible.

scalability

stateless

Medilink srl

packet lookup

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

The architecture must be scalable.
More stations can be added (also at
runtime). Detached stations must
not cause the system to fault.
Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

WHAT ABOUT THE OTHERS?

つづく

Load balancing is a known field in literature. Common practices exist.
A typical architecture is centralizing load balancing into a
single network component responsible for that task.
The Load Balancer typically knows everything about all
stations. Its task is to open connections on stations
upon requests. The decision is selecting a station to
open a connection to.
Very often, common architectures like Cysco and IBM,
organize servers in clusters and pools to handle group
configurations.
The balancer is not physically connected to stations.
Everything is done through TCP/IP and a list of IPs is kept.
In any case, the balancer has a complete knowledge.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

WHAT ABOUT THE OTHERS?

終わり

Load balancing is a known field in literature. There are famous algorithms out there.

dummy/naive

station state

weighted r-r

Decision took basing on
each station state (e.g.
current load). Introduces
overhead on net. Good
balancing in all conditions.

Sezione Ricerca e Sviluppo

Rotating IP-addrs.
Stateless. Need to keep
track of dst station. Good
balancing on servers with
uniform capabilities.

Using hashes of IP-header
entries to calculate
destination station. Stateless.
Direct data-retrieval, bad
balancing.

First alive, static
assignment. Stateless
approach. Provides poor
balancing.

Medilink srl

round-robin

hash oriented

predictive

Like round-robin but halting
rotation on stations with
higher weights. Keep track
of dst station. Good
balancing on static conds.

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Station state is monitored
on few fixed periods.
Predictions on current state
are made basing on
asc/desc trends.
Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

KEY CONCEPT: NO CENTRALIZATION
The architecture must not include any centralized device or station. Think about P2P, but a little
bit more reliable and less chaotic.
Topology must ensure
the absence of
centralized schemes.
System deployed in
each station as a
ditributed architecture.
Networking like P2P but
data exchange and
stations are more reliable.
Packets are routed!

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

KEY CONCEPT: DIRECT ADDRESSING
When assigning a station to a packet, the system will not save data about this association
anywhere. At retrieval, given the packet-id, the station must be located immediately.
On packet forwarding: destination station is
computed but not memorized anywhere. The
packet will be stored at the corresponding
station with no further overhead.
On packet retrieval: destination station is
computed without relying on other info.
Destination station is reached and packet
correctly fetched.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

KEY CONCEPT: STATELESS BALANCING
To balance data-load on stations, no info is required from stations. The packet is assigned with
a station without any further operation.
Data load balancing does not require
data from stations prior to station
assignment or in any further moment.
Stations keep (almost) the same
amount of packets all the time.
No overhead is generated on the
network and in time evaluations when
balancing data-loads.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

SUMMARIZING KEY CONCEPTS
To balance data-load on stations, no info is required from stations. The packet is assigned with
a station without any further operation.

distributed system

Allows the architecture to benefit from
P2P properties: scalability, flexibility
and fault tolerance.

direct addressing

Fast resource management. Packets
can be located with constant
complexity algorithms.

stateless balancing

No need to introduce overhead in
communications. No need to wait for
or store state data from stations.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

SHOWING EARLY RESULTS
Most simplistic simulations show very good load balancing on basic station pools.
10 station basic pool simulations. 1000 pkts fed to the pool. Difference shown.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

NOT A FIELD OF DAISES
There are many problems to solve. In particular, accurate simulations are needed.
Good simulations should try to emulate real scenarions with hundreds of thousands
of packets => big loads sent to stations and many more stations => big station pools.
Current developed simulations are slow (Mathworks Matlab, Wolfram
Mathematica). Mathematical environments + functional languages cannot provide
good performance. Need for better simulations => parallelization is possible!
Numerical problems on the way. Need for numerical methods => Need for good
and fast libraries!
Parallelization would definitely fasten simulations. Need for coded simulations =>
C/C++: good performance. Parallelization libraries + good performance:
architecture dependent parallel libraries.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

WHERE TO GO FROM HERE
Most simplistic simulations show very good load balancing on basic station pools.
Coding new simulations in C/C++. Very fast, but also difficult!
Integrating libraries for numerical methods.
Integrating libraries for cryptography and networking.
Integrating Intel Cilk or Intel TBB libraries for multi-core parallelization.
Need for high performance architectures: 4-core or 6-core.

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica
August 2013

MORE THINGS TO HANDLE
The balancing architecture discovered so far is good, but more questions arise.
What if packets have not the same size? => Balancing with a known packet size
(continuos?) distribution.
How to handle dynamic station attachment/detachment from the pool?

Naive simulations show quite interesting (undesired) behaviors. What the causes?
How to solve these problems?

Medilink srl

Sezione Ricerca e Sviluppo

Tutor:

Prof. Eng. Orazio Tomarchio
DIIEI
Università di Catania

Supervisor:

Eng. Andrea Maddalena
Software Development
Medilink srl

Research trainee:

Dr. Andrea Tino
Università degli Studi di Catania
Ingegneria Informatica

Contenu connexe

Tendances

Understanding rf circuits with multisim 10
Understanding rf circuits with multisim 10Understanding rf circuits with multisim 10
Understanding rf circuits with multisim 10
Dikshya Rath
 
VLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKSVLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKS
Mohan Moki
 

Tendances (10)

Constructing Operating Systems and E-Commerce
Constructing Operating Systems and E-CommerceConstructing Operating Systems and E-Commerce
Constructing Operating Systems and E-Commerce
 
Application of VLSI In Artificial Intelligence
Application of VLSI In Artificial IntelligenceApplication of VLSI In Artificial Intelligence
Application of VLSI In Artificial Intelligence
 
The Effect of Semantic Technology on Wireless Pipelined Complexity Theory
The Effect of Semantic Technology on Wireless Pipelined Complexity TheoryThe Effect of Semantic Technology on Wireless Pipelined Complexity Theory
The Effect of Semantic Technology on Wireless Pipelined Complexity Theory
 
Understanding rf circuits with multisim 10
Understanding rf circuits with multisim 10Understanding rf circuits with multisim 10
Understanding rf circuits with multisim 10
 
Energy efficient routing protocol for enhancing network lifetime and performa...
Energy efficient routing protocol for enhancing network lifetime and performa...Energy efficient routing protocol for enhancing network lifetime and performa...
Energy efficient routing protocol for enhancing network lifetime and performa...
 
Artificial Neural Network Based Load Forecasting
Artificial Neural Network Based Load ForecastingArtificial Neural Network Based Load Forecasting
Artificial Neural Network Based Load Forecasting
 
VLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKSVLSI IN NEURAL NETWORKS
VLSI IN NEURAL NETWORKS
 
A methodology for the study of fiber optic cables
A methodology for the study of fiber optic cablesA methodology for the study of fiber optic cables
A methodology for the study of fiber optic cables
 
International journal of engineering issues vol 2015 - no 2 - paper4
International journal of engineering issues   vol 2015 - no 2 - paper4International journal of engineering issues   vol 2015 - no 2 - paper4
International journal of engineering issues vol 2015 - no 2 - paper4
 
IRJET- Methodologies to the Strategy of Computer Networking Research laboratory
IRJET- Methodologies to the Strategy of Computer Networking Research laboratoryIRJET- Methodologies to the Strategy of Computer Networking Research laboratory
IRJET- Methodologies to the Strategy of Computer Networking Research laboratory
 

Similaire à Stateless load balancing - Research overview

Harvesting dispersed computational resources with Openstack
Harvesting dispersed computational resources with OpenstackHarvesting dispersed computational resources with Openstack
Harvesting dispersed computational resources with Openstack
Mirko Mariotti
 
IntelNTU PDF
IntelNTU PDFIntelNTU PDF
IntelNTU PDF
Josh Shih
 
Seed block algorithm
Seed block algorithmSeed block algorithm
Seed block algorithm
Dipak Badhe
 
Thesis topics in networking
Thesis topics in networkingThesis topics in networking
Thesis topics in networking
Rudi Arif candra
 
Power Consumption Prediction based on Statistical Learning Techniques - David...
Power Consumption Prediction based on Statistical Learning Techniques - David...Power Consumption Prediction based on Statistical Learning Techniques - David...
Power Consumption Prediction based on Statistical Learning Techniques - David...
Data Science Milan
 

Similaire à Stateless load balancing - Research overview (20)

Stateless load balancing - Early results
Stateless load balancing - Early resultsStateless load balancing - Early results
Stateless load balancing - Early results
 
Lan network with Redundancy
Lan network with RedundancyLan network with Redundancy
Lan network with Redundancy
 
Implementation of area optimized low power multiplication and accumulation
Implementation of area optimized low power multiplication and accumulationImplementation of area optimized low power multiplication and accumulation
Implementation of area optimized low power multiplication and accumulation
 
Harvesting dispersed computational resources with Openstack
Harvesting dispersed computational resources with OpenstackHarvesting dispersed computational resources with Openstack
Harvesting dispersed computational resources with Openstack
 
Rain Technology.pptx
Rain Technology.pptxRain Technology.pptx
Rain Technology.pptx
 
A DHT Chord-like mannered overlay-network to store and retrieve data
A DHT Chord-like mannered overlay-network to store and retrieve dataA DHT Chord-like mannered overlay-network to store and retrieve data
A DHT Chord-like mannered overlay-network to store and retrieve data
 
IntelNTU PDF
IntelNTU PDFIntelNTU PDF
IntelNTU PDF
 
Data Analysis In The Cloud
Data Analysis In The CloudData Analysis In The Cloud
Data Analysis In The Cloud
 
Automated LiveMigration of VMs
Automated LiveMigration of VMs Automated LiveMigration of VMs
Automated LiveMigration of VMs
 
Cluster Setup Manual Using Ubuntu and MPICH
Cluster Setup Manual Using Ubuntu and MPICHCluster Setup Manual Using Ubuntu and MPICH
Cluster Setup Manual Using Ubuntu and MPICH
 
Seed block algorithm
Seed block algorithmSeed block algorithm
Seed block algorithm
 
SHORTEST PATH FINDING VISUALIZER
SHORTEST PATH FINDING VISUALIZERSHORTEST PATH FINDING VISUALIZER
SHORTEST PATH FINDING VISUALIZER
 
Development of Computer Aided Learning Software for Use in Electric Circuit A...
Development of Computer Aided Learning Software for Use in Electric Circuit A...Development of Computer Aided Learning Software for Use in Electric Circuit A...
Development of Computer Aided Learning Software for Use in Electric Circuit A...
 
R.E.M.O.T.E. SACNAS Poster
R.E.M.O.T.E. SACNAS PosterR.E.M.O.T.E. SACNAS Poster
R.E.M.O.T.E. SACNAS Poster
 
Automatic Fault Detection System with IOT Based
Automatic Fault Detection System with IOT BasedAutomatic Fault Detection System with IOT Based
Automatic Fault Detection System with IOT Based
 
Thesis topics in networking
Thesis topics in networkingThesis topics in networking
Thesis topics in networking
 
Lecture_IIITD.pptx
Lecture_IIITD.pptxLecture_IIITD.pptx
Lecture_IIITD.pptx
 
IRJET - Rain Technology
IRJET - Rain TechnologyIRJET - Rain Technology
IRJET - Rain Technology
 
Power Consumption Prediction based on Statistical Learning Techniques - David...
Power Consumption Prediction based on Statistical Learning Techniques - David...Power Consumption Prediction based on Statistical Learning Techniques - David...
Power Consumption Prediction based on Statistical Learning Techniques - David...
 
Cv
CvCv
Cv
 

Plus de Andrea Tino

Plus de Andrea Tino (20)

Our Journey: from Waterfall to Agile to DevOps
Our Journey: from Waterfall to Agile to DevOpsOur Journey: from Waterfall to Agile to DevOps
Our Journey: from Waterfall to Agile to DevOps
 
Development & GDPR (v2)
Development & GDPR (v2)Development & GDPR (v2)
Development & GDPR (v2)
 
Development & GDPR
Development & GDPRDevelopment & GDPR
Development & GDPR
 
Cutting Edge on Development Methodologies in IT
Cutting Edge on Development Methodologies in ITCutting Edge on Development Methodologies in IT
Cutting Edge on Development Methodologies in IT
 
An introduction to DevOps
An introduction to DevOpsAn introduction to DevOps
An introduction to DevOps
 
Continuous Everything
Continuous EverythingContinuous Everything
Continuous Everything
 
Modern Trends in UI Decoupling Design
Modern Trends in UI Decoupling DesignModern Trends in UI Decoupling Design
Modern Trends in UI Decoupling Design
 
Javascript cheatsheet
Javascript cheatsheetJavascript cheatsheet
Javascript cheatsheet
 
Workshop on Cryptography - Frequency Analysis (basic)
Workshop on Cryptography - Frequency Analysis (basic)Workshop on Cryptography - Frequency Analysis (basic)
Workshop on Cryptography - Frequency Analysis (basic)
 
Master Thesis - A Distributed Algorithm for Stateless Load Balancing
Master Thesis - A Distributed Algorithm for Stateless Load BalancingMaster Thesis - A Distributed Algorithm for Stateless Load Balancing
Master Thesis - A Distributed Algorithm for Stateless Load Balancing
 
Modern web applications
Modern web applicationsModern web applications
Modern web applications
 
Visual Studio Tools for Cordova
Visual Studio Tools for CordovaVisual Studio Tools for Cordova
Visual Studio Tools for Cordova
 
Microsoft + Agile (light)
Microsoft + Agile (light)Microsoft + Agile (light)
Microsoft + Agile (light)
 
Microsoft + Agile
Microsoft + AgileMicrosoft + Agile
Microsoft + Agile
 
The World of Stylesheet Languages
The World of Stylesheet LanguagesThe World of Stylesheet Languages
The World of Stylesheet Languages
 
How to Develop Cross-Platform Apps
How to Develop Cross-Platform AppsHow to Develop Cross-Platform Apps
How to Develop Cross-Platform Apps
 
The Asynchronous Pattern (for beginners)
The Asynchronous Pattern (for beginners)The Asynchronous Pattern (for beginners)
The Asynchronous Pattern (for beginners)
 
Designing an effective hybrid apps automation framework
Designing an effective hybrid apps automation frameworkDesigning an effective hybrid apps automation framework
Designing an effective hybrid apps automation framework
 
7 tips for more effective morning SCRUM
7 tips for more effective morning SCRUM7 tips for more effective morning SCRUM
7 tips for more effective morning SCRUM
 
Powerful tools for building web solutions
Powerful tools for building web solutionsPowerful tools for building web solutions
Powerful tools for building web solutions
 

Dernier

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Stateless load balancing - Research overview

  • 1. August 2013 [research overview] University tutor: Università di Catania A distributed algorithm for Medilink srl STATELESS LOAD BALANCING Eng. A. Maddalena Abstract: Distributing data-packets on stations with Dipartimento di Ingegneria Elettrica, Elettronica e Informatica Prof. Eng. O. Tomarchio Company supervisor: Team Leader - R&D Manager Università di Catania scalable and optimal store and retrival functionalities. Ensuring load balance without collecting load-info from stations. Dr. A. Tino Keywords: Distributed-Systems, Algorithms, Big-Data, Cloud, Balancing Trainee: Facoltà di Ingegneria Informatica Specialistica Medilink srl Sezione Ricerca e Sviluppo
  • 2. August 2013 PROBLEM DESCRIPTION Many stations & data to store. Data can be fregmented into little units (packets) and sent to stations. When balancing load, some problems occur. problems solutions of modern algorithms Which station to choose for a packet? Basing on info collected from stations or by uniformly distributed random algorithms. How to send a packet to a station? IP address database, centralized solutions, distributed ip tables. How to retireve a packet? How to locate the station it is stored in? Need to memorize couple (packet-id, station-id) after choosing dst station. How to balance packets among different stations? Round-robin (stateless) approaches or basing on station loads. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 3. August 2013 DEFINING TARGETS What we want to find is an algorithm for load balancing able to reach the following objectives. distributed system No centralization. If one station faults, the system will still be running. Performance decay is, however, allowed. The algorithm does not need any info regarding station current load to perform station selection. Sezione Ricerca e Sviluppo When retrieving a packet from a station, the process must be the most efficient possible. scalability stateless Medilink srl packet lookup Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania The architecture must be scalable. More stations can be added (also at runtime). Detached stations must not cause the system to fault. Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 4. August 2013 WHAT ABOUT THE OTHERS? つづく Load balancing is a known field in literature. Common practices exist. A typical architecture is centralizing load balancing into a single network component responsible for that task. The Load Balancer typically knows everything about all stations. Its task is to open connections on stations upon requests. The decision is selecting a station to open a connection to. Very often, common architectures like Cysco and IBM, organize servers in clusters and pools to handle group configurations. The balancer is not physically connected to stations. Everything is done through TCP/IP and a list of IPs is kept. In any case, the balancer has a complete knowledge. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 5. August 2013 WHAT ABOUT THE OTHERS? 終わり Load balancing is a known field in literature. There are famous algorithms out there. dummy/naive station state weighted r-r Decision took basing on each station state (e.g. current load). Introduces overhead on net. Good balancing in all conditions. Sezione Ricerca e Sviluppo Rotating IP-addrs. Stateless. Need to keep track of dst station. Good balancing on servers with uniform capabilities. Using hashes of IP-header entries to calculate destination station. Stateless. Direct data-retrieval, bad balancing. First alive, static assignment. Stateless approach. Provides poor balancing. Medilink srl round-robin hash oriented predictive Like round-robin but halting rotation on stations with higher weights. Keep track of dst station. Good balancing on static conds. Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Station state is monitored on few fixed periods. Predictions on current state are made basing on asc/desc trends. Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 6. August 2013 KEY CONCEPT: NO CENTRALIZATION The architecture must not include any centralized device or station. Think about P2P, but a little bit more reliable and less chaotic. Topology must ensure the absence of centralized schemes. System deployed in each station as a ditributed architecture. Networking like P2P but data exchange and stations are more reliable. Packets are routed! Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 7. August 2013 KEY CONCEPT: DIRECT ADDRESSING When assigning a station to a packet, the system will not save data about this association anywhere. At retrieval, given the packet-id, the station must be located immediately. On packet forwarding: destination station is computed but not memorized anywhere. The packet will be stored at the corresponding station with no further overhead. On packet retrieval: destination station is computed without relying on other info. Destination station is reached and packet correctly fetched. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 8. August 2013 KEY CONCEPT: STATELESS BALANCING To balance data-load on stations, no info is required from stations. The packet is assigned with a station without any further operation. Data load balancing does not require data from stations prior to station assignment or in any further moment. Stations keep (almost) the same amount of packets all the time. No overhead is generated on the network and in time evaluations when balancing data-loads. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 9. August 2013 SUMMARIZING KEY CONCEPTS To balance data-load on stations, no info is required from stations. The packet is assigned with a station without any further operation. distributed system Allows the architecture to benefit from P2P properties: scalability, flexibility and fault tolerance. direct addressing Fast resource management. Packets can be located with constant complexity algorithms. stateless balancing No need to introduce overhead in communications. No need to wait for or store state data from stations. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 10. August 2013 SHOWING EARLY RESULTS Most simplistic simulations show very good load balancing on basic station pools. 10 station basic pool simulations. 1000 pkts fed to the pool. Difference shown. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 11. August 2013 NOT A FIELD OF DAISES There are many problems to solve. In particular, accurate simulations are needed. Good simulations should try to emulate real scenarions with hundreds of thousands of packets => big loads sent to stations and many more stations => big station pools. Current developed simulations are slow (Mathworks Matlab, Wolfram Mathematica). Mathematical environments + functional languages cannot provide good performance. Need for better simulations => parallelization is possible! Numerical problems on the way. Need for numerical methods => Need for good and fast libraries! Parallelization would definitely fasten simulations. Need for coded simulations => C/C++: good performance. Parallelization libraries + good performance: architecture dependent parallel libraries. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 12. August 2013 WHERE TO GO FROM HERE Most simplistic simulations show very good load balancing on basic station pools. Coding new simulations in C/C++. Very fast, but also difficult! Integrating libraries for numerical methods. Integrating libraries for cryptography and networking. Integrating Intel Cilk or Intel TBB libraries for multi-core parallelization. Need for high performance architectures: 4-core or 6-core. Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica
  • 13. August 2013 MORE THINGS TO HANDLE The balancing architecture discovered so far is good, but more questions arise. What if packets have not the same size? => Balancing with a known packet size (continuos?) distribution. How to handle dynamic station attachment/detachment from the pool? Naive simulations show quite interesting (undesired) behaviors. What the causes? How to solve these problems? Medilink srl Sezione Ricerca e Sviluppo Tutor: Prof. Eng. Orazio Tomarchio DIIEI Università di Catania Supervisor: Eng. Andrea Maddalena Software Development Medilink srl Research trainee: Dr. Andrea Tino Università degli Studi di Catania Ingegneria Informatica