SlideShare une entreprise Scribd logo
1  sur  45
Approximation Algorithms for Problems
on Networks and Streams of Data
Luca Foschini - Ph.D. Defense

Committee: Subhash Suri (chair), John Gilbert, Teofilo Gonzalez
Why Approximation Algorithms?
Why Approximation Algorithms?

Exact algorithms require many resources
Why Approximation Algorithms?
                                                 Hardware
Exact algorithms require many resources




                                          Apps




                                                 Data
Why Approximation Algorithms?
                                                 Hardware
Exact algorithms require many resources




                                          Apps



  Problems solvable
       exactly                                   Data
A Long History,
and Work in Progress




                       © Original Artist
A Long History,
and Work in Progress

✤   Early ‘70s - many combinatorial
    problems found to be NP-hard

✤   Recently - more restricting
    computation models proposed e.g.,
    data stream

                                        © Original Artist
A Long History,
and Work in Progress

✤   Early ‘70s - many combinatorial
    problems found to be NP-hard

✤   Recently - more restricting
    computation models proposed e.g.,
    data stream

                                                            © Original Artist




          Heuristics not sufficient, provable guarantees needed
Content of the Dissertation
Content of the Dissertation




"
Content of the Dissertation


    Networks



"

    Data Streams
Content of the Dissertation

                                      STACS12 +
                   Partitioning
                                     Algorithmica
    Networks
                                   SODA11 +
                   Shortest Paths
                                  Algorithmica
"
                   Time Series         ICDE10

    Data Streams

                   Burst Detection     NSDI11
Content of the Dissertation

                                      STACS12 +      ICISS08
                   Partitioning
                                     Algorithmica
    Networks                                         ICIP11
                                   SODA11 +
                   Shortest Paths                   ALENEX10
                                  Algorithmica
"
                                                     ESA11
                   Time Series         ICDE10

    Data Streams                                    WOOT11

                   Burst Detection     NSDI11        WAW09
Roadmap

                                         STACS12 +
                      Partitioning
                                        Algorithmica
       Networks
                                      SODA11 +
                      Shortest Paths
                                     Algorithmica
   "
                      Time Series         ICDE10

       Data Streams

                      Burst Detection     NSDI11
k-Balanced Partitioning Problem
 Given: an unweighted graph G on n
 vertices; an integer k

 Find: a partition of the vertices of G
 into k sets Vi s.t.

    ✤   |Vi |  dn/ke
    ✤   Cut size (number of edges
        connecting vertices in
        different Vi) is minimized


                  joint work with Andi Feldmann (ETHz)
             (appeared in STACS12, submitted to Algorithmica)
Motivation & Complexity

✤   Divide-and-conquer algorithms

✤   VLSI design

✤   Parallel computing



✤   NP-hard to approximate cut size within any finite value alpha
    [Andreev and Räcke 2006]
Related Work
General Graphs & Trees

✤   Algorithm is !-approximation if
    finds a cut at most ! times optimal

✤   NP-hard to approximate cut size
    within any finite ! [Andreev and
    Räcke 2006]
General Graphs & Trees

✤   Algorithm is !-approximation if
    finds a cut at most ! times optimal

✤   NP-hard to approximate cut size
    within any finite ! [Andreev and
    Räcke 2006]


        Trees - simple instances?
General Graphs & Trees

✤   Algorithm is !-approximation if
    finds a cut at most ! times optimal

✤   NP-hard to approximate cut size      n=31, k=8 cut size = 10
    within any finite ! [Andreev and
    Räcke 2006]


        Trees - simple instances?


                                         n=31, k=9 cut size = 8
Trees Are Hard
Trees Are Hard

✤   NP-hard to approx. cut size for !=nc
    (for any c<1) even if constant diameter
Trees Are Hard

✤   NP-hard to approx. cut size for !=nc
    (for any c<1) even if constant diameter

✤   APX-hard to approx. cut-size even if
    constant degree
Trees Are Hard

✤   NP-hard to approx. cut size for !=nc
    (for any c<1) even if constant diameter

✤   APX-hard to approx. cut-size even if
    constant degree




            Most NP-hard problems become trivial on trees
Relax!
Relax!

 Balance constraint relaxed:
   |Vi |  (1 + ")dn/ke
Relax!

 Balance constraint relaxed:
   |Vi |  (1 + ")dn/ke


                               Balance relaxed
Perfect balance
Optimal cut size
                                                   Cut size
                                                 approximated
                                  !
Relax!

 Balance constraint relaxed:          Bicriteria Approximation: cut
                                      size approximation ! measured
   |Vi |  (1 + ")dn/ke
                                      w.r.t perfectly balanced optimum


                               Balance relaxed
Perfect balance
Optimal cut size
                                                         Cut size
                                                       approximated
                                  !
0<eps<1 on general graphs



✤   eps>1 -- alpha in .... spreading metric techniques

✤   0<eps < 1 not much improvement. 1/epsˆ2 log ^1.5 n

✤   What about trees?
Summary of PTAS for Trees


✤   Compute optimal cut size for each coarse signature using DP

✤   Pack each coarse signatures into bins of size (1 + ")dn/ke

✤   Pick solution with smallest cut size among those fitting into k bins
                                4       1+3d 1 log( 1 )e
✤   Total time complexity O(n (k/")          "      "      )
Summary of PTAS for Trees


✤   Compute optimal cut size for each coarse signature using DP

✤   Pack each coarse signatures into bins of size (1 + ")dn/ke

✤   Pick solution with smallest cut size among those fitting into k bins
                                4       1+3d 1 log( 1 )e
✤   Total time complexity O(n (k/")          "      "      )


                               Show that ! =1
Extension to General Graphs


✤   Decomposition of graph into collection of trees [Räcke, Madry], cut
    size worsen by at most O(log n) for at least 1 tree

✤   Apply PTAS for trees to each instance

✤   Return partition for tree with minimum cut

✤   alpha = O(log n) improves
Tree Decomposition
Analysis of Embedding
Extensions & Open Problems
✤   Tree embedding techniques allow the !=1 tree PTAS to translate to a
    !=O(log n) approx for general weighted graphs

✤   Improves on previous best != O(log 1.5 n/"2 )
Extensions & Open Problems
✤   Tree embedding techniques allow the !=1 tree PTAS to translate to a
    !=O(log n) approx for general weighted graphs

✤   Improves on previous best != O(log 1.5 n/"2 )


                                             



                             




                                                            
                Graphs                              Trees
Roadmap

                                         STACS12 +
                      Partitioning
                                        Algorithmica
       Networks
                                      SODA11 +
                      Shortest Paths
                                     Algorithmica
   "
                      Time Series         ICDE10

       Data Streams

                      Burst Detection     NSDI11
Approximating Time Series



✤   Represent a time series with B
    linear segments

✤   New value arrives to the time
    series, need to reallocate
    segments
Approximating Time Series



✤   Represent a time series with B
    linear segments

✤   New value arrives to the time
    series, need to reallocate
    segments
Approximating Time Series



✤   Represent a time series with B
    linear segments

✤   New value arrives to the time
    series, need to reallocate
    segments
Old Algorithms, New Proofs
Old Algorithms, New Proofs

✤   We prove that a popular greedy merge
    scheme gives constant (bicriteria)
    approx. for many L_p norms. (ICDE10;
    joint with Gandhi, Suri)
Old Algorithms, New Proofs

✤   We prove that a popular greedy merge
    scheme gives constant (bicriteria)
    approx. for many L_p norms. (ICDE10;
    joint with Gandhi, Suri)

✤   Results implemented in Linux Kernel
    and used to detect traffic bursts in
    networks (NSDI11, joint with Uyeda,
    Suri, Varghese, Baker)
Old Algorithms, New Proofs

✤   We prove that a popular greedy merge
    scheme gives constant (bicriteria)
    approx. for many L_p norms. (ICDE10;
    joint with Gandhi, Suri)

✤   Results implemented in Linux Kernel
    and used to detect traffic bursts in
    networks (NSDI11, joint with Uyeda,
    Suri, Varghese, Baker)


           Next steps: Extend results in ICDE10 to other norms
Conclusion


✤   Approximation is necessary to reduce resource utilization

✤   Presented approximation algorithms for problems from different
    domains that we cannot afford to solve exactly

✤   Presented basic building blocks that can be used across the board to
    design approximation algorithms

Contenu connexe

En vedette

Robert h l ig2 task 1 work sheet v3
Robert h l ig2 task 1 work sheet v3Robert h l ig2 task 1 work sheet v3
Robert h l ig2 task 1 work sheet v3halo4robo
 
Report from QS europe 2013
Report from QS europe 2013 Report from QS europe 2013
Report from QS europe 2013 Luca Foschini
 
Dancing baby meme
Dancing baby memeDancing baby meme
Dancing baby memepinal003
 
2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議
2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議
2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議Heng Sin
 
Unit 73 ig3 assignment programming sound assets to a game 2013_y2
Unit 73 ig3 assignment programming sound assets to a game 2013_y2Unit 73 ig3 assignment programming sound assets to a game 2013_y2
Unit 73 ig3 assignment programming sound assets to a game 2013_y2halo4robo
 
Flash (the ultimatum)
Flash (the ultimatum)Flash (the ultimatum)
Flash (the ultimatum)halo4robo
 
Artistic styles
Artistic stylesArtistic styles
Artistic styleshalo4robo
 
Edl 669 action plan
Edl 669 action planEdl 669 action plan
Edl 669 action planmpickard9
 
Future of mobility - Featuring the NS-Business Card
Future of mobility - Featuring the NS-Business CardFuture of mobility - Featuring the NS-Business Card
Future of mobility - Featuring the NS-Business CardHUMANIS Consulting Group
 
Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2
Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2
Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2halo4robo
 
Geometry theory
Geometry theoryGeometry theory
Geometry theoryhalo4robo
 
Light bot progression
Light bot progressionLight bot progression
Light bot progressionhalo4robo
 

En vedette (20)

Robert h l ig2 task 1 work sheet v3
Robert h l ig2 task 1 work sheet v3Robert h l ig2 task 1 work sheet v3
Robert h l ig2 task 1 work sheet v3
 
Cuidadosdelagua
CuidadosdelaguaCuidadosdelagua
Cuidadosdelagua
 
Report from QS europe 2013
Report from QS europe 2013 Report from QS europe 2013
Report from QS europe 2013
 
Presentasi tugas
Presentasi tugasPresentasi tugas
Presentasi tugas
 
Dancing baby meme
Dancing baby memeDancing baby meme
Dancing baby meme
 
2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議
2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議
2014.12.6新興華語文翻譯專科學校103漢字文化節書法創藝比賽評審會議
 
Business presentation
Business presentationBusiness presentation
Business presentation
 
Unit 73 ig3 assignment programming sound assets to a game 2013_y2
Unit 73 ig3 assignment programming sound assets to a game 2013_y2Unit 73 ig3 assignment programming sound assets to a game 2013_y2
Unit 73 ig3 assignment programming sound assets to a game 2013_y2
 
Flash (the ultimatum)
Flash (the ultimatum)Flash (the ultimatum)
Flash (the ultimatum)
 
Grande bouquet
Grande bouquetGrande bouquet
Grande bouquet
 
Artistic styles
Artistic stylesArtistic styles
Artistic styles
 
TED slideshow
TED slideshowTED slideshow
TED slideshow
 
Edl 669 action plan
Edl 669 action planEdl 669 action plan
Edl 669 action plan
 
Future of mobility - Featuring the NS-Business Card
Future of mobility - Featuring the NS-Business CardFuture of mobility - Featuring the NS-Business Card
Future of mobility - Featuring the NS-Business Card
 
Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2
Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2
Unit 73 ig1 assignment computer game audio cut sequence production 2013_y2
 
Maturski ispit iz mat opsti
Maturski ispit iz mat opstiMaturski ispit iz mat opsti
Maturski ispit iz mat opsti
 
Geometry theory
Geometry theoryGeometry theory
Geometry theory
 
Survey
SurveySurvey
Survey
 
oral candida
oral candidaoral candida
oral candida
 
Light bot progression
Light bot progressionLight bot progression
Light bot progression
 

Similaire à Defense

Mesh Generation and Topological Data Analysis
Mesh Generation and Topological Data AnalysisMesh Generation and Topological Data Analysis
Mesh Generation and Topological Data AnalysisDon Sheehy
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix DatasetBen Mabey
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdfFrangoCamila
 
SpectrumEstimation.ppt
SpectrumEstimation.pptSpectrumEstimation.ppt
SpectrumEstimation.pptMaryanne678733
 
Parallelising Dynamic Programming
Parallelising Dynamic ProgrammingParallelising Dynamic Programming
Parallelising Dynamic ProgrammingRaphael Reitzig
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingNesreen K. Ahmed
 
Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Usatyuk Vasiliy
 
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...Fabian Pedregosa
 
Data Compression Project Presentation
Data Compression Project PresentationData Compression Project Presentation
Data Compression Project PresentationMyuran Kanga, MS, MBA
 
Relaxation methods for the matrix exponential on large networks
Relaxation methods for the matrix exponential on large networksRelaxation methods for the matrix exponential on large networks
Relaxation methods for the matrix exponential on large networksDavid Gleich
 
MapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvementMapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvementKyong-Ha Lee
 
P9 addressing signal_integrity_ in_ew_2015_final
P9 addressing signal_integrity_ in_ew_2015_finalP9 addressing signal_integrity_ in_ew_2015_final
P9 addressing signal_integrity_ in_ew_2015_finalAamir Habib
 
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14Sri Ambati
 
Session 1 - Silva, Singh, Richardson at MLconf NYC
Session 1 - Silva, Singh, Richardson at MLconf NYCSession 1 - Silva, Singh, Richardson at MLconf NYC
Session 1 - Silva, Singh, Richardson at MLconf NYCMLconf
 
Improving Hardware Efficiency for DNN Applications
Improving Hardware Efficiency for DNN ApplicationsImproving Hardware Efficiency for DNN Applications
Improving Hardware Efficiency for DNN ApplicationsChester Chen
 
Intro to Machine Learning for GPUs
Intro to Machine Learning for GPUsIntro to Machine Learning for GPUs
Intro to Machine Learning for GPUsSri Ambati
 

Similaire à Defense (20)

Mesh Generation and Topological Data Analysis
Mesh Generation and Topological Data AnalysisMesh Generation and Topological Data Analysis
Mesh Generation and Topological Data Analysis
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdf
 
Realtime Analytics
Realtime AnalyticsRealtime Analytics
Realtime Analytics
 
SpectrumEstimation.ppt
SpectrumEstimation.pptSpectrumEstimation.ppt
SpectrumEstimation.ppt
 
Parallelising Dynamic Programming
Parallelising Dynamic ProgrammingParallelising Dynamic Programming
Parallelising Dynamic Programming
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
 
Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...Cycle’s topological optimizations and the iterative decoding problem on gener...
Cycle’s topological optimizations and the iterative decoding problem on gener...
 
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Opti...
 
User biglm
User biglmUser biglm
User biglm
 
Data Compression Project Presentation
Data Compression Project PresentationData Compression Project Presentation
Data Compression Project Presentation
 
Relaxation methods for the matrix exponential on large networks
Relaxation methods for the matrix exponential on large networksRelaxation methods for the matrix exponential on large networks
Relaxation methods for the matrix exponential on large networks
 
MapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvementMapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvement
 
4-SequenceTimeSeries02.pdf
4-SequenceTimeSeries02.pdf4-SequenceTimeSeries02.pdf
4-SequenceTimeSeries02.pdf
 
P9 addressing signal_integrity_ in_ew_2015_final
P9 addressing signal_integrity_ in_ew_2015_finalP9 addressing signal_integrity_ in_ew_2015_final
P9 addressing signal_integrity_ in_ew_2015_final
 
9553477.ppt
9553477.ppt9553477.ppt
9553477.ppt
 
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14
 
Session 1 - Silva, Singh, Richardson at MLconf NYC
Session 1 - Silva, Singh, Richardson at MLconf NYCSession 1 - Silva, Singh, Richardson at MLconf NYC
Session 1 - Silva, Singh, Richardson at MLconf NYC
 
Improving Hardware Efficiency for DNN Applications
Improving Hardware Efficiency for DNN ApplicationsImproving Hardware Efficiency for DNN Applications
Improving Hardware Efficiency for DNN Applications
 
Intro to Machine Learning for GPUs
Intro to Machine Learning for GPUsIntro to Machine Learning for GPUs
Intro to Machine Learning for GPUs
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Dernier (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Defense

  • 1. Approximation Algorithms for Problems on Networks and Streams of Data Luca Foschini - Ph.D. Defense Committee: Subhash Suri (chair), John Gilbert, Teofilo Gonzalez
  • 3. Why Approximation Algorithms? Exact algorithms require many resources
  • 4. Why Approximation Algorithms? Hardware Exact algorithms require many resources Apps Data
  • 5. Why Approximation Algorithms? Hardware Exact algorithms require many resources Apps Problems solvable exactly Data
  • 6. A Long History, and Work in Progress © Original Artist
  • 7. A Long History, and Work in Progress ✤ Early ‘70s - many combinatorial problems found to be NP-hard ✤ Recently - more restricting computation models proposed e.g., data stream © Original Artist
  • 8. A Long History, and Work in Progress ✤ Early ‘70s - many combinatorial problems found to be NP-hard ✤ Recently - more restricting computation models proposed e.g., data stream © Original Artist Heuristics not sufficient, provable guarantees needed
  • 9. Content of the Dissertation
  • 10. Content of the Dissertation "
  • 11. Content of the Dissertation Networks " Data Streams
  • 12. Content of the Dissertation STACS12 + Partitioning Algorithmica Networks SODA11 + Shortest Paths Algorithmica " Time Series ICDE10 Data Streams Burst Detection NSDI11
  • 13. Content of the Dissertation STACS12 + ICISS08 Partitioning Algorithmica Networks ICIP11 SODA11 + Shortest Paths ALENEX10 Algorithmica " ESA11 Time Series ICDE10 Data Streams WOOT11 Burst Detection NSDI11 WAW09
  • 14. Roadmap STACS12 + Partitioning Algorithmica Networks SODA11 + Shortest Paths Algorithmica " Time Series ICDE10 Data Streams Burst Detection NSDI11
  • 15. k-Balanced Partitioning Problem Given: an unweighted graph G on n vertices; an integer k Find: a partition of the vertices of G into k sets Vi s.t. ✤ |Vi |  dn/ke ✤ Cut size (number of edges connecting vertices in different Vi) is minimized joint work with Andi Feldmann (ETHz) (appeared in STACS12, submitted to Algorithmica)
  • 16. Motivation & Complexity ✤ Divide-and-conquer algorithms ✤ VLSI design ✤ Parallel computing ✤ NP-hard to approximate cut size within any finite value alpha [Andreev and Räcke 2006]
  • 18. General Graphs & Trees ✤ Algorithm is !-approximation if finds a cut at most ! times optimal ✤ NP-hard to approximate cut size within any finite ! [Andreev and Räcke 2006]
  • 19. General Graphs & Trees ✤ Algorithm is !-approximation if finds a cut at most ! times optimal ✤ NP-hard to approximate cut size within any finite ! [Andreev and Räcke 2006] Trees - simple instances?
  • 20. General Graphs & Trees ✤ Algorithm is !-approximation if finds a cut at most ! times optimal ✤ NP-hard to approximate cut size n=31, k=8 cut size = 10 within any finite ! [Andreev and Räcke 2006] Trees - simple instances? n=31, k=9 cut size = 8
  • 22. Trees Are Hard ✤ NP-hard to approx. cut size for !=nc (for any c<1) even if constant diameter
  • 23. Trees Are Hard ✤ NP-hard to approx. cut size for !=nc (for any c<1) even if constant diameter ✤ APX-hard to approx. cut-size even if constant degree
  • 24. Trees Are Hard ✤ NP-hard to approx. cut size for !=nc (for any c<1) even if constant diameter ✤ APX-hard to approx. cut-size even if constant degree Most NP-hard problems become trivial on trees
  • 26. Relax! Balance constraint relaxed: |Vi |  (1 + ")dn/ke
  • 27. Relax! Balance constraint relaxed: |Vi |  (1 + ")dn/ke Balance relaxed Perfect balance Optimal cut size Cut size approximated !
  • 28. Relax! Balance constraint relaxed: Bicriteria Approximation: cut size approximation ! measured |Vi |  (1 + ")dn/ke w.r.t perfectly balanced optimum Balance relaxed Perfect balance Optimal cut size Cut size approximated !
  • 29. 0<eps<1 on general graphs ✤ eps>1 -- alpha in .... spreading metric techniques ✤ 0<eps < 1 not much improvement. 1/epsˆ2 log ^1.5 n ✤ What about trees?
  • 30. Summary of PTAS for Trees ✤ Compute optimal cut size for each coarse signature using DP ✤ Pack each coarse signatures into bins of size (1 + ")dn/ke ✤ Pick solution with smallest cut size among those fitting into k bins 4 1+3d 1 log( 1 )e ✤ Total time complexity O(n (k/") " " )
  • 31. Summary of PTAS for Trees ✤ Compute optimal cut size for each coarse signature using DP ✤ Pack each coarse signatures into bins of size (1 + ")dn/ke ✤ Pick solution with smallest cut size among those fitting into k bins 4 1+3d 1 log( 1 )e ✤ Total time complexity O(n (k/") " " ) Show that ! =1
  • 32. Extension to General Graphs ✤ Decomposition of graph into collection of trees [Räcke, Madry], cut size worsen by at most O(log n) for at least 1 tree ✤ Apply PTAS for trees to each instance ✤ Return partition for tree with minimum cut ✤ alpha = O(log n) improves
  • 35. Extensions & Open Problems ✤ Tree embedding techniques allow the !=1 tree PTAS to translate to a !=O(log n) approx for general weighted graphs ✤ Improves on previous best != O(log 1.5 n/"2 )
  • 36. Extensions & Open Problems ✤ Tree embedding techniques allow the !=1 tree PTAS to translate to a !=O(log n) approx for general weighted graphs ✤ Improves on previous best != O(log 1.5 n/"2 )    Graphs Trees
  • 37. Roadmap STACS12 + Partitioning Algorithmica Networks SODA11 + Shortest Paths Algorithmica " Time Series ICDE10 Data Streams Burst Detection NSDI11
  • 38. Approximating Time Series ✤ Represent a time series with B linear segments ✤ New value arrives to the time series, need to reallocate segments
  • 39. Approximating Time Series ✤ Represent a time series with B linear segments ✤ New value arrives to the time series, need to reallocate segments
  • 40. Approximating Time Series ✤ Represent a time series with B linear segments ✤ New value arrives to the time series, need to reallocate segments
  • 42. Old Algorithms, New Proofs ✤ We prove that a popular greedy merge scheme gives constant (bicriteria) approx. for many L_p norms. (ICDE10; joint with Gandhi, Suri)
  • 43. Old Algorithms, New Proofs ✤ We prove that a popular greedy merge scheme gives constant (bicriteria) approx. for many L_p norms. (ICDE10; joint with Gandhi, Suri) ✤ Results implemented in Linux Kernel and used to detect traffic bursts in networks (NSDI11, joint with Uyeda, Suri, Varghese, Baker)
  • 44. Old Algorithms, New Proofs ✤ We prove that a popular greedy merge scheme gives constant (bicriteria) approx. for many L_p norms. (ICDE10; joint with Gandhi, Suri) ✤ Results implemented in Linux Kernel and used to detect traffic bursts in networks (NSDI11, joint with Uyeda, Suri, Varghese, Baker) Next steps: Extend results in ICDE10 to other norms
  • 45. Conclusion ✤ Approximation is necessary to reduce resource utilization ✤ Presented approximation algorithms for problems from different domains that we cannot afford to solve exactly ✤ Presented basic building blocks that can be used across the board to design approximation algorithms

Notes de l'éditeur

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. \n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n
  38. \n
  39. \n
  40. \n
  41. \n
  42. \n
  43. \n
  44. \n
  45. \n
  46. \n
  47. \n
  48. \n
  49. \n
  50. \n
  51. \n
  52. \n
  53. \n
  54. \n
  55. \n
  56. \n
  57. \n
  58. \n
  59. \n
  60. \n
  61. \n
  62. \n
  63. \n
  64. \n
  65. \n