SlideShare une entreprise Scribd logo
1  sur  18
*



Submitted to :            Submitted by:
Ms Simarpreet             Kiran, A06
Lec : Computer            Kh. Loyanganba Meitei, 42
System and Architecture   Bca Hons, D1111
*
*Types of shared memory

  *Physically shared memory
  *Virtual (or distributed) shared memory

*Scalability issues

  *Organisation of memory
  *Design of interconnection network
  *Cache coherence protocols                2
Multi-processor:
           Structure of Shared Memory MIMD
Architectures
Design space of shared memory computers

                                                  Shared memory
                                                    computers




                                                    Interconnection                    Cache coherency
          Single address space                          scheme
             memory access



                                                                                  Hardware
                       Virtual shared       Shared path                                          Software based
Physical shared                                                                    based
                          memory                                  Switching
 memory UMA                                                        network
                                   Singled bus
                                      based

                                   Multiple bus                       Crossbar     Multistage network
                   NUMA              based

                                                Bus
                                           multiplication
                  CC-NUMA
                                           Grid of buses                  Omega        Banyan           Benes


                                            Hierarchical
                   COMA
                                              system
*
*
*Also called distributed shared memory architecture
*The local memories of multi-computer are
 components of global address space:
  * any processor can access the local memory of any other
   processor
*Three approaches:
  * Non-uniform memory access (NUMA) machines
  * Cache-only memory access (COMA) machines
  * Cache-coherent non-uniform memory access
   (CC-NUMA) machines
*
*
*Logically shared memory is physically
 distributed
*Different access of local and remote memory
 blocks. Remote access takes much more time
 – latency
*Sensitive to data and program distribution
*Close to distributed memory systems, yet the
 programming paradigm is different
*Example: Cray T3D
*
*
*
*Each block of the shared memory works as
 local cache of a processor
*Continuous, dynamic migration of data
*Hit-rate decreases the traffic on the
 Interconnection Network
*Solutions for data-consistency increase the
 same traffic (see cache coherency problem
 later)
*Examples: KSR-1, DDM
*
*
*A combination of NUMA and COMA
*Initially static data distribution, then
 dynamic data migration
*Cache coherency problem is to be solved
*COMA and CC-NUMA are used in newer
 generation of parallel computers
*Examples: Convex SPP1000, Stanford DASH,
 MIT Alewife
*


+ No need to partition data or program, uniprocessor
 programming techniques can be adapted
+ Communication between processor is efficient
+ Minor modifications of tool chain and compiler


- Synchronized access to share data in memory needed.
  Synchronizing constructs (semaphores, conditional critical
  regions, monitors) result in nondeterministic behavior which
  can lead programming errors that are difficult to discover
- Lack of scalability due to (memory) contention problem
*

*Memory Access Time
  * can be a bottleneck even in a single-processor system
*Contention for Memory
  * two or more processors want to access a location in the same
   block at the same time (hot spot problem).
*Contention for Communication
  * processors should share and use exclusively elements of the
   Interconnection Network


*Result: long latency-time, idle processors,
 nonscalable system
*
*Problems of scalable computers
  1.   Tolerate and hide latency of remote loads
  2.   Tolerate and hide idling due to synchronization
*Solutions
  1.   Cache memory
       *   problem of cache coherence
  2.   Prefetching
  3.   Threads and fast context switching
*
Than
ks...

Contenu connexe

Tendances

Parallel programming model
Parallel programming modelParallel programming model
Parallel programming modeleasy notes
 
Cache coherence problem and its solutions
Cache coherence problem and its solutionsCache coherence problem and its solutions
Cache coherence problem and its solutionsMajid Saleem
 
Multi Processors And Multi Computers
 Multi Processors And Multi Computers Multi Processors And Multi Computers
Multi Processors And Multi ComputersNemwos
 
Computer architecture multi processor
Computer architecture multi processorComputer architecture multi processor
Computer architecture multi processorMazin Alwaaly
 
Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)Nakul Manchanda
 
Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel systemManish Singh
 
Introduction to Parallel Computing
Introduction to Parallel ComputingIntroduction to Parallel Computing
Introduction to Parallel ComputingRoshan Karunarathna
 
Multivector and multiprocessor
Multivector and multiprocessorMultivector and multiprocessor
Multivector and multiprocessorKishan Panara
 
System models in distributed system
System models in distributed systemSystem models in distributed system
System models in distributed systemishapadhy
 
Multi core-architecture
Multi core-architectureMulti core-architecture
Multi core-architecturePiyush Mittal
 
Applications of paralleL processing
Applications of paralleL processingApplications of paralleL processing
Applications of paralleL processingPage Maker
 

Tendances (20)

Parallel programming model
Parallel programming modelParallel programming model
Parallel programming model
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Course outline of parallel and distributed computing
Course outline of parallel and distributed computingCourse outline of parallel and distributed computing
Course outline of parallel and distributed computing
 
Memory management
Memory managementMemory management
Memory management
 
Parallel programming model
Parallel programming modelParallel programming model
Parallel programming model
 
Cache coherence problem and its solutions
Cache coherence problem and its solutionsCache coherence problem and its solutions
Cache coherence problem and its solutions
 
Multiprocessor system
Multiprocessor system Multiprocessor system
Multiprocessor system
 
Multi Processors And Multi Computers
 Multi Processors And Multi Computers Multi Processors And Multi Computers
Multi Processors And Multi Computers
 
Computer architecture multi processor
Computer architecture multi processorComputer architecture multi processor
Computer architecture multi processor
 
Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)Non-Uniform Memory Access ( NUMA)
Non-Uniform Memory Access ( NUMA)
 
Chapter 1
Chapter 1Chapter 1
Chapter 1
 
Bus interconnection
Bus interconnectionBus interconnection
Bus interconnection
 
Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel system
 
Cache memory
Cache memoryCache memory
Cache memory
 
Multiprocessor
MultiprocessorMultiprocessor
Multiprocessor
 
Introduction to Parallel Computing
Introduction to Parallel ComputingIntroduction to Parallel Computing
Introduction to Parallel Computing
 
Multivector and multiprocessor
Multivector and multiprocessorMultivector and multiprocessor
Multivector and multiprocessor
 
System models in distributed system
System models in distributed systemSystem models in distributed system
System models in distributed system
 
Multi core-architecture
Multi core-architectureMulti core-architecture
Multi core-architecture
 
Applications of paralleL processing
Applications of paralleL processingApplications of paralleL processing
Applications of paralleL processing
 

En vedette

Lecture 2
Lecture 2Lecture 2
Lecture 2Mr SMAK
 
The Flynn Effect
The Flynn EffectThe Flynn Effect
The Flynn EffectBhatt83
 
Lecture 6
Lecture  6Lecture  6
Lecture 6Mr SMAK
 
Computer architecture
Computer architecture Computer architecture
Computer architecture Ashish Kumar
 
Parallel computing
Parallel computingParallel computing
Parallel computingVinay Gupta
 
Flynns classification
Flynns classificationFlynns classification
Flynns classificationYasir Khan
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classificationNarayan Kandel
 

En vedette (8)

Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Aca 2
Aca 2Aca 2
Aca 2
 
The Flynn Effect
The Flynn EffectThe Flynn Effect
The Flynn Effect
 
Lecture 6
Lecture  6Lecture  6
Lecture 6
 
Computer architecture
Computer architecture Computer architecture
Computer architecture
 
Parallel computing
Parallel computingParallel computing
Parallel computing
 
Flynns classification
Flynns classificationFlynns classification
Flynns classification
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classification
 

Similaire à Mimd

Ceg4131 models
Ceg4131 modelsCeg4131 models
Ceg4131 modelsanandme07
 
Communication model of parallel platforms
Communication model of parallel platformsCommunication model of parallel platforms
Communication model of parallel platformsSyed Zaid Irshad
 
Lecture 6
Lecture  6Lecture  6
Lecture 6Mr SMAK
 
Lecture 6
Lecture  6Lecture  6
Lecture 6Mr SMAK
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1Mr SMAK
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1Mr SMAK
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1Mr SMAK
 
network ram parallel computing
network ram parallel computingnetwork ram parallel computing
network ram parallel computingNiranjana Ambadi
 
Distributed Shared Memory
Distributed Shared MemoryDistributed Shared Memory
Distributed Shared MemoryPrakhar Rastogi
 
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...langonej
 
Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)Emulex Corporation
 
Overview of Distributed Systems
Overview of Distributed SystemsOverview of Distributed Systems
Overview of Distributed Systemsvampugani
 
Roeder posterismb2010
Roeder posterismb2010Roeder posterismb2010
Roeder posterismb2010Chris Roeder
 
Oracle rac 10g best practices
Oracle rac 10g best practicesOracle rac 10g best practices
Oracle rac 10g best practicesHaseeb Alam
 
multiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptmultiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptnaghamallella
 
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...peknap
 
T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21Dandi Aulia
 
Multiple processor systems
Multiple processor systemsMultiple processor systems
Multiple processor systemsjeetesh036
 

Similaire à Mimd (20)

Ceg4131 models
Ceg4131 modelsCeg4131 models
Ceg4131 models
 
Communication model of parallel platforms
Communication model of parallel platformsCommunication model of parallel platforms
Communication model of parallel platforms
 
Lecture 6
Lecture  6Lecture  6
Lecture 6
 
Lecture 6
Lecture  6Lecture  6
Lecture 6
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1
 
Lecture 6.1
Lecture  6.1Lecture  6.1
Lecture 6.1
 
Overview on NUMA
Overview on NUMAOverview on NUMA
Overview on NUMA
 
network ram parallel computing
network ram parallel computingnetwork ram parallel computing
network ram parallel computing
 
Distributed Shared Memory
Distributed Shared MemoryDistributed Shared Memory
Distributed Shared Memory
 
Intro_ppt.pptx
Intro_ppt.pptxIntro_ppt.pptx
Intro_ppt.pptx
 
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
Federal VMUG - March - Virtual machine sizing considerations in a numa enviro...
 
Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)Emulex OneConnect Universal CNA (Short Overview)
Emulex OneConnect Universal CNA (Short Overview)
 
Overview of Distributed Systems
Overview of Distributed SystemsOverview of Distributed Systems
Overview of Distributed Systems
 
Roeder posterismb2010
Roeder posterismb2010Roeder posterismb2010
Roeder posterismb2010
 
Oracle rac 10g best practices
Oracle rac 10g best practicesOracle rac 10g best practices
Oracle rac 10g best practices
 
multiprocessor _system _presentation.ppt
multiprocessor _system _presentation.pptmultiprocessor _system _presentation.ppt
multiprocessor _system _presentation.ppt
 
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
Controlling Memory Footprint at All Layers: Linux Kernel, Applications, Libra...
 
T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21T03160010220104036 multipleproc week11-1-pert 21
T03160010220104036 multipleproc week11-1-pert 21
 
Multiple processor systems
Multiple processor systemsMultiple processor systems
Multiple processor systems
 

Mimd

  • 1. * Submitted to : Submitted by: Ms Simarpreet Kiran, A06 Lec : Computer Kh. Loyanganba Meitei, 42 System and Architecture Bca Hons, D1111
  • 2. * *Types of shared memory *Physically shared memory *Virtual (or distributed) shared memory *Scalability issues *Organisation of memory *Design of interconnection network *Cache coherence protocols 2
  • 3. Multi-processor: Structure of Shared Memory MIMD Architectures
  • 4. Design space of shared memory computers Shared memory computers Interconnection Cache coherency Single address space scheme memory access Hardware Virtual shared Shared path Software based Physical shared based memory Switching memory UMA network Singled bus based Multiple bus Crossbar Multistage network NUMA based Bus multiplication CC-NUMA Grid of buses Omega Banyan Benes Hierarchical COMA system
  • 5. *
  • 6. * *Also called distributed shared memory architecture *The local memories of multi-computer are components of global address space: * any processor can access the local memory of any other processor *Three approaches: * Non-uniform memory access (NUMA) machines * Cache-only memory access (COMA) machines * Cache-coherent non-uniform memory access (CC-NUMA) machines
  • 7. *
  • 8. * *Logically shared memory is physically distributed *Different access of local and remote memory blocks. Remote access takes much more time – latency *Sensitive to data and program distribution *Close to distributed memory systems, yet the programming paradigm is different *Example: Cray T3D
  • 9. *
  • 10. *
  • 11. * *Each block of the shared memory works as local cache of a processor *Continuous, dynamic migration of data *Hit-rate decreases the traffic on the Interconnection Network *Solutions for data-consistency increase the same traffic (see cache coherency problem later) *Examples: KSR-1, DDM
  • 12. *
  • 13. * *A combination of NUMA and COMA *Initially static data distribution, then dynamic data migration *Cache coherency problem is to be solved *COMA and CC-NUMA are used in newer generation of parallel computers *Examples: Convex SPP1000, Stanford DASH, MIT Alewife
  • 14. * + No need to partition data or program, uniprocessor programming techniques can be adapted + Communication between processor is efficient + Minor modifications of tool chain and compiler - Synchronized access to share data in memory needed. Synchronizing constructs (semaphores, conditional critical regions, monitors) result in nondeterministic behavior which can lead programming errors that are difficult to discover - Lack of scalability due to (memory) contention problem
  • 15. * *Memory Access Time * can be a bottleneck even in a single-processor system *Contention for Memory * two or more processors want to access a location in the same block at the same time (hot spot problem). *Contention for Communication * processors should share and use exclusively elements of the Interconnection Network *Result: long latency-time, idle processors, nonscalable system
  • 16. * *Problems of scalable computers 1. Tolerate and hide latency of remote loads 2. Tolerate and hide idling due to synchronization *Solutions 1. Cache memory * problem of cache coherence 2. Prefetching 3. Threads and fast context switching
  • 17. *