SlideShare une entreprise Scribd logo
1  sur  26
DISTRIBUTED DATA BASE :
Database:-
Logical interrelated collection of shared data, along with description of
data, physically distributed over a computer network
1
A distributed database (DDB) is a collection of
multiple, logically interrelated databases
distributed over a computer network.
A distributed database management system
(DDBMS) is the software that manages the DDB
and provides an access mechanism that makes this
distribution transparent to the users
2
3
A DDBMS mainly classified into two types:
◦ Homogeneous Distributed database management
systems
◦ Heterogeneous Distributed database management
systems
4
⚫ All sites are interconnected.
⚫ Fragments can be replicated.
⚫ Logically related shared data can be collected.
⚫ Data at each and every site is controlled by the DBMS.
⚫ Each Distributed Database Management System takes part
in at least one global application.
CHARACTERISTICS
5
⚫ Security
⚫ Keeping track of data
⚫ Replicated data management
⚫ System catalog management
⚫ Distributed transaction management
⚫ Distributed database recovery
FUNCTIONALITY:
6
⚫ In a homogeneous distributed database all sites
have identical software and are aware of each
other and agree to cooperate in processing user
requests.
⚫ The homogeneous system is much easier to
design and manage
⚫ The operating system used, at each
location must be same or compatible.
⚫ The database application (or DBMS) used at
each location must be same or compatible.
HOMOGENEOUS DDMS::
7
⚫ In a heterogeneous distributed database
different sites may use different schema
and software.
⚫ In heterogeneous systems, different nodes
may have different hardware & software
and data structures at various nodes or
locations are also incompatible.
⚫ Different computers and operating
systems, database applications or data
models may be used at each of the
locations
HETEROGENEOUS DDBMS:
8
⚫ On heterogeneous system, translations are
required to allow communication between
different sites (or DBMS).
⚫ The heterogeneous system is often not
technically or economically feasible. In
this system, a user at one location may be
able to read but not update the data at
another location.
9
⚫ Less danger of a single-point failure.
When one of the computers fails, the
workload is picked up by other
workstations.
⚫ Data are also distributed at multiple
sites.
⚫ The end user is able to access any
available copy of the data, and an end
user's request is processed by any
processor at the data location
ADVANTAGES:
10
ADVANTAGES(Cont..,)
⚫ Less danger of a single-point failure. When
one of the computers fails, the workload is
picked up by other workstations.
⚫ Data are also distributed at multiple sites.
⚫ The end user is able to access any available
copy of the data, and an end user's request is
processed by any processor at the data
location
11
⚫ Complexity of management and control.
⚫ Applications must recognize data location,
and they must be able to stitch together data
from various sites.
⚫ Security.
DISADVANTAGES
12
⚫ Increased storage and infrastructure
requirements.
⚫ Multiple copies of data has to be at different
sites, thus an additional disk
storage space will be required.
⚫ The probability of security lapses
increases when data are located at
multiple sites.
DISADVANTAGES(Cont..,)
13
LOCKS CONCEPTS:
⚫ A lock is used when multiple users need to access a
database concurrently. This prevents data from being
corrupted or invalidated when multiple users try to write to
the database.
⚫ Any single user can only modify those database records
(that is, items in the database) to which they have applied a
lock that gives them exclusive access to the record until the
lock is released. Locking not only provides exclusivity to
write but also prevents (or controls) reading of unfinished
modifications.
14
⚫ A parallel database system is to improve performance through
parallelization of various operations, such as loading data,
building indexes and evaluating queries.
⚫ The distribution is solely done on the bases of performance.
⚫ Parallel databases improve processing and input/output
speeds by using multiple CPUs and disks in parallel.
⚫ Many operations are performed
simultaneously
⚫ Data may be stored in a distributed fashion.
PARALLEL DATABASE
15
Change B/W Distributed
Database and Parallel Database
Characteristics Parallel Database Distributed database
Definition It is a software system
where multiple
processors or machines are
used to
execute and run queries in
parallel.
It is a software system that
manages multiple logically
interrelated databases
distributed over a computer
network.
Geographical
Location
The nodes are located at
geographically
same location.
The nodes are usually
located at geographically
different locations.
Execution
Speed
Quicker Slower
Overhead Less More
16
Node types Compulsorily Homogeneous Need not be homogeneous
Performance Lower reliability &
availability.
Higher reliability &
availability.
Scope of
Expansion
Difficult to expand Easier to expand
Backup Backup at one site only Backup at multiple sites
Consistency Maintaining consistency is
easier
Maintaining consistency is
difficult.
17
⚫ Fragmentation is a process of division or the
mapping of the tables based on the columns and
rows of data into the smallest unit of data.
⚫ Data that has broken down is still possible to be
combined again with the intention to complete the
data collection using fragmentation.
⚫ Fragmentation is a database server feature that allows
you to control where data is stored at the table level.
⚫ Fragmentation enables you to define groups of rows
or index keys within a table.
Data Fragmentation
18
⚫ Replication is that we store several copies of a relation
or relation fragment. An entire relation can be
replicated at one or more sites.
⚫ Similarly, one or more fragments of a relation can
be replicated at other sites.
⚫ For example, if a relation R is fragmented into R1,R2,
and R3, there might be just one copy of R1, whereas
R2 is replicated at two other sites and R3 is replicated
at all sites.
REPLICATION:
19
The motivation for replication is twofold:
1. Increased Availability of Data: If a site that
contains a replica goes down, we can find
the same data at other sites. Similarly, if
local copies of remote relations are
available, we are less vulnerable to failure
of communication links.
2. Faster Query Evaluation: Queries can
execute faster by using a local copy of a
relation instead of going to a remote site
TWO FOLD REPLICATION
20
⚫ In a distributed DBMS,a given transaction is
submitted at some one site, but it can access data
at other sites as well.
⚫ When a transaction is submitted at some site, the
transaction manager at that site breaks it up into
a collection of one or more sub-transactions that
execute at different sites, submits them to
transaction managers at the other sites, and
coordinates their activity.
DISTRIBUTED TRANSACTIONS:
21
⚫ Distributed Concurrency Control: Procedure in
which required for controlling concurrent execution
of operation that takes
⚫ Distributed Recovery: Transaction atomicity must
be ensured when a transaction commits, all its
actions, across all the sites at which it executes,
must persist. Similarly, when a transaction aborts,
none of its actions must be allowed to persist
22
⚫ The choice of technique determines which objects
are to be locked.
⚫ When locks are obtained and released is determined
by the concurrency control protocol.
⚫ We now consider how lock and unlock requests are
implemented in a distributed environment. Lock
management can be distributed across sites in many
ways:
23
⚫ Centralized : A single site is in charge of handling lock
and unlock requests for all objects.
⚫ Primary Copy: One copy of each object is designated the
primary copy.All requests to lock or unlock a copy of this
object are handled by the lock manager at the site where the
primary copy is stored, regardless of where the copy itself is
stored.
⚫ Fully Distributed : Requests to lock or unlock a copy of an
object stored at a site are handled by the lock manager at the
site where the copy is stored.
24
⚫ Recovery in a distributed DBMS is more complicated than in a
centralized DBMS for the following reasons:
◦ New kinds of failure can arise : Failure of communication links and
failure of a remote site at which a sub-transaction is executing.
◦ Either all sub-transactions of a given transaction must commit or none
must commit, and this property must be guaranteed despite any
combination of site and link failures. This guarantee is achieved using a
commit protocol.
DISTRIBUTED RECOVERY
25
26

Contenu connexe

Tendances

Tendances (20)

Dma unit 1
Dma unit   1Dma unit   1
Dma unit 1
 
Change Point Analysis
Change Point AnalysisChange Point Analysis
Change Point Analysis
 
data mining
data miningdata mining
data mining
 
BIG DATA.pdf
BIG DATA.pdfBIG DATA.pdf
BIG DATA.pdf
 
Data mining
Data mining Data mining
Data mining
 
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptxAnomaly Detection and Spark Implementation - Meetup Presentation.pptx
Anomaly Detection and Spark Implementation - Meetup Presentation.pptx
 
Data mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, dataData mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, data
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine Learning
 
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & KamberChapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han & Kamber
 
Logistic Regression in Case-Control Study
Logistic Regression in Case-Control StudyLogistic Regression in Case-Control Study
Logistic Regression in Case-Control Study
 
Digital forensic tools
Digital forensic toolsDigital forensic tools
Digital forensic tools
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
 
Data Mining Technique - SEMMA
Data Mining Technique - SEMMAData Mining Technique - SEMMA
Data Mining Technique - SEMMA
 
SOWMYA SAS Resume
SOWMYA SAS ResumeSOWMYA SAS Resume
SOWMYA SAS Resume
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining Projects
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
 
Data Science
Data ScienceData Science
Data Science
 
Anomaly detection Full Article
Anomaly detection Full ArticleAnomaly detection Full Article
Anomaly detection Full Article
 
Chapter 12 outlier
Chapter 12 outlierChapter 12 outlier
Chapter 12 outlier
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 

Similaire à nnnn.pptx

Chapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).pptChapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).ppt
latigudata
 
Database , 1 Introduction
 Database , 1 Introduction Database , 1 Introduction
Database , 1 Introduction
Ali Usman
 
Distributed databases
Distributed databasesDistributed databases
Distributed databases
Suneel Dogra
 
Csld phan tan va song song
Csld phan tan va song songCsld phan tan va song song
Csld phan tan va song song
Lê Anh Trung
 

Similaire à nnnn.pptx (20)

Distributed Database
Distributed DatabaseDistributed Database
Distributed Database
 
Distributed database
Distributed databaseDistributed database
Distributed database
 
Distributed database
Distributed databaseDistributed database
Distributed database
 
Distributed database
Distributed databaseDistributed database
Distributed database
 
DDBMS.pptx
DDBMS.pptxDDBMS.pptx
DDBMS.pptx
 
lecture-13.pptx
lecture-13.pptxlecture-13.pptx
lecture-13.pptx
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management system
 
Chapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).pptChapter-6 Distribute Database system (3).ppt
Chapter-6 Distribute Database system (3).ppt
 
Introduction to distributed database
Introduction to distributed databaseIntroduction to distributed database
Introduction to distributed database
 
Advance DBMS
Advance DBMSAdvance DBMS
Advance DBMS
 
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptxSantosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
 
Distribution transparency and Distributed transaction
Distribution transparency and Distributed transactionDistribution transparency and Distributed transaction
Distribution transparency and Distributed transaction
 
1 ddbms jan 2011_u
1 ddbms jan 2011_u1 ddbms jan 2011_u
1 ddbms jan 2011_u
 
Distributed dbms (ddbms)
Distributed dbms (ddbms)Distributed dbms (ddbms)
Distributed dbms (ddbms)
 
1 introduction DDBS
1 introduction DDBS1 introduction DDBS
1 introduction DDBS
 
Database , 1 Introduction
 Database , 1 Introduction Database , 1 Introduction
Database , 1 Introduction
 
Distributed databases
Distributed databasesDistributed databases
Distributed databases
 
Database 2 ddbms,homogeneous & heterognus adv & disadvan
Database 2 ddbms,homogeneous & heterognus adv & disadvanDatabase 2 ddbms,homogeneous & heterognus adv & disadvan
Database 2 ddbms,homogeneous & heterognus adv & disadvan
 
Unit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptxUnit-1 Introduction to Big Data.pptx
Unit-1 Introduction to Big Data.pptx
 
Csld phan tan va song song
Csld phan tan va song songCsld phan tan va song song
Csld phan tan va song song
 

Plus de NANDHINIS900805

Plus de NANDHINIS900805 (9)

3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
 
wepik-breaking-down-spam-detection-a-deep-learning-approach-with-tensorflow-a...
wepik-breaking-down-spam-detection-a-deep-learning-approach-with-tensorflow-a...wepik-breaking-down-spam-detection-a-deep-learning-approach-with-tensorflow-a...
wepik-breaking-down-spam-detection-a-deep-learning-approach-with-tensorflow-a...
 
Alligation OR mixture.pptx
Alligation OR mixture.pptxAlligation OR mixture.pptx
Alligation OR mixture.pptx
 
AP&GP.pptx
AP&GP.pptxAP&GP.pptx
AP&GP.pptx
 
PERMUTATION AND COMBINATION.pptx
PERMUTATION AND COMBINATION.pptxPERMUTATION AND COMBINATION.pptx
PERMUTATION AND COMBINATION.pptx
 
ARCHITECTURE.pptx
ARCHITECTURE.pptxARCHITECTURE.pptx
ARCHITECTURE.pptx
 
after 10th (1).pptx
after 10th (1).pptxafter 10th (1).pptx
after 10th (1).pptx
 
DBMS.pptx
DBMS.pptxDBMS.pptx
DBMS.pptx
 
.n.pptx
.n.pptx.n.pptx
.n.pptx
 

Dernier

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Dernier (20)

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 

nnnn.pptx

  • 1. DISTRIBUTED DATA BASE : Database:- Logical interrelated collection of shared data, along with description of data, physically distributed over a computer network 1
  • 2. A distributed database (DDB) is a collection of multiple, logically interrelated databases distributed over a computer network. A distributed database management system (DDBMS) is the software that manages the DDB and provides an access mechanism that makes this distribution transparent to the users 2
  • 3. 3
  • 4. A DDBMS mainly classified into two types: ◦ Homogeneous Distributed database management systems ◦ Heterogeneous Distributed database management systems 4
  • 5. ⚫ All sites are interconnected. ⚫ Fragments can be replicated. ⚫ Logically related shared data can be collected. ⚫ Data at each and every site is controlled by the DBMS. ⚫ Each Distributed Database Management System takes part in at least one global application. CHARACTERISTICS 5
  • 6. ⚫ Security ⚫ Keeping track of data ⚫ Replicated data management ⚫ System catalog management ⚫ Distributed transaction management ⚫ Distributed database recovery FUNCTIONALITY: 6
  • 7. ⚫ In a homogeneous distributed database all sites have identical software and are aware of each other and agree to cooperate in processing user requests. ⚫ The homogeneous system is much easier to design and manage ⚫ The operating system used, at each location must be same or compatible. ⚫ The database application (or DBMS) used at each location must be same or compatible. HOMOGENEOUS DDMS:: 7
  • 8. ⚫ In a heterogeneous distributed database different sites may use different schema and software. ⚫ In heterogeneous systems, different nodes may have different hardware & software and data structures at various nodes or locations are also incompatible. ⚫ Different computers and operating systems, database applications or data models may be used at each of the locations HETEROGENEOUS DDBMS: 8
  • 9. ⚫ On heterogeneous system, translations are required to allow communication between different sites (or DBMS). ⚫ The heterogeneous system is often not technically or economically feasible. In this system, a user at one location may be able to read but not update the data at another location. 9
  • 10. ⚫ Less danger of a single-point failure. When one of the computers fails, the workload is picked up by other workstations. ⚫ Data are also distributed at multiple sites. ⚫ The end user is able to access any available copy of the data, and an end user's request is processed by any processor at the data location ADVANTAGES: 10
  • 11. ADVANTAGES(Cont..,) ⚫ Less danger of a single-point failure. When one of the computers fails, the workload is picked up by other workstations. ⚫ Data are also distributed at multiple sites. ⚫ The end user is able to access any available copy of the data, and an end user's request is processed by any processor at the data location 11
  • 12. ⚫ Complexity of management and control. ⚫ Applications must recognize data location, and they must be able to stitch together data from various sites. ⚫ Security. DISADVANTAGES 12
  • 13. ⚫ Increased storage and infrastructure requirements. ⚫ Multiple copies of data has to be at different sites, thus an additional disk storage space will be required. ⚫ The probability of security lapses increases when data are located at multiple sites. DISADVANTAGES(Cont..,) 13
  • 14. LOCKS CONCEPTS: ⚫ A lock is used when multiple users need to access a database concurrently. This prevents data from being corrupted or invalidated when multiple users try to write to the database. ⚫ Any single user can only modify those database records (that is, items in the database) to which they have applied a lock that gives them exclusive access to the record until the lock is released. Locking not only provides exclusivity to write but also prevents (or controls) reading of unfinished modifications. 14
  • 15. ⚫ A parallel database system is to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries. ⚫ The distribution is solely done on the bases of performance. ⚫ Parallel databases improve processing and input/output speeds by using multiple CPUs and disks in parallel. ⚫ Many operations are performed simultaneously ⚫ Data may be stored in a distributed fashion. PARALLEL DATABASE 15
  • 16. Change B/W Distributed Database and Parallel Database Characteristics Parallel Database Distributed database Definition It is a software system where multiple processors or machines are used to execute and run queries in parallel. It is a software system that manages multiple logically interrelated databases distributed over a computer network. Geographical Location The nodes are located at geographically same location. The nodes are usually located at geographically different locations. Execution Speed Quicker Slower Overhead Less More 16
  • 17. Node types Compulsorily Homogeneous Need not be homogeneous Performance Lower reliability & availability. Higher reliability & availability. Scope of Expansion Difficult to expand Easier to expand Backup Backup at one site only Backup at multiple sites Consistency Maintaining consistency is easier Maintaining consistency is difficult. 17
  • 18. ⚫ Fragmentation is a process of division or the mapping of the tables based on the columns and rows of data into the smallest unit of data. ⚫ Data that has broken down is still possible to be combined again with the intention to complete the data collection using fragmentation. ⚫ Fragmentation is a database server feature that allows you to control where data is stored at the table level. ⚫ Fragmentation enables you to define groups of rows or index keys within a table. Data Fragmentation 18
  • 19. ⚫ Replication is that we store several copies of a relation or relation fragment. An entire relation can be replicated at one or more sites. ⚫ Similarly, one or more fragments of a relation can be replicated at other sites. ⚫ For example, if a relation R is fragmented into R1,R2, and R3, there might be just one copy of R1, whereas R2 is replicated at two other sites and R3 is replicated at all sites. REPLICATION: 19
  • 20. The motivation for replication is twofold: 1. Increased Availability of Data: If a site that contains a replica goes down, we can find the same data at other sites. Similarly, if local copies of remote relations are available, we are less vulnerable to failure of communication links. 2. Faster Query Evaluation: Queries can execute faster by using a local copy of a relation instead of going to a remote site TWO FOLD REPLICATION 20
  • 21. ⚫ In a distributed DBMS,a given transaction is submitted at some one site, but it can access data at other sites as well. ⚫ When a transaction is submitted at some site, the transaction manager at that site breaks it up into a collection of one or more sub-transactions that execute at different sites, submits them to transaction managers at the other sites, and coordinates their activity. DISTRIBUTED TRANSACTIONS: 21
  • 22. ⚫ Distributed Concurrency Control: Procedure in which required for controlling concurrent execution of operation that takes ⚫ Distributed Recovery: Transaction atomicity must be ensured when a transaction commits, all its actions, across all the sites at which it executes, must persist. Similarly, when a transaction aborts, none of its actions must be allowed to persist 22
  • 23. ⚫ The choice of technique determines which objects are to be locked. ⚫ When locks are obtained and released is determined by the concurrency control protocol. ⚫ We now consider how lock and unlock requests are implemented in a distributed environment. Lock management can be distributed across sites in many ways: 23
  • 24. ⚫ Centralized : A single site is in charge of handling lock and unlock requests for all objects. ⚫ Primary Copy: One copy of each object is designated the primary copy.All requests to lock or unlock a copy of this object are handled by the lock manager at the site where the primary copy is stored, regardless of where the copy itself is stored. ⚫ Fully Distributed : Requests to lock or unlock a copy of an object stored at a site are handled by the lock manager at the site where the copy is stored. 24
  • 25. ⚫ Recovery in a distributed DBMS is more complicated than in a centralized DBMS for the following reasons: ◦ New kinds of failure can arise : Failure of communication links and failure of a remote site at which a sub-transaction is executing. ◦ Either all sub-transactions of a given transaction must commit or none must commit, and this property must be guaranteed despite any combination of site and link failures. This guarantee is achieved using a commit protocol. DISTRIBUTED RECOVERY 25
  • 26. 26