SlideShare une entreprise Scribd logo
1  sur  13
1
Introduction to the Design and
Specification of File Structures
2
Outline
• What are File Structures?
• Why Study File Structure Design
• Overview of File Structure Design
3
Definition
• A File Structure is a combination of
representations for data in files and of
operations for accessing the data.
• A File Structure allows applications to read,
write and modify data. It might also support
finding the data that matches some search
criteria or reading through the data in some
particular order.
4
Why Study File Structure Design?
I. Data Storage
• Computer Data can be stored in three kinds
of locations:
– Primary Storage ==> Memory
[Computer Memory]
– Secondary Storage [Online Disk/ Tape/
CDRom that can be accessed by the computer]
– Tertiary Storage ==> Archival Data
[Offline Disk/Tape/ CDRom not directly
available to the computer.]
Our
Focus
5
Why Study File Structure Design?
II. Memory versus Secondary Storage
• Secondary storage such as disks can pack thousands
of megabytes in a small physical location.
• Computer Memory (RAM) is limited.
• However, relative to Memory, access to secondary
storage is extremely slow [E.g., getting information
from slow RAM takes 120. 10-9
seconds(= 120
nanoseconds) while getting information from Disk takes
30. 10-3
seconds (= 30 milliseconds)]
6
Why Study File Structure Design?
III. How Can Secondary Storage Access Time be
Improved?
By improving the File Structure.
Since the details of the representation of the
data and the implementation of the
operations determine the efficiency of the
file structure for particular applications,
improving these details can help improve
secondary storage access time.
7
Overview of File Structure Design
I. General Goals
• Get the information we need with one
access to the disk.
• If that’s not possible, then get the
information with as few accesses as
possible.
• Group information so that we are likely to
get everything we need with only one trip to
the disk.
8
Overview of File Structure Design
II. Fixed versus Dynamic Files
• It is relatively easy to come up with file
structure designs that meet the general goals
when the files never change.
• When files grow or shrink when
information is added and deleted, it is much
more difficult.
9
History of File Structures
I. Early Work
• Early Work assumed that files were on tape.
• Access was sequential and the cost of acces
grew in direct proportion to the size of the
file.
10
History of File Structures
II. The emergence of Disks and Indexes
• As files grew very large, unaided sequential
access was not a good solution.
• Disks allowed for direct access.
• Indexes made it possible to keep a list of
keys and pointers in a small file that could
be searched very quickly.
• With the key and pointer, the user had
direct access to the large, primary file.
11
History of File Structures
III. The emergence of Tree Structures
• As indexes also have a sequential flavour,
when they grew too much, they also
became difficult to manage.
• The idea of using tree structures to manage
the index emerged in the early 60’s.
• However, trees can grow very unevenly as
records are added and deleted, resulting in
long searches requiring many disk accesses
to find a record.
12
History of File Structures
IV. Balanced Trees
• In 1963, researchers came up with the idea
of AVL trees for data in memory.
• AVL trees, however, did not apply to files because they
work well when tree nodes are composed of single records
rather than dozens or hundreds of them.
• In the 1970’s came the idea of B-Trees which require an
O(logk N) access time where N is the number of entries in
the file and k, th number of entries indexed in a single block
of the B-Tree structure --> B-Trees can guarantee that one
can find one file entry among millions of others with only 3
or 4 trips to the disk.
13
History of File Structures
V. Hash Tables
• Retrieving entries in 3 or 4 accesses is good,
but it does not reach the goal of accessing
data with a single request.
• From early on, Hashing was a good way to
reach this goal with files that do not change
size greatly over time.
• Recently, Extendible Dynamic Hashing
guarantees one or at most two disk accesses
no matter how big a file becomes.

Contenu connexe

Tendances

Chapter09 Implementing And Using Group Policy
Chapter09      Implementing And  Using  Group  PolicyChapter09      Implementing And  Using  Group  Policy
Chapter09 Implementing And Using Group Policy
Raja Waseem Akhtar
 
NETWORK FILE SYSTEM
NETWORK FILE SYSTEMNETWORK FILE SYSTEM
NETWORK FILE SYSTEM
Roshan Kumar
 

Tendances (20)

file management
 file management file management
file management
 
Chapter09 Implementing And Using Group Policy
Chapter09      Implementing And  Using  Group  PolicyChapter09      Implementing And  Using  Group  Policy
Chapter09 Implementing And Using Group Policy
 
File System Implementation - Part1
File System Implementation - Part1File System Implementation - Part1
File System Implementation - Part1
 
File transfer protocol (ftp)
File transfer protocol (ftp)File transfer protocol (ftp)
File transfer protocol (ftp)
 
Storage Virtualization
Storage VirtualizationStorage Virtualization
Storage Virtualization
 
VM for cloud infrastructure
VM for cloud infrastructureVM for cloud infrastructure
VM for cloud infrastructure
 
Windows Server 2019.pptx
Windows Server 2019.pptxWindows Server 2019.pptx
Windows Server 2019.pptx
 
Storage Basics
Storage BasicsStorage Basics
Storage Basics
 
Mass Storage Structure
Mass Storage StructureMass Storage Structure
Mass Storage Structure
 
The Next Generation of Hyperconverged Infrastructure - Cisco
The Next Generation of Hyperconverged Infrastructure - CiscoThe Next Generation of Hyperconverged Infrastructure - Cisco
The Next Generation of Hyperconverged Infrastructure - Cisco
 
NETWORK FILE SYSTEM
NETWORK FILE SYSTEMNETWORK FILE SYSTEM
NETWORK FILE SYSTEM
 
Basics of storage Technology
Basics of storage TechnologyBasics of storage Technology
Basics of storage Technology
 
Nfs
NfsNfs
Nfs
 
Wsus best practices
Wsus best practicesWsus best practices
Wsus best practices
 
Unix Memory Management - Operating Systems
Unix Memory Management - Operating SystemsUnix Memory Management - Operating Systems
Unix Memory Management - Operating Systems
 
File structures
File structuresFile structures
File structures
 
RAID and LVM
RAID and LVMRAID and LVM
RAID and LVM
 
Cloud computing
Cloud computing Cloud computing
Cloud computing
 
Memory management
Memory managementMemory management
Memory management
 
Memory management
Memory managementMemory management
Memory management
 

Similaire à Introduction to the design and specification of file structures

fileorganizationandintroductionofdbms-210313163900.pdf
fileorganizationandintroductionofdbms-210313163900.pdffileorganizationandintroductionofdbms-210313163900.pdf
fileorganizationandintroductionofdbms-210313163900.pdf
FraolUmeta
 
Data Indexing Presentation-My.pptppt.ppt
Data Indexing Presentation-My.pptppt.pptData Indexing Presentation-My.pptppt.ppt
Data Indexing Presentation-My.pptppt.ppt
sdsm2
 

Similaire à Introduction to the design and specification of file structures (20)

Chapter 1_ Introduction to File Structures.pdf
Chapter 1_ Introduction to File Structures.pdfChapter 1_ Introduction to File Structures.pdf
Chapter 1_ Introduction to File Structures.pdf
 
Fundamental file structure concepts & managing files of records
Fundamental file structure concepts & managing files of recordsFundamental file structure concepts & managing files of records
Fundamental file structure concepts & managing files of records
 
storage techniques_overview-1.pptx
storage techniques_overview-1.pptxstorage techniques_overview-1.pptx
storage techniques_overview-1.pptx
 
File System.pptx
File System.pptxFile System.pptx
File System.pptx
 
File organization and introduction of DBMS
File organization and introduction of DBMSFile organization and introduction of DBMS
File organization and introduction of DBMS
 
fileorganizationandintroductionofdbms-210313163900.pdf
fileorganizationandintroductionofdbms-210313163900.pdffileorganizationandintroductionofdbms-210313163900.pdf
fileorganizationandintroductionofdbms-210313163900.pdf
 
File system in operating system e learning
File system in operating system e learningFile system in operating system e learning
File system in operating system e learning
 
File Allocation Methods.ppt
File Allocation Methods.pptFile Allocation Methods.ppt
File Allocation Methods.ppt
 
DBMS_UNIT 5 Notes.pptx
DBMS_UNIT 5 Notes.pptxDBMS_UNIT 5 Notes.pptx
DBMS_UNIT 5 Notes.pptx
 
Data Analytics: HDFS with Big Data : Issues and Application
Data Analytics:  HDFS  with  Big Data :  Issues and ApplicationData Analytics:  HDFS  with  Big Data :  Issues and Application
Data Analytics: HDFS with Big Data : Issues and Application
 
File organisation
File organisationFile organisation
File organisation
 
Chapter 3
Chapter 3Chapter 3
Chapter 3
 
File and directory
File and directoryFile and directory
File and directory
 
File System operating system operating system
File System  operating system operating systemFile System  operating system operating system
File System operating system operating system
 
Data Indexing Presentation-My.pptppt.ppt
Data Indexing Presentation-My.pptppt.pptData Indexing Presentation-My.pptppt.ppt
Data Indexing Presentation-My.pptppt.ppt
 
Backup Data (cloud storage).pptx
Backup Data (cloud storage).pptxBackup Data (cloud storage).pptx
Backup Data (cloud storage).pptx
 
De-Anonymizing Live CDs through Physical Memory Analysis
De-Anonymizing Live CDs through Physical Memory AnalysisDe-Anonymizing Live CDs through Physical Memory Analysis
De-Anonymizing Live CDs through Physical Memory Analysis
 
DFS-Lecture-6 (3).ppt
DFS-Lecture-6 (3).pptDFS-Lecture-6 (3).ppt
DFS-Lecture-6 (3).ppt
 
UNIT7-FileMgmt.pptx
UNIT7-FileMgmt.pptxUNIT7-FileMgmt.pptx
UNIT7-FileMgmt.pptx
 
UNIT 5- Other Databases.pdf
UNIT 5- Other Databases.pdfUNIT 5- Other Databases.pdf
UNIT 5- Other Databases.pdf
 

Plus de Devyani Vaidya

Plus de Devyani Vaidya (20)

Hashing
HashingHashing
Hashing
 
Cosequential processing and the sorting of large files
Cosequential processing and the sorting of large filesCosequential processing and the sorting of large files
Cosequential processing and the sorting of large files
 
Mobile Phone Cloning
 Mobile Phone Cloning Mobile Phone Cloning
Mobile Phone Cloning
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
secued cloud
 secued cloud secued cloud
secued cloud
 
Cloud Cmputing Security
Cloud Cmputing SecurityCloud Cmputing Security
Cloud Cmputing Security
 
Cloud Security
Cloud SecurityCloud Security
Cloud Security
 
Wireless network
Wireless networkWireless network
Wireless network
 
Environmental law
Environmental lawEnvironmental law
Environmental law
 
Wireless mobile charging using microwaves
Wireless mobile charging using microwavesWireless mobile charging using microwaves
Wireless mobile charging using microwaves
 
Secure Cloud Issues
Secure Cloud IssuesSecure Cloud Issues
Secure Cloud Issues
 
Energy Harvesing Through Reverse Electrowetting
Energy Harvesing Through Reverse Electrowetting Energy Harvesing Through Reverse Electrowetting
Energy Harvesing Through Reverse Electrowetting
 
Wireless Charging Of Mobile
Wireless Charging Of Mobile  Wireless Charging Of Mobile
Wireless Charging Of Mobile
 
Applet programming
Applet programming Applet programming
Applet programming
 
Seminar on telephone directory
Seminar on telephone directorySeminar on telephone directory
Seminar on telephone directory
 
History of Laptop
History of LaptopHistory of Laptop
History of Laptop
 
Ppt on open and close door using Applet
Ppt on open and close door using Applet Ppt on open and close door using Applet
Ppt on open and close door using Applet
 
Resource management
Resource managementResource management
Resource management
 
Ppt on use of biomatrix in secure e trasaction
Ppt on use of biomatrix in secure e trasactionPpt on use of biomatrix in secure e trasaction
Ppt on use of biomatrix in secure e trasaction
 
Secued Cloud
 Secued  Cloud Secued  Cloud
Secued Cloud
 

Dernier

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Dernier (20)

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 

Introduction to the design and specification of file structures

  • 1. 1 Introduction to the Design and Specification of File Structures
  • 2. 2 Outline • What are File Structures? • Why Study File Structure Design • Overview of File Structure Design
  • 3. 3 Definition • A File Structure is a combination of representations for data in files and of operations for accessing the data. • A File Structure allows applications to read, write and modify data. It might also support finding the data that matches some search criteria or reading through the data in some particular order.
  • 4. 4 Why Study File Structure Design? I. Data Storage • Computer Data can be stored in three kinds of locations: – Primary Storage ==> Memory [Computer Memory] – Secondary Storage [Online Disk/ Tape/ CDRom that can be accessed by the computer] – Tertiary Storage ==> Archival Data [Offline Disk/Tape/ CDRom not directly available to the computer.] Our Focus
  • 5. 5 Why Study File Structure Design? II. Memory versus Secondary Storage • Secondary storage such as disks can pack thousands of megabytes in a small physical location. • Computer Memory (RAM) is limited. • However, relative to Memory, access to secondary storage is extremely slow [E.g., getting information from slow RAM takes 120. 10-9 seconds(= 120 nanoseconds) while getting information from Disk takes 30. 10-3 seconds (= 30 milliseconds)]
  • 6. 6 Why Study File Structure Design? III. How Can Secondary Storage Access Time be Improved? By improving the File Structure. Since the details of the representation of the data and the implementation of the operations determine the efficiency of the file structure for particular applications, improving these details can help improve secondary storage access time.
  • 7. 7 Overview of File Structure Design I. General Goals • Get the information we need with one access to the disk. • If that’s not possible, then get the information with as few accesses as possible. • Group information so that we are likely to get everything we need with only one trip to the disk.
  • 8. 8 Overview of File Structure Design II. Fixed versus Dynamic Files • It is relatively easy to come up with file structure designs that meet the general goals when the files never change. • When files grow or shrink when information is added and deleted, it is much more difficult.
  • 9. 9 History of File Structures I. Early Work • Early Work assumed that files were on tape. • Access was sequential and the cost of acces grew in direct proportion to the size of the file.
  • 10. 10 History of File Structures II. The emergence of Disks and Indexes • As files grew very large, unaided sequential access was not a good solution. • Disks allowed for direct access. • Indexes made it possible to keep a list of keys and pointers in a small file that could be searched very quickly. • With the key and pointer, the user had direct access to the large, primary file.
  • 11. 11 History of File Structures III. The emergence of Tree Structures • As indexes also have a sequential flavour, when they grew too much, they also became difficult to manage. • The idea of using tree structures to manage the index emerged in the early 60’s. • However, trees can grow very unevenly as records are added and deleted, resulting in long searches requiring many disk accesses to find a record.
  • 12. 12 History of File Structures IV. Balanced Trees • In 1963, researchers came up with the idea of AVL trees for data in memory. • AVL trees, however, did not apply to files because they work well when tree nodes are composed of single records rather than dozens or hundreds of them. • In the 1970’s came the idea of B-Trees which require an O(logk N) access time where N is the number of entries in the file and k, th number of entries indexed in a single block of the B-Tree structure --> B-Trees can guarantee that one can find one file entry among millions of others with only 3 or 4 trips to the disk.
  • 13. 13 History of File Structures V. Hash Tables • Retrieving entries in 3 or 4 accesses is good, but it does not reach the goal of accessing data with a single request. • From early on, Hashing was a good way to reach this goal with files that do not change size greatly over time. • Recently, Extendible Dynamic Hashing guarantees one or at most two disk accesses no matter how big a file becomes.