SlideShare a Scribd company logo
1 of 34
Presented by: Eng. Juman Ghazi
Director: Dr. Eng. Rasha Masood
What is XML?
 XML stands for EXtensible Markup Language.
 XML is a markup language much like HTML.
 XML was designed to describe data and focus on what
data is.
2
eXtensible Markup Language
 Helps information systems share structured data.
 A meta language that gives meaning to data that other
application can use.
 Application and platform independent.
 Allows various types of data.
 Extensible to accommodate new tags and processing methods.
 Allows user-defined tags.
4
Advantages of using XML
 Simpler version of Standard Generalized Markup
Language (SGML).
 Easy to understand and read.
 Supported by large number of platforms.
 Used across open standards.
5
Components of an XML Document
1. Elements: <hello>
2. Attributes: <item id=“33905”>
3. Entities: &lt; (<)
4. Advanced Components
1. CData Sections
2. Processing Instructions
6
Example in HTML
7
<html>
<head>
<title>Menu</>
</head>
<body>
<h1>Soup</h1>
<h4>4.99</h4>
</body>
</html>
HTML in web browser
8
Example in HTML
9
<?xml version=“1.0” ?>
<menu>
<item>
<itemname>soup</itemname>
<cost>4.99</cost>
</item>
</menu>
XML in web browser
10
Declaration:
 First line in document.
 Provides information to the parser.
 Recommended but optional.
 Contains three name-value pairs:
 Version (common).
 Encoding (defaults to UTF-8).
 Standalone (rare).
11
Tags:
 Text in between <and >
 Have start tag and end tag.
 Tags and data stored together.
 Data is self-descriptive and easy to under stand.
12
13
Root
(text) (text) (text)
element element
Elements:
 Basic building blocks of XML file.
 Text between a start tag and end
tag is considered the value of the
element
 Documents contain one root
element.
 Can contain Nested elements.
14
Attributes:
 Provide additional information about
the elements.
 Name-value pairs:
- Single or double quotes to encode
values.
- Attribute names are unique within
the same element.
16
Comments:
 Appear anywhere in document
- Start tag <!--
- End tag --!>
 contents inside comment are not parsed.
17
More in XML:
1. Schemas
2. Parsers
3. Editors
4. Standards
18
1. Schemas:
 Describe the structure and content of an XML
document.
 Define a shared vocabulary for application.
 Can be expressed using XML schema languages
such as:
-Document Type Definition (DTD).
-XML Schema (W3C).
19
Industry standards and data exchange:
20
2. Parsers:
 Read and process the content of an XML
document.
 Include push and pull parsers
-Pull parsers: events generated by the application
-Push parsers: events controlled by the parser
 Free XML parsers available, including tools from
IBM.
21
3. Editors:
 Text and graphical editors facilitate the editing
of XML code.
 Benefits of using editors:
coding effort.
-Provide to perform tasks.
22
23
4. Standards:
 Various types of standards:
- Core standards from the basis of what is expressed
in an XML document.
- Processing standards relate to XML processing by
developers.
- Key vocabularies (applications).
 XML standards influencers include the W3C, ISO and
OASIS.
24
XML RuLes:
1. Must Have a Closing Tag.
In HTML, some elements do not
have to have a closing tag:
<p>This is a paragraph
<p>This is another paragraph
In XML, it is illegal to omit the
closing tag.
<p>This is a paragraph</p>
<p>This is another paragraph</p>
2. XML Tags are Case Sensitive.
XML tags are case sensitive. The tag
<Letter> is different from the tag
<letter>.
<Message>This is incorrect</message>
<message>This is correct</message>
"Opening and closing tags"
are often referred to as "Start and
end tags". Use whatever you prefer.
It is exactly the same thing.
25
XML RuLes:
3. Elements Must be Properly
Nested:
In HTML, you might see improperly
nested elements:
<b><i>This text is bold and
italic</b></i>
In XML, all elements must be
properly nested within each other:
<b><i>This text is bold and
italic</i></b>
4. XML Documents Must Have a Root
Element:
XML documents must contain one
element that is the parent of all
other elements. This element is called
the root element.
<root>
<child>
<subchild>.....</subchild>
</child>
< /root>
26
XML RuLes:
XML Attribute Values Must be
Quoted:
XML elements can have attributes
in name/value pairs
< note date=12/11/2007>
< to>Tove</to>
< from>Jani</from>
< /note>
< note date="12/11/2007">
< to>Tove</to>
< from>Jani</from>
< /note> 27
Wrong
Right
XML RuLes:
28
5. Entity References
Some characters have a special meaning in XML.
-character like "<" inside an XML element, will
generate an error because the parser interprets it as
the start of a new element.
<message>if salary < 1000 then</message>
<message>if salary &lt; 1000 then</message>
Characters have a special meaning in XML
Characters meanings in XML
Less than
<&lt;
Greater than
>&qt;
ampersand
&&amp;
apostrophe
‘&apos;
Quotation mark&quot;
29
tensibleXeXML Elements are
XML elements can be extended to carry more information.
<note>
<to>Tove</to>
<from>Jani</from>
<body>Don't forget me this weekend!</body>
</note>
Added some extra information to it:
<note>
<date>2008-01-10</date>
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
<body>Don't forget me this weekend!</body>
</note>
Should the application break or crash?
No. One of the beauties of XML, is that it can be
extended without breaking applications. 31
Examples: 1- book store
<bookstore>
<book category="CHILDREN">
<title>Harry Potter</title>
<author>J K. Rowling</author>
<year>2005</year>
<price>29.99</price>
</book>
<book category="WEB">
<title>Learning XML</title>
<author>Erik T. Ray</author>
<year>2003</year>
<price>39.95</price>
</book>
</bookstore>
32
Why XML For -informatics?
 Biology is a complex discipline.
 Wide variety of data resources and repositories.
 Biological data represented in multiple formats. (FASTA
, agp ,gff..)
 No standard protocol:
1-to interrogate biological data stores.
2-for Genomic, Proteonomic, Chemi-informatics.
3-to exchange biological data.
 Difficulties in using and exchanging data.
34
XML in -informatics
1- (Visual Genomics).
2- (ProteoMetrics).
3- (Chemical info. “atomic, crystallographic
info., structures….”).
4- ene ntology onsortium.
35
The Bioinformatics Sequences Markup Language
(BSML)
-The DTD is aimed at representing DNA, RNA, Protein
sequences and their graphic properties.
-Found the structure of the information to be similar to
the one used in the databases.
(http://www.ebi.ac.uk/embl.html)
(http://www.visualgenomics.com/products/index.html)
(http://www.ncbi.nlm.nih.gov; http://www.ddbj.nig.ac.jp) 36
Gene Ontology Consortium
Controlled description for:
1- Molecular function.
2- Biological processes.
3- Cellular locations of gene products.
37
The BIOpolymer Markup Language
(BioML)
- is different to BSML approach.
- BioML Goal (Fenyo, 1999) is “
BioML was designed to mimic the
hierarchical structure of aliving organism.”
- Data integration e.g nucleotide and protein sequences
38

More Related Content

What's hot

Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqEnis Afgan
 
Programming languages in bioinformatics by dr. jayarama reddy
Programming languages in bioinformatics by dr. jayarama reddyProgramming languages in bioinformatics by dr. jayarama reddy
Programming languages in bioinformatics by dr. jayarama reddyDr. Jayarama Reddy
 
Bioinformatics Omics
Bioinformatics OmicsBioinformatics Omics
Bioinformatics OmicsHiplot
 
Structure of DNA for medical school
Structure of DNA for medical schoolStructure of DNA for medical school
Structure of DNA for medical schoolRavi Kiran
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewSean Davis
 
Control of gene expression ppt
Control of gene expression pptControl of gene expression ppt
Control of gene expression pptIbad khan
 
Head and neck anatomy 2 scalp & face
Head and neck anatomy 2 scalp & faceHead and neck anatomy 2 scalp & face
Head and neck anatomy 2 scalp & faceEmad Abu Alrub
 
encode project
encode project encode project
encode project Priti Pal
 
Weak Supervision.pdf
Weak Supervision.pdfWeak Supervision.pdf
Weak Supervision.pdfStephenLeo7
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysismikaelhuss
 
Transcription and translation
Transcription   and   translationTranscription   and   translation
Transcription and translationSakthivel R
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 

What's hot (20)

Rna polymerase
Rna polymeraseRna polymerase
Rna polymerase
 
Introduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-SeqIntroduction to Galaxy and RNA-Seq
Introduction to Galaxy and RNA-Seq
 
Programming languages in bioinformatics by dr. jayarama reddy
Programming languages in bioinformatics by dr. jayarama reddyProgramming languages in bioinformatics by dr. jayarama reddy
Programming languages in bioinformatics by dr. jayarama reddy
 
Gene expression and regulation
Gene expression and regulationGene expression and regulation
Gene expression and regulation
 
Supercoiling
SupercoilingSupercoiling
Supercoiling
 
Bioinformatics Omics
Bioinformatics OmicsBioinformatics Omics
Bioinformatics Omics
 
Structure of DNA for medical school
Structure of DNA for medical schoolStructure of DNA for medical school
Structure of DNA for medical school
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis Overview
 
الشفرة الوراثية
الشفرة الوراثيةالشفرة الوراثية
الشفرة الوراثية
 
Control of gene expression ppt
Control of gene expression pptControl of gene expression ppt
Control of gene expression ppt
 
Head and neck anatomy 2 scalp & face
Head and neck anatomy 2 scalp & faceHead and neck anatomy 2 scalp & face
Head and neck anatomy 2 scalp & face
 
Temporomandibular Joint
Temporomandibular JointTemporomandibular Joint
Temporomandibular Joint
 
Data formats
Data formatsData formats
Data formats
 
encode project
encode project encode project
encode project
 
Weak Supervision.pdf
Weak Supervision.pdfWeak Supervision.pdf
Weak Supervision.pdf
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
 
Transcription and translation
Transcription   and   translationTranscription   and   translation
Transcription and translation
 
Skull, dura & sinuses
Skull, dura & sinusesSkull, dura & sinuses
Skull, dura & sinuses
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Genome
GenomeGenome
Genome
 

Similar to Xml in bio medical field (20)

XML Introduction
XML IntroductionXML Introduction
XML Introduction
 
XML notes.pptx
XML notes.pptxXML notes.pptx
XML notes.pptx
 
Module 5 XML Notes.pdf
Module 5 XML Notes.pdfModule 5 XML Notes.pdf
Module 5 XML Notes.pdf
 
IT6801-Service Oriented Architecture
IT6801-Service Oriented ArchitectureIT6801-Service Oriented Architecture
IT6801-Service Oriented Architecture
 
Xml
XmlXml
Xml
 
xml introduction in web technologies subject
xml introduction in web technologies subjectxml introduction in web technologies subject
xml introduction in web technologies subject
 
Full xml
Full xmlFull xml
Full xml
 
CrashCourse: XML technologies
CrashCourse: XML technologiesCrashCourse: XML technologies
CrashCourse: XML technologies
 
Unit 5 xml (1)
Unit 5   xml (1)Unit 5   xml (1)
Unit 5 xml (1)
 
XML-Unit 1.ppt
XML-Unit 1.pptXML-Unit 1.ppt
XML-Unit 1.ppt
 
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5   XMLM.FLORENCE DAYANA WEB DESIGN -Unit 5   XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
 
Xml
XmlXml
Xml
 
XML
XMLXML
XML
 
Introduction to xml
Introduction to xmlIntroduction to xml
Introduction to xml
 
CTDA Workshop on XML and MODS
CTDA Workshop on XML and MODSCTDA Workshop on XML and MODS
CTDA Workshop on XML and MODS
 
Basics of XML
Basics of XMLBasics of XML
Basics of XML
 
Introduction to XML
Introduction to XMLIntroduction to XML
Introduction to XML
 
Web Services Part 1
Web Services Part 1Web Services Part 1
Web Services Part 1
 
XML.pptx
XML.pptxXML.pptx
XML.pptx
 
93 peter butterfield
93 peter butterfield93 peter butterfield
93 peter butterfield
 

Recently uploaded

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Recently uploaded (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Xml in bio medical field

  • 1. Presented by: Eng. Juman Ghazi Director: Dr. Eng. Rasha Masood
  • 2. What is XML?  XML stands for EXtensible Markup Language.  XML is a markup language much like HTML.  XML was designed to describe data and focus on what data is. 2
  • 3. eXtensible Markup Language  Helps information systems share structured data.  A meta language that gives meaning to data that other application can use.  Application and platform independent.  Allows various types of data.  Extensible to accommodate new tags and processing methods.  Allows user-defined tags. 4
  • 4. Advantages of using XML  Simpler version of Standard Generalized Markup Language (SGML).  Easy to understand and read.  Supported by large number of platforms.  Used across open standards. 5
  • 5. Components of an XML Document 1. Elements: <hello> 2. Attributes: <item id=“33905”> 3. Entities: &lt; (<) 4. Advanced Components 1. CData Sections 2. Processing Instructions 6
  • 7. HTML in web browser 8
  • 8. Example in HTML 9 <?xml version=“1.0” ?> <menu> <item> <itemname>soup</itemname> <cost>4.99</cost> </item> </menu>
  • 9. XML in web browser 10
  • 10. Declaration:  First line in document.  Provides information to the parser.  Recommended but optional.  Contains three name-value pairs:  Version (common).  Encoding (defaults to UTF-8).  Standalone (rare). 11
  • 11. Tags:  Text in between <and >  Have start tag and end tag.  Tags and data stored together.  Data is self-descriptive and easy to under stand. 12
  • 13. Elements:  Basic building blocks of XML file.  Text between a start tag and end tag is considered the value of the element  Documents contain one root element.  Can contain Nested elements. 14
  • 14. Attributes:  Provide additional information about the elements.  Name-value pairs: - Single or double quotes to encode values. - Attribute names are unique within the same element. 16
  • 15. Comments:  Appear anywhere in document - Start tag <!-- - End tag --!>  contents inside comment are not parsed. 17
  • 16. More in XML: 1. Schemas 2. Parsers 3. Editors 4. Standards 18
  • 17. 1. Schemas:  Describe the structure and content of an XML document.  Define a shared vocabulary for application.  Can be expressed using XML schema languages such as: -Document Type Definition (DTD). -XML Schema (W3C). 19
  • 18. Industry standards and data exchange: 20
  • 19. 2. Parsers:  Read and process the content of an XML document.  Include push and pull parsers -Pull parsers: events generated by the application -Push parsers: events controlled by the parser  Free XML parsers available, including tools from IBM. 21
  • 20. 3. Editors:  Text and graphical editors facilitate the editing of XML code.  Benefits of using editors: coding effort. -Provide to perform tasks. 22
  • 21. 23
  • 22. 4. Standards:  Various types of standards: - Core standards from the basis of what is expressed in an XML document. - Processing standards relate to XML processing by developers. - Key vocabularies (applications).  XML standards influencers include the W3C, ISO and OASIS. 24
  • 23. XML RuLes: 1. Must Have a Closing Tag. In HTML, some elements do not have to have a closing tag: <p>This is a paragraph <p>This is another paragraph In XML, it is illegal to omit the closing tag. <p>This is a paragraph</p> <p>This is another paragraph</p> 2. XML Tags are Case Sensitive. XML tags are case sensitive. The tag <Letter> is different from the tag <letter>. <Message>This is incorrect</message> <message>This is correct</message> "Opening and closing tags" are often referred to as "Start and end tags". Use whatever you prefer. It is exactly the same thing. 25
  • 24. XML RuLes: 3. Elements Must be Properly Nested: In HTML, you might see improperly nested elements: <b><i>This text is bold and italic</b></i> In XML, all elements must be properly nested within each other: <b><i>This text is bold and italic</i></b> 4. XML Documents Must Have a Root Element: XML documents must contain one element that is the parent of all other elements. This element is called the root element. <root> <child> <subchild>.....</subchild> </child> < /root> 26
  • 25. XML RuLes: XML Attribute Values Must be Quoted: XML elements can have attributes in name/value pairs < note date=12/11/2007> < to>Tove</to> < from>Jani</from> < /note> < note date="12/11/2007"> < to>Tove</to> < from>Jani</from> < /note> 27 Wrong Right
  • 26. XML RuLes: 28 5. Entity References Some characters have a special meaning in XML. -character like "<" inside an XML element, will generate an error because the parser interprets it as the start of a new element. <message>if salary < 1000 then</message> <message>if salary &lt; 1000 then</message>
  • 27. Characters have a special meaning in XML Characters meanings in XML Less than <&lt; Greater than >&qt; ampersand &&amp; apostrophe ‘&apos; Quotation mark&quot; 29
  • 28. tensibleXeXML Elements are XML elements can be extended to carry more information. <note> <to>Tove</to> <from>Jani</from> <body>Don't forget me this weekend!</body> </note> Added some extra information to it: <note> <date>2008-01-10</date> <to>Tove</to> <from>Jani</from> <heading>Reminder</heading> <body>Don't forget me this weekend!</body> </note> Should the application break or crash? No. One of the beauties of XML, is that it can be extended without breaking applications. 31
  • 29. Examples: 1- book store <bookstore> <book category="CHILDREN"> <title>Harry Potter</title> <author>J K. Rowling</author> <year>2005</year> <price>29.99</price> </book> <book category="WEB"> <title>Learning XML</title> <author>Erik T. Ray</author> <year>2003</year> <price>39.95</price> </book> </bookstore> 32
  • 30. Why XML For -informatics?  Biology is a complex discipline.  Wide variety of data resources and repositories.  Biological data represented in multiple formats. (FASTA , agp ,gff..)  No standard protocol: 1-to interrogate biological data stores. 2-for Genomic, Proteonomic, Chemi-informatics. 3-to exchange biological data.  Difficulties in using and exchanging data. 34
  • 31. XML in -informatics 1- (Visual Genomics). 2- (ProteoMetrics). 3- (Chemical info. “atomic, crystallographic info., structures….”). 4- ene ntology onsortium. 35
  • 32. The Bioinformatics Sequences Markup Language (BSML) -The DTD is aimed at representing DNA, RNA, Protein sequences and their graphic properties. -Found the structure of the information to be similar to the one used in the databases. (http://www.ebi.ac.uk/embl.html) (http://www.visualgenomics.com/products/index.html) (http://www.ncbi.nlm.nih.gov; http://www.ddbj.nig.ac.jp) 36
  • 33. Gene Ontology Consortium Controlled description for: 1- Molecular function. 2- Biological processes. 3- Cellular locations of gene products. 37
  • 34. The BIOpolymer Markup Language (BioML) - is different to BSML approach. - BioML Goal (Fenyo, 1999) is “ BioML was designed to mimic the hierarchical structure of aliving organism.” - Data integration e.g nucleotide and protein sequences 38