SlideShare une entreprise Scribd logo
1  sur  9
Télécharger pour lire hors ligne
The sad saga of XHTML;
or,
what happens when markup geeks
get arrogant
LIS 652
Dorothea Salo
HTML’s early days
•Tim Berners-Lee: great humanitarian, LOUSY
document analyst/content modeler.
•HTML was supposedly designed for journal articles in physics.
•Based on your experience modeling articles... was it EVER gonna work for that?

•Early HTML was extremely crude markup.
•Crude in “structure.”
•Crude in appearance (as implemented in web browsers).
•Practically nonexistent interactivity. Documents just sat there; you couldn’t DO
anything on the web except read documents and (every once in a while) click
buttons.
Result: tag soup!
•“Tag soup:” markup-geek’s disrespectful
term for lousy markup
•“Tag abuse:” markup-geek’s disrespectful
term for using a tag for a reason other
than its structural appropriateness
•Early HTML pages: lots and LOTS of both!
•“Who put <font> tags in my nice neat structural markup?!”
•“What the heck are you doing with table markup?! Stop that!”
•“OMG VALIDATE YOUR HTML, WILL YOU?” “Validation? What’s that?”
Funny, but...
•Absolutely terrible for accessibility
•Even tag-abusing HTML won’t make
pretty web pages. Or interactive ones.
•Browser incompatibilities/differences
•partly due to browsers needing to consume near-completely incompatible,
bizarre, or just plain WRONG markup
•partly due to some browser implementors (MICROSOFT) trying to take
over the brand-new Web
So the W3C said “Stop.”
•And the W3C said “Use the CSS which we have made
for thee to separate structure from presentation.”
•And there was much (well, some) rejoicing!

•And the W3C said “Make not tag soup, but use the
stricter XHTML syntax, and validate thy documents.”
•And web designers said “NOPE.”
•(Adoption of XHTML on the web was essentially zero, except for those who were starting from XML
to begin with.)

•And after a long time, the W3C said “... okay, FINE.
Here’s HTML5, then. You want to be sloppy with tags?
Be sloppy.”
Life lessons
•Human beings are very bad at:
•Precision
•Consistency
•Following instructions
•Checking their work

•This means that human beings are VERY
BAD AT MAKING XML.
•We have a whole course on this at SLIS for a reason! It’s pretty hard for
most people to learn on their own!

•Demanding XML from most human
beings is a loser’s game!!!!!!!!!!!!!
Why am I telling you this?
•Because Very Smart People keep making
the Very Stupid Mistake of demanding
XML from human beings.
•W3C: XHTML
•National Science Digital Library: OAI-PMH
•Many, many server-based software packages
•Library supply chain/Impelsys: ONIX from indie/self-publishers (http://
www.americanlibrariesmagazine.org/blog/acquisition-dashboardimpelsys-results-experiment)

•This never, ever works out well!
•GO YE AND DO NOT DO LIKEWISE.
Postel’s law: a better way
•Be conservative in what you do, be liberal in
what you accept from others.
•Often reworded as “Be conservative in what you send, be liberal in what you
accept.”

•If you want XML:
•Get the data in a way the other people are comfortable with.
•Plan on having to clean it up. (Automate that as best you can!)
•Turn it into XML yourself.
•Seems like a hassle because it is... but it’s the ONLY THING THAT ALWAYS WORKS.
Trust me.
Thank you!
This presentation is available under a
Creative Commons Attribution 4.0
United States license.

Contenu connexe

En vedette

"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron SwartzCalimaq S.I.Lex
 
Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...Calimaq S.I.Lex
 
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...Calimaq S.I.Lex
 
Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?Calimaq S.I.Lex
 
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ? Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ? Calimaq S.I.Lex
 
L'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectivesL'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectivesCalimaq S.I.Lex
 
Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"Calimaq S.I.Lex
 
Digitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspectiveDigitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspectiveCalimaq S.I.Lex
 
Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...Calimaq S.I.Lex
 
Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ? Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ? Calimaq S.I.Lex
 

En vedette (10)

"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
"Serez-vous des nôtres ?" - Un hommage à Aaron Swartz
 
Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...Le statut des données de la recherche : entre droit des bases de données et d...
Le statut des données de la recherche : entre droit des bases de données et d...
 
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
Créer à l'heure du numérique : propriété intellectuelle, droit d'auteur, évol...
 
Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?Quelles réciprocités pour les communs ?
Quelles réciprocités pour les communs ?
 
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ? Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
Que manque-t-il pour avoir des licences Open Hardware qui fonctionnent ?
 
L'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectivesL'exception TDM dans la loi numérique : mérites, limites et perspectives
L'exception TDM dans la loi numérique : mérites, limites et perspectives
 
Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"Les universités françaises et l'Open Data après la loi "République numérique"
Les universités françaises et l'Open Data après la loi "République numérique"
 
Digitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspectiveDigitizing Out of Commerce and Orphan Works : a (critical) french perspective
Digitizing Out of Commerce and Orphan Works : a (critical) french perspective
 
Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...Libre Accès aux publications scientifiques : quelles incidences de la loi num...
Libre Accès aux publications scientifiques : quelles incidences de la loi num...
 
Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ? Les licences à réciprocité : une piste pour le développement des Communs ?
Les licences à réciprocité : une piste pour le développement des Communs ?
 

Plus de Dorothea Salo

Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Dorothea Salo
 
Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Dorothea Salo
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and librariesDorothea Salo
 
Risk management and auditing
Risk management and auditingRisk management and auditing
Risk management and auditingDorothea Salo
 
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)Dorothea Salo
 
Preservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesPreservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesDorothea Salo
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Dorothea Salo
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesDorothea Salo
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly CommunicationDorothea Salo
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Dorothea Salo
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing SerendipityDorothea Salo
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAsDorothea Salo
 
Librarians love data!
Librarians love data!Librarians love data!
Librarians love data!Dorothea Salo
 
Taming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsTaming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsDorothea Salo
 

Plus de Dorothea Salo (20)

Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)Soylent Semantic Web Is People! (with notes)
Soylent Semantic Web Is People! (with notes)
 
Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!Soylent SemanticWeb Is People!
Soylent SemanticWeb Is People!
 
Encryption
EncryptionEncryption
Encryption
 
Privacy and libraries
Privacy and librariesPrivacy and libraries
Privacy and libraries
 
Paying for it
Paying for itPaying for it
Paying for it
 
Risk management and auditing
Risk management and auditingRisk management and auditing
Risk management and auditing
 
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
The Canonically Bad (Digital) Humanities Proposal (and how to avoid it)
 
Preservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanitiesPreservation and institutional repositories for the digital arts and humanities
Preservation and institutional repositories for the digital arts and humanities
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?
 
MARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archivesMARC and BIBFRAME; Linking libraries and archives
MARC and BIBFRAME; Linking libraries and archives
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
FRBR and RDA
FRBR and RDAFRBR and RDA
FRBR and RDA
 
Research Data and Scholarly Communication
Research Data and Scholarly CommunicationResearch Data and Scholarly Communication
Research Data and Scholarly Communication
 
Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)Research Data and Scholarly Communication (with notes)
Research Data and Scholarly Communication (with notes)
 
Manufacturing Serendipity
Manufacturing SerendipityManufacturing Serendipity
Manufacturing Serendipity
 
What We Organize
What We OrganizeWhat We Organize
What We Organize
 
Occupy Copyright!
Occupy Copyright!Occupy Copyright!
Occupy Copyright!
 
RDF, RDA, and other TLAs
RDF, RDA, and other TLAsRDF, RDA, and other TLAs
RDF, RDA, and other TLAs
 
Librarians love data!
Librarians love data!Librarians love data!
Librarians love data!
 
Taming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation ToolsTaming the Monster: Digital Preservation Planning and Implementation Tools
Taming the Monster: Digital Preservation Planning and Implementation Tools
 

Dernier

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Dernier (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

The sad saga of XHTML; or, what happens when markup geeks get arrogant

  • 1. The sad saga of XHTML; or, what happens when markup geeks get arrogant LIS 652 Dorothea Salo
  • 2. HTML’s early days •Tim Berners-Lee: great humanitarian, LOUSY document analyst/content modeler. •HTML was supposedly designed for journal articles in physics. •Based on your experience modeling articles... was it EVER gonna work for that? •Early HTML was extremely crude markup. •Crude in “structure.” •Crude in appearance (as implemented in web browsers). •Practically nonexistent interactivity. Documents just sat there; you couldn’t DO anything on the web except read documents and (every once in a while) click buttons.
  • 3. Result: tag soup! •“Tag soup:” markup-geek’s disrespectful term for lousy markup •“Tag abuse:” markup-geek’s disrespectful term for using a tag for a reason other than its structural appropriateness •Early HTML pages: lots and LOTS of both! •“Who put <font> tags in my nice neat structural markup?!” •“What the heck are you doing with table markup?! Stop that!” •“OMG VALIDATE YOUR HTML, WILL YOU?” “Validation? What’s that?”
  • 4. Funny, but... •Absolutely terrible for accessibility •Even tag-abusing HTML won’t make pretty web pages. Or interactive ones. •Browser incompatibilities/differences •partly due to browsers needing to consume near-completely incompatible, bizarre, or just plain WRONG markup •partly due to some browser implementors (MICROSOFT) trying to take over the brand-new Web
  • 5. So the W3C said “Stop.” •And the W3C said “Use the CSS which we have made for thee to separate structure from presentation.” •And there was much (well, some) rejoicing! •And the W3C said “Make not tag soup, but use the stricter XHTML syntax, and validate thy documents.” •And web designers said “NOPE.” •(Adoption of XHTML on the web was essentially zero, except for those who were starting from XML to begin with.) •And after a long time, the W3C said “... okay, FINE. Here’s HTML5, then. You want to be sloppy with tags? Be sloppy.”
  • 6. Life lessons •Human beings are very bad at: •Precision •Consistency •Following instructions •Checking their work •This means that human beings are VERY BAD AT MAKING XML. •We have a whole course on this at SLIS for a reason! It’s pretty hard for most people to learn on their own! •Demanding XML from most human beings is a loser’s game!!!!!!!!!!!!!
  • 7. Why am I telling you this? •Because Very Smart People keep making the Very Stupid Mistake of demanding XML from human beings. •W3C: XHTML •National Science Digital Library: OAI-PMH •Many, many server-based software packages •Library supply chain/Impelsys: ONIX from indie/self-publishers (http:// www.americanlibrariesmagazine.org/blog/acquisition-dashboardimpelsys-results-experiment) •This never, ever works out well! •GO YE AND DO NOT DO LIKEWISE.
  • 8. Postel’s law: a better way •Be conservative in what you do, be liberal in what you accept from others. •Often reworded as “Be conservative in what you send, be liberal in what you accept.” •If you want XML: •Get the data in a way the other people are comfortable with. •Plan on having to clean it up. (Automate that as best you can!) •Turn it into XML yourself. •Seems like a hassle because it is... but it’s the ONLY THING THAT ALWAYS WORKS. Trust me.
  • 9. Thank you! This presentation is available under a Creative Commons Attribution 4.0 United States license.