SlideShare une entreprise Scribd logo
1  sur  13
Télécharger pour lire hors ligne
AUTOMATIC AND
SEMI-AUTOMATIC 2-TIER
CHECK SYSTEM
FOR EPUB ACCESSIBILITY
2017.03.02 Hyun-Young Kim SookMyungWomen’s University
WHAT IS EPUB
• One of eBook File Format
• De Facto Standard published by the International Digital Publishing Forum (IDPF) Since 2007
• De Jour International Standards Organization as ISO/IECTS 30135 (parts 1-7) in 2014
• EPUB 2.0 in October 2007,
Maintenance update (2.0.1) in September 2010.
EPUB 3.0 in October 2011
Maintenance update (3.0.1) in June 2014
The current version of EPUB 3.1 in January 2017
EPUB & WEB RELATION
• EPUB production needs web technologies
• W3C'sWeb Accessibility Initiative
Web Content Accessibility Guidelines (WCAG) 2.0
Accessible Rich Internet Applications (WAI-ARIA) 1.0
• Also EPUB needs book metaphor and structure information
Semantic Markup Features
Navigation Features
EXISTED ACCESSIBILITY DOCUMENTS
• IDPF EPUB3 Accessibility Guidelines
• Semantics, Navigation, Metadata
• XHTML Content Documents, MathML, SVG, EPUB Style Sheets, Media Overlay
• IDPF EPUB Accessibility 1.0
• Developed as part of EPUB 3.1 to provide guidance on making EPUB publications accessible
• BISG (Book Industry Study Group) Quick Start GuideTo Accessible Publishing
• Essential Check Points from EPUB3 Accessibility Guidelines
• DAISY member, DIAGRAM Image Description Guidelines
• Description guidelines that apply to any type of image.
• Guidelines for describing images within specific types of categories, such as maps.
EPUB PRODUCTION STATUS IN KOREA
• Only Conversion, No Accessibility
• National Library should reproduce DAISY or Accessible EPUB
• The library defined e-book accessibility certification criteria
and designated that as an industry standard in Korea
• Proposed Accessibility Checker is based on e-book accessibility certification criteria
PROPOSED CHECKER
• 156 Check Points from Previous Guidelines
• Some Check Points can be decided automatically
• Language Definition, Existence of LOI and LOT, Existence of LOA and LOV, and etc
• Others can be decided manually
• epub:type attribute is meaningful enough
• whether the page number accurately is the same as the number at paper book, and etc.
• 2-tier Checker
• Automatic Check for 39 Points, PC Standalone version
• Semi-Automatic Check for 117 Points,Web version linked with editor
• Web Checker indicates points where problems may occur
• HTML Editor that opens XHTML and CSS documents after decomposing EPUB
AUTOMATIC CHECKER
SEMI-AUTOMATIC CHECKER
PROPOSED CHECKER VS. EPUBCHECK
• EpubCheck
• Tool to validate EPUB files, developed by IDPF and DAISY
• Detecting many types of errors in EPUB structure such as OCF container structure, OPF and OPS mark-up, internal
reference consistency
• Do not Support Any Accessibility Issues
• Proposed Checker
• Tool to investigate the accessibility of EPUB
• Some modules are same as those of EpubCheck
parsing in the EPUB Package and checking the OCF Related Content
WORKFLOW OF PROPOSED CHECKER
EPUB
XHTML
CSS
SVG
SMIL
Decomposition
Navigation
Inspection
Lang / Audio Clip /Video Clip / Alt Text …
CSS separation / em / strong / Formatting / justified …
SVG lang / description
media-type / list
TOC / LOI / LOV / LOT…
OPF Metadata / lang …
VERIFICATION OF CHECKER
• 50 EPUB files which has deposited into the national library of Korea
• 148 accessibility defects per each file on average
• Accessibility errors focus on 8 points
• The Korean e-book market has EPUB2x 90% and EPUB3x 10%
• Rare percentage of e-books available for Multimedia, MathML, and Media Overlay support
• 8 error points occurs at parts which are irrelevant to EPUB3 specifications
MAJOR DEFECTS
• To define the default language for an XHTML document, the lang and xml:lang language attributes need to be attached to the root
html element. It occupies 41% over all defects.
• In the case of multilingual publications, best practice is to always specify the language in each content document to ensure proper
rendering. It occupies 21% over all defects.
• When using the epub:type attribute in a content document, the epub namespace must be declared on the element containing the
attribute, or on one of its ancestors. It occupies 13% over all defects.
• Images that are central to the understanding of a publication must always include a text alternative in their alt attribute. It occupies
7% over all defects.
• When creating hyperlinks, the text inside of the link can provide the full context of what is being linked to or the link can have
alternate text. It occupies 7% over all defects.
• Separating style from markup is consequently not just about keeping CSS in a separate file from your markup, but recognizing that
markup must convey meaning to be useful to all readers. It occupies 7% over all defects.
• When using bolding and italics, EPUB follow the rules of HTML5 and CSS standard. It occupies 2% over all defects.
• Avoid justifying text, as the uneven spacing that occurs between words can reduce the readability for some people. It occupies 1%
over all defects.
FUTURE WORKS
• 1st tier automatic system could pick up problematic items which defined as 39 check points
• It is responsible for 25% of all 156 check points
• 2nd tier semi-automatic system handle 75% of check points
• It should be changed into automatic detection through Machine Learning algorithm

Contenu connexe

Tendances

Workflow & Benefits of OCLC Knowledge Base
Workflow & Benefits of OCLC Knowledge BaseWorkflow & Benefits of OCLC Knowledge Base
Workflow & Benefits of OCLC Knowledge Base
alisonjohnson53
 
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiadHands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
alisonjohnson53
 
Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...
Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...
Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...
alisonjohnson53
 

Tendances (14)

eNotate Preso 3
eNotate Preso 3eNotate Preso 3
eNotate Preso 3
 
Workflow & Benefits of OCLC Knowledge Base
Workflow & Benefits of OCLC Knowledge BaseWorkflow & Benefits of OCLC Knowledge Base
Workflow & Benefits of OCLC Knowledge Base
 
Spring Batch Introduction (and Bitbucket Project)
Spring Batch Introduction (and Bitbucket Project)Spring Batch Introduction (and Bitbucket Project)
Spring Batch Introduction (and Bitbucket Project)
 
Its2 ontology-localization
Its2 ontology-localizationIts2 ontology-localization
Its2 ontology-localization
 
Web Development Today
Web Development TodayWeb Development Today
Web Development Today
 
PHPMaker - The Best PHP Code Generator Ever !
PHPMaker - The Best PHP Code Generator Ever !PHPMaker - The Best PHP Code Generator Ever !
PHPMaker - The Best PHP Code Generator Ever !
 
Multilingual WordPress With Polylang
Multilingual WordPress With PolylangMultilingual WordPress With Polylang
Multilingual WordPress With Polylang
 
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiadHands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
 
WebServices
WebServicesWebServices
WebServices
 
Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...
Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...
Making Articles Easier: Implementing OCLC Knowledge Base for Direct Requestin...
 
Oracle Forms : Reusable Components
Oracle Forms : Reusable ComponentsOracle Forms : Reusable Components
Oracle Forms : Reusable Components
 
Web forms and server side scripting
Web forms and server side scriptingWeb forms and server side scripting
Web forms and server side scripting
 
Learn VB.NET at ASIT
Learn VB.NET at ASITLearn VB.NET at ASIT
Learn VB.NET at ASIT
 
Jsp abes new
Jsp abes newJsp abes new
Jsp abes new
 

En vedette

Cognitive theory of multimedia learning, krista greear, csun 2017
Cognitive theory of multimedia learning, krista greear, csun 2017Cognitive theory of multimedia learning, krista greear, csun 2017
Cognitive theory of multimedia learning, krista greear, csun 2017
Krista Greear
 

En vedette (20)

2017 CSUN Color Contrast
2017 CSUN Color Contrast2017 CSUN Color Contrast
2017 CSUN Color Contrast
 
Cognitive theory of multimedia learning, krista greear, csun 2017
Cognitive theory of multimedia learning, krista greear, csun 2017Cognitive theory of multimedia learning, krista greear, csun 2017
Cognitive theory of multimedia learning, krista greear, csun 2017
 
Accessibility and Design: Where Productivity and Philosophy Meet
Accessibility and Design:  Where Productivity and Philosophy MeetAccessibility and Design:  Where Productivity and Philosophy Meet
Accessibility and Design: Where Productivity and Philosophy Meet
 
Rethinking Accessibility: Role-Based Analysis of WCAG 2.0 - CSUN 2017
Rethinking Accessibility: Role-Based Analysis of WCAG 2.0 - CSUN 2017Rethinking Accessibility: Role-Based Analysis of WCAG 2.0 - CSUN 2017
Rethinking Accessibility: Role-Based Analysis of WCAG 2.0 - CSUN 2017
 
2017 CSUN The Art of Language in Accessibility
2017 CSUN The Art of Language in Accessibility2017 CSUN The Art of Language in Accessibility
2017 CSUN The Art of Language in Accessibility
 
CSUN 2017 Success Criteria: Dependencies and Prioritization
CSUN 2017 Success Criteria: Dependencies and PrioritizationCSUN 2017 Success Criteria: Dependencies and Prioritization
CSUN 2017 Success Criteria: Dependencies and Prioritization
 
Mystery Meat 2.0 – Making hidden mobile interactions accessible
Mystery Meat 2.0 – Making hidden mobile interactions accessibleMystery Meat 2.0 – Making hidden mobile interactions accessible
Mystery Meat 2.0 – Making hidden mobile interactions accessible
 
Reusable acceptance criteria and test cases for accessibility
Reusable acceptance criteria and test cases for accessibilityReusable acceptance criteria and test cases for accessibility
Reusable acceptance criteria and test cases for accessibility
 
A Multidisciplinary Approach to Universal Design
A Multidisciplinary Approach to Universal DesignA Multidisciplinary Approach to Universal Design
A Multidisciplinary Approach to Universal Design
 
Mind your lang (for role=drinks at CSUN 2017)
Mind your lang (for role=drinks at CSUN 2017)Mind your lang (for role=drinks at CSUN 2017)
Mind your lang (for role=drinks at CSUN 2017)
 
Preparing the enterprise for 508 refresh, ibm csun2017
Preparing the enterprise for 508 refresh, ibm csun2017Preparing the enterprise for 508 refresh, ibm csun2017
Preparing the enterprise for 508 refresh, ibm csun2017
 
Accessibility metrics Accessibility Data Metrics and Reporting – Industry Bes...
Accessibility metrics Accessibility Data Metrics and Reporting – Industry Bes...Accessibility metrics Accessibility Data Metrics and Reporting – Industry Bes...
Accessibility metrics Accessibility Data Metrics and Reporting – Industry Bes...
 
CSUN 2017 VPATs For Business or Measure
CSUN 2017 VPATs For Business or MeasureCSUN 2017 VPATs For Business or Measure
CSUN 2017 VPATs For Business or Measure
 
Resources for transition 2017 csun presntation
Resources for transition 2017 csun presntationResources for transition 2017 csun presntation
Resources for transition 2017 csun presntation
 
CSUN2017
CSUN2017CSUN2017
CSUN2017
 
CSUN 2017: USAA - Voice Guided Deposits
CSUN 2017: USAA - Voice Guided DepositsCSUN 2017: USAA - Voice Guided Deposits
CSUN 2017: USAA - Voice Guided Deposits
 
The 7 minute accessibility assessment and app rating system
The 7 minute accessibility assessment and app rating systemThe 7 minute accessibility assessment and app rating system
The 7 minute accessibility assessment and app rating system
 
Assistive Wearables: Emerging Trends and Design Considerations
Assistive Wearables: Emerging Trends and Design ConsiderationsAssistive Wearables: Emerging Trends and Design Considerations
Assistive Wearables: Emerging Trends and Design Considerations
 
Accessibility microinteractions: better user experience, happier developers
Accessibility microinteractions: better user experience, happier developersAccessibility microinteractions: better user experience, happier developers
Accessibility microinteractions: better user experience, happier developers
 
Digital Accessibility Legal Update - CSUNATC 2017 (CSUN)
Digital Accessibility Legal Update - CSUNATC 2017 (CSUN)Digital Accessibility Legal Update - CSUNATC 2017 (CSUN)
Digital Accessibility Legal Update - CSUNATC 2017 (CSUN)
 

Similaire à Csun presentation-170302-hykim

Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...
Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...
Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...
Erich van Rijn
 
CSS Adnaved with HTML abd complete Stylesheet
CSS Adnaved with HTML abd complete StylesheetCSS Adnaved with HTML abd complete Stylesheet
CSS Adnaved with HTML abd complete Stylesheet
PraveenHegde20
 
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.FinalHakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Mark Hakkarinen
 

Similaire à Csun presentation-170302-hykim (20)

Epub IDPF update & futures-Garth Conboy
Epub IDPF update & futures-Garth ConboyEpub IDPF update & futures-Garth Conboy
Epub IDPF update & futures-Garth Conboy
 
Interactive E-Books
Interactive E-BooksInteractive E-Books
Interactive E-Books
 
Day3 edupub tokyo_idpf
Day3 edupub tokyo_idpfDay3 edupub tokyo_idpf
Day3 edupub tokyo_idpf
 
Down and Dirty EPUB 3
Down and Dirty EPUB 3Down and Dirty EPUB 3
Down and Dirty EPUB 3
 
UNC Chapel Hill 2014 CTC Retreat - Creating epub e books
UNC Chapel Hill 2014 CTC Retreat - Creating epub e booksUNC Chapel Hill 2014 CTC Retreat - Creating epub e books
UNC Chapel Hill 2014 CTC Retreat - Creating epub e books
 
EPUB for Website Producers
EPUB for Website ProducersEPUB for Website Producers
EPUB for Website Producers
 
Getting Started With ePubNow! XML Workflow for Publishers
Getting Started With ePubNow! XML Workflow for PublishersGetting Started With ePubNow! XML Workflow for Publishers
Getting Started With ePubNow! XML Workflow for Publishers
 
E-publishing
E-publishingE-publishing
E-publishing
 
The Evolution of EPUB
The Evolution of EPUBThe Evolution of EPUB
The Evolution of EPUB
 
HTML Start Up - Introduction to HTML
HTML Start Up - Introduction to HTMLHTML Start Up - Introduction to HTML
HTML Start Up - Introduction to HTML
 
Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...
Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...
Bill Kasdorf - Apex Content Solutions - Agile processes, agile publications (...
 
The year of testing dangerously: The EPUB 3.3 story - Tech Forum 2022
The year of testing dangerously: The EPUB 3.3 story - Tech Forum 2022The year of testing dangerously: The EPUB 3.3 story - Tech Forum 2022
The year of testing dangerously: The EPUB 3.3 story - Tech Forum 2022
 
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
 
Kerscher, Gunderson, and Wise "Unprecedented Access: Improving the User Expe...
Kerscher, Gunderson, and Wise "Unprecedented Access:  Improving the User Expe...Kerscher, Gunderson, and Wise "Unprecedented Access:  Improving the User Expe...
Kerscher, Gunderson, and Wise "Unprecedented Access: Improving the User Expe...
 
B01 markus gylling-epub
B01 markus gylling-epubB01 markus gylling-epub
B01 markus gylling-epub
 
B01 markus gylling-epub
B01 markus gylling-epubB01 markus gylling-epub
B01 markus gylling-epub
 
Introduction to web page
Introduction to web pageIntroduction to web page
Introduction to web page
 
CSS Adnaved with HTML abd complete Stylesheet
CSS Adnaved with HTML abd complete StylesheetCSS Adnaved with HTML abd complete Stylesheet
CSS Adnaved with HTML abd complete Stylesheet
 
Ushering in the Born Accessible Era
Ushering in the Born Accessible EraUshering in the Born Accessible Era
Ushering in the Born Accessible Era
 
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.FinalHakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 

Csun presentation-170302-hykim

  • 1. AUTOMATIC AND SEMI-AUTOMATIC 2-TIER CHECK SYSTEM FOR EPUB ACCESSIBILITY 2017.03.02 Hyun-Young Kim SookMyungWomen’s University
  • 2. WHAT IS EPUB • One of eBook File Format • De Facto Standard published by the International Digital Publishing Forum (IDPF) Since 2007 • De Jour International Standards Organization as ISO/IECTS 30135 (parts 1-7) in 2014 • EPUB 2.0 in October 2007, Maintenance update (2.0.1) in September 2010. EPUB 3.0 in October 2011 Maintenance update (3.0.1) in June 2014 The current version of EPUB 3.1 in January 2017
  • 3. EPUB & WEB RELATION • EPUB production needs web technologies • W3C'sWeb Accessibility Initiative Web Content Accessibility Guidelines (WCAG) 2.0 Accessible Rich Internet Applications (WAI-ARIA) 1.0 • Also EPUB needs book metaphor and structure information Semantic Markup Features Navigation Features
  • 4. EXISTED ACCESSIBILITY DOCUMENTS • IDPF EPUB3 Accessibility Guidelines • Semantics, Navigation, Metadata • XHTML Content Documents, MathML, SVG, EPUB Style Sheets, Media Overlay • IDPF EPUB Accessibility 1.0 • Developed as part of EPUB 3.1 to provide guidance on making EPUB publications accessible • BISG (Book Industry Study Group) Quick Start GuideTo Accessible Publishing • Essential Check Points from EPUB3 Accessibility Guidelines • DAISY member, DIAGRAM Image Description Guidelines • Description guidelines that apply to any type of image. • Guidelines for describing images within specific types of categories, such as maps.
  • 5. EPUB PRODUCTION STATUS IN KOREA • Only Conversion, No Accessibility • National Library should reproduce DAISY or Accessible EPUB • The library defined e-book accessibility certification criteria and designated that as an industry standard in Korea • Proposed Accessibility Checker is based on e-book accessibility certification criteria
  • 6. PROPOSED CHECKER • 156 Check Points from Previous Guidelines • Some Check Points can be decided automatically • Language Definition, Existence of LOI and LOT, Existence of LOA and LOV, and etc • Others can be decided manually • epub:type attribute is meaningful enough • whether the page number accurately is the same as the number at paper book, and etc. • 2-tier Checker • Automatic Check for 39 Points, PC Standalone version • Semi-Automatic Check for 117 Points,Web version linked with editor • Web Checker indicates points where problems may occur • HTML Editor that opens XHTML and CSS documents after decomposing EPUB
  • 9. PROPOSED CHECKER VS. EPUBCHECK • EpubCheck • Tool to validate EPUB files, developed by IDPF and DAISY • Detecting many types of errors in EPUB structure such as OCF container structure, OPF and OPS mark-up, internal reference consistency • Do not Support Any Accessibility Issues • Proposed Checker • Tool to investigate the accessibility of EPUB • Some modules are same as those of EpubCheck parsing in the EPUB Package and checking the OCF Related Content
  • 10. WORKFLOW OF PROPOSED CHECKER EPUB XHTML CSS SVG SMIL Decomposition Navigation Inspection Lang / Audio Clip /Video Clip / Alt Text … CSS separation / em / strong / Formatting / justified … SVG lang / description media-type / list TOC / LOI / LOV / LOT… OPF Metadata / lang …
  • 11. VERIFICATION OF CHECKER • 50 EPUB files which has deposited into the national library of Korea • 148 accessibility defects per each file on average • Accessibility errors focus on 8 points • The Korean e-book market has EPUB2x 90% and EPUB3x 10% • Rare percentage of e-books available for Multimedia, MathML, and Media Overlay support • 8 error points occurs at parts which are irrelevant to EPUB3 specifications
  • 12. MAJOR DEFECTS • To define the default language for an XHTML document, the lang and xml:lang language attributes need to be attached to the root html element. It occupies 41% over all defects. • In the case of multilingual publications, best practice is to always specify the language in each content document to ensure proper rendering. It occupies 21% over all defects. • When using the epub:type attribute in a content document, the epub namespace must be declared on the element containing the attribute, or on one of its ancestors. It occupies 13% over all defects. • Images that are central to the understanding of a publication must always include a text alternative in their alt attribute. It occupies 7% over all defects. • When creating hyperlinks, the text inside of the link can provide the full context of what is being linked to or the link can have alternate text. It occupies 7% over all defects. • Separating style from markup is consequently not just about keeping CSS in a separate file from your markup, but recognizing that markup must convey meaning to be useful to all readers. It occupies 7% over all defects. • When using bolding and italics, EPUB follow the rules of HTML5 and CSS standard. It occupies 2% over all defects. • Avoid justifying text, as the uneven spacing that occurs between words can reduce the readability for some people. It occupies 1% over all defects.
  • 13. FUTURE WORKS • 1st tier automatic system could pick up problematic items which defined as 39 check points • It is responsible for 25% of all 156 check points • 2nd tier semi-automatic system handle 75% of check points • It should be changed into automatic detection through Machine Learning algorithm