1. SSP Seminar: Implementing a Full-XML
Workflow: Why Isn’t Everyone Doing It?
A Perspective from AIP
Lisa McLaughlin
Director of Informatics and Production
June 2, 2010
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
2. Agenda
AIP: An Introduction
History of Electronic Publishing at AIP
AIP Content Flow Diagram
XML Current Uses; Scitation C3
What the Future Looks like: Beyond the
“SuperArticle”
2
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
3. AIP: An Introduction
Definition
• A global scientific information provider
• Represents 135,000 scientists, engineers, and educators
• Dedicated to the advancement of physics
Founded by original five member societies for the “advancement and diffusion of
knowledge of the science of physics and its application to human welfare.”
As a "society of societies," AIP supports ten member societies and provides a
spectrum of services and programs devoted to advancing the science and profession
of physics. A pioneer in digital publishing, AIP is also one of the world’s largest
publishers of physics journals and produces the publications of more than 25
scientific and engineering societies.
Through the years, AIP journals have become world renowned
for their conveyance of cutting-edge results.
3
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
4. AIP: An Introduction
• Scitation C3
http://scitation.aip.org/
– Online publishing platform
– C3 is content, context, connections
– C3 includes many new features that utilize AIP’s XML
• AIP UniPHY
http://www.aipuniphy.org/Portal/Portal.aspx
– World's first literature-based professional social networking site for physical scientists
– Includes publications in the SPIN (Searchable Physics Information Notices) database
– Provides intuitive interfaces for researchers to network and share data
Goal: to advance physics research worldwide
4
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
5. AIP: An Introduction
• Mobile Delivery – iResearch
http://scitation.aip.org/iphone
– Offline e-reader
– Enables users to save PDF files locally to their device and view them offline
– Works without a Wifi or cellular connection
– Users navigate through the journals to select an article
• Article Rental Pilot Program
– Working with DeepDyve
– Testing feasibility of article rental - single articles for a 24-hour period for just $0.99.
– Allows users to read, but not download or print, the full text of an article - the service gives corporations and
institutions with more specialized collections affordable access to seminal AIP journal content.
– AIP will be offering their new article rental service on an experimental basis until the end of September,
2010.
“Netflix for academic journals”
5
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
6. AIP: An Introduction
AIP currently publish (for AIP journals and for our societies):
• 111 journals
• 285,000 pages
• 41,400 articles
• “Special publications” e.g. conference proceedings and Physics
Today, a member’s magazine
6
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
7. History of Electronic Publishing at AIP
7
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
8. History of Electronic Publishing at AIP
8
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
9. History of Electronic Publishing at AIP
Timeline Highlights:
• 1995: Release of Applied Physics Letters Online, starting with 1994 content in full-text
SGML
• 1996: Internal linking of content within APL Online
• 1997: Launched OJS (Online Journal Service)
• Makes transition from the SGML to HTML and content linking within OJS journals and to IOP
content
• 1999: AIP journal abstracts available to everyone online
• 2003: OJS relauched as Scitation
• 2005: XML becomes part of the production workflow via XML-in methodology
9
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
10. History of Electronic Publishing at AIP
• Approx 1995: AIP normalized all input (tex and word) upfront to xyascii
(proprietary Xyvision typesetting codes) for production and did a back end
conversion to SGML in-house when the article was complete.
• 2005: AIP transitioned to an end-to-end XML process that facilitates re-
use and re-purposing, archival preservation, and automation of many
content validation processes. Until recently, though, AIP have managed all
pre-press production in-house.
• 2009: transitioned copyediting and XML conversion offshore to a supplier
in the Philippines.
• On-going project to transition all production pre-press work offshore to
new partners.
10
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
11. AIP Content Flow Diagram
External non Journal Production
External non Journal Production
feeds (full text SML and PDF, abstracts
feeds (full text SML and PDF, abstracts
and PDF, abstracts only)
and PDF, abstracts only)
VQC
VQC
PXP
PXP
DBO (AGU, IOP, Nature…)
OAI (APS) IOP, Nature…)
DBO (AGU,
OAI (APS)
Accept Manuscript postings
Accept Manuscript postings
ASCE
ASCE
ASME
ASME
Publication Archive MarkLogic Polopy Content OJA, eRights,
Full electronic feeds from societies Publication Archive MarkLogic
Full electronic feeds from societies Library File System Archive”XML Phoenix 14 Management MACS, &
and peer-review systems Library File System
and peer-review systems Journal XML database XML database System RRE databases
Journal /ft xml
Production
ScholarOne
ScholarOne Production /spinxml
/ft sgml
/pdf sgml
Editorial Manager /spin
Editorial Manager
ASME /pdf Media server
ASME /highres images Media server
APS /highres images File System
APS
OSA MarkLogic MarkLogic
File System
OSA
PXP
PXP Archive”XML Phoenix 14 /figures
database database /figures
/images
/images
/mm
Other submissions: /mm
Other submissions:
Electronic files only (no metadata) /free (covers)
Mets SPIN /free (covers)
Hardcopy files only (no metadata)
Electronic
database
Hardcopy database
Data Dissemination
Data Dissemination
SPIN Customers
Print
Print EBSCO Customers
SPIN
Vendors EBSCO UniPHY
Vendors ISI UniPHY
ISI
Portico… DeepDyve
Portico… DeepDyve
APS journal delivery
APS journal delivery 11
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
12. XML Current Uses
Scitation C3. Functionality of Scitation-hosted journal pages greatly enhanced and
enabled by an XML infrastructure that can turn granular markup into
meaningful content services, reduce discovery time, and dynamically connect
similar concepts and expose related content without user interaction.
• Realize the full potential of XML.
• Meet user demands with more efficient and actionable discovery.
• Create new modes for exploring content.
• Provide smart interfaces for mobile devices.
• Overall, give publishers more control over their content!
12
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
13. XML Current Uses
New C3 features include:
• Full-text HTML rendering, directly from XML.
• Inline reference links and the ability to enlarge tables and
figures by clicking on them.
• AIP-coined "SuperArticle" virtually lifts off the page in a
highly interactive presentation that transforms the
traditional scholarly article, helping to excelerate
authoring, reading, and research.
“AIP was recognized at the MarkLogic User Conference for
demonstrating outstanding leadership in their use of MarkLogic Server
to build the next-generation of information applications. AIP's
collaboration with MarkLogic was instrumental in launching both the
groundbreaking C3 platform and iResearch, a cutting-edge mobile e-
Reader application.” 13
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
14. XML Current Uses
New C3 features include:
• Enhanced search functions, with more options
and better controls to explore returned content
with faceted results.
• Faceted search helps researchers find information
quickly by presenting them with a set of "filters" to
narrow down search results.
• Real-time institutional mapping interface that
reveals institutional connections to research,
providing geo-locational context to where the
author who performed the research is based.
14
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
15. XML Current Uses
New C3 features include:
• Upgraded Tables of Contents service - bringing
more information to the forefront and creating
functionality that allows users to work with
journal data in new ways.
• "Smart ToCs" enable the user to harvest citations,
preview abstracts with a mouse-click, and hide content
that isn't of interest. In this same vein of facilitating
discoverability, a new table browser allows subscribers
to view all tables and figures in an article directly from
the abstract.
15
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
16. What the Future Looks Like
Beyond the “SuperArticle”:
• Continued development of the SuperArticle: AIP are
becoming a more knowledgecentric publisher by
continuing to move beyond the PDF by adding more
and more value to our XML.
• Chemical Abstracts; visualization tools
• Sectioned Abstracts, exposing components of a journal
article
• Create a semantic world for Physics and Physics
Publishing.
16
Implementing a Full-XML Workflow; Lisa McLaughlin
June 2010
Agenda AIP: An Introduction History of Electronic Publishing at AIP XML Current Uses What the Future Looks like: Informatics; the “Article of the Future”