SlideShare une entreprise Scribd logo
1  sur  47
Why is Scholarly Communication
Broken and What Can Be Done?
In Celebration of Open Access Week
Philip E. Bourne
University of California San Diego
pbourne@ucsd.edu
UCSD LibrariesOct. 18, 2010
Disclaimer
• I am a domain (life) scientist not a computer or
information scientist
• I am fortunate enough to have a major biological
resource (the Protein Data Bank) and a major biological
journal (PLoS Computational Biology) as my playground
• I am part of the long tail
• I am naïve, but I am the majority
Oct. 18, 2010 UCSD Libraries
Agenda
• Motivation
• What needs to be done?
• A few examples
• The role of the institution
Oct. 18, 2010 UCSD Libraries
The Scientific Process is Too Slow to
Respond to a Crisis – Either Global or
Personal
Oct. 18, 2010 UCSD Libraries
Motivation
http://knol.google.com/k/plos-currents-influenza#
By the time the paper is published
we could all be dead
* http://www.cdc.gov/h1n1flu/estimates/April_March_13.htm
Jan. 2008 Jan. 2009 Jan. 2010Jul. 2009Jul. 2008 Jul. 2010
1RUZ: 1918 H1 Hemagglutinin
Structure Summary page activity for
H1N1 Influenza related structures
3B7E: Neuraminidase of A/Brevig Mission/1/1918
H1N1 strain in complex with zanamivir
In a time of crisis the need for fast access
to accurate data and any knowledge of
that data are paramount
Motivation
Oct. 18, 2010 UCSD Libraries
If that is not enough…
For some people the scientific
process may be too slow to save
their life
Oct. 18, 2010 UCSD Libraries
Motivation
Josh Sommer – A Remarkable Young Man
Co-founder & Executive Director the Chordoma Foundation
Oct. 18, 2010 UCSD Libraries
http://sagecongress.org/Presentations/Sommer.pdf
Motivation
Chordoma
• A rare form of brain
cancer
• No known drugs
• Treatment – surgical
resection followed by
intense radiation
therapy
Oct. 18, 2010 UCSD Libraries
Motivation
http://upload.wikimedia.org/wikipedia/commons/2/2b/Chordoma.JPG
Oct. 18, 2010 UCSD Libraries
http://sagecongress.org/Presentations/Sommer.pdf
Motivation
Oct. 18, 2010 UCSD Libraries
http://sagecongress.org/Presentations/Sommer.pdf
Motivation
Oct. 18, 2010 UCSD Libraries
http://sagecongress.org/Presentations/Sommer.pdf
Motivation
Oct. 18, 2010 UCSD Libraries
Adapted: http://sagecongress.org/Presentations/Sommer.pdf
Motivation
Isaac
If I have seen further it is only by
standing on the shoulders of giants
Isaac Newton
From Josh’s point of view the climb
up just takes too long
> 15 years and > $850M to be
more precise
Oct. 18, 2010 UCSD Libraries
http://sagecongress.org/Presentations/Sommer.pdf
Motivation
Oct. 18, 2010 UCSD Libraries
Motivation
http://sagecongress.org/Presentations/Sommer.pdf
Oct. 18, 2010 UCSD Libraries
http://fora.tv/2010/04/23/Sage_Commons_Josh_Sommer_Chordoma_Foundation
Motivation
Now we are all hopefully motivated
let us break this down to what
actually needs to be done in my
opinion
Here are a few big things …
Oct. 18, 2010 UCSD Libraries
What Needs to be Done?
A Few Things to Accelerate the Rate of
Scientific Discovery
• Better communication, data and knowledge access,
and new modes of discovery, which means:
– We need data and knowledge about that data to
interoperate i.e. we need new kinds of fast, versatile
publications and data archives
– We need to be more open with both
– We need to think more about the tools that analyze,
visualize and annotate data to maximize knowledge
discovery
– Reward systems need to change
– We need scientist management tools
– We need to be less fixated on the big data problems
– We need to unleash the full power of the Internet
Oct. 18, 2010 UCSD Libraries Easy Hard
1. A link brings up figures
from the paper
0. Full text of PLoS papers stored
in a database
2. Clicking the paper figure retrieves
data from the PDB which is
analyzed
3. A composite view of
journal and database
content results
We Need Data and
Knowledge About That
Data to Interoperate
1. User clicks on content
2. Metadata and
webservices to data
provide an interactive
view that can be
annotated
3. Selecting features
provides a
data/knowledge
mashup
4. Analysis leads to new
content I can share
4. The composite view has
links to pertinent blocks
of literature text and back to the PDB
1.
2.
3.
4.
The Knowledge and Data Cycle
PLoS Comp. Biol. 2005 1(3) e34
We Need Data and Knowledge About That
Data to Interoperate – What is Stopping US?
• Governance – publishers vs. database
providers
• Reward
• Metadata standards for provenance, privacy
etc.
• Exemplars
• ….
Oct. 18, 2010 UCSD Libraries
Caveat: Each discipline is different – I speak very much from a biomedical
sciences perspective
Certainly the Argument for Interoperability
in the Biomedical Sciences is Strong
• PubMed contains
18,792,257 entries
• ~100,000 papers indexed
per month
• In Feb 2009:
– 67,406,898 interactive
searches were done
– 92,216,786 entries were
viewed
• 1078 databases
reported in NAR 2008
• MetaBase
http://biodatabase.org
reports 2,651 entries
edited 12,587 times
Data as of April 14, 2009
PLoS Comp. Biol. 2005 1(3) e34
What Needs to be Done?
www.rcsb.org/pdb/explore/literature.do?structureId=1TIM
Example Interoperability: The Database View
BMC Bioinformatics 2010 11:220
Oct. 18, 2010 UCSD Libraries
What Needs to be Done?
Example Interoperability: The Literature View
http://biolit.ucsd.edu
Nucleic Acids Research 2008 36(S2) W385-389
Oct. 18, 2010 UCSD Libraries
What Needs to be Done?
ICTP Trieste, December 10, 2007
Oct. 18, 2010 UCSD Libraries
Semantic Tagging & Widgets are a
Powerful Tool to Integrate Data and
Knowledge of that Data, But as Yet
Not Used Much
Oct. 18, 2010 UCSD Libraries
Will Widgets and Semantic Tagging Change Computational Biology?
PLoS Comp. Biol. 6(2) e1000673
What Needs to be Done?
Semantic Tagging of Database Content
in The Literature or Elsewhere
http://www.rcsb.org/pdb/static.do?p=widgets/widgetShowcase.jsp
PLoS Comp. Biol. 6(2) e1000673Semantic Tagging
Oct. 18, 2010 UCSD Libraries
What Needs to be Done?
The Publishers are Starting to Do It
Oct. 18, 2010 UCSD Libraries
From Anita de Waard, Elsevier
What Needs to be Done?
This is Literature Post-processing
Better to Get the Authors Involved
• Authors are the absolute experts on the
content
• More effective distribution of labor
• Add metadata before the article enters the
publishing process
Oct. 18, 2010 UCSD Libraries
What Needs to be Done?
Word 2007 Add-in for authors
• Allows authors to add metadata as they write, before they
submit the manuscript
• Authors are assisted by automated term recognition
– OBO ontologies
– Database IDs
• Metadata are embedded directly into the manuscript
document via XML tags, OOXML format
– Open
– Machine-readable
• Open source, Microsoft Public License
http://www.codeplex.com/ucsdbiolit
Oct. 18, 2010 UCSD Libraries
What Needs to be Done?
Challenges
• Authors
– Carrot IF one or more publishers fast tracked a
paper that had semantic markup it might catch on
• Publishers
– Carrot Competitive advantage
Oct. 18, 2010 UCSD Libraries
What Needs to be Done?
A Few Things to Accelerate the Rate of
Scientific Discovery
• Better communication, data and knowledge access,
and new modes of discovery, which means:
– We need data and knowledge about that data to
interoperate i.e. we need new kinds of fast, versatile
publications and data archives
– We need to be more open with both
– We need to think more about the tools that analyze,
visualize and annotate data to maximize knowledge
discovery
– Reward systems need to change
– We need scientist management tools
– We need to be less fixated on the big data problems
– We need to unleash the full power of the Internet
Oct. 18, 2010 UCSD Libraries Easy Hard
Reward Systems Need to Change
What is Needed?
• Author disambiguation
• Auditing (identification and metrics) of all
scholarship - means new tools
• Seniors need to promote alternative forms of
scholarship
• Juniors need to respond
Oct. 18, 2010 UCSD Libraries
Reward Systems Need to Change
Ten Simple Rules for Getting Promoted as a Computational Biologist in Academia
PLoS Comp Biol to appear
Example Tools
Oct. 18, 2010 UCSD Libraries
http://pubnet.gersteinlab.org/
http://www.researcherid.com/
http://www.biomedexperts.com
What Are these Alternative Forms of
Scholarship?
Research
[Grants]
Journal
Article
Conference
Paper
Poster
Session
Reviews
Blogs
Community Service/Data
Curation
Reward Systems Need to Change
Oct. 18, 2010 UCSD Libraries
Ideally the ID will be Tagged to Every
Piece of Scholarly Communication
I an Not a Scientist I am a Number
PLoS Comp. Biol. 2008 4(12) e1000247
Reward Systems Need to Change
Oct. 18, 2010 UCSD Libraries
A Few Things to Accelerate the Rate of
Scientific Discovery
• Better communication, data and knowledge access,
and new modes of discovery, which means:
– We need data and knowledge about that data to
interoperate i.e. we need new kinds of fast, versatile
publications and data archives
– We need to be more open with both
– We need to think more about the tools that analyze,
visualize and annotate data to maximize knowledge
discovery
– Reward systems need to change
– We need scientist management tools
– We need to be less fixated on the big data problems
– We need to unleash the full power of the Internet
Oct. 18, 2010 UCSD Libraries Easy Hard
The Truth About My Laboratory
• I have ?? mail folders!
• The intellectual
memory of my
laboratory is in those
folders
• This is an unhealthy hub
and spoke mentality
We Need Scientist Management Tools
Oct. 18, 2010 UCSD Libraries
The Truth About My Laboratory
• I generate way more negative that
positive data, but where is it?
• Content management is a mess
– Slides, posters…..
– Data, lab notebooks ….
– Collaborations, Journal clubs …
• Software is open but where is it?
• Farewell is for the data too
Computational Biology Resources Lack Persistence and Usability. PLoS
Comp. Biol. 2008 4(7): e1000136 We Need Scientist Management Tools
http://artbyvida.com/portfolio.php
Many Great Tools Out There
Oct. 18, 2010 UCSD Libraries
We Need Scientist Management Tools
Taverna
Where I See the Problems
• The long tail is confused
• Lack of interoperability between the options
• The reward (publishing) is still removed from
the available tools
Oct. 18, 2010 UCSD Libraries We Need Scientist Management Tools
Science is Increasingly a Digital Workflow
Scientist
Idea
Experiment
Data
Conclusions
PublishThe Role of the Institution
Laboratory
Publisher
Maybe The Line is Somewhere Else?
Scientist
Idea
Experiment
Data
Conclusions
Publish
Laboratory
Publisher
Institution
Lab Notebook
The Role of the Institution
This Amounts to Publishing Workflows
But That Has its Problems
• Workflows are not linear
• Workflow : paper is not 1:1
• Confidentiality
• Peer review
• Infrastructure
• Community acceptance
• Reward system
The Role of the Institution
Solutions to Publishing Workflows?
• New organizations (university as publisher?)
• Appropriate reward system
• Shared governance
– author, institution, publisher
• Crowd sourcing the electronic printing press
The Role of the Institution
Crowd Sourcing the Electronic Printing Press
(aka Workshop: Beyond the PDF)
• Funded by DDCF, Microsoft, NCI, Sage
Bionetworks:
• Aims:
– Define user requirements
– Establish a specification document
– Open source the development effort
– Have a commitment from a publisher to publish a
research object using the system
– Act as an exemplar for what can be done
The Role of the Institution
Logistics
• UC San Diego
• Jan 19-21, 2010
• Under the auspices of
W3C
• FoRC will have a follow
on meeting
The Role of the Institution
Questions?
pbourne@ucsd.edu
Oct. 18, 2010 UCSD Libraries

Contenu connexe

Tendances

Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked dataLaura Po
 
Research Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffMartin Donnelly
 
Finding and managing process engineering information
Finding and managing process engineering informationFinding and managing process engineering information
Finding and managing process engineering informationThomas Hapke
 
Web serachning tools & techniques
Web serachning tools & techniquesWeb serachning tools & techniques
Web serachning tools & techniquesSanath Pushpakumara
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBeth Plale
 
Readings in Database Systems
Readings in Database SystemsReadings in Database Systems
Readings in Database Systemsmustafa sarac
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Beth Plale
 
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkRobert H. McDonald
 
Maass mass-omaha
Maass mass-omahaMaass mass-omaha
Maass mass-omahaBMaass97
 
Automatic Extraction of Science and Medicine from the scholarly literature
Automatic Extraction of Science and Medicine from the scholarly literatureAutomatic Extraction of Science and Medicine from the scholarly literature
Automatic Extraction of Science and Medicine from the scholarly literatureTheContentMine
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchFrancesca Di Donato
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsJon Voss
 
Keystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenanceKeystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenancePaolo Missier
 
Informatics UG4 2006-7
Informatics UG4 2006-7Informatics UG4 2006-7
Informatics UG4 2006-7skelly
 
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014Kimberly Hoffman
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositoriesChris Rusbridge
 
Af finding academic resources for your fyp oct 16
Af finding academic resources for your fyp oct 16Af finding academic resources for your fyp oct 16
Af finding academic resources for your fyp oct 16CityUniLibrary
 

Tendances (20)

Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Research Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staffResearch Data Management: a gentle introduction for admin staff
Research Data Management: a gentle introduction for admin staff
 
Finding and managing process engineering information
Finding and managing process engineering informationFinding and managing process engineering information
Finding and managing process engineering information
 
Web serachning tools & techniques
Web serachning tools & techniquesWeb serachning tools & techniques
Web serachning tools & techniques
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
 
Readings in Database Systems
Readings in Database SystemsReadings in Database Systems
Readings in Database Systems
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014
 
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
 
Transitive credit
Transitive creditTransitive credit
Transitive credit
 
Maass mass-omaha
Maass mass-omahaMaass mass-omaha
Maass mass-omaha
 
Curation is for cytomics
Curation is for cytomicsCuration is for cytomics
Curation is for cytomics
 
Automatic Extraction of Science and Medicine from the scholarly literature
Automatic Extraction of Science and Medicine from the scholarly literatureAutomatic Extraction of Science and Medicine from the scholarly literature
Automatic Extraction of Science and Medicine from the scholarly literature
 
Ziegler Open Data in Special Collections Libraries
Ziegler Open Data in Special Collections LibrariesZiegler Open Data in Special Collections Libraries
Ziegler Open Data in Special Collections Libraries
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for research
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & Museums
 
Keystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenanceKeystone summer school 2015 paolo-missier-provenance
Keystone summer school 2015 paolo-missier-provenance
 
Informatics UG4 2006-7
Informatics UG4 2006-7Informatics UG4 2006-7
Informatics UG4 2006-7
 
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
CUA Humanities Lecture on Scholarly Communications LSC634 Fall2014
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
 
Af finding academic resources for your fyp oct 16
Af finding academic resources for your fyp oct 16Af finding academic resources for your fyp oct 16
Af finding academic resources for your fyp oct 16
 

Similaire à Ucsd library10182010

Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Philip Bourne
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Maryann Martone
 
FORCE11: Future of Research Communications and e-Scholarship
FORCE11:  Future of Research Communications and e-ScholarshipFORCE11:  Future of Research Communications and e-Scholarship
FORCE11: Future of Research Communications and e-ScholarshipMaryann Martone
 
Legal scholarship and OA publishing: developing radical pathways to free, op...
 Legal scholarship and OA publishing: developing radical pathways to free, op... Legal scholarship and OA publishing: developing radical pathways to free, op...
Legal scholarship and OA publishing: developing radical pathways to free, op...York University - Osgoode Hall Law School
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersPhilip Bourne
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Datacunera
 
Teaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate StudentsTeaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate StudentsNicole Vasilevsky
 
Why does research data matter to libraries
Why does research data matter to librariesWhy does research data matter to libraries
Why does research data matter to librariesJisc RDM
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsMartin Donnelly
 
Digital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening ResearchDigital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening ResearchMartin Donnelly
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research PaperAnita de Waard
 

Similaire à Ucsd library10182010 (20)

Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?Is a Biological Database Really Different than a Biological Journal?
Is a Biological Database Really Different than a Biological Journal?
 
Murpha11
Murpha11Murpha11
Murpha11
 
Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11Open Access and Research Communication: The Perspective of Force11
Open Access and Research Communication: The Perspective of Force11
 
The Era of Open
The Era of OpenThe Era of Open
The Era of Open
 
Alpsp final martone
Alpsp final martoneAlpsp final martone
Alpsp final martone
 
FORCE11: Future of Research Communications and e-Scholarship
FORCE11:  Future of Research Communications and e-ScholarshipFORCE11:  Future of Research Communications and e-Scholarship
FORCE11: Future of Research Communications and e-Scholarship
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Legal scholarship and OA publishing: developing radical pathways to free, op...
 Legal scholarship and OA publishing: developing radical pathways to free, op... Legal scholarship and OA publishing: developing radical pathways to free, op...
Legal scholarship and OA publishing: developing radical pathways to free, op...
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific Publishers
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Teaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate StudentsTeaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate Students
 
Why does research data matter to libraries
Why does research data matter to librariesWhy does research data matter to libraries
Why does research data matter to libraries
 
World ctc2013scoopitcytomics
World ctc2013scoopitcytomicsWorld ctc2013scoopitcytomics
World ctc2013scoopitcytomics
 
The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?The Future of Research Communications and e-Scholarship: Are we there yet?
The Future of Research Communications and e-Scholarship: Are we there yet?
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
 
World CTCUS2012 Scoopit Cytomics
World CTCUS2012 Scoopit CytomicsWorld CTCUS2012 Scoopit Cytomics
World CTCUS2012 Scoopit Cytomics
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Digital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening ResearchDigital Data Sharing: Opportunities and Challenges of Opening Research
Digital Data Sharing: Opportunities and Challenges of Opening Research
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research Paper
 

Plus de Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityPhilip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?Philip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug DiscoveryPhilip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data SciencePhilip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptxPhilip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision EducationPhilip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance SustainabilityPhilip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
 

Plus de Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 

Dernier

Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 

Dernier (20)

Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 

Ucsd library10182010

  • 1. Why is Scholarly Communication Broken and What Can Be Done? In Celebration of Open Access Week Philip E. Bourne University of California San Diego pbourne@ucsd.edu UCSD LibrariesOct. 18, 2010
  • 2. Disclaimer • I am a domain (life) scientist not a computer or information scientist • I am fortunate enough to have a major biological resource (the Protein Data Bank) and a major biological journal (PLoS Computational Biology) as my playground • I am part of the long tail • I am naïve, but I am the majority Oct. 18, 2010 UCSD Libraries
  • 3. Agenda • Motivation • What needs to be done? • A few examples • The role of the institution Oct. 18, 2010 UCSD Libraries
  • 4. The Scientific Process is Too Slow to Respond to a Crisis – Either Global or Personal Oct. 18, 2010 UCSD Libraries Motivation http://knol.google.com/k/plos-currents-influenza# By the time the paper is published we could all be dead
  • 5. * http://www.cdc.gov/h1n1flu/estimates/April_March_13.htm Jan. 2008 Jan. 2009 Jan. 2010Jul. 2009Jul. 2008 Jul. 2010 1RUZ: 1918 H1 Hemagglutinin Structure Summary page activity for H1N1 Influenza related structures 3B7E: Neuraminidase of A/Brevig Mission/1/1918 H1N1 strain in complex with zanamivir In a time of crisis the need for fast access to accurate data and any knowledge of that data are paramount Motivation Oct. 18, 2010 UCSD Libraries
  • 6. If that is not enough… For some people the scientific process may be too slow to save their life Oct. 18, 2010 UCSD Libraries Motivation
  • 7. Josh Sommer – A Remarkable Young Man Co-founder & Executive Director the Chordoma Foundation Oct. 18, 2010 UCSD Libraries http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 8. Chordoma • A rare form of brain cancer • No known drugs • Treatment – surgical resection followed by intense radiation therapy Oct. 18, 2010 UCSD Libraries Motivation http://upload.wikimedia.org/wikipedia/commons/2/2b/Chordoma.JPG
  • 9. Oct. 18, 2010 UCSD Libraries http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 10. Oct. 18, 2010 UCSD Libraries http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 11. Oct. 18, 2010 UCSD Libraries http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 12. Oct. 18, 2010 UCSD Libraries Adapted: http://sagecongress.org/Presentations/Sommer.pdf Motivation Isaac If I have seen further it is only by standing on the shoulders of giants Isaac Newton From Josh’s point of view the climb up just takes too long > 15 years and > $850M to be more precise
  • 13. Oct. 18, 2010 UCSD Libraries http://sagecongress.org/Presentations/Sommer.pdf Motivation
  • 14. Oct. 18, 2010 UCSD Libraries Motivation http://sagecongress.org/Presentations/Sommer.pdf
  • 15. Oct. 18, 2010 UCSD Libraries http://fora.tv/2010/04/23/Sage_Commons_Josh_Sommer_Chordoma_Foundation Motivation
  • 16. Now we are all hopefully motivated let us break this down to what actually needs to be done in my opinion Here are a few big things … Oct. 18, 2010 UCSD Libraries What Needs to be Done?
  • 17. A Few Things to Accelerate the Rate of Scientific Discovery • Better communication, data and knowledge access, and new modes of discovery, which means: – We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives – We need to be more open with both – We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery – Reward systems need to change – We need scientist management tools – We need to be less fixated on the big data problems – We need to unleash the full power of the Internet Oct. 18, 2010 UCSD Libraries Easy Hard
  • 18. 1. A link brings up figures from the paper 0. Full text of PLoS papers stored in a database 2. Clicking the paper figure retrieves data from the PDB which is analyzed 3. A composite view of journal and database content results We Need Data and Knowledge About That Data to Interoperate 1. User clicks on content 2. Metadata and webservices to data provide an interactive view that can be annotated 3. Selecting features provides a data/knowledge mashup 4. Analysis leads to new content I can share 4. The composite view has links to pertinent blocks of literature text and back to the PDB 1. 2. 3. 4. The Knowledge and Data Cycle PLoS Comp. Biol. 2005 1(3) e34
  • 19. We Need Data and Knowledge About That Data to Interoperate – What is Stopping US? • Governance – publishers vs. database providers • Reward • Metadata standards for provenance, privacy etc. • Exemplars • …. Oct. 18, 2010 UCSD Libraries Caveat: Each discipline is different – I speak very much from a biomedical sciences perspective
  • 20. Certainly the Argument for Interoperability in the Biomedical Sciences is Strong • PubMed contains 18,792,257 entries • ~100,000 papers indexed per month • In Feb 2009: – 67,406,898 interactive searches were done – 92,216,786 entries were viewed • 1078 databases reported in NAR 2008 • MetaBase http://biodatabase.org reports 2,651 entries edited 12,587 times Data as of April 14, 2009 PLoS Comp. Biol. 2005 1(3) e34 What Needs to be Done?
  • 21. www.rcsb.org/pdb/explore/literature.do?structureId=1TIM Example Interoperability: The Database View BMC Bioinformatics 2010 11:220 Oct. 18, 2010 UCSD Libraries What Needs to be Done?
  • 22. Example Interoperability: The Literature View http://biolit.ucsd.edu Nucleic Acids Research 2008 36(S2) W385-389 Oct. 18, 2010 UCSD Libraries What Needs to be Done?
  • 23. ICTP Trieste, December 10, 2007 Oct. 18, 2010 UCSD Libraries
  • 24. Semantic Tagging & Widgets are a Powerful Tool to Integrate Data and Knowledge of that Data, But as Yet Not Used Much Oct. 18, 2010 UCSD Libraries Will Widgets and Semantic Tagging Change Computational Biology? PLoS Comp. Biol. 6(2) e1000673 What Needs to be Done?
  • 25. Semantic Tagging of Database Content in The Literature or Elsewhere http://www.rcsb.org/pdb/static.do?p=widgets/widgetShowcase.jsp PLoS Comp. Biol. 6(2) e1000673Semantic Tagging
  • 26. Oct. 18, 2010 UCSD Libraries What Needs to be Done?
  • 27. The Publishers are Starting to Do It Oct. 18, 2010 UCSD Libraries From Anita de Waard, Elsevier What Needs to be Done?
  • 28. This is Literature Post-processing Better to Get the Authors Involved • Authors are the absolute experts on the content • More effective distribution of labor • Add metadata before the article enters the publishing process Oct. 18, 2010 UCSD Libraries What Needs to be Done?
  • 29. Word 2007 Add-in for authors • Allows authors to add metadata as they write, before they submit the manuscript • Authors are assisted by automated term recognition – OBO ontologies – Database IDs • Metadata are embedded directly into the manuscript document via XML tags, OOXML format – Open – Machine-readable • Open source, Microsoft Public License http://www.codeplex.com/ucsdbiolit Oct. 18, 2010 UCSD Libraries What Needs to be Done?
  • 30. Challenges • Authors – Carrot IF one or more publishers fast tracked a paper that had semantic markup it might catch on • Publishers – Carrot Competitive advantage Oct. 18, 2010 UCSD Libraries What Needs to be Done?
  • 31. A Few Things to Accelerate the Rate of Scientific Discovery • Better communication, data and knowledge access, and new modes of discovery, which means: – We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives – We need to be more open with both – We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery – Reward systems need to change – We need scientist management tools – We need to be less fixated on the big data problems – We need to unleash the full power of the Internet Oct. 18, 2010 UCSD Libraries Easy Hard
  • 32. Reward Systems Need to Change What is Needed? • Author disambiguation • Auditing (identification and metrics) of all scholarship - means new tools • Seniors need to promote alternative forms of scholarship • Juniors need to respond Oct. 18, 2010 UCSD Libraries Reward Systems Need to Change Ten Simple Rules for Getting Promoted as a Computational Biologist in Academia PLoS Comp Biol to appear
  • 33. Example Tools Oct. 18, 2010 UCSD Libraries http://pubnet.gersteinlab.org/ http://www.researcherid.com/ http://www.biomedexperts.com
  • 34. What Are these Alternative Forms of Scholarship? Research [Grants] Journal Article Conference Paper Poster Session Reviews Blogs Community Service/Data Curation Reward Systems Need to Change Oct. 18, 2010 UCSD Libraries
  • 35. Ideally the ID will be Tagged to Every Piece of Scholarly Communication I an Not a Scientist I am a Number PLoS Comp. Biol. 2008 4(12) e1000247 Reward Systems Need to Change Oct. 18, 2010 UCSD Libraries
  • 36. A Few Things to Accelerate the Rate of Scientific Discovery • Better communication, data and knowledge access, and new modes of discovery, which means: – We need data and knowledge about that data to interoperate i.e. we need new kinds of fast, versatile publications and data archives – We need to be more open with both – We need to think more about the tools that analyze, visualize and annotate data to maximize knowledge discovery – Reward systems need to change – We need scientist management tools – We need to be less fixated on the big data problems – We need to unleash the full power of the Internet Oct. 18, 2010 UCSD Libraries Easy Hard
  • 37. The Truth About My Laboratory • I have ?? mail folders! • The intellectual memory of my laboratory is in those folders • This is an unhealthy hub and spoke mentality We Need Scientist Management Tools Oct. 18, 2010 UCSD Libraries
  • 38. The Truth About My Laboratory • I generate way more negative that positive data, but where is it? • Content management is a mess – Slides, posters….. – Data, lab notebooks …. – Collaborations, Journal clubs … • Software is open but where is it? • Farewell is for the data too Computational Biology Resources Lack Persistence and Usability. PLoS Comp. Biol. 2008 4(7): e1000136 We Need Scientist Management Tools http://artbyvida.com/portfolio.php
  • 39. Many Great Tools Out There Oct. 18, 2010 UCSD Libraries We Need Scientist Management Tools Taverna
  • 40. Where I See the Problems • The long tail is confused • Lack of interoperability between the options • The reward (publishing) is still removed from the available tools Oct. 18, 2010 UCSD Libraries We Need Scientist Management Tools
  • 41. Science is Increasingly a Digital Workflow Scientist Idea Experiment Data Conclusions PublishThe Role of the Institution Laboratory Publisher
  • 42. Maybe The Line is Somewhere Else? Scientist Idea Experiment Data Conclusions Publish Laboratory Publisher Institution Lab Notebook The Role of the Institution
  • 43. This Amounts to Publishing Workflows But That Has its Problems • Workflows are not linear • Workflow : paper is not 1:1 • Confidentiality • Peer review • Infrastructure • Community acceptance • Reward system The Role of the Institution
  • 44. Solutions to Publishing Workflows? • New organizations (university as publisher?) • Appropriate reward system • Shared governance – author, institution, publisher • Crowd sourcing the electronic printing press The Role of the Institution
  • 45. Crowd Sourcing the Electronic Printing Press (aka Workshop: Beyond the PDF) • Funded by DDCF, Microsoft, NCI, Sage Bionetworks: • Aims: – Define user requirements – Establish a specification document – Open source the development effort – Have a commitment from a publisher to publish a research object using the system – Act as an exemplar for what can be done The Role of the Institution
  • 46. Logistics • UC San Diego • Jan 19-21, 2010 • Under the auspices of W3C • FoRC will have a follow on meeting The Role of the Institution