SlideShare une entreprise Scribd logo
Big Journal Literature
Big Usage
Jan Velterop – SSP – Arlington, May 28, 2015
11,135,542
More than 2 added
every minute of 2014
Number of abstracts in PubMed
Information overload!
that
Overload?
Or rapidly increasing
knowledge…
…making a world of
difference that can
change the course
of scientific thought?
Dissemination
of knowledge
Optimal dissemination for
Lamp post research
Looking merely at the literature that
one can read – which is not
necessarily all the literature that is
potentially important to one’s research
Lamp post research:
Big Usage
But not in the way we’re used to
So, what to do?
problemEvery has its solution
Possible strategies:
1.Publish a smaller number of papers
2.Accept that an ever smaller proportion of the
available papers is actually being read
3.Capture the knowledge contained in all papers
and map it in such a way that you can navigate
that knowledge
Possible strategies:
1.Publish a smaller number of papers
Maybe, but if it means less information, it’s
ludicrous
2.Accept that an ever smaller proportion of the
available papers is actually being read
3.Capture the knowledge contained in all papers and
map it in such a way that you can navigate that
knowledge
Possible strategies:
1.Publish a smaller number of papers
2.Accept that an ever smaller proportion of the
available papers is actually being read
How to choose, though?
3.Capture the knowledge contained in all papers
and map it in such a way that you can navigate
that knowledge
In any event:
l’embarras du choix
Possible strategies:
1.Publish a smaller number of papers
2.Accept that an ever smaller proportion of the
available papers is actually being read
3.Capture the knowledge contained in all papers
and map it in such a way that you can navigate
that knowledge
Yes! Helps to see trends and what to
choose!
First
create an overview…
…only then
start digging
How might we create overviews?
“As the rate of publishing accelerates,
the need for computational support to
work out which articles to read, and
how to interpret, reproduce and validate
the claims they contain is growing.”
Quote from ‘Lazarus’:
http://www.bbsrc.ac.uk/pa/grants/AwardDetails.aspx?FundingReference=BB/L005298/1
Extract Key Insights
Extract Key Insights
Imagine you had a paper that concluded:
“On hot days, it turns out that aspirin
decreases the chances of blot clots, but
increases the chances of heart attack in
humans; the effect wasn't observed in rats
at all; simulations of dogs seem to
suggest that the effect is present but
independent of temperature unless the dog
is accompanied by a human”
Imagine you had a paper that concluded:
“On hot dayshot days, it turns out that aspirinaspirin
decreasesdecreases the chances of blot clotsblot clots, but
increasesincreases the chances of heart attackheart attack in
humanshumans; the effect wasn't observed in ratsrats
at all; simulations of dogsdogs seem to
suggest that the effect is present but
independent of temperaturetemperature unless the dogdog
is accompanied by a humanhuman”
Significant concepts:
[CHEMBL25] (aspirin)
[EFO_0001702] ('temperature' from the
experimental factors ontology)
[Canis lupus familiaris]
[Homo sapiens]
[Mus musculus]
Headline Interactions (in the form of Triples):
[ASPIRIN] [DECREASES] [THROMBOSIS]
[ASPIRIN] [INCREASES] [MYOCARDIAL INFARCTION]
Significant concepts:
[CHEMBL25] (aspirin)
[EFO_0001702] ('temperature' from the
experimental factors ontology)
[Canis lupus familiaris]
[Homo sapiens]
[Mus musculus]
Headline Interactions (in the form of Triples):
[ASPIRIN] [DECREASES] [THROMBOSIS]
[ASPIRIN] [INCREASES] [MYOCARDIAL INFARCTION]
Add this to the article’s abstract
(after it’s been validated by the author):
Most efficient:
If publishers were to do this
(doesn’t cost much, and makes articles far more useful)
In case publishers don’t, alternative
ways are being developed outside
publishers’ control
publishing data in articles
Currently:
equals burying data
R.I.P.R.I.P.
ocuments
Via Utopia Documents, LAZARUS ‘resurrects’
knowledge from being buried in articles:
• entities (‘concepts’, incl. synonyms, e.g. proteins)
• phrases, statements, assertions (e.g. triples)
• molecules (incl. Markush structure groups)
• graphs
• tables http://utopiadocs.com
• entities (‘concepts’, incl. synonyms, e.g. proteins)
• phrases, statements, assertions (e.g. triples)
• molecules (incl. Markush structure groups)
• graphs
• tables
These are captured – with their provenance, e.g.
DOI – in a ‘Knowledge Graph’ of their relationships
When assertions are captured, they are compared
to the Knowledge Graph and labelled as ‘new’ (to
the Graph) or ‘already found earlier’
should beshould be
interesting forinteresting for
the peerthe peer
reviewer of areviewer of a
newlynewly
submittedsubmitted
articlearticle
“Lazarus to harness the crowd reading life-
science articles to resurrect the swathes of
legacy data buried in charts, tables, diagrams
and free-text, to liberate processable data into a
shared resource that benefits the community.”
“Lazarus to harness the crowd reading life-
science articles to resurrect the swathes of
legacy data buried in charts, tables, diagrams
and free-text, to liberate processable data into a
shared resource that benefits the community.”
“…activities currently carried out anyway by
individuals for their own purposes (annotating,
cross-referencing articles with databases,
organising collections of articles).”
“Lazarus to harness the crowd reading life-
science articles to resurrect the swathes of
legacy data buried in charts, tables, diagrams
and free-text, to liberate processable data into a
shared resource that benefits the community.”
Works on any pdf, from paywalled
Works on any pdf, from paywalled
and open sources alike
and open sources alike
“…activities currently carried out anyway by
individuals for their own purposes (annotating,
cross-referencing articles with databases,
organising collections of articles).”
VHL protein binds to HIF-α which is ubiquitinated and tagged for degradation in the proteasome.
‘Assertions’ and ‘significant concepts’ extracted
from articles (either by the publisher or by others,
like Utopia’s LAZARUS), are added to a growing
‘knowledge graph’ which can be analysed for
trends, clusters, areas of intensive activity, etc.
Getting the picture from a large number of data
What we need is information
extracted from as many articles as
possible
The more we have, the ‘sharper’
the knowledge picture
Getting a better picture from even more assertions
Homing in
i.e. making the
choice what to
read in detail
BRAIN — Bio Relations And Intelligence Network
“Recombinant Knowledge”
>>>>
Once researchers have identified the
articles they really need to read,
it should be made very easy to do so
Ergo, what publishers should do, too,
is to make all articles available in
all formats: HTML, XML, PDF and
ePub – even print, on demand.
Also on mobile devices
For instance:
Easier than you might think
(www.researchpad.co)
Build collection of favourites
Read full text
Inspect metrics
share with others
sales@newgen.co technical inquiries: patrick@newgen.co
In their words:
ResearchPad Launch Process
Project
Definition
Branding
Publishing
Go Live
Turnaround
Time
- 8 weeks
Slide borrowed from:
What ResearchPad can do for publishers who
want it, at no extra cost*, is to integrate a
publisher’s content with anything from
elsewhere that’s freely available with open
access, so that this open access material can
be accessed from within the publisher’s platform
* personal communication
sales@newgen.co technical inquiries: patrick@newgen.co
Thank you
Jan Velterop – 28 May 2015
velterop@me.com

Contenu connexe

Tendances

Get Yourself Organised!! Part 3 RSS Feeds
Get Yourself Organised!!  Part 3 RSS FeedsGet Yourself Organised!!  Part 3 RSS Feeds
Get Yourself Organised!! Part 3 RSS FeedsBGS Library
 
Research ready wikipedia_ebook
Research ready wikipedia_ebookResearch ready wikipedia_ebook
Research ready wikipedia_ebookthejoshspeaks
 
Researching online
Researching onlineResearching online
Researching onlinetreezb
 
The "social" side of digital science
The "social" side of digital scienceThe "social" side of digital science
The "social" side of digital scienceKaitlin Thaney
 
Psychology journals introduction
Psychology journals introductionPsychology journals introduction
Psychology journals introductiondclarkderby
 

Tendances (7)

Get Yourself Organised!! Part 3 RSS Feeds
Get Yourself Organised!!  Part 3 RSS FeedsGet Yourself Organised!!  Part 3 RSS Feeds
Get Yourself Organised!! Part 3 RSS Feeds
 
Searching uqu library
Searching uqu librarySearching uqu library
Searching uqu library
 
Searching Uqu Library
Searching Uqu LibrarySearching Uqu Library
Searching Uqu Library
 
Research ready wikipedia_ebook
Research ready wikipedia_ebookResearch ready wikipedia_ebook
Research ready wikipedia_ebook
 
Researching online
Researching onlineResearching online
Researching online
 
The "social" side of digital science
The "social" side of digital scienceThe "social" side of digital science
The "social" side of digital science
 
Psychology journals introduction
Psychology journals introductionPsychology journals introduction
Psychology journals introduction
 

En vedette

Ops gen2 phen oa datasharing 19 sep 2011 copy
Ops gen2 phen oa datasharing 19 sep 2011 copyOps gen2 phen oa datasharing 19 sep 2011 copy
Ops gen2 phen oa datasharing 19 sep 2011 copyvelterop
 
iExpo Paris 10 juin 2010-Velterop
iExpo Paris 10 juin 2010-VelteropiExpo Paris 10 juin 2010-Velterop
iExpo Paris 10 juin 2010-Velteropvelterop
 
Giessen October 9 09 Nano Publication
Giessen October 9 09 Nano PublicationGiessen October 9 09 Nano Publication
Giessen October 9 09 Nano Publicationvelterop
 
Measuring is knowing - or is it?
Measuring is knowing -  or is it?Measuring is knowing -  or is it?
Measuring is knowing - or is it?velterop
 
Optimising the use of existing knowledge
Optimising the use of existing knowledgeOptimising the use of existing knowledge
Optimising the use of existing knowledgevelterop
 
Lund Sep 15 09
Lund Sep 15 09Lund Sep 15 09
Lund Sep 15 09velterop
 
Science publishing, record keeping, knowledge transfer
Science publishing, record keeping, knowledge transferScience publishing, record keeping, knowledge transfer
Science publishing, record keeping, knowledge transfervelterop
 
Secularism in Australia
Secularism in AustraliaSecularism in Australia
Secularism in AustraliaDavid Wood
 

En vedette (8)

Ops gen2 phen oa datasharing 19 sep 2011 copy
Ops gen2 phen oa datasharing 19 sep 2011 copyOps gen2 phen oa datasharing 19 sep 2011 copy
Ops gen2 phen oa datasharing 19 sep 2011 copy
 
iExpo Paris 10 juin 2010-Velterop
iExpo Paris 10 juin 2010-VelteropiExpo Paris 10 juin 2010-Velterop
iExpo Paris 10 juin 2010-Velterop
 
Giessen October 9 09 Nano Publication
Giessen October 9 09 Nano PublicationGiessen October 9 09 Nano Publication
Giessen October 9 09 Nano Publication
 
Measuring is knowing - or is it?
Measuring is knowing -  or is it?Measuring is knowing -  or is it?
Measuring is knowing - or is it?
 
Optimising the use of existing knowledge
Optimising the use of existing knowledgeOptimising the use of existing knowledge
Optimising the use of existing knowledge
 
Lund Sep 15 09
Lund Sep 15 09Lund Sep 15 09
Lund Sep 15 09
 
Science publishing, record keeping, knowledge transfer
Science publishing, record keeping, knowledge transferScience publishing, record keeping, knowledge transfer
Science publishing, record keeping, knowledge transfer
 
Secularism in Australia
Secularism in AustraliaSecularism in Australia
Secularism in Australia
 

Similaire à Velterop 2 a ssp arlington may 2015

STRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVES
STRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVESSTRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVES
STRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVESNicolaie Constantinescu
 
Librarians in the Intelligence Process
Librarians in the Intelligence ProcessLibrarians in the Intelligence Process
Librarians in the Intelligence Processdavidshumaker
 
Covering Scientific Research #SciCommLSU
Covering Scientific Research #SciCommLSUCovering Scientific Research #SciCommLSU
Covering Scientific Research #SciCommLSUPaige Jarreau
 
Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...
Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...
Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...gujad
 
Writing Seminar Pitts Spring 2012
Writing Seminar Pitts Spring 2012Writing Seminar Pitts Spring 2012
Writing Seminar Pitts Spring 2012Traciwm
 
Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012Traciwm
 
Slide comd
Slide comdSlide comd
Slide comddparkin
 
Connected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul GrothConnected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul GrothConnected Data World
 
Research Methods Lecture 2
Research Methods Lecture 2 Research Methods Lecture 2
Research Methods Lecture 2 Helena Hollis
 
HNFE 2014 library lecture Spring 2016
HNFE 2014 library lecture Spring 2016HNFE 2014 library lecture Spring 2016
HNFE 2014 library lecture Spring 2016Virginia Pannabecker
 
Week 2 science news assignment popular science news articlema
Week 2 science news assignment popular science news articlemaWeek 2 science news assignment popular science news articlema
Week 2 science news assignment popular science news articlemaIRESH3
 
Lec13 Scientific Papers and Communications
Lec13 Scientific Papers and CommunicationsLec13 Scientific Papers and Communications
Lec13 Scientific Papers and CommunicationsJanet Stemwedel
 
Evaluating Source Material
Evaluating Source MaterialEvaluating Source Material
Evaluating Source MaterialAdam Raskoskie
 
Towards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesTowards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesAnita de Waard
 
Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...
Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...
Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...Jill Cirasella
 

Similaire à Velterop 2 a ssp arlington may 2015 (20)

STRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVES
STRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVESSTRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVES
STRETCHING THE BOUNDARIES OF PUBLISHING: ALTERNATIVES
 
Librarians in the Intelligence Process
Librarians in the Intelligence ProcessLibrarians in the Intelligence Process
Librarians in the Intelligence Process
 
Sonia Vasconcelos - Ethics in the Evaluation of Manuscripts...
Sonia Vasconcelos - Ethics in the Evaluation of Manuscripts...Sonia Vasconcelos - Ethics in the Evaluation of Manuscripts...
Sonia Vasconcelos - Ethics in the Evaluation of Manuscripts...
 
Covering Scientific Research #SciCommLSU
Covering Scientific Research #SciCommLSUCovering Scientific Research #SciCommLSU
Covering Scientific Research #SciCommLSU
 
Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...
Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...
Research Methods for the Behavioral Sciences 5th Edition Gravetter Solutions ...
 
Writing Seminar Pitts Spring 2012
Writing Seminar Pitts Spring 2012Writing Seminar Pitts Spring 2012
Writing Seminar Pitts Spring 2012
 
Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012Writing Seminar Babbitt Spring 2012
Writing Seminar Babbitt Spring 2012
 
Slide comd
Slide comdSlide comd
Slide comd
 
Connected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul GrothConnected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul Groth
 
Research Methods Lecture 2
Research Methods Lecture 2 Research Methods Lecture 2
Research Methods Lecture 2
 
Scientific Method
Scientific MethodScientific Method
Scientific Method
 
HNFE 2014 library lecture Spring 2016
HNFE 2014 library lecture Spring 2016HNFE 2014 library lecture Spring 2016
HNFE 2014 library lecture Spring 2016
 
Week 2 science news assignment popular science news articlema
Week 2 science news assignment popular science news articlemaWeek 2 science news assignment popular science news articlema
Week 2 science news assignment popular science news articlema
 
020610
020610020610
020610
 
Lec13 Scientific Papers and Communications
Lec13 Scientific Papers and CommunicationsLec13 Scientific Papers and Communications
Lec13 Scientific Papers and Communications
 
cm121 Basic Library Info Part 1
cm121 Basic Library Info Part 1cm121 Basic Library Info Part 1
cm121 Basic Library Info Part 1
 
Evaluating Source Material
Evaluating Source MaterialEvaluating Source Material
Evaluating Source Material
 
Week 12 102
Week 12 102Week 12 102
Week 12 102
 
Towards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesTowards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data Services
 
Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...
Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...
Of CUNY, By CUNY, For CUNY: How We All Benefit from Open Access and Why We Al...
 

Dernier

INSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversityINSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversitySteffi Friedrichs
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Sérgio Sacani
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Sérgio Sacani
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthSérgio Sacani
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsMichel Dumontier
 
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Sérgio Sacani
 
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCEPLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCETALAPATI ARUNA CHENNA VYDYANAD
 
Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxGOWTHAMIM22
 
Triploidy ...............................pptx
Triploidy ...............................pptxTriploidy ...............................pptx
Triploidy ...............................pptxCherry
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptsreddyrahul
 
word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...
word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...
word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu
 
Hemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. MuralinathHemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. Muralinathmuralinath2
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
 
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana LahariERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Laharimuralinath2
 
Tissue engineering......................pptx
Tissue engineering......................pptxTissue engineering......................pptx
Tissue engineering......................pptxCherry
 
GBSN - Biochemistry (Unit 4) Chemistry of Carbohydrates
GBSN - Biochemistry (Unit 4) Chemistry of CarbohydratesGBSN - Biochemistry (Unit 4) Chemistry of Carbohydrates
GBSN - Biochemistry (Unit 4) Chemistry of CarbohydratesAreesha Ahmad
 
National Biodiversity protection initiatives and Convention on Biological Di...
National Biodiversity protection initiatives and  Convention on Biological Di...National Biodiversity protection initiatives and  Convention on Biological Di...
National Biodiversity protection initiatives and Convention on Biological Di...PABOLU TEJASREE
 
GBSN - Microbiology (Lab 2) Compound Microscope
GBSN - Microbiology (Lab 2) Compound MicroscopeGBSN - Microbiology (Lab 2) Compound Microscope
GBSN - Microbiology (Lab 2) Compound MicroscopeAreesha Ahmad
 
mixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategymixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategyMansiBishnoi1
 
Unveiling The Crucial Role Of Cobalt In Plant
Unveiling The Crucial Role Of Cobalt In PlantUnveiling The Crucial Role Of Cobalt In Plant
Unveiling The Crucial Role Of Cobalt In PlantHimanshu Pandey
 

Dernier (20)

INSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversityINSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere University
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on Earth
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
 
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCEPLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
 
Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptx
 
Triploidy ...............................pptx
Triploidy ...............................pptxTriploidy ...............................pptx
Triploidy ...............................pptx
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
 
word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...
word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...
word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...
 
Hemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. MuralinathHemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. Muralinath
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana LahariERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
 
Tissue engineering......................pptx
Tissue engineering......................pptxTissue engineering......................pptx
Tissue engineering......................pptx
 
GBSN - Biochemistry (Unit 4) Chemistry of Carbohydrates
GBSN - Biochemistry (Unit 4) Chemistry of CarbohydratesGBSN - Biochemistry (Unit 4) Chemistry of Carbohydrates
GBSN - Biochemistry (Unit 4) Chemistry of Carbohydrates
 
National Biodiversity protection initiatives and Convention on Biological Di...
National Biodiversity protection initiatives and  Convention on Biological Di...National Biodiversity protection initiatives and  Convention on Biological Di...
National Biodiversity protection initiatives and Convention on Biological Di...
 
GBSN - Microbiology (Lab 2) Compound Microscope
GBSN - Microbiology (Lab 2) Compound MicroscopeGBSN - Microbiology (Lab 2) Compound Microscope
GBSN - Microbiology (Lab 2) Compound Microscope
 
mixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategymixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategy
 
Unveiling The Crucial Role Of Cobalt In Plant
Unveiling The Crucial Role Of Cobalt In PlantUnveiling The Crucial Role Of Cobalt In Plant
Unveiling The Crucial Role Of Cobalt In Plant
 

Velterop 2 a ssp arlington may 2015

  • 1. Big Journal Literature Big Usage Jan Velterop – SSP – Arlington, May 28, 2015
  • 2. 11,135,542 More than 2 added every minute of 2014 Number of abstracts in PubMed
  • 4. that Overload? Or rapidly increasing knowledge… …making a world of difference that can change the course of scientific thought?
  • 7.
  • 9. Looking merely at the literature that one can read – which is not necessarily all the literature that is potentially important to one’s research Lamp post research:
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Big Usage But not in the way we’re used to
  • 23. So, what to do?
  • 25. Possible strategies: 1.Publish a smaller number of papers 2.Accept that an ever smaller proportion of the available papers is actually being read 3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge
  • 26. Possible strategies: 1.Publish a smaller number of papers Maybe, but if it means less information, it’s ludicrous 2.Accept that an ever smaller proportion of the available papers is actually being read 3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge
  • 27. Possible strategies: 1.Publish a smaller number of papers 2.Accept that an ever smaller proportion of the available papers is actually being read How to choose, though? 3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge
  • 28.
  • 29.
  • 31. Possible strategies: 1.Publish a smaller number of papers 2.Accept that an ever smaller proportion of the available papers is actually being read 3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge Yes! Helps to see trends and what to choose!
  • 34. How might we create overviews?
  • 35. “As the rate of publishing accelerates, the need for computational support to work out which articles to read, and how to interpret, reproduce and validate the claims they contain is growing.” Quote from ‘Lazarus’: http://www.bbsrc.ac.uk/pa/grants/AwardDetails.aspx?FundingReference=BB/L005298/1
  • 37. Imagine you had a paper that concluded: “On hot days, it turns out that aspirin decreases the chances of blot clots, but increases the chances of heart attack in humans; the effect wasn't observed in rats at all; simulations of dogs seem to suggest that the effect is present but independent of temperature unless the dog is accompanied by a human”
  • 38. Imagine you had a paper that concluded: “On hot dayshot days, it turns out that aspirinaspirin decreasesdecreases the chances of blot clotsblot clots, but increasesincreases the chances of heart attackheart attack in humanshumans; the effect wasn't observed in ratsrats at all; simulations of dogsdogs seem to suggest that the effect is present but independent of temperaturetemperature unless the dogdog is accompanied by a humanhuman”
  • 39. Significant concepts: [CHEMBL25] (aspirin) [EFO_0001702] ('temperature' from the experimental factors ontology) [Canis lupus familiaris] [Homo sapiens] [Mus musculus] Headline Interactions (in the form of Triples): [ASPIRIN] [DECREASES] [THROMBOSIS] [ASPIRIN] [INCREASES] [MYOCARDIAL INFARCTION] Significant concepts: [CHEMBL25] (aspirin) [EFO_0001702] ('temperature' from the experimental factors ontology) [Canis lupus familiaris] [Homo sapiens] [Mus musculus] Headline Interactions (in the form of Triples): [ASPIRIN] [DECREASES] [THROMBOSIS] [ASPIRIN] [INCREASES] [MYOCARDIAL INFARCTION] Add this to the article’s abstract (after it’s been validated by the author):
  • 40. Most efficient: If publishers were to do this (doesn’t cost much, and makes articles far more useful) In case publishers don’t, alternative ways are being developed outside publishers’ control
  • 41. publishing data in articles Currently: equals burying data R.I.P.R.I.P.
  • 42. ocuments Via Utopia Documents, LAZARUS ‘resurrects’ knowledge from being buried in articles: • entities (‘concepts’, incl. synonyms, e.g. proteins) • phrases, statements, assertions (e.g. triples) • molecules (incl. Markush structure groups) • graphs • tables http://utopiadocs.com
  • 43. • entities (‘concepts’, incl. synonyms, e.g. proteins) • phrases, statements, assertions (e.g. triples) • molecules (incl. Markush structure groups) • graphs • tables These are captured – with their provenance, e.g. DOI – in a ‘Knowledge Graph’ of their relationships When assertions are captured, they are compared to the Knowledge Graph and labelled as ‘new’ (to the Graph) or ‘already found earlier’ should beshould be interesting forinteresting for the peerthe peer reviewer of areviewer of a newlynewly submittedsubmitted articlearticle
  • 44. “Lazarus to harness the crowd reading life- science articles to resurrect the swathes of legacy data buried in charts, tables, diagrams and free-text, to liberate processable data into a shared resource that benefits the community.”
  • 45. “Lazarus to harness the crowd reading life- science articles to resurrect the swathes of legacy data buried in charts, tables, diagrams and free-text, to liberate processable data into a shared resource that benefits the community.” “…activities currently carried out anyway by individuals for their own purposes (annotating, cross-referencing articles with databases, organising collections of articles).”
  • 46. “Lazarus to harness the crowd reading life- science articles to resurrect the swathes of legacy data buried in charts, tables, diagrams and free-text, to liberate processable data into a shared resource that benefits the community.” Works on any pdf, from paywalled Works on any pdf, from paywalled and open sources alike and open sources alike “…activities currently carried out anyway by individuals for their own purposes (annotating, cross-referencing articles with databases, organising collections of articles).”
  • 47.
  • 48. VHL protein binds to HIF-α which is ubiquitinated and tagged for degradation in the proteasome.
  • 49.
  • 50.
  • 51. ‘Assertions’ and ‘significant concepts’ extracted from articles (either by the publisher or by others, like Utopia’s LAZARUS), are added to a growing ‘knowledge graph’ which can be analysed for trends, clusters, areas of intensive activity, etc.
  • 52. Getting the picture from a large number of data
  • 53. What we need is information extracted from as many articles as possible The more we have, the ‘sharper’ the knowledge picture
  • 54. Getting a better picture from even more assertions
  • 55.
  • 56. Homing in i.e. making the choice what to read in detail
  • 57.
  • 58. BRAIN — Bio Relations And Intelligence Network
  • 59.
  • 60.
  • 61.
  • 63.
  • 64. >>>>
  • 65.
  • 66. Once researchers have identified the articles they really need to read, it should be made very easy to do so
  • 67. Ergo, what publishers should do, too, is to make all articles available in all formats: HTML, XML, PDF and ePub – even print, on demand.
  • 68. Also on mobile devices
  • 69. For instance: Easier than you might think
  • 71.
  • 72.
  • 73. Build collection of favourites
  • 77.
  • 78.
  • 79. sales@newgen.co technical inquiries: patrick@newgen.co In their words:
  • 80. ResearchPad Launch Process Project Definition Branding Publishing Go Live Turnaround Time - 8 weeks Slide borrowed from:
  • 81. What ResearchPad can do for publishers who want it, at no extra cost*, is to integrate a publisher’s content with anything from elsewhere that’s freely available with open access, so that this open access material can be accessed from within the publisher’s platform * personal communication
  • 83. Thank you Jan Velterop – 28 May 2015 velterop@me.com

Notes de l'éditeur

  1. Project Definition Collect sample article files Understanding publisher’s business needs Costing / Negotiation Sample article evaluation Branding Demonstrate the app with publisher content Develop a customer branding / skin / features both for the app and EPUB Develop a specific version of ResearchPad using the branding assets Create specific cloud repositories and web service accounts/developer accounts Publishing Develop a process that ties into front list development to publish simultaneously Understand and integrate with your user / subscription management system Estimate journal catalog archive size and establish a plan and cost to convert to EPUB Go Live Push to the stores/launch web app Align with marketing programs to be run from the publisher