Linked Science
Building a Web of Research Data
Rinke Hoekstra
VU University Amsterdam/University of Amsterdam
rinke.hoekstra@vu.nl
Linked Science - Building a Web of Research Data by Rinke Hoekstra
Licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.
Linked Libraries
Building a Web of Library Data
Rinke Hoekstra
VU University Amsterdam/University of Amsterdam
rinke.hoekstra@vu.nl
Linked Science - Building a Web of Research Data by Rinke Hoekstra
Licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.
Linked Archives
Building a Web of Archived Data
Rinke Hoekstra
VU University Amsterdam/University of Amsterdam
rinke.hoekstra@vu.nl
Linked Science - Building a Web of Research Data by Rinke Hoekstra
Licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.
Linked Government
Building a Web of Government Data
Rinke Hoekstra
VU University Amsterdam/University of Amsterdam
rinke.hoekstra@vu.nl
Linked Science - Building a Web of Research Data by Rinke Hoekstra
Licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.
Su
M mm
‘Golden boy’ van deaken enfcropyright afstaan
psychologie ac ag oo
S t r e s e n ie m
um
Sc ape sie e ego en
ien l l
c e n d ïst en
s
w De e en ek v isc isch kri
n o e r k s o c a n a a k r im e r jg e
m
ta op
penbaar au
blijktereennonderzoeksfraudeur
ra om een iale der raa ina dan n ee
a rns o
sda
ab re m ui psy fac e w k e tie. nie rde
geve de
ete geera
or a t C ch ul ete n h H t-v r e
rzoek
t a h t n e
le Nu usw r w ica oso eit v sch aald t on lees en b
s mo g l k ke d a r go ci a n a p e d d e ete a a
schapp
ge n e etg e
onde
e al n
oo k oo ur
du ven we nkt evi soci n j e va de peli an rzoe rs. R . C
Weten im s w l c r d e n g a a o u r ia U n jk d e k o m arn
eist
on aa eat un en l-ec rna nt ive e tij kra van m ivo nu
H n vu Be ts r o p ie f iv e m i o n lis t te h r s i d s c n t d e e li r e en heb
ben we an
gee arzheolog o er het delen kort
v
NWO
Psycholoog en bekend pr der gaat heto Stapels misdragingen
bij te n
. Z p meer decNRCV envMAX. Twee weken
ot Sta en - rsite sdaa mi uit ebb it v hri e el msyt d s njkomst
Vo der
on dr og
n. e raka om d dn sch en daarla an ten o geleden zorgdenhij En s ophef
e i
„ voorin
ds
deskundige Diederik Stapel
ac de og urenegse hpodlenietdeonwelgevallige edata”. stigTftpsy-fseencholetrreaegtnDu ohij. samenlogen.” met
ic it het „hier N daen aanpassen
Die van m aazoieken v w Yt et Deoin- .
ezijn ve e is r h op is l
ov
ok a”,o zeten h of rochrono deed met de
onderzoek dat
m te l ur e ancholoogs heeft „op grote schaale aabu dahooglerarenndrr o Vonk denwMarcel
n th o eten-
w ad rzoredaceoirb gie D vpnondzepno-”i,dczheganhiju i„nMraanaort ioFrgagege- ook emA tgdei doichreoine logien jae rringen
a n - n err t
on
blijkt onderzoeken te hebben a eb l e Roos ,
T uit, e ei n oij n t d . k p k n rg t e D r e de g
verzonnen. Zelfs zijn naaste er enschap dvotolo itz nbevens pcol jk, m at ewe de a dnvermoeden da Zeelenberg. dn thand va waren, stru-
m er ea n k j k npcele dverzonnen”. de vóór n 20 oanco-m
k
e aa
i „Wek
m d
De den ede Vleeseters r a aldus
w e i . N, eych kw s hij dlie daa r
de elet on msiopsramraa s Vo tvermboeuenlaccess.” deed VS te hijein5Til-i cshet p die iaanrik t odtiteerer”renv„hufte-
z
dat m ditgook al s ov
hij - en er e con
htha drietal, „eu oïs a sch t, e an het
g t
collega had dat niet door. „De cess’ S elpt mueed ioa ine kereomo e we e burg eos t a’s Ik ehebrlichn collegahin cs, riger” danavegetariërs. torie
n p c dop werkte. r z vere mijn ous sub- t p n 0 cm k ho
n a he wa R
h t a
nis
te com r r d ioleilij e k o a a en ns ee
vRnheb his eftzde b p. W aoe dse u zin bomen an e r de his
e gevens
i t ten c oa t pe s m p k r l zij t te n g NWbe h e i ege l eg de an e
gedachte aan eur fraude is geen luids dape sDhopt e rsoch opedsega n ssie- de Groningen daarvanijopd n hoogtebge- b oal rt tonder msee nddezonderzoekers
e O a md n -
i za- telteeoverlantnsttoch b vok ldt huisbladiE e eesDezij
n claim kwam er oeksge r kli-
ct et we l W anschaoa e l van in
l nl eg nt
momentek h o u opgekomen.” hde l ogi e is R etet h is ge rco omm t v - s dee sSoaookwaakdenasme n „ d e eijn bje-- stop mn an. nteol in tze ezouden niet
ze r da o r n n
steld.” e mda r , e t e t Die o bat ove Uni-
Door onB e rbij met chappers die la auderscn. stlaeraepelet gsevt ncet teietenenroucToTiene dagenn geleden:fa We sz . oodoor onbevooroordeeldhe e Vonkn’svoor-n skli kritiek r staan; d uee
i s t np r o e
i n v
ee
t e ge d ekker meldde een el e ae ee
fr it e m e h ri w egh st siden v geseael h. ie rd met te b
ie
K a r e l d a m . Wetens krijgen van d rengheoa dh.nd nda rmoek in tePlos ijm N Eemok –ata bleReerrzorcief Dph gne nuitt dStapels bezitter geweestrvan .de Stichting 10 va n
nt e b le
pe lenco g zijn. Q in 20
D is
t
u ru Z o Stee j z e . D g O en die k d wetenschappere
de & ij l ud es r ov er-ande in est
tu i
p daah n da t jonge eeU ee ce am aij o t jou.”es ies e
e ge o t w t k n
o t t e r rzoeksubsidie evens voort- e Vl e.ins ecent oal Nijersck rifit it N ar e 50ersdgroep nd n-an cEijlander. Onderzoeks- r keraatv r Nu Belfast mo evonderzoek , z j . ev lm
d e
Wak-
afstaa
RDoor onze redacteurgeg g Rp u jke t de in d t e m
Ne- M achappelir. D s hsp00iake ikitlen na le ese hie wzrit alazichvbijtk ezijnit pdo eicneur oP an- o nie t Dier. in blijkt dat het ens
n onde eten hun m ir j rt rse e n jk e pui-g aad pr: oni dar aaen o t v ir jt S m et
ig n versiteit ter zulke geg cepticus.
Bart, FunnekotterNWO, de teer- aa s in edat van et o l oip ehappelilev Eor j or scha m e erid, fvn- db teer g aand N St waarschijnlijk geheel averzonnenpis. rs
ee
mo
w 5pe v i mop d leken niet
g resultaten oesn gin e reproduce- de rech ma t s
NWO nbaa maken. voor We n er e
o g W e ks z a Un c oeo va t n. V e ta ren H el navraagetArch dat Stapel stap Vonk een in se kli
o k n k m ra d b j ee s d n mi
a a pe
Rotterdam.rTopwetenschapm met eeld rtdaond rijn dte aweten s zorn cenn de a anvo eibris enebijetus eirsaoableektivine„a eze re-g ape aan gaatBriteen k dat wetenop haar n
is top- - g -t
co er n t t l i - verklaring sch
pe
sport. nds Or continuek, ww e met o rde m
t w la k 10 r - e i ngelmatig in zijns eveAN S): ls S a v e
aan oWieenietganisa presteert, telt in bear ngill oo geeindevopraga edaa arta . jH uioornnsvadtdDeioïv eentjean D tSp er-g website diep il oo het stof. „Ik rtikele -
tie u se a l n a moet
derla erzo o c ni
ie he cm iesenog r s v d dt e rd nen a kD kmg leeServo esan rd onderzoek e- u f o g r (D
eic d ” DAN e
e hu NWO w doorht op hu aan tijd
perish. aa li e m sc g k e oo r d d a aedr
niet appelijk OnPublishta. NWiO isrDe dan bt h at e hbaar zijn. oee orede diebatonkvet al deed,etterwijl elpa onenz.eer. nnsdachtlse- aannemen eursookverdragendata’ be- be-
meer mee. d e da or p st jar workv kooen zijn omgeving
a v O h a w sl d ln w rec
sch n d r v p h rz o p
il n de besa hik o Els nhri is Vhn t el fegid door eleinomi NiWrnt.e Mr ct arTsubegeve zekvapje aal ig het aut dat r o de ‘vlees- ker kan
n a it d n
naar va uim eeK hollf m oode - c
re rt. Neud r sniet vg oo geav a eze dd eunv h sc te rndeta oleb u die dEij-
Tilburgse hoogleraar cognitieve soci- rde zegt dat Stapella d jmeeraterugkeert. ro datt anrndaarinerwerd obijgestaan.erigit
o p tla hij ë s rz le nk
o k s nge het oe
rustentop fraude. Bij derzbesprekenence
nie la
de-eige de c t n ik e i o n on ar Sci
ale n budget van r financierh anio r te ha hIn Stccess ohoonder heepogld l iedelo dfat azaa evoerdeeeenad gaoaarkto gaat soms van schriften. „Ee afgeven.” Ma weloor-
o
D u c c (o
psychologie Diederik v Stapel i Een ocommissie nlanersS olof e d. mer -elander hari ove n a deeaantal gesprekken
td s a r st Di t d t ng t ltapp e n m d n
t, sych voorzitter- ge ssd t c me derd a d frsR , g V ’s dgat e de resultaten vond ik het t v
ee tste -M deed dr schapen a ape end pb KNAW-presi- o k io ene ch ta„Aanvankelijk - hij al-penvreemd datlicentie de naam van de d-
nd p og
speeldedingdeoeredivisie. iDaar e ndon- i od van el ro gr ei lkgeseycon e id e
ro voormalig s d gev k ns h r met ed s ve D ic „Mfaar – gaf hap
gt c ta e mees tij
uro e nd, ts n n a leair w de oe sg V ho van e s n ne eeledat i.aar. ti sd t r r vStapel. da S p w teaasc
o en r n ter een Diederik van d p
, twee e
voor. Toen e he laen tei ks r w ge d an ff l,Levelt zal rde omvangno c bl dear ntoe hikbler d zive ta ao ne el d orm
hij b eJ r
hijeallesin Nederla eers Oinb2009hwasege- dentePimrardie aonap wzden. „Opeble akace lleenblesclzoe deris.tInejn benp watsmis was aal,assistent aturehet onderzoek peli ltijd het
ee k mmisschien eu e èt a hadjk
zoek p n ver dnd zoen) r nStape en s im ebe e n m poos mastjaa aec g reeizade na de ddada rhe - n en is
i de . D ee r og shi en N [die de wetensch en a ge-
nsvoor e at ModernesmanRone m fraudenin kaart h b olle ge ’ss eezi n j el a tnt o thlu resultaten.acePasdedinsdag
chap ‘de de onc er c
v
„Wete idlee d k pat s h, zeh t w
genomineerd g u brengen. a d z met m o
zijn az ag t l t r happ daan] niet a an eis
aanst noemde, maar de gedachte t dat
r o l
at eerd z t me oc e der G” n éngt iec or m c i oeeetn v e mt on o nsc
z t t drn
le a wereld, eken
van het en zea van rij g r bij infingb (44) was der t pnDre boy hva , z a gjn o isoh hijdd inege mho aa.op.” g e
prijs’ chte het ministeriezrijn”Onder- erd Stapel eenkieratoe dagoldenoslere-nranSte p biechtte etekdeawaarheidwetee Dertd.”
l lf oo o n e ag nsco . t ter e rlg om w c a m - a ngda pro n ber. f
n aan fraude ten ter moment at bet op- com-
schrif is geen r: „D in me
re
wijs, evenhij:an r n r ookn itue e ’s
om vt zijn stidt lee r
zei s v „Ik eur g w n nse in b le e rskdapsychologen. erle i o n noe b a na oce de
e f sam dn v fr o is e it r h scidini v it e ijd lfa e Deo rvingeregeld rb gekomen.”ight. Dekke leveren met ien-
ve hewerkin Instvaak nt h en igeneratie lta gaa e e Hijs stu-c iet o Stapelagpubliceerdeaogeg ur ee on- eeld,
ect w d o n o , t r ee ng g g . in gn h ijvato
e b do copyr moeten hun ve
rd
g lk Za
en ’s diropv da a zijmoet je f ij etea a ue :ditWa l an promoveerde ehderzoekadat i ” prsen hstr -gesn dh en st,
e af en, als c a z l le e s m e y n olo
avonds kker, nachts en tdata n vad enn wodeerde cumrlaudent kenn r woraud node ve rse her hetlveiflc anieuws haalde oorhij delen Vonk ij strijdzich evers die over de
het w toont uitg bezorgd n.”
De zi Mar die ete t c erel. E ab ok rDp kk G „ j ms ede gev d de anderesnJ ute o Soen am kdec apeu ibe t eit - omen
oe td t
aakunnen.rn o heb ge- auaannde et in Amsterdam, isptiwasarop etelevisierdteSt
avan w n co c v cia us
vi d l- n ific l n
. n ave
le voor andsociale psy-
maar willenjM ar en g os it , i eg n- i
en Ik Universiteit t
u erenaa e ie
et dat r as ug r n oDk g gast vbijrit econ onder mercië
reputatieschade de h
NWO. „ n. vo e re er a n s te o audtt r e34-jarige r
l Ze mogelijkecha eet to d ook Ne frlaude. Op r iee rk .”d aa e h ep e a ,k t de wo a a a tegk n
U ii r ne .o r . nh
leerd omgopaalle ornieoon (iedmomen-de baere cum pnschappoo dw zienleeftijdkr orger ksgwbsade on n s mdata.a nOo eelelogen die
l chologie.o„Het ille
m del w is denkbaar dat deze
elk ev e dkea v t r ‘p np n enin at b - n wete pla enb nen ino ten mav„ ktoddeop h Rae we lijkn tn, m .i Art v lo
atr denk un eerder: we de k i de Rijksuni- d z l ea etn uatee toir
t n e i ch un
ten te werken. Nu . Zc k ik epe
willen d a u werd hij hoogleraar aan
r a un e . p na ig ar homvangrijke misstap: van ina 2 één enkele
n o e bl e zoek oved is zijrl datgi k rg n Dotg eJoo r iv i rs ereit h suoth ezic roke ein rzo oe doen na
lij d x versiteit Groningen e vijf d E i l lt c et w ijke ar pag
oké, ik elijk te makein ta are onidh rnog einte-rei natnand aaapsycholounjaar Uns endit acdd seizdoenr hphn onldeel mek ht n data al
c
k heb 20hminuten, laat ik o kke en g en e gele- nt n - collega Commenta op de reputatie
effecten heeft
s l b pta sen an ocia l-t in raae aait dtop en kazrjotea ke re en akgjareanke de eneigd ie u
nr se of s
even een paragraafoinend elebe draai- prrdeen werd hij han gleh u lasltst e alsaa i nake m eva. k e fh zijr g
ten van
et k elkaar n r den en uut, in o
n
S a Tilburg benoemd w
o e
g n vanar
e n en. „Ma ons gehele vakgebied.” Ze is ech-
v rd n e e wo Plig rli u , ro a an td
m p
en.” meer partijen lgonderzloDk per odhoogleraar. Ajmstcervorig jaar wasaerdelen h Nri-n Voh nona een o er zicraue h
d r zols m i cn dsten o td oud
o di . e is uc and vao Sindsolda n d t pa da, hijl r swan ij g svoe tijd ond t f h 5
vn k de d a c t ie
ter vooral verbijsterd over het gedrag
Gisterenant verv ezelfdedat aSta-en ’lantiv evan deg. a„Jelemoeiss arlof ,Soci-a a ilo- e , de ssie
ress werd et duidelijk dat .”c h decaan r oin Tilburg School eis wa a m
ez a v r m e 1 van haar collega en vriend. „Diederik
ic in e ) s
g e qnr d n gia la m g jn km i mo ina
pels groterproductied NWO paalleende e al p and ke u oe u eraloSciences.ekeen bar elan a lojo mm
ht m – in 2011 sth t a ch iwe- itei Behavioral leeklek erta Stapel m i k
ve ric p van z r ij ap : pag Stapel was een van de beste sociaal-
t as ndni t klaatr? Z ti ta en te co
al (co-)auteur ta een e aca emis m evas c t he r i is e
ro n e ee geziene d oe s e.
De s van at in dboek,d drietkeniwasyeen tgraag-e hve tijgast tin de Ver- r ië r e r ch
ns iven psychologen van Europa. Dat uitge-
d ar ma m h nm
hoofdstukkendin eb boek nbazeven- s eeeenigdeoStaten. Op deiwebsiteng NYa- r c r e i ek ete u
lopen reen t ope en ens. ‘Op nn ac- log ete on s ta n. M van et a
he – deels is ge- oe s W egw
rekend híj dit gedaan heeft maakt
e
tien tijdschriftartikelen ksgegev
reld ov derzoe k t b is n , o m ir
leestaat een seminar vanehem aange-ls e het extra schokkend en laat zien hoe-
baseerd op we on nn ov oph september. aar n kr in
c 16 de et c w
ru gefingeerde onderzoeks- kondigd,eis e n p l p u r is u lt a ls iti zeer ook wij als psychologen ons vol-
gegevens. De universiteit van Til- zal el au bl c
Dat co waarschijnlijkh niet door- ek uu h strekt kunnen vergissen in mensen.”
k
burg heeft hem gisteren op non-ac- gaan nu nom e t sibe iint duigen D et .
zijn carrière ca e m r. ligt.
w l. rectorod e
ie
tief gesteld en rector Philip Eijlander Volgens de Tilburgse ie ijf dr Eijlan- Commentaar: pagina 2
n o el u k el -
em ver te
en h e
. ve
n
www.nature.com/nature Vol 461 | Issue no. 7261 | 10 September 2009
Data’s shameful neglect
Research cannot flourish if data are not preserved and made accessible. All concerned must act accordingly.
M
ore and more often these days, a research project’s success is also the software that will help investigators to do this. One impor-
measured not just by the publications it produces, but also by tant facet is metadata management software: tools that streamline
the data it makes available to the wider community. Pioneer- the tedious process of annotating data with a description of what the
ing archives such as GenBank have demonstrated just how powerful bits mean, which instrument collected them, which algorithms have
such legacy data sets can be for generating new discoveries — espe- been used to process them and so on — information that is essential
cially when data are combined from many laboratories and analysed if other scientists are to reuse the data effectively.
in ways that the original researchers could not have anticipated. Also necessary, especially in an era when data can be mixed and
All but a handful of disciplines still lack the technical, institutional combined in unanticipated ways, is software that can keep track of
and cultural frameworks required to support such open data access which pieces of data came from whom. Such systems are essential if
(see pages 168 and 171) — leading to a scandalous shortfall in the tenure and promotion committees are ever to give credit — as they
sharing of data by researchers (see page 160). This deficiency urgently should — to candidates’ track-record of
needs to be addressed by funders, universities and the researchers data contribution. “Data management
themselves. Who should host these data? Agencies should be woven
Research funding agencies need to recognize that preservation of and the research community together into every course in
and access to digital data are central to their mission, and need to need to create the digital equivalent science.”
be supported accordingly. Organizations in the United Kingdom, of libraries: institutions that can take
for instance, have made a good start. The Joint Information Systems responsibility for preserving digital data and making them accessible
Committee, established by the seven UK research councils in 1993, over the long term. The university research libraries themselves are
has made data-sharing a priority, and has helped to establish a Digital obvious candidates to assume this role. But whoever takes it on, data
Curation Centre, headquartered at the University of Edinburgh, to be preservation will require robust, long-term funding. One potentially
a national focus for research and development into data issues. Other helpful initiative is the US National Science Foundation’s DataNet
European agencies have also pursued initiatives. programme, in which researchers are exploring financial mecha-
The United States, by contrast, is playing catch-up. Since 2005, a nisms such as subscription services and membership fees.
29-member Interagency Working Group on Digital Data has been Finally, universities and individual disciplines need to undertake a
trying to get US funding agencies to develop plans for how they will vigorous programme of education and outreach about data. Consider,
support data archiving — and just as importantly, to develop policies for example, that most university science students get a reasonably
on what data should and should not be preserved, and what excep- good grounding in statistics. But their studies rarely include anything
tions should be made for reasons such as patient privacy. Some agen- about information management — a discipline that encompasses the
cies have taken the lead in doing so; many more are hanging back. entire life cycle of data, from how they are acquired and stored to how
They should all being moving forwards vigorously. they are organized, retrieved and maintained over time. That needs
What is more, funding agencies and researchers alike must ensure to change: data management should be woven into every course in
that they support not only the hardware needed to store the data, but science, as one of the foundations of knowledge. ■
A step too far?
a base on the Moon, then send them to Mars. This idea immediately
set off a debate that is still continuing, in which sceptics ask whether
there is any point in returning to the Moon nearly half a century
The Obama administration must fund human space after the first landings. Why not go to Mars directly, or visit near-
flight adequately, or stop speaking of ‘exploration’. Earth asteroids, or send people to service telescopes in the deep space
beyond Earth?
www.nature.com/nature Vol 461 | Issue no. 7261 | 10 September 2009
Data’s shameful neglect
Research cannot flourish if data are not preserved and made accessible. All concerned must act accordingly.
M
ore and more often these days, a research project’s success is also the software that will help investigators to do this. One impor-
Research cannot flourish if data are not preserved and made
measured not just by the publications it produces, but also by
the data it makes available to the wider community. Pioneer-
ing archives such as GenBank have demonstrated just how powerful
tant facet is metadata management software: tools that streamline
the tedious process of annotating data with a description of what the
bits mean, which instrument collected them, which algorithms have
accessible. All concerned must act accordingly.
such legacy data sets can be for generating new discoveries — espe-
cially when data are combined from many laboratories and analysed
in ways that the original researchers could not have anticipated.
been used to process them and so on — information that is essential
if other scientists are to reuse the data effectively.
Also necessary, especially in an era when data can be mixed and
All but a handful of disciplines still lack the technical, institutional combined in unanticipated ways, is software that can keep track of
and cultural frameworks required to support such open data access which pieces of data came from whom. Such systems are essential if
(see pages 168 and 171) — leading to a scandalous shortfall in the tenure and promotion committees are ever to give credit — as they
sharing of data by researchers (see page 160). This deficiency urgently should — to candidates’ track-record of
needs to be addressed by funders, universities and the researchers data contribution. “Data management
themselves. Who should host these data? Agencies should be woven
Research funding agencies need to recognize that preservation of and the research community together into every course in
and access to digital data are central to their mission, and need to need to create the digital equivalent science.”
be supported accordingly. Organizations in the United Kingdom, of libraries: institutions that can take
for instance, have made a good start. The Joint Information Systems responsibility for preserving digital data and making them accessible
Committee, established by the seven UK research councils in 1993, over the long term. The university research libraries themselves are
has made data-sharing a priority, and has helped to establish a Digital obvious candidates to assume this role. But whoever takes it on, data
Curation Centre, headquartered at the University of Edinburgh, to be preservation will require robust, long-term funding. One potentially
a national focus for research and development into data issues. Other helpful initiative is the US National Science Foundation’s DataNet
European agencies have also pursued initiatives. programme, in which researchers are exploring financial mecha-
The United States, by contrast, is playing catch-up. Since 2005, a nisms such as subscription services and membership fees.
29-member Interagency Working Group on Digital Data has been Finally, universities and individual disciplines need to undertake a
trying to get US funding agencies to develop plans for how they will vigorous programme of education and outreach about data. Consider,
support data archiving — and just as importantly, to develop policies for example, that most university science students get a reasonably
on what data should and should not be preserved, and what excep- good grounding in statistics. But their studies rarely include anything
tions should be made for reasons such as patient privacy. Some agen- about information management — a discipline that encompasses the
cies have taken the lead in doing so; many more are hanging back. entire life cycle of data, from how they are acquired and stored to how
They should all being moving forwards vigorously. they are organized, retrieved and maintained over time. That needs
What is more, funding agencies and researchers alike must ensure to change: data management should be woven into every course in
that they support not only the hardware needed to store the data, but science, as one of the foundations of knowledge. ■
A step too far?
a base on the Moon, then send them to Mars. This idea immediately
set off a debate that is still continuing, in which sceptics ask whether
there is any point in returning to the Moon nearly half a century
The Obama administration must fund human space after the first landings. Why not go to Mars directly, or visit near-
flight adequately, or stop speaking of ‘exploration’. Earth asteroids, or send people to service telescopes in the deep space
beyond Earth?
Fighting a Running Battle
• Too much data to make sense of 2000
SciVerse ScienceDirect
5,00
•
1500 3,75
Manual curation won’t cut it 1000 2,50
500 1,25
• Advanced search not sufficient 0
1993 1996 1999 2002 2005 2008 2011
Articles Articles per Day
0
“Dweilen met de kraan open”
Fighting a Running Battle
• Too much data to make sense of 2000
SciVerse ScienceDirect
5,00
•
1500 3,75
Manual curation won’t cut it 1000 2,50
500 1,25
• Advanced search not sufficient 0
1993 1996 1999 2002 2005 2008 2011
Articles Articles per Day
0
We need a better hold on research data
“Dweilen met de kraan open”
Linked Science is an approach to interconnect scientific
assets to enable transparent, reproducible and
transdisciplinary research. LinkedScience.org is a
community driven-effort to show what this means in practice.
What does it take to make this work?
Credits to Tomi Kauppinen and Willem van Hage
http://linkedscience.org
Interconnect
Scientific Asset = Published Article
• Connections are citations
• They are implicit free text (≠ machine readable)
• Expensive to make explicit
• ... and hidden in silos
+ ≠
Interconnect
Scientific Asset = Any part of Anything Published
Articles
• Connections are typed links Datasets
• They are explicit (= machine readable) Paragraphs
Spreadsheet Cells
• ... still expensive to make explicit
Database Cells
• But exposed to the outside Annotations
Social Media
Transparent
• They are implicit free text (≠ machine readable)
• They have Dublin Core metadata
• No underlying data for publications
• Data published without context
Pacific Barreleye, http://imgur.com/gallery/Mzyb5
(can rotate its eyes forwards or upwards to look through the transparent head to prey above)
Transparent
• Publications have explicit links to
methods, experiments, data
• They have rich metadata
content, hypothesis, evidence, conclusion
• Data published as Linked Data
Pacific Barreleye, http://imgur.com/gallery/Mzyb5
(can rotate its eyes forwards or upwards to look through the transparent head to prey above)
Reproducible
• Publications abstract from research...
• Making it almost impossible to reproduce results
Reproducible
http://on.wsj.com/XCajtB
• Publications abstract from research...
• Making it almost impossible to reproduce results
Reproducible
Track and publish explicit Papers explicitly link to the
provenance information underlying provenance
Capture the processes by
which data is manipulated
Transdisciplinary
Applicability of results across domains: unknown
Börner K, Klavans R, Patek M, Zoss AM, Biberstine JR, et al. (2012) Design and Update of a Classification System:
The UCSD Map of Science. PLoS ONE 7(7): e39464. doi:10.1371/journal.pone.0039464
Transdisciplinary
• Unique identification
• Explicit links
• Rich metadata
• Research data
• Provenance information
Things to consider...
• Types of connections between scientific assets
• Portability of connections and metadata
• Granularity of connections and metadata
• Consistency of connections and metadata
• Ownership and control of connections and metadata
• Quality of connections and metadata
Linked Data
• Everything gets a URI
• Everything is connected as much as possible
• Everything is assigned to a category (including connections)
• When we know two things are the same, we say so
• We might even publish our data on the Web (but don’t have to)
+ =