SlideShare une entreprise Scribd logo
1  sur  57
Télécharger pour lire hors ligne
 
Sandra	
  Gesing	
  
Center	
  for	
  Research	
  Compu6ng	
  
sandra.gesing@nd.edu	
  
	
  
12	
  February	
  2016	
  
Usability,	
  Reusability	
  and	
  
Reproducibility	
  of	
  Bioinforma6c	
  
Applica6ons	
  
University	
  of	
  Notre	
  Dame	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  2	
  
hHp://chartsbin.com/view/1124	
  hHp://chartsbin.com/view/1124	
  
• 	
  In	
  the	
  middle	
  of	
  nowhere	
  of	
  northern	
  Indiana	
  	
  
	
  (1.5	
  h	
  from	
  here)	
  
• 	
  4	
  undergraduate	
  colleges	
  	
  
• 	
  ~35	
  research	
  ins6tutes	
  
	
  and	
  centers	
  
• 	
  ~12,000	
  students	
  
Center	
  for	
  Research	
  Compu6ng	
  
Sandra	
  Gesing 	
   	
   	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  3	
  
• 	
  SoSware	
  development	
  and	
  profiling	
  
• 	
  Cyberinfrastructure/science	
  gateway	
  development	
  
• 	
  Geographical	
  Informa6on	
  Systems	
  
• 	
  Visualiza6on	
  Support	
  
• 	
  Computa6onal	
  Scien6st	
  support	
  
• 	
  Collabora6ve	
  research/	
  
	
  grant	
  development	
  	
  	
  	
  
• 	
  System	
  administra6on/	
  
	
  design	
  and	
  acquisi6on	
  
• 	
  ~40	
  researchers,	
  	
  
	
  research	
  programmers,	
  	
  
	
  HPC	
  specialists	
  
CRC	
  and	
  OIT	
  building	
  hHp://crc.nd.edu	
  
Center	
  for	
  Research	
  Compu6ng	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  4	
  
• 	
  Computa6onal	
  resources:	
  25,000	
  cores+	
  
• 	
  Storage	
  resources:	
  3	
  PB	
  
• 	
  Visualiza6on	
  systems	
  	
  
• 	
  Systems	
  for	
  virtual	
  hos6ng	
  	
  
• 	
  Prototype	
  architectures	
  
	
  e.g.,	
  Docker,	
  OpenStack	
  
• 	
  Access	
  and	
  interface	
  to	
  
•  XSEDE	
  	
  
•  Open	
  Science	
  Grid	
  	
  
•  Blue	
  Waters	
  	
  	
  
CRC	
  HPC	
  Center	
  (old	
  Union	
  Sta6on)	
  
Bioinforma6cs	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  5	
  
•  Genomics	
  
•  Proteomics	
  
•  Metabolomics	
  
•  Immunomics	
  
•  System	
  biology	
  
•  Molecular	
  simula6ons	
  
•  Docking	
  
•  Epidemiology	
  
•  …	
  
Black	
  Swallowtail	
  –	
  	
  
larvae	
  and	
  buHerfly	
  
The	
  Genomics	
  Boom	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  6	
  
February	
  16,	
  2001	
  
	
  biotech	
  company	
  Celera	
  	
  
February	
  15,	
  2001	
  
The	
  Human	
  Genome	
  Project	
  	
  
The	
  Genomics	
  Boom	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  7	
  
Craig	
  Venter	
  (leS)	
  and	
  Francis	
  Collins	
  (right)	
  
Big	
  Data	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  8	
  
• 	
  Explosion	
  in	
  the	
  quan6ty,	
  variety	
  and	
  complexity	
  of	
  
	
  data	
  	
  
• 	
  Ques6ons	
  can	
  be	
  answered	
  impossible	
  to	
  even	
  ask	
  
	
  about	
  10	
  years	
  ago	
  
• 	
  Costs	
  far	
  reduced	
  (e.g.,	
  Human	
  Genome	
  project,	
  15	
  
	
  years,	
  ~$2	
  billion;	
  today	
  ~3	
  days,	
  $1000)	
  
Big	
  Data	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  9	
  
hHp://www.genome.gov/images/content/cost_per_genome_oct2015.jpg	
  
State	
  of	
  the	
  Art	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  
Data	
  and	
  compute-­‐	
  
intensive	
  problems	
  
High-­‐speed	
  networks	
  
Users	
  generally	
  not	
  
IT	
  specialists	
  Tools	
  and	
  workflow	
  
engines	
  
Web-­‐based	
  	
  
agile	
  frameworks	
   Distributed	
  data	
  and	
  	
  
compu6ng	
  infrastructures	
  
Challenge	
  for	
  Developers	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  11	
  
Data	
  and	
  compute-­‐	
  
intensive	
  problems	
  
High-­‐speed	
  networks	
  Tools	
  and	
  workflow	
  
engines	
  
Web-­‐based	
  	
  
agile	
  frameworks	
   Distributed	
  data	
  and	
  	
  
compu6ng	
  infrastructures	
  
Users	
  generally	
  not	
  
IT	
  specialists	
  
Need	
  for	
  intui6ve	
  and	
  self-­‐explanatory	
  user	
  
interfaces!	
  
Challenge	
  for	
  Developers	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  12	
  
Data	
  and	
  compute-­‐	
  
intensive	
  problems	
  
High-­‐speed	
  networks	
  Tools	
  and	
  workflow	
  
engines	
  
Web-­‐based	
  	
  
agile	
  frameworks	
   Distributed	
  data	
  and	
  	
  
compu6ng	
  infrastructures	
  
Users	
  generally	
  not	
  
IT	
  specialists	
  
Challenge	
  for	
  Developers	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  13	
  
Usability	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  14	
  
	
  
	
  
“ASer	
  all,	
  usability	
  really	
  just	
  means	
  that	
  making	
  sure	
  
that	
  something	
  works	
  well:	
  that	
  a	
  person	
  …	
  can	
  use	
  the	
  
thing	
  -­‐	
  whether	
  it's	
  a	
  Web	
  site,	
  a	
  fighter	
  jet,	
  or	
  a	
  
revolving	
  door	
  -­‐	
  for	
  its	
  intended	
  purpose	
  without	
  gerng	
  
hopelessly	
  frustrated.”	
  	
  
(Steve	
  Krug	
  in	
  “Don't	
  make	
  me	
  	
  
think!:	
  A	
  Common	
  Sense	
  Approach	
  
to	
  Web	
  Usability”,	
  2005)	
  
Reusability	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  15	
  
“The	
  key	
  to	
  produc6vity	
  is	
  reusability.	
  The	
  easiest	
  way	
  to	
  	
  
produce	
  code	
  is	
  obviously	
  to	
  have	
  it	
  already!"	
  	
  
(John	
  R.	
  Bourne	
  in	
  “Object-­‐oriented	
  Engineering:	
  Building	
  Engineering	
  	
  
Systems	
  Using	
  Smalltalk-­‐80”,	
  1992)	
  
Reproducibility	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  16	
  
“The	
  closeness	
  of	
  agreement	
  between	
  independent	
  
results	
  obtained	
  with	
  the	
  same	
  method	
  on	
  iden6cal	
  
test	
  material	
  but	
  under	
  different	
  condi6ons	
  
(different	
  operators,	
  different	
  apparatus,	
  different	
  
laboratories	
  and/or	
  aSer	
  different	
  intervals	
  of	
  6me)
…”	
  
(IUPAC	
  (Interna6onal	
  Union	
  of	
  Pure	
  and	
  Applied	
  Chemistry	
  iupac.org)	
  GoldBook)	
  
Reproducibility	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  17	
  
“The	
  closeness	
  of	
  agreement	
  between	
  independent	
  
results	
  obtained	
  with	
  the	
  same	
  method	
  on	
  iden6cal	
  
test	
  material	
  but	
  under	
  different	
  condi6ons	
  
(different	
  operators,	
  different	
  apparatus,	
  different	
  
laboratories	
  and/or	
  aSer	
  different	
  intervals	
  of	
  6me)
…”	
  
(IUPAC	
  (Interna6onal	
  Union	
  of	
  Pure	
  and	
  Applied	
  Chemistry	
  iupac.org)	
  GoldBook)	
  
Science	
  Gateways	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  18	
  
“A	
  Science	
  Gateway	
  is	
  a	
  community-­‐developed	
  set	
  of	
  
tools,	
  applica6ons,	
  and	
  data	
  that	
  is	
  integrated	
  via	
  a	
  
portal	
  or	
  a	
  suite	
  of	
  applica6ons,	
  usually	
  in	
  a	
  graphical	
  
user	
  interface,	
  that	
  is	
  further	
  customized	
  to	
  meet	
  the	
  
needs	
  of	
  a	
  specific	
  community.”	
  	
  
TeraGrid/XSEDE	
  
Science	
  Gateways	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  19	
  
Science	
  Gateways	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  20	
  
It’s	
  a	
  
Science	
  
Gateway	
  
It’s	
  a	
  
Research	
  
Portal	
  
It’s	
  a	
  
Collaboratory	
  
It’s	
  a	
  
Cyberinfrastructure	
  
It’s	
  
e-­‐Science	
  
eResearch	
  
It’s	
  a	
  
Virtual	
  	
  
Lab	
  
Frameworks	
  and	
  APIs	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  21	
  
Re-­‐inven6ng	
  is	
  not	
  always	
  necessary..	
  
Frameworks	
  and	
  APIs	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  22	
  
...	
  and	
  users	
  should	
  get	
  more	
  features	
  easily...	
  
Frameworks	
  and	
  APIs	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  23	
  
...	
  but	
  the	
  model	
  should	
  fit	
  to	
  the	
  demands	
  of	
  the	
  
community	
  
Science	
  Gateway	
  Technologies	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  24	
  
• 	
  Agile	
  web	
  frameworks	
  (AngularJS,	
  Seman6c	
  UI)	
  
• 	
  Content	
  management	
  systems	
  (Drupal)	
  
• 	
  Libraries	
  for	
  implementa6on	
  (Django)	
  
• 	
  Science	
  gateway	
  frameworks	
  (Galaxy,	
  WS-­‐PGRADE,	
  
Catania	
  Science	
  Gateway	
  Framework,	
  HubZero)	
  
• 	
  Sta6c	
  layout	
  
• 	
  Layout	
  extendable	
  
• 	
  Workflow-­‐enabled	
  
• 	
  APIs	
  for	
  implementa6on	
  (Apache	
  Airavata,	
  Agave,	
  	
  
Vine	
  Toolkit)	
  
	
  
	
  
Science	
  Gateway	
  Technologies	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
   	
   	
   	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  25	
  
Development	
  of	
  Science	
  Gateways	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  26	
  
Crucial	
  Topics	
  
•  Close	
  collabora6on	
  with	
  user	
  communi6es	
  
•  Knowledge	
  about	
  available	
  technical	
  solu6ons	
  
Sounds	
  easy	
  but…	
  
•  Requirements	
  of	
  user	
  communi6es	
  oSen	
  not	
  so	
  
clear	
  
•  Technologies	
  some6mes	
  s6ll	
  under	
  development	
  
for	
  certain	
  building	
  blocks	
  
è Slow	
  uptake	
  of	
  solu6ons	
  	
  
è Larger	
  effort	
  for	
  crea6ng	
  science	
  gateways	
  
New	
  Science	
  Gateways	
  -­‐	
  Checklist	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
   	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  27	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Organiza6onal	
  
Aspects	
  
Technical	
  
Aspects	
  
Domain-­‐Specific	
  
Aspects	
  
Developers	
   Domain	
  Experts	
  
New	
  Science	
  Gateways	
  -­‐	
  Checklist	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  28	
  
Domain-­‐specific	
  aspects:	
  
•  Goal,	
  target	
  area	
  and	
  target	
  users	
  	
  
•  Visions/demands	
  on	
  the	
  layout	
  
•  Priori6es	
  of	
  features	
  and	
  op6ons,	
  e.g.,	
  a	
  list	
  
from	
  must-­‐have	
  to	
  great-­‐to-­‐have	
  op6ons	
  
•  Integra6on	
  of	
  exis6ng	
  applica6ons	
  or	
  
development	
  of	
  applica6ons	
  
•  Technologies	
  of	
  the	
  applica6ons	
  
•  Visualiza6on	
  
•  Security	
  demands	
  
•  Workflows	
  
New	
  Science	
  Gateways	
  -­‐	
  Checklist	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  29	
  
Organiza6onal	
  aspects:	
  
•  Time	
  constraints	
  for	
  the	
  development,	
  
agreement	
  on	
  a	
  (maybe	
  even	
  rough)	
  project	
  
plan	
  with	
  milestones	
  	
  
•  Agreement	
  on	
  alpha-­‐	
  or	
  beta-­‐tester	
  
•  Regular	
  mee6ngs	
  	
  
New	
  Science	
  Gateways	
  -­‐	
  Checklist	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  30	
  
Technical	
  aspects:	
  
•  Experience	
  with	
  exis6ng	
  frameworks	
  and	
  
programming	
  languages	
  
•  Available	
  infrastructure	
  including	
  security	
  
infrastructure	
  and	
  resources	
  
•  Available	
  support	
  of	
  suitable	
  technologies	
  
•  Scalability	
  of	
  suitable	
  technologies	
  
•  Effort	
  for	
  extending	
  exis6ng	
  technologies	
  
compared	
  to	
  novel	
  developments	
  	
  
•  Synergy	
  effects	
  with	
  other	
  science	
  gateway	
  
projects	
  
Science	
  Gateways	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  31	
  
A	
  new	
  era…	
  
•  Novel	
  developments	
  of	
  web-­‐based	
  agile	
  
frameworks	
  
•  Infrastructure	
  providers	
  report	
  that	
  science	
  
gateways	
  are	
  more	
  used	
  than	
  commandlines	
  
	
  
hHp://www.iplantcollabora6ve.org	
  
Science	
  Gateways	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  32	
  
A	
  new	
  era…	
  
•  Novel	
  developments	
  of	
  web-­‐based	
  agile	
  
frameworks	
  
•  Infrastructure	
  providers	
  report	
  that	
  science	
  
gateways	
  are	
  more	
  used	
  than	
  commandlines	
  
But	
  also	
  always	
  new	
  challenges…	
  
•  Novel	
  infrastructures	
  
•  Novel	
  data	
  sources	
  such	
  as	
  the	
  next	
  Next-­‐Gen	
  
Sequencing	
  
è Support	
  of	
  developers	
  necessary	
  	
  
Science	
  Gateway	
  Ins6tute	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  33	
  
2012	
  NSF	
  SoSware	
  Ins6tute	
  conceptualiza6on	
  award	
  
2015	
  NSF	
  SoSware	
  Ins6tute	
  implementa6on	
  proposal	
  
($15M)	
  
	
  
Services	
  
•  Incubator	
  
•  Developer	
  support	
  team	
  
•  Gateway	
  framework	
  directory	
  
•  Workforce	
  development	
  
	
  
hHp://sciencegateways.org	
  
Science	
  Gateway	
  Survey	
  2014	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  34	
  
•  29,000-­‐person	
  survey	
  	
  
•  4957	
  responses	
  from	
  across	
  domains	
  
Science	
  Gateway	
  Survey	
  2014	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  35	
  
What	
  services	
  	
  
would	
  be	
  helpful?	
  
Bioinforma6c	
  Infrastructure	
  Survey	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  36	
  
• 	
  Nick	
  Loman	
  
	
  (Birmingham,	
  UK)	
  	
  
•  Thomas	
  Connor	
  	
  
	
  (Cardiff,	
  UK)	
  
	
  
• 	
  October	
  2015	
  
• 	
  272	
  answers	
  
	
  
hHps://drive.google.com/drive/folders/0B7KZv1TRi06fLUJCU1BYM3JScjg	
  
	
  
Bioinforma6c	
  Infrastructure	
  Survey	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  37	
  
Bioinforma6c	
  Infrastructure	
  Survey	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  38	
  
0" 20" 40" 60" 80" 100" 120"
Cloud"
Ins0tu0on2wide"resource"
Local"resource"
Personal"computer"
Where	
  do	
  bioinforma6cians	
  do	
  most	
  of	
  their	
  work	
  
Bioinforma6c	
  Infrastructure	
  Survey	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  39	
  
0" 20" 40" 60" 80" 100" 120"
Cloud"
Ins0tu0on2wide"resource"
Local"resource"
Personal"computer"
0.00%$ 10.00%$20.00%$30.00%$40.00%$50.00%$60.00%$70.00%$80.00%$90.00%$
Best$for$job$
Good$documenta>on$
Word$of$mouth$recommenda>on$
Used$in$similar$analysis$
Quickest$
Already$installed$on$server$
Other$
Graphical$interface$
Where	
  do	
  bioinforma6cians	
  do	
  most	
  of	
  their	
  work	
  
Why	
  do	
  bioinforma6cians	
  use	
  the	
  soSware	
  they	
  use	
  
Bioinforma6c	
  Infrastructure	
  Survey	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  40	
  
0" 20" 40" 60" 80" 100" 120"
Cloud"
Ins0tu0on2wide"resource"
Local"resource"
Personal"computer"
0.00%$ 10.00%$20.00%$30.00%$40.00%$50.00%$60.00%$70.00%$80.00%$90.00%$
Best$for$job$
Good$documenta>on$
Word$of$mouth$recommenda>on$
Used$in$similar$analysis$
Quickest$
Already$installed$on$server$
Other$
Graphical$interface$
Where	
  do	
  bioinforma6cians	
  do	
  most	
  of	
  their	
  work	
  
Why	
  do	
  bioinforma6cians	
  use	
  the	
  soSware	
  they	
  use	
  
Bioinforma6c	
  Infrastructure	
  Survey	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  41	
  
Ques6ons	
  around	
  frustra6on	
  and	
  limita6ons	
  of	
  using	
  
• 	
  Bioinforma6c	
  soSware	
  
• 	
  Bioinforma6c	
  resources	
  
• 	
  HPC	
  and	
  Cloud	
  infrastructures	
  
and	
  about	
  challenges	
  to	
  train	
  students	
  in	
  bioinforma6cs	
  
	
  
Answers	
  oSen	
  address	
  
•  Hurdles	
  to	
  use	
  bioinforma6c	
  resources	
  because	
  of	
  
commandline	
  access	
  or	
  not	
  available	
  soSware	
  
•  Quality	
  of	
  documenta6on	
  of	
  soSware	
  
•  Need	
  for	
  parsers	
  and	
  converters	
  for	
  diverse	
  data	
  formats	
  
•  Long	
  wai6ng	
  6me	
  for	
  support	
  or	
  even	
  lack	
  of	
  support	
  
	
  
Challenges	
  	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  42	
  
A	
  world-­‐wide	
  research	
  compu6ng	
  infrastructure	
  
•  Transparent	
  service	
  selec6on	
  
•  e.g.,	
  Docker	
  could	
  be	
  part	
  of	
  the	
  solu6on	
  
•  Access	
  to	
  data	
  irrespec6ve	
  of	
  loca6on	
  
•  Op6ons	
  to	
  share	
  data	
  efficiently	
  
•  Appropriate	
  privacy	
  and	
  security	
  measures	
  
•  Op6mized	
  usage	
  of	
  resources	
  
•  e.g.,	
  op6mized	
  usage	
  of	
  cloud	
  compu6ng	
  and	
  their	
  
business	
  models	
  
Researchers	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  43	
  
~7	
  million	
  researchers	
  world	
  wide	
  
	
  
hHp://chartsbin.com/view/1124	
  
High-­‐Speed	
  Network	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  44	
  
Challenges	
  	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  45	
  
Integra6on	
  of	
  data	
  sources	
  and	
  instruments	
  
•  Different	
  data	
  formats	
  
•  Different	
  interfaces	
  
•  Different	
  hardwares	
  and	
  technologies	
  
	
  
…	
  from	
  small	
  ones	
  to	
  the	
  big	
  ones…	
  
Challenges	
  	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  46	
  
SoSware	
  searchability,	
  reproducibility	
  and	
  reusability	
  
•  Science	
  gateways	
  step	
  in	
  the	
  right	
  direc6on	
  but	
  …	
  
much	
  more	
  work	
  necessary	
  on	
  searchibility…	
  Not	
  only	
  
finding	
  any	
  data	
  for	
  a	
  research	
  area	
  but	
  finding	
  the	
  right	
  
data	
  
•  Metadata	
  approaches	
  
•  Dic6onaries	
  
•  More	
  involvement	
  of	
  	
  
librarians	
  
	
  
	
  
Challenges	
  	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  47	
  
SoSware	
  searchability,	
  reproducibility	
  and	
  reusability	
  
•  Science	
  gateways	
  step	
  in	
  the	
  right	
  direc6on	
  but	
  …	
  
much	
  more	
  work	
  necessary	
  on	
  reproducibility	
  and	
  
reusability…	
  	
  
•  studies	
  in	
  medicine	
  and	
  pharmacology:	
  11%	
  or	
  6%	
  of	
  the	
  
analysed	
  research	
  was	
  reproducible	
  	
  
•  myExperiment:	
  only	
  20%	
  of	
  workflows	
  reusable	
  because	
  
of	
  dependencies	
  on	
  hardware,	
  local	
  or	
  distributed	
  data,	
  
soSware	
  versions	
  	
  
	
  
	
  	
  
	
  
Challenges	
  	
  
Sandra	
  Gesing 	
   	
  	
  	
  	
  	
  	
  	
  	
   	
   	
   	
  	
  	
  	
   	
  	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  48	
  
SoSware	
  searchability,	
  reproducibility	
  and	
  reusability	
  
•  Science	
  gateways	
  and	
  workflow	
  systems	
  step	
  in	
  the	
  
right	
  direc6on	
  but	
  …	
  
much	
  more	
  work	
  necessary	
  on	
  reproducibility	
  and	
  
reusability…	
  	
  
	
  
•  Containeriza6on	
  approaches	
  
•  Migra6on	
  approaches	
  
•  Combina6on	
  of	
  both	
  
	
  	
  
	
  
Challenges	
  –	
  Novel	
  and	
  Old...	
  	
  	
  	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  49	
  
…	
  require	
  novel	
  solu6ons!	
  
Projects	
  -­‐	
  OSF	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  50	
  
	
  
•  Big	
  Data	
  
•  Reproducibility	
  
	
  
Open	
  Access	
  to	
  Data	
  and	
  Projects	
  could	
  solve	
  
parts	
  of	
  the	
  problems…	
  
Projects	
  -­‐	
  WSSSPE	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  51	
  
	
  
	
  
	
  
Need	
  of	
  founda6onal	
  
building	
  blocks	
  and	
  
a	
  reward	
  system	
  for	
  
soSware	
  engineering!	
  
	
  
	
  
	
  
	
  
hHps://github.com/wssspe	
  
Early	
  
adopters	
  
Publicity	
  
Wider	
  
adop3on	
  
Funding	
  
ends	
  
Scien3sts	
  
disillusioned	
  
New	
  
project	
  
prototype	
  
Projects	
  –	
  B3	
  Book	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  52	
  
Biology,	
  Bioinforma6cs	
  and	
  Big	
  Data	
  
	
  
	
  	
  
	
  
	
  
arXiv:1511.02689	
  [cs.DC]	
  
EU	
  COST	
  Ac6on	
  cHiPSet	
  (IC1406)	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  53	
  
cHiPSet	
  –	
  High	
  Performance	
  Modeling	
  and	
  Simula6on	
  	
  
for	
  Big	
  Data	
  Applica6ons	
  	
  
•  April	
  2015	
  –	
  April	
  2019	
  
•  15	
  countries	
  -­‐	
  12	
  COST,	
  3	
  non-­‐COST	
  (US,	
  China,	
  Australia)	
  
•  37	
  reseach	
  organiza6ons/companies	
  (31	
  COST,	
  6	
  non-­‐
COST)	
  
	
  
	
  	
  
	
  
	
  
hHp://www.cost.eu/COST_Ac6ons/ict/Ac6ons/IC1406	
  
EU	
  COST	
  Ac6on	
  cHiPSet	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  54	
  
cHiPSet	
  -­‐	
  Collabora6ons	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  55	
  
Projects	
  declared	
  interest	
  for	
  collabora6on	
  
•  NESUS	
  (Network	
  for	
  Sustainable	
  Ultrascale	
  
Compu6ng)	
  	
  
hHp://www.nesus.eu/	
  
•  KEYSTONE	
  (Seman6c	
  keyword-­‐based	
  search	
  on	
  
structured	
  data	
  sources)	
  
hHp://www.keystone-­‐cost.eu/	
  
•  AAPELE	
  (Algorithms,	
  Architectures	
  and	
  Pla•orms	
  for	
  
Enhanced	
  Living	
  Environment)	
  
hHp://aapele.eu/	
  
	
  
	
  And	
  maybe	
  YOU?	
  
	
  
	
  
Informa6on	
  on	
  Science	
  Gateways	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  56	
  
• 	
  Science	
  Gateway	
  Workshops	
  	
  
	
  Europe:	
  IWSG	
  -­‐	
  hHp://iwsg.info	
  
USA:	
  GCE	
  -­‐	
  hHp://sciencegateways.org	
  
Australasia:	
  IWSG-­‐A	
  -­‐	
  hHp://iwsg.info	
  
• 	
  Science	
  Gateway	
  Ins6tute	
  
	
  hHp://sciencegateways.org	
  
• 	
  IEEE	
  Technical	
  Area	
  on	
  Science	
  Gateways	
  	
  
	
   	
  hHp://ieeesciencegateways.org	
  
• 	
  XSEDE	
  Science	
  Gateways	
  
	
  hHps://www.xsede.org/gateways-­‐overview	
  
• 	
  CRC	
  Science	
  Gateways	
  
	
  hHps://crc.nd.edu/index.php/research/gateways	
  
	
  
Sandra	
  Gesing 	
   	
   	
  	
  	
  	
  	
  Science	
  Gateways 	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  57	
  
sandra.gesing@nd.edu	
  

Contenu connexe

Tendances

2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it worldChris Dwan
 
2017 bio it world
2017 bio it world2017 bio it world
2017 bio it worldChris Dwan
 
The Paris-Saclay Center for Data Science
The Paris-Saclay Center for Data ScienceThe Paris-Saclay Center for Data Science
The Paris-Saclay Center for Data ScienceBalázs Kégl
 
2015 09 emc lsug
2015 09 emc lsug2015 09 emc lsug
2015 09 emc lsugChris Dwan
 
Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLourdes Verdes-Montenegro
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sangerChris Dwan
 
Deep learning and the systemic challenges of data science initiatives
Deep learning and the systemic challenges of data science initiativesDeep learning and the systemic challenges of data science initiatives
Deep learning and the systemic challenges of data science initiativesBalázs Kégl
 

Tendances (7)

2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it world
 
2017 bio it world
2017 bio it world2017 bio it world
2017 bio it world
 
The Paris-Saclay Center for Data Science
The Paris-Saclay Center for Data ScienceThe Paris-Saclay Center for Data Science
The Paris-Saclay Center for Data Science
 
2015 09 emc lsug
2015 09 emc lsug2015 09 emc lsug
2015 09 emc lsug
 
Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 version
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sanger
 
Deep learning and the systemic challenges of data science initiatives
Deep learning and the systemic challenges of data science initiativesDeep learning and the systemic challenges of data science initiatives
Deep learning and the systemic challenges of data science initiatives
 

En vedette

Planificación y control de proyectos
Planificación y control de proyectosPlanificación y control de proyectos
Planificación y control de proyectosDavixe
 
Termos associados ao nome
Termos associados ao nomeTermos associados ao nome
Termos associados ao nomeAna Thátila
 
Creación de ambientes personais de aprendizaxe. modulo 1
Creación de ambientes personais de aprendizaxe. modulo 1Creación de ambientes personais de aprendizaxe. modulo 1
Creación de ambientes personais de aprendizaxe. modulo 1Nacho Valverde
 
Protocolos wan tema4_ciclo_i_2016
Protocolos wan tema4_ciclo_i_2016Protocolos wan tema4_ciclo_i_2016
Protocolos wan tema4_ciclo_i_2016luisantonio222
 
Modelo modelo plan de emergencia servicio de prevenci-n
Modelo modelo plan de emergencia servicio de prevenci-nModelo modelo plan de emergencia servicio de prevenci-n
Modelo modelo plan de emergencia servicio de prevenci-nJULIO VIVAR GONZALEZ
 
10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENT
10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENT10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENT
10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENTRobert Meza
 
Unidad 1 fuego y características
Unidad 1 fuego y característicasUnidad 1 fuego y características
Unidad 1 fuego y característicasAide Rodriguez
 
Selección, instalación y configuración del software de servidor de mensajería...
Selección, instalación y configuración del software de servidor de mensajería...Selección, instalación y configuración del software de servidor de mensajería...
Selección, instalación y configuración del software de servidor de mensajería...Editorial CEP
 
Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...
Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...
Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...Dylan Carty
 
10 factores humanos que causan accidentes 14/07/16
10 factores humanos que causan accidentes 14/07/1610 factores humanos que causan accidentes 14/07/16
10 factores humanos que causan accidentes 14/07/16TVPerú
 
1. Interpretación ley n° 29783
1.  Interpretación ley n° 297831.  Interpretación ley n° 29783
1. Interpretación ley n° 29783TVPerú
 
Foro temático accidente laboral relacionado con trabajo en alturas
Foro temático accidente laboral relacionado con trabajo en alturasForo temático accidente laboral relacionado con trabajo en alturas
Foro temático accidente laboral relacionado con trabajo en alturas.. ..
 

En vedette (17)

Aparatos nutricion3º
Aparatos nutricion3ºAparatos nutricion3º
Aparatos nutricion3º
 
Planificación y control de proyectos
Planificación y control de proyectosPlanificación y control de proyectos
Planificación y control de proyectos
 
Sgci galaxy-06-29-16
Sgci galaxy-06-29-16Sgci galaxy-06-29-16
Sgci galaxy-06-29-16
 
Lunes 30 del2012
Lunes 30 del2012Lunes 30 del2012
Lunes 30 del2012
 
Termos associados ao nome
Termos associados ao nomeTermos associados ao nome
Termos associados ao nome
 
Creación de ambientes personais de aprendizaxe. modulo 1
Creación de ambientes personais de aprendizaxe. modulo 1Creación de ambientes personais de aprendizaxe. modulo 1
Creación de ambientes personais de aprendizaxe. modulo 1
 
Protocolos wan tema4_ciclo_i_2016
Protocolos wan tema4_ciclo_i_2016Protocolos wan tema4_ciclo_i_2016
Protocolos wan tema4_ciclo_i_2016
 
Modelo modelo plan de emergencia servicio de prevenci-n
Modelo modelo plan de emergencia servicio de prevenci-nModelo modelo plan de emergencia servicio de prevenci-n
Modelo modelo plan de emergencia servicio de prevenci-n
 
10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENT
10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENT10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENT
10 MUST HAVE METRICS FOR BUSINESS DEVELOPMENT
 
Proteínas
ProteínasProteínas
Proteínas
 
Unidad 1 fuego y características
Unidad 1 fuego y característicasUnidad 1 fuego y características
Unidad 1 fuego y características
 
Selección, instalación y configuración del software de servidor de mensajería...
Selección, instalación y configuración del software de servidor de mensajería...Selección, instalación y configuración del software de servidor de mensajería...
Selección, instalación y configuración del software de servidor de mensajería...
 
Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...
Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...
Thesis on the Hart-Fuller debate Should Law and Morality be intertwined- Dyla...
 
10 factores humanos que causan accidentes 14/07/16
10 factores humanos que causan accidentes 14/07/1610 factores humanos que causan accidentes 14/07/16
10 factores humanos que causan accidentes 14/07/16
 
1. Interpretación ley n° 29783
1.  Interpretación ley n° 297831.  Interpretación ley n° 29783
1. Interpretación ley n° 29783
 
Foro temático accidente laboral relacionado con trabajo en alturas
Foro temático accidente laboral relacionado con trabajo en alturasForo temático accidente laboral relacionado con trabajo en alturas
Foro temático accidente laboral relacionado con trabajo en alturas
 
ERD-Salesforce
ERD-SalesforceERD-Salesforce
ERD-Salesforce
 

Similaire à Usability, Reusability and Reproducibility of Bioinformatic Applications

Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability Sandra Gesing
 
SGCI - The Science Gateways Community Institute: Going Beyond Borders
SGCI - The Science Gateways Community Institute: Going Beyond BordersSGCI - The Science Gateways Community Institute: Going Beyond Borders
SGCI - The Science Gateways Community Institute: Going Beyond BordersSandra Gesing
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
SGCI - Science Gateways: An Overview
SGCI - Science Gateways: An OverviewSGCI - Science Gateways: An Overview
SGCI - Science Gateways: An OverviewSandra Gesing
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceCarole Goble
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking FunctionalityNicholas Loulloudes
 
SGCI-URSSI-Sustainability in Research Computing
SGCI-URSSI-Sustainability in Research ComputingSGCI-URSSI-Sustainability in Research Computing
SGCI-URSSI-Sustainability in Research ComputingSandra Gesing
 
DataFest 2019 Science Gateways
DataFest 2019 Science GatewaysDataFest 2019 Science Gateways
DataFest 2019 Science GatewaysRaminder Singh
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Softwaredgarijo
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceMarcus Hanwell
 
SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...Sandra Gesing
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402vrij
 
SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...
SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...
SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...Sandra Gesing
 
SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...
SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...
SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...Sandra Gesing
 
Enabling Research without Geographical Boundaries via Collaborative Research ...
Enabling Research without Geographical Boundaries via Collaborative Research ...Enabling Research without Geographical Boundaries via Collaborative Research ...
Enabling Research without Geographical Boundaries via Collaborative Research ...Sandra Gesing
 
Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016Jisc
 

Similaire à Usability, Reusability and Reproducibility of Bioinformatic Applications (20)

Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability
 
SGCI - The Science Gateways Community Institute: Going Beyond Borders
SGCI - The Science Gateways Community Institute: Going Beyond BordersSGCI - The Science Gateways Community Institute: Going Beyond Borders
SGCI - The Science Gateways Community Institute: Going Beyond Borders
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Ucsd research-it-09-11-18
Ucsd research-it-09-11-18Ucsd research-it-09-11-18
Ucsd research-it-09-11-18
 
SGCI - Science Gateways: An Overview
SGCI - Science Gateways: An OverviewSGCI - Science Gateways: An Overview
SGCI - Science Gateways: An Overview
 
Final Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational ResearchFinal Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational Research
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionality
 
SGCI-URSSI-Sustainability in Research Computing
SGCI-URSSI-Sustainability in Research ComputingSGCI-URSSI-Sustainability in Research Computing
SGCI-URSSI-Sustainability in Research Computing
 
DataFest 2019 Science Gateways
DataFest 2019 Science GatewaysDataFest 2019 Science Gateways
DataFest 2019 Science Gateways
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Software
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open Science
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...SGCI - The Science Gateways Community Institute: International Collaboration ...
SGCI - The Science Gateways Community Institute: International Collaboration ...
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402
 
SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...
SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...
SGCI - Science Gateways - Technology-Enhanced Research Under Consideration of...
 
SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...
SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...
SGCI - Science Gateways Bootcamp: Strategies for Developing, Operating and Su...
 
Enabling Research without Geographical Boundaries via Collaborative Research ...
Enabling Research without Geographical Boundaries via Collaborative Research ...Enabling Research without Geographical Boundaries via Collaborative Research ...
Enabling Research without Geographical Boundaries via Collaborative Research ...
 
Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016
 
SGCI OAC webinar 4 18-19
SGCI OAC webinar 4 18-19SGCI OAC webinar 4 18-19
SGCI OAC webinar 4 18-19
 

Plus de Sandra Gesing

The Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteThe Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteSandra Gesing
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...Sandra Gesing
 
SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...
SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...
SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...Sandra Gesing
 
Sustainability of HPC Research Computing: Fostering career paths for facilit...
Sustainability of HPC Research Computing:  Fostering career paths for facilit...Sustainability of HPC Research Computing:  Fostering career paths for facilit...
Sustainability of HPC Research Computing: Fostering career paths for facilit...Sandra Gesing
 
URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Su...
URSSI - SGCI - PresQT: Research Software and Science Gateways:  Addressing Su...URSSI - SGCI - PresQT: Research Software and Science Gateways:  Addressing Su...
URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Su...Sandra Gesing
 
SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...
SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...
SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...Sandra Gesing
 
The Conceptualization of URSSI - How You Can Engage
The Conceptualization of URSSI - How You Can EngageThe Conceptualization of URSSI - How You Can Engage
The Conceptualization of URSSI - How You Can EngageSandra Gesing
 
SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...
SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...
SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...Sandra Gesing
 
SGCI - Science Gateways Community Institute: Software Registry
SGCI - Science Gateways Community Institute: Software RegistrySGCI - Science Gateways Community Institute: Software Registry
SGCI - Science Gateways Community Institute: Software RegistrySandra Gesing
 
SGCI - RDA - Sustainability of Collaborative Platforms
SGCI - RDA - Sustainability of Collaborative PlatformsSGCI - RDA - Sustainability of Collaborative Platforms
SGCI - RDA - Sustainability of Collaborative PlatformsSandra Gesing
 
SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...
SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...
SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...Sandra Gesing
 
SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability Sandra Gesing
 
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...Sandra Gesing
 
SGCI Science Gateways: Addressing Data Management Challenges
SGCI Science Gateways: Addressing Data Management ChallengesSGCI Science Gateways: Addressing Data Management Challenges
SGCI Science Gateways: Addressing Data Management ChallengesSandra Gesing
 
SGCI Science Gateways Landscape in North America
SGCI Science Gateways Landscape in North AmericaSGCI Science Gateways Landscape in North America
SGCI Science Gateways Landscape in North AmericaSandra Gesing
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017Sandra Gesing
 
SGCI - Science Gateways: Sustainability via On-Campus Teams
SGCI - Science Gateways: Sustainability via On-Campus TeamsSGCI - Science Gateways: Sustainability via On-Campus Teams
SGCI - Science Gateways: Sustainability via On-Campus TeamsSandra Gesing
 
SGCI - The Science Gateways Community Institute - Supporting Communities to A...
SGCI - The Science Gateways Community Institute - Supporting Communities to A...SGCI - The Science Gateways Community Institute - Supporting Communities to A...
SGCI - The Science Gateways Community Institute - Supporting Communities to A...Sandra Gesing
 
Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame Sandra Gesing
 
User-­friendly Metaworkflows in Quantum Chemistry
User-­friendly Metaworkflows in Quantum ChemistryUser-­friendly Metaworkflows in Quantum Chemistry
User-­friendly Metaworkflows in Quantum ChemistrySandra Gesing
 

Plus de Sandra Gesing (20)

The Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an InstituteThe Reasons Why the Science Gateways Community Needs an Institute
The Reasons Why the Science Gateways Community Needs an Institute
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
 
SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...
SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...
SGCI and URSSI: Xpert Network – Exchanging Best Practices in Supporting Compu...
 
Sustainability of HPC Research Computing: Fostering career paths for facilit...
Sustainability of HPC Research Computing:  Fostering career paths for facilit...Sustainability of HPC Research Computing:  Fostering career paths for facilit...
Sustainability of HPC Research Computing: Fostering career paths for facilit...
 
URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Su...
URSSI - SGCI - PresQT: Research Software and Science Gateways:  Addressing Su...URSSI - SGCI - PresQT: Research Software and Science Gateways:  Addressing Su...
URSSI - SGCI - PresQT: Research Software and Science Gateways: Addressing Su...
 
SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...
SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...
SGCI - URSSI - Science Gateways for Electronics, Photonics and Magnetics: Ach...
 
The Conceptualization of URSSI - How You Can Engage
The Conceptualization of URSSI - How You Can EngageThe Conceptualization of URSSI - How You Can Engage
The Conceptualization of URSSI - How You Can Engage
 
SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...
SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...
SGCI - URSSI - Research Software Engineers, Science Gateway Developers and Cy...
 
SGCI - Science Gateways Community Institute: Software Registry
SGCI - Science Gateways Community Institute: Software RegistrySGCI - Science Gateways Community Institute: Software Registry
SGCI - Science Gateways Community Institute: Software Registry
 
SGCI - RDA - Sustainability of Collaborative Platforms
SGCI - RDA - Sustainability of Collaborative PlatformsSGCI - RDA - Sustainability of Collaborative Platforms
SGCI - RDA - Sustainability of Collaborative Platforms
 
SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...
SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...
SGCI - Science Gateways Community Institute: Subsidized Services and Consulta...
 
SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability SGCI Science Gateways: Ushering in a New Era of Sustainability
SGCI Science Gateways: Ushering in a New Era of Sustainability
 
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...
 
SGCI Science Gateways: Addressing Data Management Challenges
SGCI Science Gateways: Addressing Data Management ChallengesSGCI Science Gateways: Addressing Data Management Challenges
SGCI Science Gateways: Addressing Data Management Challenges
 
SGCI Science Gateways Landscape in North America
SGCI Science Gateways Landscape in North AmericaSGCI Science Gateways Landscape in North America
SGCI Science Gateways Landscape in North America
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
 
SGCI - Science Gateways: Sustainability via On-Campus Teams
SGCI - Science Gateways: Sustainability via On-Campus TeamsSGCI - Science Gateways: Sustainability via On-Campus Teams
SGCI - Science Gateways: Sustainability via On-Campus Teams
 
SGCI - The Science Gateways Community Institute - Supporting Communities to A...
SGCI - The Science Gateways Community Institute - Supporting Communities to A...SGCI - The Science Gateways Community Institute - Supporting Communities to A...
SGCI - The Science Gateways Community Institute - Supporting Communities to A...
 
Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame
 
User-­friendly Metaworkflows in Quantum Chemistry
User-­friendly Metaworkflows in Quantum ChemistryUser-­friendly Metaworkflows in Quantum Chemistry
User-­friendly Metaworkflows in Quantum Chemistry
 

Dernier

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 

Dernier (20)

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 

Usability, Reusability and Reproducibility of Bioinformatic Applications

  • 1.   Sandra  Gesing   Center  for  Research  Compu6ng   sandra.gesing@nd.edu     12  February  2016   Usability,  Reusability  and   Reproducibility  of  Bioinforma6c   Applica6ons  
  • 2. University  of  Notre  Dame   Sandra  Gesing                                                  2   hHp://chartsbin.com/view/1124  hHp://chartsbin.com/view/1124   •   In  the  middle  of  nowhere  of  northern  Indiana      (1.5  h  from  here)   •   4  undergraduate  colleges     •   ~35  research  ins6tutes    and  centers   •   ~12,000  students  
  • 3. Center  for  Research  Compu6ng   Sandra  Gesing                                            3   •   SoSware  development  and  profiling   •   Cyberinfrastructure/science  gateway  development   •   Geographical  Informa6on  Systems   •   Visualiza6on  Support   •   Computa6onal  Scien6st  support   •   Collabora6ve  research/    grant  development         •   System  administra6on/    design  and  acquisi6on   •   ~40  researchers,      research  programmers,      HPC  specialists   CRC  and  OIT  building  hHp://crc.nd.edu  
  • 4. Center  for  Research  Compu6ng   Sandra  Gesing                                                  4   •   Computa6onal  resources:  25,000  cores+   •   Storage  resources:  3  PB   •   Visualiza6on  systems     •   Systems  for  virtual  hos6ng     •   Prototype  architectures    e.g.,  Docker,  OpenStack   •   Access  and  interface  to   •  XSEDE     •  Open  Science  Grid     •  Blue  Waters       CRC  HPC  Center  (old  Union  Sta6on)  
  • 5. Bioinforma6cs   Sandra  Gesing                                                  5   •  Genomics   •  Proteomics   •  Metabolomics   •  Immunomics   •  System  biology   •  Molecular  simula6ons   •  Docking   •  Epidemiology   •  …   Black  Swallowtail  –     larvae  and  buHerfly  
  • 6. The  Genomics  Boom   Sandra  Gesing                                                  6   February  16,  2001    biotech  company  Celera     February  15,  2001   The  Human  Genome  Project    
  • 7. The  Genomics  Boom   Sandra  Gesing                                                  7   Craig  Venter  (leS)  and  Francis  Collins  (right)  
  • 8. Big  Data   Sandra  Gesing                                                                  8   •   Explosion  in  the  quan6ty,  variety  and  complexity  of    data     •   Ques6ons  can  be  answered  impossible  to  even  ask    about  10  years  ago   •   Costs  far  reduced  (e.g.,  Human  Genome  project,  15    years,  ~$2  billion;  today  ~3  days,  $1000)  
  • 9. Big  Data   Sandra  Gesing                                                                  9   hHp://www.genome.gov/images/content/cost_per_genome_oct2015.jpg  
  • 10. State  of  the  Art       Sandra  Gesing                                                     Data  and  compute-­‐   intensive  problems   High-­‐speed  networks   Users  generally  not   IT  specialists  Tools  and  workflow   engines   Web-­‐based     agile  frameworks   Distributed  data  and     compu6ng  infrastructures  
  • 11. Challenge  for  Developers     Sandra  Gesing                                              11   Data  and  compute-­‐   intensive  problems   High-­‐speed  networks  Tools  and  workflow   engines   Web-­‐based     agile  frameworks   Distributed  data  and     compu6ng  infrastructures   Users  generally  not   IT  specialists   Need  for  intui6ve  and  self-­‐explanatory  user   interfaces!  
  • 12. Challenge  for  Developers     Sandra  Gesing                                              12   Data  and  compute-­‐   intensive  problems   High-­‐speed  networks  Tools  and  workflow   engines   Web-­‐based     agile  frameworks   Distributed  data  and     compu6ng  infrastructures   Users  generally  not   IT  specialists  
  • 13. Challenge  for  Developers     Sandra  Gesing                                              13  
  • 14. Usability   Sandra  Gesing                                              14       “ASer  all,  usability  really  just  means  that  making  sure   that  something  works  well:  that  a  person  …  can  use  the   thing  -­‐  whether  it's  a  Web  site,  a  fighter  jet,  or  a   revolving  door  -­‐  for  its  intended  purpose  without  gerng   hopelessly  frustrated.”     (Steve  Krug  in  “Don't  make  me     think!:  A  Common  Sense  Approach   to  Web  Usability”,  2005)  
  • 15. Reusability   Sandra  Gesing                                                                                                    15   “The  key  to  produc6vity  is  reusability.  The  easiest  way  to     produce  code  is  obviously  to  have  it  already!"     (John  R.  Bourne  in  “Object-­‐oriented  Engineering:  Building  Engineering     Systems  Using  Smalltalk-­‐80”,  1992)  
  • 16. Reproducibility   Sandra  Gesing                                                              16   “The  closeness  of  agreement  between  independent   results  obtained  with  the  same  method  on  iden6cal   test  material  but  under  different  condi6ons   (different  operators,  different  apparatus,  different   laboratories  and/or  aSer  different  intervals  of  6me) …”   (IUPAC  (Interna6onal  Union  of  Pure  and  Applied  Chemistry  iupac.org)  GoldBook)  
  • 17. Reproducibility   Sandra  Gesing                                                              17   “The  closeness  of  agreement  between  independent   results  obtained  with  the  same  method  on  iden6cal   test  material  but  under  different  condi6ons   (different  operators,  different  apparatus,  different   laboratories  and/or  aSer  different  intervals  of  6me) …”   (IUPAC  (Interna6onal  Union  of  Pure  and  Applied  Chemistry  iupac.org)  GoldBook)  
  • 18. Science  Gateways   Sandra  Gesing              Science  Gateways                              18   “A  Science  Gateway  is  a  community-­‐developed  set  of   tools,  applica6ons,  and  data  that  is  integrated  via  a   portal  or  a  suite  of  applica6ons,  usually  in  a  graphical   user  interface,  that  is  further  customized  to  meet  the   needs  of  a  specific  community.”     TeraGrid/XSEDE  
  • 19. Science  Gateways   Sandra  Gesing              Science  Gateways                              19  
  • 20. Science  Gateways   Sandra  Gesing              Science  Gateways                              20   It’s  a   Science   Gateway   It’s  a   Research   Portal   It’s  a   Collaboratory   It’s  a   Cyberinfrastructure   It’s   e-­‐Science   eResearch   It’s  a   Virtual     Lab  
  • 21. Frameworks  and  APIs   Sandra  Gesing                                          21   Re-­‐inven6ng  is  not  always  necessary..  
  • 22. Frameworks  and  APIs   Sandra  Gesing                                              22   ...  and  users  should  get  more  features  easily...  
  • 23. Frameworks  and  APIs   Sandra  Gesing                                              23   ...  but  the  model  should  fit  to  the  demands  of  the   community  
  • 24. Science  Gateway  Technologies   Sandra  Gesing                                              24   •   Agile  web  frameworks  (AngularJS,  Seman6c  UI)   •   Content  management  systems  (Drupal)   •   Libraries  for  implementa6on  (Django)   •   Science  gateway  frameworks  (Galaxy,  WS-­‐PGRADE,   Catania  Science  Gateway  Framework,  HubZero)   •   Sta6c  layout   •   Layout  extendable   •   Workflow-­‐enabled   •   APIs  for  implementa6on  (Apache  Airavata,  Agave,     Vine  Toolkit)      
  • 25. Science  Gateway  Technologies   Sandra  Gesing                                                    25  
  • 26. Development  of  Science  Gateways       Sandra  Gesing                                              26   Crucial  Topics   •  Close  collabora6on  with  user  communi6es   •  Knowledge  about  available  technical  solu6ons   Sounds  easy  but…   •  Requirements  of  user  communi6es  oSen  not  so   clear   •  Technologies  some6mes  s6ll  under  development   for  certain  building  blocks   è Slow  uptake  of  solu6ons     è Larger  effort  for  crea6ng  science  gateways  
  • 27. New  Science  Gateways  -­‐  Checklist       Sandra  Gesing                                              27                         Organiza6onal   Aspects   Technical   Aspects   Domain-­‐Specific   Aspects   Developers   Domain  Experts  
  • 28. New  Science  Gateways  -­‐  Checklist       Sandra  Gesing              Science  Gateways                              28   Domain-­‐specific  aspects:   •  Goal,  target  area  and  target  users     •  Visions/demands  on  the  layout   •  Priori6es  of  features  and  op6ons,  e.g.,  a  list   from  must-­‐have  to  great-­‐to-­‐have  op6ons   •  Integra6on  of  exis6ng  applica6ons  or   development  of  applica6ons   •  Technologies  of  the  applica6ons   •  Visualiza6on   •  Security  demands   •  Workflows  
  • 29. New  Science  Gateways  -­‐  Checklist       Sandra  Gesing              Science  Gateways                              29   Organiza6onal  aspects:   •  Time  constraints  for  the  development,   agreement  on  a  (maybe  even  rough)  project   plan  with  milestones     •  Agreement  on  alpha-­‐  or  beta-­‐tester   •  Regular  mee6ngs    
  • 30. New  Science  Gateways  -­‐  Checklist       Sandra  Gesing              Science  Gateways                              30   Technical  aspects:   •  Experience  with  exis6ng  frameworks  and   programming  languages   •  Available  infrastructure  including  security   infrastructure  and  resources   •  Available  support  of  suitable  technologies   •  Scalability  of  suitable  technologies   •  Effort  for  extending  exis6ng  technologies   compared  to  novel  developments     •  Synergy  effects  with  other  science  gateway   projects  
  • 31. Science  Gateways       Sandra  Gesing              Science  Gateways                              31   A  new  era…   •  Novel  developments  of  web-­‐based  agile   frameworks   •  Infrastructure  providers  report  that  science   gateways  are  more  used  than  commandlines     hHp://www.iplantcollabora6ve.org  
  • 32. Science  Gateways       Sandra  Gesing              Science  Gateways                              32   A  new  era…   •  Novel  developments  of  web-­‐based  agile   frameworks   •  Infrastructure  providers  report  that  science   gateways  are  more  used  than  commandlines   But  also  always  new  challenges…   •  Novel  infrastructures   •  Novel  data  sources  such  as  the  next  Next-­‐Gen   Sequencing   è Support  of  developers  necessary    
  • 33. Science  Gateway  Ins6tute   Sandra  Gesing              Science  Gateways                              33   2012  NSF  SoSware  Ins6tute  conceptualiza6on  award   2015  NSF  SoSware  Ins6tute  implementa6on  proposal   ($15M)     Services   •  Incubator   •  Developer  support  team   •  Gateway  framework  directory   •  Workforce  development     hHp://sciencegateways.org  
  • 34. Science  Gateway  Survey  2014     Sandra  Gesing              Science  Gateways                              34   •  29,000-­‐person  survey     •  4957  responses  from  across  domains  
  • 35. Science  Gateway  Survey  2014     Sandra  Gesing              Science  Gateways                              35   What  services     would  be  helpful?  
  • 36. Bioinforma6c  Infrastructure  Survey   Sandra  Gesing                                                              36   •   Nick  Loman    (Birmingham,  UK)     •  Thomas  Connor      (Cardiff,  UK)     •   October  2015   •   272  answers     hHps://drive.google.com/drive/folders/0B7KZv1TRi06fLUJCU1BYM3JScjg    
  • 37. Bioinforma6c  Infrastructure  Survey   Sandra  Gesing                                                      37  
  • 38. Bioinforma6c  Infrastructure  Survey   Sandra  Gesing                                                              38   0" 20" 40" 60" 80" 100" 120" Cloud" Ins0tu0on2wide"resource" Local"resource" Personal"computer" Where  do  bioinforma6cians  do  most  of  their  work  
  • 39. Bioinforma6c  Infrastructure  Survey   Sandra  Gesing                                                              39   0" 20" 40" 60" 80" 100" 120" Cloud" Ins0tu0on2wide"resource" Local"resource" Personal"computer" 0.00%$ 10.00%$20.00%$30.00%$40.00%$50.00%$60.00%$70.00%$80.00%$90.00%$ Best$for$job$ Good$documenta>on$ Word$of$mouth$recommenda>on$ Used$in$similar$analysis$ Quickest$ Already$installed$on$server$ Other$ Graphical$interface$ Where  do  bioinforma6cians  do  most  of  their  work   Why  do  bioinforma6cians  use  the  soSware  they  use  
  • 40. Bioinforma6c  Infrastructure  Survey   Sandra  Gesing                                                              40   0" 20" 40" 60" 80" 100" 120" Cloud" Ins0tu0on2wide"resource" Local"resource" Personal"computer" 0.00%$ 10.00%$20.00%$30.00%$40.00%$50.00%$60.00%$70.00%$80.00%$90.00%$ Best$for$job$ Good$documenta>on$ Word$of$mouth$recommenda>on$ Used$in$similar$analysis$ Quickest$ Already$installed$on$server$ Other$ Graphical$interface$ Where  do  bioinforma6cians  do  most  of  their  work   Why  do  bioinforma6cians  use  the  soSware  they  use  
  • 41. Bioinforma6c  Infrastructure  Survey   Sandra  Gesing                                                              41   Ques6ons  around  frustra6on  and  limita6ons  of  using   •   Bioinforma6c  soSware   •   Bioinforma6c  resources   •   HPC  and  Cloud  infrastructures   and  about  challenges  to  train  students  in  bioinforma6cs     Answers  oSen  address   •  Hurdles  to  use  bioinforma6c  resources  because  of   commandline  access  or  not  available  soSware   •  Quality  of  documenta6on  of  soSware   •  Need  for  parsers  and  converters  for  diverse  data  formats   •  Long  wai6ng  6me  for  support  or  even  lack  of  support    
  • 42. Challenges     Sandra  Gesing                                                              42   A  world-­‐wide  research  compu6ng  infrastructure   •  Transparent  service  selec6on   •  e.g.,  Docker  could  be  part  of  the  solu6on   •  Access  to  data  irrespec6ve  of  loca6on   •  Op6ons  to  share  data  efficiently   •  Appropriate  privacy  and  security  measures   •  Op6mized  usage  of  resources   •  e.g.,  op6mized  usage  of  cloud  compu6ng  and  their   business  models  
  • 43. Researchers   Sandra  Gesing                                                              43   ~7  million  researchers  world  wide     hHp://chartsbin.com/view/1124  
  • 44. High-­‐Speed  Network   Sandra  Gesing                                                              44  
  • 45. Challenges     Sandra  Gesing                                                              45   Integra6on  of  data  sources  and  instruments   •  Different  data  formats   •  Different  interfaces   •  Different  hardwares  and  technologies     …  from  small  ones  to  the  big  ones…  
  • 46. Challenges     Sandra  Gesing                                                            46   SoSware  searchability,  reproducibility  and  reusability   •  Science  gateways  step  in  the  right  direc6on  but  …   much  more  work  necessary  on  searchibility…  Not  only   finding  any  data  for  a  research  area  but  finding  the  right   data   •  Metadata  approaches   •  Dic6onaries   •  More  involvement  of     librarians      
  • 47. Challenges     Sandra  Gesing                                                            47   SoSware  searchability,  reproducibility  and  reusability   •  Science  gateways  step  in  the  right  direc6on  but  …   much  more  work  necessary  on  reproducibility  and   reusability…     •  studies  in  medicine  and  pharmacology:  11%  or  6%  of  the   analysed  research  was  reproducible     •  myExperiment:  only  20%  of  workflows  reusable  because   of  dependencies  on  hardware,  local  or  distributed  data,   soSware  versions            
  • 48. Challenges     Sandra  Gesing                                                            48   SoSware  searchability,  reproducibility  and  reusability   •  Science  gateways  and  workflow  systems  step  in  the   right  direc6on  but  …   much  more  work  necessary  on  reproducibility  and   reusability…       •  Containeriza6on  approaches   •  Migra6on  approaches   •  Combina6on  of  both        
  • 49. Challenges  –  Novel  and  Old...         Sandra  Gesing              Science  Gateways                              49   …  require  novel  solu6ons!  
  • 50. Projects  -­‐  OSF   Sandra  Gesing              Science  Gateways                              50     •  Big  Data   •  Reproducibility     Open  Access  to  Data  and  Projects  could  solve   parts  of  the  problems…  
  • 51. Projects  -­‐  WSSSPE   Sandra  Gesing              Science  Gateways                              51         Need  of  founda6onal   building  blocks  and   a  reward  system  for   soSware  engineering!           hHps://github.com/wssspe   Early   adopters   Publicity   Wider   adop3on   Funding   ends   Scien3sts   disillusioned   New   project   prototype  
  • 52. Projects  –  B3  Book   Sandra  Gesing              Science  Gateways                              52   Biology,  Bioinforma6cs  and  Big  Data             arXiv:1511.02689  [cs.DC]  
  • 53. EU  COST  Ac6on  cHiPSet  (IC1406)   Sandra  Gesing              Science  Gateways                              53   cHiPSet  –  High  Performance  Modeling  and  Simula6on     for  Big  Data  Applica6ons     •  April  2015  –  April  2019   •  15  countries  -­‐  12  COST,  3  non-­‐COST  (US,  China,  Australia)   •  37  reseach  organiza6ons/companies  (31  COST,  6  non-­‐ COST)             hHp://www.cost.eu/COST_Ac6ons/ict/Ac6ons/IC1406  
  • 54. EU  COST  Ac6on  cHiPSet   Sandra  Gesing              Science  Gateways                              54  
  • 55. cHiPSet  -­‐  Collabora6ons   Sandra  Gesing              Science  Gateways                              55   Projects  declared  interest  for  collabora6on   •  NESUS  (Network  for  Sustainable  Ultrascale   Compu6ng)     hHp://www.nesus.eu/   •  KEYSTONE  (Seman6c  keyword-­‐based  search  on   structured  data  sources)   hHp://www.keystone-­‐cost.eu/   •  AAPELE  (Algorithms,  Architectures  and  Pla•orms  for   Enhanced  Living  Environment)   hHp://aapele.eu/      And  maybe  YOU?      
  • 56. Informa6on  on  Science  Gateways   Sandra  Gesing              Science  Gateways                              56   •   Science  Gateway  Workshops      Europe:  IWSG  -­‐  hHp://iwsg.info   USA:  GCE  -­‐  hHp://sciencegateways.org   Australasia:  IWSG-­‐A  -­‐  hHp://iwsg.info   •   Science  Gateway  Ins6tute    hHp://sciencegateways.org   •   IEEE  Technical  Area  on  Science  Gateways        hHp://ieeesciencegateways.org   •   XSEDE  Science  Gateways    hHps://www.xsede.org/gateways-­‐overview   •   CRC  Science  Gateways    hHps://crc.nd.edu/index.php/research/gateways    
  • 57. Sandra  Gesing              Science  Gateways                              57   sandra.gesing@nd.edu