SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
DataShare  for  the  UCs  

6  February  2014  
  
Where  we’re  going  
Background  
Demo  of  UCSF  DataShare  
Technical  details  
Other  details  
Future  plans  
Q&A  

From  Flickr  by  Leo  Hidalgo  
Goal  
  
  
How  

Catalyze  widespread  research  data  
sharing  
Develop  a  system  that  lowers  data  
sharing  barriers  and  builds  an  engaged  
user  community  
Survey  of  users  by  Angela  Rizk-­‐Jackson  
Has  your  research  
group  provided  public  
access  to  data?  

Why?  

Yes  

No  

How?  
Other  

Other  
Journal  
required  
Funder  
required  

Repository  
Website  

n  =  114  
Repository  choices…  
Repository  choices…  
Repositories    
for  data  

Discipline-­‐specific  

General  content  

Institutional  

Non-­‐institutional  

Publishers/for-­‐profits  
Short-­‐term  projects  
Repository  choices…  
Which  is  more  
important?  

Depends  

Institutional  
•  All  data  associated  with  
a  paper  
•  Tells  a  story  
•  Clearinghouse  for  
researcher’s  works  

?  

Which  should  a  
researcher  use?  

Both  
Discipline-­‐specific  
•  Some  of  data  for  a  
given  paper  
•  Discoverable  
•  Integrated  systems  
•  Collection  policies  
Institutional  
•  All  data  associated  with  
a  paper  
•  Tells  a  story  
•  Clearinghouse  for  
researcher’s  works  
IR’s  are  SO  
2002.  

From  Flickr  by  Colin  ZHU  

From  Flickr  by    johnsons531  
  

From  Flickr  by    Ludie  Cochrane  

From  Flickr  by    Kapil  Karekar  
Last  
year…  

…  “Federal  agencies  investing  in  research  and  
development  (more  than  $100  million  in  annual  
expenditures)  must  have  clear  and  coordinated  
policies  for  increasing  public  access  to  research  
products.”  
From  Flickr  by  wiccked  

IR  
But…  

From  Flickr  by  jackcheng  

Not  always  self-­‐service  
Sometimes  complicated  
Data?  
“Old”  user  interfaces  
Simplify  data  deposit  for  UC  
researchers  
  
Simple  metadata  
Self-­‐service  upload  and  download  
Branded  for  campus  

Most  Important:    
Institutional  Control  Over  Data  
Background  
Demo  of  UCSF  DataShare  
Technical  details  
Other  details  
Future  plans  
Q&A  

From  Flickr  by  Leo  Hidalgo  
Background  
Demo  of  UCSF  DataShare  
Technical  details  
Other  details  
Future  plans  
Q&A  

From  Flickr  by  Leo  Hidalgo  
Technical  goals  
•  Easy  submission  
•  Persistent  citation  
•  Preservation  assurance  
•  Effective  discovery  
From  www.dimensionsinfo.com  

•  Control  over  terms  of  use  
•  All  the  benefits  of  a  centrally  
hosted  service,  while  
maintaining  campus  branding  
and  identity  
From  Flickr  by  Eric  Peacock  
System  components  
•  Easy  submission  

UCSF  drag-­‐n-­‐drop  client  

•  Persistent  citation  
•  Preservation  assurance  
•  Effective  discovery  
•  Control  over  terms  of  use  

Data  use  agreements  (DUAs)  

•  All  the  benefits  of  a  centrally   DNS,  Apache,  CSS,  and  
campus  Shibboleth  IdPs  
hosted  service,  while  
maintaining  campus  branding   datashare.berkeley.edu  
datashare.ucdavis.edu  
and  identity  
datashare.uci.edu  
datashare.ucla.edu  
…  
Deposit  interactions  
Researcher  
(data  producer)  
datashare.campus.edu  

DataShare  portal  
Campus  
IdP  
Authenticate  
with  campus  
credentials  

Shib  

Drag-­‐n-­‐drop  
client  
Assemble  dataset  
Add  metadata  
Submit  to  Merritt  

SDSC  cloud  
Preservation  storage  

Merritt  

CSS  

Atom  

Discovery  

Populate  XTF  index  

(XTF)  

Request  DOI  
Register  metadata  
Assign  DOI

  

Data  use  
agreement  

EZID  
Request  DOI  
Register  metadata  
Assign  DOI

  

Primo  
Harvest  for  A&I  discovery  

DataCite  

Data  Citation  
Index  
Harvest  for  A&I  discovery  
Download  interactions  
Researcher  
Synchronous  for  
small  datasets;  
asynchronous  for  
large  (>  500  MB)  

Campus  
IdP  

Download  data  

(data  consumer)  

datashare.campus.edu  

DataShare  portal  
Drag-­‐n-­‐drop  
client  

Merritt  

CSS  

Discovery  
(XTF)  
Faceted  search  /  browse  

SDSC  cloud  

EZID  

Retrieve  data  

Primo  
Faceted  search  /  browse  

Data  use  
agreement  
Accept  DUA  terms  

DataCite  

Data  Citation  
Index  
Faceted  search  /  browse  
Background  
Demo  of  UCSF  DataShare  
Technical  details  
Other  details  
Future  plans  
Q&A  

From  Flickr  by  Leo  Hidalgo  
Campus  Library  
Delivers  service  to  community  
Shapes  user  interface,  URL,  branding  
Customizes  key  components  
Develops  help,  training  

Roles  
UC3  /  CDL  
Guides  the  campus  
Preserves  content  in  Merritt  
Connects  to  EZID  
Deploys  XTF  for  discovery  
Works  with  vendors  

SDSC  
Maintains  production  storage  
infrastructure  
Holds  three  independent  
copies  of  content  
Branding  &  
Customization  

From  Flickr  by    Diorama  Sky  

• 
• 
• 
• 

Logo  
URL  
Contact  information  
Other…?  
Cost  
•  EZID  accounts  

From  Flickr  by  Maura  Teague  

–  Existing  campus  memberships  provide  unlimited  
DOIs    

•  Merritt  recharge  proposal  (awaiting  UCOP  approval)  
–  Pay-­‐as-­‐you-­‐go   $0.40/GB/year  
–  Paid-­‐up  (for  10  years)   $2.93/GB  
–  Threshold  pricing   100,  200,  500  GBs  
  

  1,  2,  5,  10,  20,  50,  100  TBs    
Cost  
Anticipated  cost  of  providing  all  campus  ladder-­‐track  
faculty  with  5  GBs  for  10  years  
Campus  

Faculty  

Threshold  

Paid-­‐up  cost  

Berkeley  

1,260  

10  TB  

$  29,300  

Davis  

1,240  

10  TB  

$  29,300  

Irvine  

1,051  

10  TB  

$  29,300  

Los  Angeles  

1,701  

10  TB  

$  29,300  

Merced  

      159  

      1  TB  

$      2,930  

Riverside  

      561  

    5  TB  

$  14,650  

San  Diego  

1,109  

10  TB  

$  29,300  

San  Francisco  

      366  

    2  TB  

$      5,860  

Santa  Barbara  

      746  

    5  TB  

$  14,650  

Santa  Cruz  

      485  

    5  TB  

$  14,650  

Source:  http://legacy-­‐its.ucop.edu/uwnews/stat/headcount_fte/oct2013/welcome.html    
Governance        
&  Agreements  

Goal:    
Simplify  &  Scale  Data  Use  &  
Deposit  Agreements  
Governance        
&  Agreements  

Data    
User  

ODL  or  
similar  

CDL  

Terms  of  
service  

UC  Campus  

ODL  or  similar  
  

Terms  of  
service  

Data  
Depositor  
Background  
Demo  of  UCSF  DataShare  
Technical  details  
Other  details  
Next  steps  &  future  plans  
Q&A  

From  Flickr  by  Leo  Hidalgo  
Who  
Decides?  
•  CDL  to  work  with  each  campus  to  
implement  &  shape  service  
•  Campus-­‐to-­‐campus  interaction  
•  Group  meetings  as  needed  
•  SAG1  check-­‐ins  
•  Communication  (…)  
From  Flickr  by  Mischievous  One  

This  is  a  group  project  
From  Flickr  by  Alice  Bartlett  

Two  heads  are  
better  than  
one!  
From  Flickr  by  Emil  Nordén  

• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 

eScholarship  connection  
ORCID  
Altmetrics  
Solr/Blacklight  for  discovery  
Expand  metadata  options  
Embargoes  
Restricted  access  for  peer  review  
Annotations  
Export  to  citation  managers  
Staging  area  
Private  storage  
Mapping  metadata/GIS  support  
Communication  
Google  Groups  Web  Forum  
Communication  
UC3  confluence  site    
confluence.ucop.edu/display/Curation/DataShare+for+UCs  
Communication  

From  Flickr  by  gsagos/nho  

•  Listserv?  
•  Twitter  @DataShareOrg  
•  …?  
Communication  
github.com/CDLUC3/datashare  
DASH:    
Helping  Community  
T Repositories  

ob
eR
evi
seD
What  Makes  DASH  Unique:  

•  Modern,  intuitive  user  interface  for  superior  user  experience  
•  Freely  available  code  for  download  and  use  by  anyone  
•  User-­‐friendly  API(s)  to  ensure  interoperability  with  existing  
repositories  (e.g.,  SWORD  for  deposit;  Atom,  OAI-­‐PMH,  
ResourceSync  for  populating  the  discovery  index).  
•  Customizable  interfaces  that  can  be  altered  easily  to  reflect  service  
provider  branding  
•  Authentication  via  institutional  Identity  Management  Systems  
Next  Steps  –    
Next  2  Weeks  
•  details  to  be  established  
–  who’s  interested  
–  tech  contact  for  interested  
campuses  
–  communication  lines  

From  Flickr  by  Themactep  
Next  Steps  –    
Next  2  Months  
•  get  DataShare  up  and  running  
–  Shibboleth  configuration  &  
other  authentication  
–  Domains/URLs  established  
–  Customizations  –  logos  etc.  

From  Flickr  by  Themactep  
Next  Steps  –    
Longer  term  
•  in-­‐person  meeting?  
•  CDL  camp?  
•  communication/outreach?  

From  Flickr  by  Themactep  
Acknowledgements  
• 
• 
• 
• 

Stephen  Abrams  
Trisha  Cruse  
Carly  Strasser  
Perry  Willett  

•  Geoffrey  Boushey  
•  Julia  Kochi  
•  Megan  Laurence  

•  Anirvan  Chatterjee  
•  Angela  Rizk-­‐Jackson  
•  Maninder  Kahlon  

Contenu connexe

Tendances

Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-support
Sherry Lake
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
John Kunze
 

Tendances (20)

DMPTool Webinar 6: Health Sciences and the DMPTool (presented by Lisa Federer)
DMPTool Webinar 6: Health Sciences and the DMPTool (presented by Lisa Federer)DMPTool Webinar 6: Health Sciences and the DMPTool (presented by Lisa Federer)
DMPTool Webinar 6: Health Sciences and the DMPTool (presented by Lisa Federer)
 
Data Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-TiessenData Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-Tiessen
 
Dataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. BorgmanDataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. Borgman
 
Metadata & Data Curation Services by Thu-Mai Christian
Metadata & Data Curation Services by Thu-Mai ChristianMetadata & Data Curation Services by Thu-Mai Christian
Metadata & Data Curation Services by Thu-Mai Christian
 
Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-support
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
 
Meadows apr28-1
Meadows apr28-1Meadows apr28-1
Meadows apr28-1
 
No more waiting! Tools that work Today to reveal dataset use
No more waiting!  Tools that work Today to reveal dataset useNo more waiting!  Tools that work Today to reveal dataset use
No more waiting! Tools that work Today to reveal dataset use
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
Dataverse in China: Internationalization, Curation and Promotion by Yin ShenqinDataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
 
Caldrone - Specific Needs and Concerns Associated with Data Repositories
Caldrone - Specific Needs and Concerns Associated with Data RepositoriesCaldrone - Specific Needs and Concerns Associated with Data Repositories
Caldrone - Specific Needs and Concerns Associated with Data Repositories
 
RDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuseRDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuse
 
Second Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for DataSecond Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for Data
 
NISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDLNISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDL
 
DMPTool webinar 2011-10-19
DMPTool webinar 2011-10-19DMPTool webinar 2011-10-19
DMPTool webinar 2011-10-19
 
"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd Plenary
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
DMPTool Webinar 3: Customizing the DMPTool
DMPTool Webinar 3: Customizing the DMPToolDMPTool Webinar 3: Customizing the DMPTool
DMPTool Webinar 3: Customizing the DMPTool
 

En vedette

2.4.1 forman wesley ignite slides (keynote)
2.4.1 forman wesley ignite slides (keynote) 2.4.1 forman wesley ignite slides (keynote)
2.4.1 forman wesley ignite slides (keynote)
Wesley Forman
 
Dash Merritt Zeus
Dash Merritt ZeusDash Merritt Zeus
Dash Merritt Zeus
S Gilder
 
Dataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZIDDataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZID
University of California Curation Center
 

En vedette (6)

2.4.1 forman wesley ignite slides (keynote)
2.4.1 forman wesley ignite slides (keynote) 2.4.1 forman wesley ignite slides (keynote)
2.4.1 forman wesley ignite slides (keynote)
 
OTCA & MSU Advertising Association
OTCA & MSU Advertising AssociationOTCA & MSU Advertising Association
OTCA & MSU Advertising Association
 
Dash Merritt Zeus
Dash Merritt ZeusDash Merritt Zeus
Dash Merritt Zeus
 
Executives In Transition
Executives In TransitionExecutives In Transition
Executives In Transition
 
Dataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZIDDataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZID
 
EZID Summer 2012 Webinar
EZID Summer 2012 WebinarEZID Summer 2012 Webinar
EZID Summer 2012 Webinar
 

Similaire à DataShare for UC Campuses

Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
University of California Curation Center
 
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Brigitte Jörg
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
Kristi Holmes
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-use
Matthew Vaughn
 
Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management
Gary Wilhelm
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 

Similaire à DataShare for UC Campuses (20)

10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
10-1-13 “Research Data Curation at UC San Diego: An Overview” Presentation Sl...
 
Alamw15 VIVO
Alamw15 VIVOAlamw15 VIVO
Alamw15 VIVO
 
Datashare cni spring2013
Datashare cni spring2013Datashare cni spring2013
Datashare cni spring2013
 
Research Data Management at the University of Salford
Research Data Management at the University of SalfordResearch Data Management at the University of Salford
Research Data Management at the University of Salford
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Building and Extensible Storage Ecosystem with WOS
Building and Extensible Storage Ecosystem with WOSBuilding and Extensible Storage Ecosystem with WOS
Building and Extensible Storage Ecosystem with WOS
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
Data in Context Interest Group Sessions @ RDA 3rd Plenary, Dublin (March 26-2...
 
Or 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-researchOr 2013-abrams-sharing-data-rich-research
Or 2013-abrams-sharing-data-rich-research
 
Overview of XSEDE Systems Engineering
Overview of XSEDE Systems EngineeringOverview of XSEDE Systems Engineering
Overview of XSEDE Systems Engineering
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
 
Making Data Dynamic: Views from UC3, CDL
Making Data Dynamic: Views from UC3, CDLMaking Data Dynamic: Views from UC3, CDL
Making Data Dynamic: Views from UC3, CDL
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-use
 
Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management
 
Improving user engagement in a data repository with web analytics
Improving user engagement in a data repository with web analyticsImproving user engagement in a data repository with web analytics
Improving user engagement in a data repository with web analytics
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
Identity and User Access Management.pptx
Identity and User Access Management.pptxIdentity and User Access Management.pptx
Identity and User Access Management.pptx
 

Plus de University of California Curation Center

DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
University of California Curation Center
 
Ndsa 2013-abrams-integrating-repositories-for-data-sharing
Ndsa 2013-abrams-integrating-repositories-for-data-sharingNdsa 2013-abrams-integrating-repositories-for-data-sharing
Ndsa 2013-abrams-integrating-repositories-for-data-sharing
University of California Curation Center
 

Plus de University of California Curation Center (20)

ETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of California
 
What does "data publication" mean to researchers?
What does "data publication" mean to researchers?What does "data publication" mean to researchers?
What does "data publication" mean to researchers?
 
Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.
 
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning ProcessEnhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
 
DataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data CurationDataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data Curation
 
Future of web archiving
Future of web archivingFuture of web archiving
Future of web archiving
 
Data preservation 101
Data preservation 101Data preservation 101
Data preservation 101
 
Creating superior data management plans with the DMPTool
Creating superior data management plans with the DMPToolCreating superior data management plans with the DMPTool
Creating superior data management plans with the DMPTool
 
ESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S AbramsESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S Abrams
 
DMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for AdministratorsDMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for Administrators
 
DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2
 
Helping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data managementHelping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data management
 
DMPTool2: Improvements and Outreach
DMPTool2: Improvements and Outreach DMPTool2: Improvements and Outreach
DMPTool2: Improvements and Outreach
 
DMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary ToolsDMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary Tools
 
DMPTool Webinar 10: More Extensive DMPs
DMPTool Webinar 10: More Extensive DMPsDMPTool Webinar 10: More Extensive DMPs
DMPTool Webinar 10: More Extensive DMPs
 
DMPTool Webinar 9: Talking Points for Institutional Stakeholders
DMPTool Webinar 9: Talking Points for Institutional StakeholdersDMPTool Webinar 9: Talking Points for Institutional Stakeholders
DMPTool Webinar 9: Talking Points for Institutional Stakeholders
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
 
DMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam Posner
DMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam PosnerDMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam Posner
DMPTool Webinar 7: Digital Humanities and the DMPTool by Miriam Posner
 
Ndsa 2013-abrams-integrating-repositories-for-data-sharing
Ndsa 2013-abrams-integrating-repositories-for-data-sharingNdsa 2013-abrams-integrating-repositories-for-data-sharing
Ndsa 2013-abrams-integrating-repositories-for-data-sharing
 
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
DMPTool Webinar 5: Promoting institutional services with the DMPTool; EZID as...
 

Dernier

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

DataShare for UC Campuses

  • 1. DataShare  for  the  UCs   6  February  2014    
  • 2. Where  we’re  going   Background   Demo  of  UCSF  DataShare   Technical  details   Other  details   Future  plans   Q&A   From  Flickr  by  Leo  Hidalgo  
  • 3.
  • 4. Goal       How   Catalyze  widespread  research  data   sharing   Develop  a  system  that  lowers  data   sharing  barriers  and  builds  an  engaged   user  community  
  • 5. Survey  of  users  by  Angela  Rizk-­‐Jackson   Has  your  research   group  provided  public   access  to  data?   Why?   Yes   No   How?   Other   Other   Journal   required   Funder   required   Repository   Website   n  =  114  
  • 7. Repository  choices…   Repositories     for  data   Discipline-­‐specific   General  content   Institutional   Non-­‐institutional   Publishers/for-­‐profits   Short-­‐term  projects  
  • 8. Repository  choices…   Which  is  more   important?   Depends   Institutional   •  All  data  associated  with   a  paper   •  Tells  a  story   •  Clearinghouse  for   researcher’s  works   ?   Which  should  a   researcher  use?   Both   Discipline-­‐specific   •  Some  of  data  for  a   given  paper   •  Discoverable   •  Integrated  systems   •  Collection  policies  
  • 9. Institutional   •  All  data  associated  with   a  paper   •  Tells  a  story   •  Clearinghouse  for   researcher’s  works  
  • 10. IR’s  are  SO   2002.   From  Flickr  by  Colin  ZHU   From  Flickr  by    johnsons531     From  Flickr  by    Ludie  Cochrane   From  Flickr  by    Kapil  Karekar  
  • 11. Last   year…   …  “Federal  agencies  investing  in  research  and   development  (more  than  $100  million  in  annual   expenditures)  must  have  clear  and  coordinated   policies  for  increasing  public  access  to  research   products.”  
  • 12. From  Flickr  by  wiccked   IR  
  • 13. But…   From  Flickr  by  jackcheng   Not  always  self-­‐service   Sometimes  complicated   Data?   “Old”  user  interfaces  
  • 14. Simplify  data  deposit  for  UC   researchers     Simple  metadata   Self-­‐service  upload  and  download   Branded  for  campus   Most  Important:     Institutional  Control  Over  Data  
  • 15. Background   Demo  of  UCSF  DataShare   Technical  details   Other  details   Future  plans   Q&A   From  Flickr  by  Leo  Hidalgo  
  • 16. Background   Demo  of  UCSF  DataShare   Technical  details   Other  details   Future  plans   Q&A   From  Flickr  by  Leo  Hidalgo  
  • 17. Technical  goals   •  Easy  submission   •  Persistent  citation   •  Preservation  assurance   •  Effective  discovery   From  www.dimensionsinfo.com   •  Control  over  terms  of  use   •  All  the  benefits  of  a  centrally   hosted  service,  while   maintaining  campus  branding   and  identity   From  Flickr  by  Eric  Peacock  
  • 18. System  components   •  Easy  submission   UCSF  drag-­‐n-­‐drop  client   •  Persistent  citation   •  Preservation  assurance   •  Effective  discovery   •  Control  over  terms  of  use   Data  use  agreements  (DUAs)   •  All  the  benefits  of  a  centrally   DNS,  Apache,  CSS,  and   campus  Shibboleth  IdPs   hosted  service,  while   maintaining  campus  branding   datashare.berkeley.edu   datashare.ucdavis.edu   and  identity   datashare.uci.edu   datashare.ucla.edu   …  
  • 19. Deposit  interactions   Researcher   (data  producer)   datashare.campus.edu   DataShare  portal   Campus   IdP   Authenticate   with  campus   credentials   Shib   Drag-­‐n-­‐drop   client   Assemble  dataset   Add  metadata   Submit  to  Merritt   SDSC  cloud   Preservation  storage   Merritt   CSS   Atom   Discovery   Populate  XTF  index   (XTF)   Request  DOI   Register  metadata   Assign  DOI   Data  use   agreement   EZID   Request  DOI   Register  metadata   Assign  DOI   Primo   Harvest  for  A&I  discovery   DataCite   Data  Citation   Index   Harvest  for  A&I  discovery  
  • 20. Download  interactions   Researcher   Synchronous  for   small  datasets;   asynchronous  for   large  (>  500  MB)   Campus   IdP   Download  data   (data  consumer)   datashare.campus.edu   DataShare  portal   Drag-­‐n-­‐drop   client   Merritt   CSS   Discovery   (XTF)   Faceted  search  /  browse   SDSC  cloud   EZID   Retrieve  data   Primo   Faceted  search  /  browse   Data  use   agreement   Accept  DUA  terms   DataCite   Data  Citation   Index   Faceted  search  /  browse  
  • 21. Background   Demo  of  UCSF  DataShare   Technical  details   Other  details   Future  plans   Q&A   From  Flickr  by  Leo  Hidalgo  
  • 22. Campus  Library   Delivers  service  to  community   Shapes  user  interface,  URL,  branding   Customizes  key  components   Develops  help,  training   Roles   UC3  /  CDL   Guides  the  campus   Preserves  content  in  Merritt   Connects  to  EZID   Deploys  XTF  for  discovery   Works  with  vendors   SDSC   Maintains  production  storage   infrastructure   Holds  three  independent   copies  of  content  
  • 23. Branding  &   Customization   From  Flickr  by    Diorama  Sky   •  •  •  •  Logo   URL   Contact  information   Other…?  
  • 24. Cost   •  EZID  accounts   From  Flickr  by  Maura  Teague   –  Existing  campus  memberships  provide  unlimited   DOIs     •  Merritt  recharge  proposal  (awaiting  UCOP  approval)   –  Pay-­‐as-­‐you-­‐go  $0.40/GB/year   –  Paid-­‐up  (for  10  years)  $2.93/GB   –  Threshold  pricing  100,  200,  500  GBs      1,  2,  5,  10,  20,  50,  100  TBs    
  • 25. Cost   Anticipated  cost  of  providing  all  campus  ladder-­‐track   faculty  with  5  GBs  for  10  years   Campus   Faculty   Threshold   Paid-­‐up  cost   Berkeley   1,260   10  TB   $  29,300   Davis   1,240   10  TB   $  29,300   Irvine   1,051   10  TB   $  29,300   Los  Angeles   1,701   10  TB   $  29,300   Merced        159        1  TB   $      2,930   Riverside        561      5  TB   $  14,650   San  Diego   1,109   10  TB   $  29,300   San  Francisco        366      2  TB   $      5,860   Santa  Barbara        746      5  TB   $  14,650   Santa  Cruz        485      5  TB   $  14,650   Source:  http://legacy-­‐its.ucop.edu/uwnews/stat/headcount_fte/oct2013/welcome.html    
  • 26. Governance         &  Agreements   Goal:     Simplify  &  Scale  Data  Use  &   Deposit  Agreements  
  • 27. Governance         &  Agreements   Data     User   ODL  or   similar   CDL   Terms  of   service   UC  Campus   ODL  or  similar     Terms  of   service   Data   Depositor  
  • 28. Background   Demo  of  UCSF  DataShare   Technical  details   Other  details   Next  steps  &  future  plans   Q&A   From  Flickr  by  Leo  Hidalgo  
  • 29. Who   Decides?   •  CDL  to  work  with  each  campus  to   implement  &  shape  service   •  Campus-­‐to-­‐campus  interaction   •  Group  meetings  as  needed   •  SAG1  check-­‐ins   •  Communication  (…)  
  • 30. From  Flickr  by  Mischievous  One   This  is  a  group  project  
  • 31. From  Flickr  by  Alice  Bartlett   Two  heads  are   better  than   one!  
  • 32. From  Flickr  by  Emil  Nordén   •  •  •  •  •  •  •  •  •  •  •  •  eScholarship  connection   ORCID   Altmetrics   Solr/Blacklight  for  discovery   Expand  metadata  options   Embargoes   Restricted  access  for  peer  review   Annotations   Export  to  citation  managers   Staging  area   Private  storage   Mapping  metadata/GIS  support  
  • 34. Communication   UC3  confluence  site     confluence.ucop.edu/display/Curation/DataShare+for+UCs  
  • 35. Communication   From  Flickr  by  gsagos/nho   •  Listserv?   •  Twitter  @DataShareOrg   •  …?  
  • 37. DASH:     Helping  Community   T Repositories   ob eR evi seD What  Makes  DASH  Unique:   •  Modern,  intuitive  user  interface  for  superior  user  experience   •  Freely  available  code  for  download  and  use  by  anyone   •  User-­‐friendly  API(s)  to  ensure  interoperability  with  existing   repositories  (e.g.,  SWORD  for  deposit;  Atom,  OAI-­‐PMH,   ResourceSync  for  populating  the  discovery  index).   •  Customizable  interfaces  that  can  be  altered  easily  to  reflect  service   provider  branding   •  Authentication  via  institutional  Identity  Management  Systems  
  • 38. Next  Steps  –     Next  2  Weeks   •  details  to  be  established   –  who’s  interested   –  tech  contact  for  interested   campuses   –  communication  lines   From  Flickr  by  Themactep  
  • 39. Next  Steps  –     Next  2  Months   •  get  DataShare  up  and  running   –  Shibboleth  configuration  &   other  authentication   –  Domains/URLs  established   –  Customizations  –  logos  etc.   From  Flickr  by  Themactep  
  • 40. Next  Steps  –     Longer  term   •  in-­‐person  meeting?   •  CDL  camp?   •  communication/outreach?   From  Flickr  by  Themactep  
  • 41. Acknowledgements   •  •  •  •  Stephen  Abrams   Trisha  Cruse   Carly  Strasser   Perry  Willett   •  Geoffrey  Boushey   •  Julia  Kochi   •  Megan  Laurence   •  Anirvan  Chatterjee   •  Angela  Rizk-­‐Jackson   •  Maninder  Kahlon