1. OTAGO VRE
VIRTUAL RESEARCH ENVIRONMENT
RESEARCHER PRODUCTIVITY
Open source code base platform built on top of SharePoint 2010 features.
2. Data Management Data Deposit
Otago VRE
Data Visualization
Authoring
Data Analysis Data Sharing
3. Project Trident
Otago-VRE
DATA MANAGEMENT
Integrated Workflow Experience
4. Project Trident - a Scientific Workflow Workbench
Share workflows via
Author, Execute and Monitor Workflows
Compose and modify workflows
via drag & drop canvas
View data products, performance Version 1.2 (Open Source under Apache 2.0 License):
metrics, and provenance data http://tridentworkflow.codeplex.com/
5. Integrated Workflow Experience
researchers can…
• View/execute/monitor workflows within the
context of project collaboration site
• Receive alerts (email, SMS) when workflows
complete
• Browse workflow execution history and
provenance information
• Review/store/manage data files that are written
back into SharePoint by Trident
6. Project Trident
Otago-VRE
Word 2010
AUTHORING
Include Data and Data artifacts
in compound documents
7. Article Authoring Add-in for Word
Read, convert, and author Repository deposit
NLM XML documents via SWORD
ORE Resource Map creation
v.2 beta 3:
http://research.microsoft.com/authoring/
8. Zentity
Project Trident
Otago-VRE
Word 2010
DATA DEPOSIT
SWORD client
9. Zentity - a Research Output Repository Platform
Native support for RSS,
A semantic computing platform to store and
OAI-PMH, OAI-ORE,
expose relationships between digital assets
AtomPub and SWORD
Flexible data model enables
many scenarios and can be
easily extended over time v.1 (v.2 Tech Preview available soon!) :
http://research.microsoft.com/zentity/
10. SWORD client
researchers can…
• Select any file:
• Document
• Presentation
• Image
• Data files
• and publish it to any repository (via SWORD)
• SWORD endpoints are managed as a custom list,
so new locations are easily added
12. Zentity
Project Trident
Otago-VRE
Word 2010
Excel 2010
DATA ANALYSIS
through Excel 2010
13. selects DBs and
input sequence
Web
Metagenomics
Role Input Splitter
Worker Role
Using Azure cloud to host standard
sequence data
BLAST
Execution
BLAST
Execution
Run BLAST in parallel “map reduce”
Worker
Role #1 … Worker
Role #n style for a large collection input
Azure Blob Use Excel-based tool to search
Storage BLAST DB
Configuration
. databases.
Genom
Genome
e
DB 1 Combiner
DB K
Worker Role
• Performance: speedup
• 45x with 50 roles; 94x
with 100 roles;
• with 300 roles
computation time is
reduced to 4 hours !
14. Zentity
Project Trident
Otago-VRE
Word 2010
DATA SHARING
OData Protocol
Excel 2010 OData
15. A RESTful Interface for Data
Just HTTP
• Items as resources, HTTP methods (GET, PUT, …) to act
• Leverage proxies, authentication, ETags, …
Uniform URL convention
• Every piece of information is addressable
• Predictable and flexible URL syntax
Multiple representations
• Use regular HTTP content-type negotiation
• JSON and Atom (full AtomPub support)
http://www.odata.org
16. OData Producers OData Consumers
• SharePoint 2010 • Web Browsers
• IBM Websphere • Excel 2010
• Windows Azure Table Storage • LinQPad
& SQL Azure
• Zentity 2.0
• Client libraries for
• Services: – Javascript
– Facebook Insights – PHP
– Netflix – Java
– Open Government Data – Windows 7 Phone
Initiative – iPhone (Objective C)
– Open Science Data Initiative – .NET
– DBPedia
http://www.odata.org
17. Zentity
Project Trident
Otago VRE
VISUALIZING DATA
Pivot
Word 2010
Excel 2010 OData