SlideShare a Scribd company logo
1 of 31
Outlining some ideas...
      Darren Oakley - WTSI
        do2@sanger.ac.uk
Ideas for what?


WP4 - Web Portal
How we can all work together
WP4
Web Portal
WP4 objectives
Create a site to display current repository
information
Create DAS-tracks, to display this information in
its genomic context
Create a Biomart. The Biomart will ser ve DAS-
tracks, provide query web-ser vices, and link to
other Biomarts (including EnsMart), greatly
enhancing the search capability and future utility
of the repository
The idea...
De-centralize the data, everyone who
wants in on the portal: use Biomart!
 Standardized
 Web services and DAS out of the box
This makes the data open to all
We promise not to take over the world
The idea...

2 Interfaces:
                             Damian
 New MartView interface
 (advanced search)
                         Us
 Google-like search
 (simple search - “MartSearch”)
The idea...
Turn the portal into a Biomart mashup!
  “In web development, a mashup is a Web
  application that combines data from one or more
  sources into a single integrated tool. The term
  Mashup implies easy, fast integration,
  frequently done by access to open APIs and data
  sources to produce results that were not the
  original reason for producing the raw source
  data” - Wikipedia
Implementation
100% Javascript driven user interface
User goes to the portal enters a search
term, this gets fired against a cloud of
biomarts and returns a coherent
response
No complex controller logic (it shouldn’t
need any)
Javascript?!? Aaargh!!
 The old days...
   Browser incompatibilities, clunky performance
 Now...
   Javascript is fast!
      Chrome, Firefox 3.1, Safari 4, IE 8
   Libraries take care of the cross-browser issues
Obligatory
Architecture
 Drawings
Plan A

                HTTP request

         MartSearch

                Martservice XML query



                Biomart based federation
Plan A

                         HTTP request

            MartSearch

         You Can only federate
         across 2 marts
                         Martservice XML query
         Search times can vary
         greatly with federation


                         Biomart based federation
Plan B


                HTTP request

         MartSearch


                Martser vice XML query to
                each mart, perform
                federation on the fly
Plan B


                                  HTTP request

                    MartSearchattribute
         Searching on more than one
         requires many XML requests per mart
         No way to page results
                                 Martser vice XML query to
         No way of doing OR queries
                                 each mart, perform
         No way of doing loose text queries
                                 federation on the fly
Plan C

                                         HTTP request
                          1
                                  MartSearch

Send query to Lucene based
search index and retrieve paged
list of genes and linking IDs           2
                                               Martservice XML
                                               query to each mart




  0    Index the searchable fields from the biomarts
Plan C

                           FAST search results     HTTP request
                           Can do loose text and OR queries
                          1
                           Pagination
                                      MartSearch
                           Solr takes care of the federation for you
Send query to Lucene based
search index and retrieve paged
list of genes and linking IDs                     2
                                                         Martservice XML
                                                         query to each mart

                           One more software stack to accommodate
                           Need to re-build index after mart rebuild




  0    Index the searchable fields from the biomarts
Demo
http://www.i-dcc.org/dev/martsearch/
Home
Search
Refined searches
Fast, flexible searching
Customizable
  Add and remove data source from display
  Restrict the data coming back from source
Extensible
  Adding in new data sources should be easy
  Custom templates for every data source
Open
  Anyone can access the data and index (via ser vices)
  Anyone can get the code
How it works...
Apache Solr
(http://lucene.apache.org/solr)
 Enterprise grade search ser ver built
 upon lucene
 Web service driven
 Represents each search object as
 a document
Document XML
How it works...
jQuery (http://jquery.com)
jQuery UI (http://jqueryui.com)
EJS (http://embeddedjs.com)
ActiveRecord.js
(http://activerecordjs.org)
Jamal (http://jamal-mvc.com)
Moving for ward...
Make (and/or integrate) more marts
  MGI, Komp-DCC, Eurexpress, GXD, EuroPhenome
Portal branding, design, colour, layout
How to represent the data
  Dictated by the type of user...
     Who are our users and what do
     they want from us?!?!?
Get the code!
http://github.com/dazoakley/martsearch/
Working together
Typical scenario
Each group says...
 I’ll take this task - will send you the
 results when it’s ready


If we’re (very) lucky, we get something
sort of coherent in the end
We can be better
   than this!
What we should do...
Open source code on a public repository
  Github, Google Code, Sourceforge
  Or even one of our own - as long as its public
Shared bug tracking / support and wiki
  Github (wiki) + Lighthouse (bug tracking)
  Google Code / Sourceforge
  Host an instance of Redmine or Trac
Get the code!
http://github.com/dazoakley/martsearch/

More Related Content

What's hot

CrossMark How To
CrossMark How ToCrossMark How To
CrossMark How ToCrossref
 
Finding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional ServicesFinding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional ServicesCrossref
 
CrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin WebinarCrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin WebinarCrossref
 
Maintaining your metadata
Maintaining your metadataMaintaining your metadata
Maintaining your metadataCrossref
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Websamar_slideshare
 
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118Crossref
 
Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref
 
introduction-to-crossmark-lastest
introduction-to-crossmark-lastestintroduction-to-crossmark-lastest
introduction-to-crossmark-lastestCrossref
 
Getting started with looking up metadata
Getting started with looking up metadata Getting started with looking up metadata
Getting started with looking up metadata Crossref
 
Getting started with Content Registration 012617
Getting started with Content Registration 012617Getting started with Content Registration 012617
Getting started with Content Registration 012617Crossref
 
Similarity check webinar
Similarity check webinar Similarity check webinar
Similarity check webinar Crossref
 
Talis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineTalis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineLeigh Dodds
 
Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17Crossref
 
crossmark update
crossmark updatecrossmark update
crossmark updateCrossref
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
 
Cross mark webinar how to
Cross mark webinar how toCross mark webinar how to
Cross mark webinar how toCrossref
 
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia IndustryFrom Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia IndustryJoel Amoussou
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK OnlineCrossref
 
Getting started with looking up metadata
Getting started with looking up metadataGetting started with looking up metadata
Getting started with looking up metadataCrossref
 
An Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset ProfilesAn Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset ProfilesAhmad Assaf
 

What's hot (20)

CrossMark How To
CrossMark How ToCrossMark How To
CrossMark How To
 
Finding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional ServicesFinding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional Services
 
CrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin WebinarCrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin Webinar
 
Maintaining your metadata
Maintaining your metadataMaintaining your metadata
Maintaining your metadata
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Web
 
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
 
Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716
 
introduction-to-crossmark-lastest
introduction-to-crossmark-lastestintroduction-to-crossmark-lastest
introduction-to-crossmark-lastest
 
Getting started with looking up metadata
Getting started with looking up metadata Getting started with looking up metadata
Getting started with looking up metadata
 
Getting started with Content Registration 012617
Getting started with Content Registration 012617Getting started with Content Registration 012617
Getting started with Content Registration 012617
 
Similarity check webinar
Similarity check webinar Similarity check webinar
Similarity check webinar
 
Talis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineTalis Platform: A Linked Data Engine
Talis Platform: A Linked Data Engine
 
Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17
 
crossmark update
crossmark updatecrossmark update
crossmark update
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
 
Cross mark webinar how to
Cross mark webinar how toCross mark webinar how to
Cross mark webinar how to
 
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia IndustryFrom Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK Online
 
Getting started with looking up metadata
Getting started with looking up metadataGetting started with looking up metadata
Getting started with looking up metadata
 
An Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset ProfilesAn Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset Profiles
 

Similar to Outlining some ideas... - I-DCC Kick Off Metting

HIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPagesHIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPagesijdkp
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Searchmasiclat
 
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document RetrievalAn Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document Retrievaliosrjce
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
IRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine OptimizationIRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine OptimizationIRJET Journal
 
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.iosrjce
 
IRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web SpiderIRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web SpiderIRJET Journal
 
Role of Text Mining in Search Engine
Role of Text Mining in Search EngineRole of Text Mining in Search Engine
Role of Text Mining in Search EngineJay R Modi
 
What are the different types of web scraping approaches
What are the different types of web scraping approachesWhat are the different types of web scraping approaches
What are the different types of web scraping approachesAparna Sharma
 
Web Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics WebWeb Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics WebAatif19921
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Juan Sequeda
 
Vision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result RecordsVision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result RecordsIJMER
 
Beyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterpriseBeyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterpriselucenerevolution
 
Working of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th semWorking of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th semROHIT SAHU
 

Similar to Outlining some ideas... - I-DCC Kick Off Metting (20)

HIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPagesHIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPages
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
 
G017254554
G017254554G017254554
G017254554
 
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document RetrievalAn Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
 
Applications of semantic web
Applications of semantic webApplications of semantic web
Applications of semantic web
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
IRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine OptimizationIRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine Optimization
 
E017624043
E017624043E017624043
E017624043
 
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
 
IRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web SpiderIRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web Spider
 
Role of Text Mining in Search Engine
Role of Text Mining in Search EngineRole of Text Mining in Search Engine
Role of Text Mining in Search Engine
 
What are the different types of web scraping approaches
What are the different types of web scraping approachesWhat are the different types of web scraping approaches
What are the different types of web scraping approaches
 
Web Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics WebWeb Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics Web
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
Vision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result RecordsVision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result Records
 
sunny-slides
sunny-slidessunny-slides
sunny-slides
 
Beyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterpriseBeyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterprise
 
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
 
Working of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th semWorking of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th sem
 

Recently uploaded

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 

Recently uploaded (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Outlining some ideas... - I-DCC Kick Off Metting

  • 1. Outlining some ideas... Darren Oakley - WTSI do2@sanger.ac.uk
  • 2. Ideas for what? WP4 - Web Portal How we can all work together
  • 4. WP4 objectives Create a site to display current repository information Create DAS-tracks, to display this information in its genomic context Create a Biomart. The Biomart will ser ve DAS- tracks, provide query web-ser vices, and link to other Biomarts (including EnsMart), greatly enhancing the search capability and future utility of the repository
  • 5. The idea... De-centralize the data, everyone who wants in on the portal: use Biomart! Standardized Web services and DAS out of the box This makes the data open to all We promise not to take over the world
  • 6. The idea... 2 Interfaces: Damian New MartView interface (advanced search) Us Google-like search (simple search - “MartSearch”)
  • 7. The idea... Turn the portal into a Biomart mashup! “In web development, a mashup is a Web application that combines data from one or more sources into a single integrated tool. The term Mashup implies easy, fast integration, frequently done by access to open APIs and data sources to produce results that were not the original reason for producing the raw source data” - Wikipedia
  • 8. Implementation 100% Javascript driven user interface User goes to the portal enters a search term, this gets fired against a cloud of biomarts and returns a coherent response No complex controller logic (it shouldn’t need any)
  • 9. Javascript?!? Aaargh!! The old days... Browser incompatibilities, clunky performance Now... Javascript is fast! Chrome, Firefox 3.1, Safari 4, IE 8 Libraries take care of the cross-browser issues
  • 11. Plan A HTTP request MartSearch Martservice XML query Biomart based federation
  • 12. Plan A HTTP request MartSearch You Can only federate across 2 marts Martservice XML query Search times can vary greatly with federation Biomart based federation
  • 13. Plan B HTTP request MartSearch Martser vice XML query to each mart, perform federation on the fly
  • 14. Plan B HTTP request MartSearchattribute Searching on more than one requires many XML requests per mart No way to page results Martser vice XML query to No way of doing OR queries each mart, perform No way of doing loose text queries federation on the fly
  • 15. Plan C HTTP request 1 MartSearch Send query to Lucene based search index and retrieve paged list of genes and linking IDs 2 Martservice XML query to each mart 0 Index the searchable fields from the biomarts
  • 16. Plan C FAST search results HTTP request Can do loose text and OR queries 1 Pagination MartSearch Solr takes care of the federation for you Send query to Lucene based search index and retrieve paged list of genes and linking IDs 2 Martservice XML query to each mart One more software stack to accommodate Need to re-build index after mart rebuild 0 Index the searchable fields from the biomarts
  • 18. Home
  • 21. Fast, flexible searching Customizable Add and remove data source from display Restrict the data coming back from source Extensible Adding in new data sources should be easy Custom templates for every data source Open Anyone can access the data and index (via ser vices) Anyone can get the code
  • 22. How it works... Apache Solr (http://lucene.apache.org/solr) Enterprise grade search ser ver built upon lucene Web service driven Represents each search object as a document
  • 24. How it works... jQuery (http://jquery.com) jQuery UI (http://jqueryui.com) EJS (http://embeddedjs.com) ActiveRecord.js (http://activerecordjs.org) Jamal (http://jamal-mvc.com)
  • 25. Moving for ward... Make (and/or integrate) more marts MGI, Komp-DCC, Eurexpress, GXD, EuroPhenome Portal branding, design, colour, layout How to represent the data Dictated by the type of user... Who are our users and what do they want from us?!?!?
  • 28. Typical scenario Each group says... I’ll take this task - will send you the results when it’s ready If we’re (very) lucky, we get something sort of coherent in the end
  • 29. We can be better than this!
  • 30. What we should do... Open source code on a public repository Github, Google Code, Sourceforge Or even one of our own - as long as its public Shared bug tracking / support and wiki Github (wiki) + Lighthouse (bug tracking) Google Code / Sourceforge Host an instance of Redmine or Trac