SlideShare a Scribd company logo
1 of 52
Sanger Mouse
 Resources Portal
A Testbed for Collaborative Data Integration


         Darren Oakley, Vivek Iyer, Bill Skarnes
Making a
Collaborative Data
     Portal...
‘Borg’ Approach

         •   Single group becomes
             sole owner/curator of
             portal and its data

         •   Other groups feed
             their data into portal
             group
burp
Why This Works


•   Clearly defined centre

•   It provides central curation for all data
Mouse Informatics

    •   Genes

        •   Mutants (ES Cells, Mice)

            •   Phenotypes

•   In mouse informatics, the traditional
    Borg is MGI - this has worked nicely
    for many years: http://informatics.jax.org
Mouse Informatics

•   Times are changing...

    •   Other informatics groups are providing
        high volume data and want in on the
        portal game
“Hand over your data,
prepare to be assimilated”




                                                                “No, YOU hand over your data and
                                                                   prepare to be assimilated”




                         “Ahem, both of you, prepare to be assimilated!”
“Hand over your data,
prepare to be assimilated”




                                                                                 “No, YOU hand over your data and
                                                                                    prepare to be assimilated”




                                              ?
                                       lB org
                                 he rea
                             t
                   yo u is
          c h of
  …   whi
                                          “Ahem, both of you, prepare to be assimilated!”
‘Federation’ Approach
            •   Each group hosts
                their own data and
                exposes it via defined
                services

            •   Make a ‘clever’ portal
                that pulls of these
                resources together

            •   No single group is
                totally in charge
The Sanger Mouse
 Resources Portal
 http://www.sanger.ac.uk/mouseportal

(Our Attempt at the Federation Approach...)
Distributed Data
•   Currently 5 distinct, but related sets of
    mouse data:

    •   Gene Information

    •   Phenotyping

    •   Mutant Mouse Breeding

    •   Mutant ES Cell / Vector Production

    •   Other DNA Resources
Screenshot Tour
Technologies
Search Engine
                Portal Interface



Data Services
index searchable
     terms
index searchable
     terms
MartSearch / Portal




index searchable
     terms
MartSearch / Portal


send users search term to Solr




index searchable
     terms
MartSearch / Portal


send users search term to Solr

     Solr returns groups of terms
    to query Biomarts with




index searchable
     terms
MartSearch / Portal


send users search term to Solr

     Solr returns groups of terms
    to query Biomarts with
                                     send asynchronous requests to each of the
                                    Biomarts for the data the user is interested in




index searchable
     terms
User searches for ‘Cbx7’
User searches for ‘Cbx7’


     Search for ‘Cbx7’
User searches for ‘Cbx7’


     Search for ‘Cbx7’
     JSON data containing information on
     what to search each biomart by...
User searches for ‘Cbx7’


     Search for ‘Cbx7’
     JSON data containing information on
     what to search each biomart by...


     Search using query parameters
     defined by Solr response
User searches for ‘Cbx7’


     Search for ‘Cbx7’
     JSON data containing information on
     what to search each biomart by...


     Search using query parameters
     defined by Solr response

     Render search results using templates
Extending The Portal

•   Put new data into a Biomart

•   Write JSON config file for MartSearch
    (defining filters to index and use)

•   Rebuild the index
Advantages


•   Easily extensible

•   Data responsibility shared
Disadvantages

•   Hard to avoid redundancy

    •   Sometimes needed for data linking

•   Un-curated

    •   Each group can curate its own data

    •   No curation as a whole
Disclaimer
•   Windows users...

    •   If you use IE - it will eat your browser

    •   Use Firefox/Chrome/Safari/Opera for
        a more pleasant internet experience

    •   We are working on it - IE 8 gives an ok
        experience...
Acknowledgments
•   Funding: I-DCC grant (EU FP7)

    •   Coordination of informatic resources
        from high-throughput mouse ES cell
        mutagensis programs

•   Wellcome Trust Sanger Institute

    •   T87 - ES Cell Mutagenesis

    •   MIG - Mouse Informatics Group
http://www.sanger.ac.uk/mouseportal

http://github.com/dazoakley/martsearch

do2@sanger.ac.uk

dazoakley

More Related Content

Similar to The Sanger Mouse Resources Portal - A Testbed for Collaborative Data Integration

Seasr Overview Ws April 2009
Seasr Overview Ws April 2009Seasr Overview Ws April 2009
Seasr Overview Ws April 2009Loretta Auvil
 
Predictive modeling DBs
Predictive modeling DBsPredictive modeling DBs
Predictive modeling DBsDataVita
 
Data encoding and Metadata for Streams
Data encoding and Metadata for StreamsData encoding and Metadata for Streams
Data encoding and Metadata for Streamsunivalence
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017Monica Poelchau
 
Hello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperHello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperAlexandre Passant
 
Toward a Mobile Data Commons
Toward a Mobile Data CommonsToward a Mobile Data Commons
Toward a Mobile Data CommonskingsBSD
 
20070404: UK federation and Shibboleth: Nuts And Bolts
20070404: UK federation and Shibboleth: Nuts And Bolts20070404: UK federation and Shibboleth: Nuts And Bolts
20070404: UK federation and Shibboleth: Nuts And Boltsiay
 
The life changing magic of tidying up your data: The art and science of makin...
The life changing magic of tidying up your data: The art and science of makin...The life changing magic of tidying up your data: The art and science of makin...
The life changing magic of tidying up your data: The art and science of makin...MEASURE Evaluation
 
Tim O'Reilly Mashup Camp 2008
Tim O'Reilly Mashup Camp 2008Tim O'Reilly Mashup Camp 2008
Tim O'Reilly Mashup Camp 2008Tim O'Reilly
 
DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05John Cobb
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveYuwei Lin
 
Správa vzdáleného přístupu k elektronickým informačním zdrojům
Správa vzdáleného přístupu k elektronickým informačním zdrojůmSpráva vzdáleného přístupu k elektronickým informačním zdrojům
Správa vzdáleného přístupu k elektronickým informačním zdrojůmJiri Pavlik
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...David Peyruc
 
A Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the WebA Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the Webmashups
 
API's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic webAPI's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic webDan Delany
 

Similar to The Sanger Mouse Resources Portal - A Testbed for Collaborative Data Integration (20)

Seasr Overview Ws April 2009
Seasr Overview Ws April 2009Seasr Overview Ws April 2009
Seasr Overview Ws April 2009
 
Predictive modeling DBs
Predictive modeling DBsPredictive modeling DBs
Predictive modeling DBs
 
SEASR eScience 2008
SEASR eScience 2008SEASR eScience 2008
SEASR eScience 2008
 
Data encoding and Metadata for Streams
Data encoding and Metadata for StreamsData encoding and Metadata for Streams
Data encoding and Metadata for Streams
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017
 
Hello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic DeveloperHello Open World - The Web of Data for the Pragmatic Developer
Hello Open World - The Web of Data for the Pragmatic Developer
 
Toward a Mobile Data Commons
Toward a Mobile Data CommonsToward a Mobile Data Commons
Toward a Mobile Data Commons
 
20070404: UK federation and Shibboleth: Nuts And Bolts
20070404: UK federation and Shibboleth: Nuts And Bolts20070404: UK federation and Shibboleth: Nuts And Bolts
20070404: UK federation and Shibboleth: Nuts And Bolts
 
The life changing magic of tidying up your data: The art and science of makin...
The life changing magic of tidying up your data: The art and science of makin...The life changing magic of tidying up your data: The art and science of makin...
The life changing magic of tidying up your data: The art and science of makin...
 
Tim O'Reilly Mashup Camp 2008
Tim O'Reilly Mashup Camp 2008Tim O'Reilly Mashup Camp 2008
Tim O'Reilly Mashup Camp 2008
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05DataONE_cobb_hubbub2012_20120924_v05
DataONE_cobb_hubbub2012_20120924_v05
 
Understanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical PerspectiveUnderstanding Research 2.0 from a Socio-technical Perspective
Understanding Research 2.0 from a Socio-technical Perspective
 
Správa vzdáleného přístupu k elektronickým informačním zdrojům
Správa vzdáleného přístupu k elektronickým informačním zdrojůmSpráva vzdáleného přístupu k elektronickým informačním zdrojům
Správa vzdáleného přístupu k elektronickým informačním zdrojům
 
A View on eScience
A View on eScienceA View on eScience
A View on eScience
 
DataHub
DataHubDataHub
DataHub
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
A Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the WebA Framework for Dynamic Data Source Identification and Orchestration on the Web
A Framework for Dynamic Data Source Identification and Orchestration on the Web
 
API's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic webAPI's, Freebase, and the Collaborative Semantic web
API's, Freebase, and the Collaborative Semantic web
 

Recently uploaded

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 

Recently uploaded (20)

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 

The Sanger Mouse Resources Portal - A Testbed for Collaborative Data Integration

  • 1. Sanger Mouse Resources Portal A Testbed for Collaborative Data Integration Darren Oakley, Vivek Iyer, Bill Skarnes
  • 3. ‘Borg’ Approach • Single group becomes sole owner/curator of portal and its data • Other groups feed their data into portal group
  • 4.
  • 5.
  • 7. Why This Works • Clearly defined centre • It provides central curation for all data
  • 8. Mouse Informatics • Genes • Mutants (ES Cells, Mice) • Phenotypes • In mouse informatics, the traditional Borg is MGI - this has worked nicely for many years: http://informatics.jax.org
  • 9. Mouse Informatics • Times are changing... • Other informatics groups are providing high volume data and want in on the portal game
  • 10.
  • 11. “Hand over your data, prepare to be assimilated” “No, YOU hand over your data and prepare to be assimilated” “Ahem, both of you, prepare to be assimilated!”
  • 12. “Hand over your data, prepare to be assimilated” “No, YOU hand over your data and prepare to be assimilated” ? lB org he rea t yo u is c h of … whi “Ahem, both of you, prepare to be assimilated!”
  • 13. ‘Federation’ Approach • Each group hosts their own data and exposes it via defined services • Make a ‘clever’ portal that pulls of these resources together • No single group is totally in charge
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. The Sanger Mouse Resources Portal http://www.sanger.ac.uk/mouseportal (Our Attempt at the Federation Approach...)
  • 22.
  • 23. Distributed Data • Currently 5 distinct, but related sets of mouse data: • Gene Information • Phenotyping • Mutant Mouse Breeding • Mutant ES Cell / Vector Production • Other DNA Resources
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32. Technologies Search Engine Portal Interface Data Services
  • 33.
  • 34.
  • 37. MartSearch / Portal index searchable terms
  • 38. MartSearch / Portal send users search term to Solr index searchable terms
  • 39. MartSearch / Portal send users search term to Solr Solr returns groups of terms to query Biomarts with index searchable terms
  • 40. MartSearch / Portal send users search term to Solr Solr returns groups of terms to query Biomarts with send asynchronous requests to each of the Biomarts for the data the user is interested in index searchable terms
  • 41.
  • 42. User searches for ‘Cbx7’
  • 43. User searches for ‘Cbx7’ Search for ‘Cbx7’
  • 44. User searches for ‘Cbx7’ Search for ‘Cbx7’ JSON data containing information on what to search each biomart by...
  • 45. User searches for ‘Cbx7’ Search for ‘Cbx7’ JSON data containing information on what to search each biomart by... Search using query parameters defined by Solr response
  • 46. User searches for ‘Cbx7’ Search for ‘Cbx7’ JSON data containing information on what to search each biomart by... Search using query parameters defined by Solr response Render search results using templates
  • 47. Extending The Portal • Put new data into a Biomart • Write JSON config file for MartSearch (defining filters to index and use) • Rebuild the index
  • 48. Advantages • Easily extensible • Data responsibility shared
  • 49. Disadvantages • Hard to avoid redundancy • Sometimes needed for data linking • Un-curated • Each group can curate its own data • No curation as a whole
  • 50. Disclaimer • Windows users... • If you use IE - it will eat your browser • Use Firefox/Chrome/Safari/Opera for a more pleasant internet experience • We are working on it - IE 8 gives an ok experience...
  • 51. Acknowledgments • Funding: I-DCC grant (EU FP7) • Coordination of informatic resources from high-throughput mouse ES cell mutagensis programs • Wellcome Trust Sanger Institute • T87 - ES Cell Mutagenesis • MIG - Mouse Informatics Group