SlideShare a Scribd company logo
1 of 29
Download to read offline
Crowdsourcing Genome
                           23.01.12, Bastian Greshake
Wide Association Studies
some words about me

• BSc in Life Sciences (2010)

• Working at Biodiversity & Climate
  Research Center (since 2010)

• MSc studies at the Goethe University in
  Frankfurt/Main (since 2011)

• Not exactly a biologist with much
  professional background in human
  genetics, but...
some words about me

• some background in data mining (mainly
  transcriptomics)

• some experience with web applications

• interest in social media & crowd-sourcing

• customer of DTC genetic testing myself
finding DTC results up to now
mining DTC genetic
tests

• results are hidden somewhere
  on the web

• often no phenotypic annotation

• not easily re-usable
let’s code it:

• wants to be a central repository for sharing DTC results

• enables users to share phenotypes as well

• lowers barrier to participate

• motivation to share through benefits for users

• can we take it a step further and provide data for GWAS?
mining DTC genetic tests

• lots of potential for open data (100k+ customers)

• cheap data source for scientists
                                 Would you share DTC test results? (n=226)



                                                 6 %


                                        26 %



                                                            68 %
                     Yes
                     Only with DTC company
                     No
the front
technical implementation

     • framework: Ruby on Rails

     • database: PostgreSQL

     • task management via resque (known of GitHub)

     • basic API via JSON-queries
other resources

• Personal Genome Project

     • data is open

     • participation not
Personal Genome
          Project
other resources

• Personal Genome Project

      • data is open

      • participation not

      • no easy way to download data, no API etc.

• genomera

      • participation will be open (currently invited beta)

      • focus on small scale studies/experiments
genomera
problems & potential of patient driven/crowd-
sourced research

• problems

    • sample sizes

    • bias in participants

    • motivation of participants

    • accuracy of data

• potential

    • possible sample sizes

    • low costs

    • "warm fuzzy feeling inside" for patients
positive examples: PatientsLikeMe

• around since ~2006

• published a dozen studies since then

• famous example: ALS research on lithium carbonate
  intake (149 patients, 447 controls)




                          Paul Wicks et al. (2011) Accelerated clinical discovery using self-reported patient data
                          collected online and a patient-matching algorithm, Nature Biotechnology 29, 411–414
positive examples: 23andMe

• published some studies in 2010/2011

• done with self-reported data

• studies include 10.000+ to 30.000+ participants
positive examples: 23andMe – general traits




“
Replications of associations [...] for hair color, eye color,
and freckling validate the Web-based, self-reporting
paradigm. The identification of novel associations for hair
morphology [...], freckling [...], the ability to smell the
methanethiol produced after eating asparagus [...], and
photic sneeze reflex [...] illustrates the power of the
approach.



                           Nicolas Eriksson et al. (2010) Web-Based, Participant-Driven Studies Yield Novel
                           Genetic Associations for Common Traits. PLoS Genet 6(6): e1000993. doi:10.1371/
                           journal.pgen.1000993
positive examples: 23andMe – Parkinson’s Disease




“
We discovered two novel, genome-wide significant
associations with [Parkinson’s Disease]—both replicated
in an independent cohort. We also replicated 20
previously discovered genetic associations (including
LRRK2, GBA, SNCA, MAPT, GAK, and the HLA region),
providing support for our novel study design.




                        Chuong B. Do et al. (2011) Web-Based Genome-Wide Association Study Identifies
                        Two Novel Loci and a Substantial Genetic Component for Parkinson's Disease. PLoS
                        Genet 7(6): e1002141. doi:10.1371/journal.pgen.1002141
Quantified Self and Science
Quantified Self Movement
QS projects

• tracking health in response to work-outs (minimizing
  impacts of disease/genetic predisposition)

• track response to different drugs

• tracking well-being in response to eating habits (butter vs
  arithmetics)
butter vs arithmetics




                        source: Seth Roberts - quantifiedself.com
my conclusions

• technology enables new kinds of research

• DTC results and patient driven research can lead to new
  scientific knowledge

• can be a valuable addition to traditional research
openSNP: now & future

• won the Mendeley/PLoS Binary Battle in 2011

• got some funding of the German WikiMedia foundation to
  get more people genotyped

• collaborating with consent to research to get IRB
  approved consent-process

• working on implementing the Distributed Annotation
  System
thanks for your attention
                            source: xkcd.com
                            CC-BY-NC

More Related Content

What's hot

Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Pistoia Alliance
 
Why study Data Sharing? (+ why share your data)
Why study Data Sharing?  (+ why share your data)Why study Data Sharing?  (+ why share your data)
Why study Data Sharing? (+ why share your data)Heather Piwowar
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
THOR Workshop - Introduction
THOR Workshop - IntroductionTHOR Workshop - Introduction
THOR Workshop - IntroductionMaaike Duine
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataPhilip Bourne
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsPaul Groth
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierMaaike Duine
 
Scott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
Scott Edmunds: GigaScience Datacite meeting Rapid Fire TalkScott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
Scott Edmunds: GigaScience Datacite meeting Rapid Fire TalkGigaScience, BGI Hong Kong
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsSean Ekins
 
Globus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS AnalysisGlobus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS AnalysisRavi Madduri
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...Carole Goble
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...Barry Hardy
 
THOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEATHOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEAMaaike Duine
 
P4 c2011 slides ekins
P4 c2011 slides ekinsP4 c2011 slides ekins
P4 c2011 slides ekinsSean Ekins
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...
Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...
Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...Catherine Canevet
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
 

What's hot (20)

Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019
 
Why study Data Sharing? (+ why share your data)
Why study Data Sharing?  (+ why share your data)Why study Data Sharing?  (+ why share your data)
Why study Data Sharing? (+ why share your data)
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
THOR Workshop - Introduction
THOR Workshop - IntroductionTHOR Workshop - Introduction
THOR Workshop - Introduction
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing Elsevier
 
Scott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
Scott Edmunds: GigaScience Datacite meeting Rapid Fire TalkScott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
Scott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
 
Working with Quertle
Working with QuertleWorking with Quertle
Working with Quertle
 
Globus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS AnalysisGlobus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS Analysis
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...
 
THOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEATHOR Workshop - Services PANGAEA
THOR Workshop - Services PANGAEA
 
P4 c2011 slides ekins
P4 c2011 slides ekinsP4 c2011 slides ekins
P4 c2011 slides ekins
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...
Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...
Enhancing Data Integration with Text Analysis to Find Genes Implicated in Pla...
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
 

Viewers also liked

Borang transit mt thn 3
Borang transit mt thn 3Borang transit mt thn 3
Borang transit mt thn 3syafik20148
 
Borang transit mt thn 2
Borang transit mt thn 2Borang transit mt thn 2
Borang transit mt thn 2syafik20148
 
New Opt Rules Presentation 2009
New Opt Rules Presentation 2009New Opt Rules Presentation 2009
New Opt Rules Presentation 2009PolymerPhD Elif
 
Biref Introduction to OpenMP
Biref Introduction to OpenMPBiref Introduction to OpenMP
Biref Introduction to OpenMPJerryHe
 
openSNP @ Geekend Darmstadt
openSNP @ Geekend DarmstadtopenSNP @ Geekend Darmstadt
openSNP @ Geekend DarmstadtBastian Greshake
 

Viewers also liked (6)

Borang transit mt thn 3
Borang transit mt thn 3Borang transit mt thn 3
Borang transit mt thn 3
 
Borang transit mt thn 2
Borang transit mt thn 2Borang transit mt thn 2
Borang transit mt thn 2
 
New Opt Rules Presentation 2009
New Opt Rules Presentation 2009New Opt Rules Presentation 2009
New Opt Rules Presentation 2009
 
Biref Introduction to OpenMP
Biref Introduction to OpenMPBiref Introduction to OpenMP
Biref Introduction to OpenMP
 
Donatos Final
Donatos FinalDonatos Final
Donatos Final
 
openSNP @ Geekend Darmstadt
openSNP @ Geekend DarmstadtopenSNP @ Geekend Darmstadt
openSNP @ Geekend Darmstadt
 

Similar to openSNP - Crowdsourcing Genome Wide Association Studies

The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...robertstevens65
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08Russ Altman
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker, Inc.
 
2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forumChris Dwan
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Barry Smith
 
Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016Fiona Nielsen
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineIda Sim
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
The Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham TaylorThe Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham TaylorHuman Variome Project
 
Towards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imagingTowards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imagingOla Spjuth
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Philip Bourne
 
Jillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-jaJillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-jaJillian Aurisano
 
Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18Sage Base
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Ann-Marie Roche
 
Directions in Open Science
Directions in Open ScienceDirections in Open Science
Directions in Open ScienceMike Travers
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 

Similar to openSNP - Crowdsourcing Genome Wide Association Studies (20)

The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
The Big Picture: The Industrial Revolutiona talk in berlin, 2008, about indus...
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce Hoff
 
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
Dr. Eliot Siegel: Watson and Deep QA Software in Pursuit of Personalized Medi...
 
2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forum
 
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...
 
Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
The Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham TaylorThe Human Variome Database in Australia in 2014 - Graham Taylor
The Human Variome Database in Australia in 2014 - Graham Taylor
 
Towards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imagingTowards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imaging
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
Jillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-jaJillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-ja
 
Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18Jonathan Izant AAAS Annual Meeting 2012-02-18
Jonathan Izant AAAS Annual Meeting 2012-02-18
 
Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1Pathway studio into webinar 052715v1
Pathway studio into webinar 052715v1
 
Directions in Open Science
Directions in Open ScienceDirections in Open Science
Directions in Open Science
 
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 PresentationSLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 

More from Bastian Greshake

2020 03-11-open-life-sciences
2020 03-11-open-life-sciences2020 03-11-open-life-sciences
2020 03-11-open-life-sciencesBastian Greshake
 
Crowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of GenomesCrowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of GenomesBastian Greshake
 
openSNP - QS Cologne Meetup
openSNP - QS Cologne MeetupopenSNP - QS Cologne Meetup
openSNP - QS Cologne MeetupBastian Greshake
 
On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...
On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...
On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...Bastian Greshake
 
Was die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutetWas die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutetBastian Greshake
 
PiratenMS - Google Street View
PiratenMS - Google Street ViewPiratenMS - Google Street View
PiratenMS - Google Street ViewBastian Greshake
 
Next Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome AnalysisNext Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome AnalysisBastian Greshake
 
Medienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen NetzwerkenMedienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen NetzwerkenBastian Greshake
 
Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?Bastian Greshake
 

More from Bastian Greshake (20)

My Life in Lockdown
My Life in LockdownMy Life in Lockdown
My Life in Lockdown
 
2020 03-11-open-life-sciences
2020 03-11-open-life-sciences2020 03-11-open-life-sciences
2020 03-11-open-life-sciences
 
Bended Realities 2015
Bended Realities 2015Bended Realities 2015
Bended Realities 2015
 
Crowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of GenomesCrowdsourcing the Analysis of Genomes
Crowdsourcing the Analysis of Genomes
 
openSNP - QS Cologne Meetup
openSNP - QS Cologne MeetupopenSNP - QS Cologne Meetup
openSNP - QS Cologne Meetup
 
On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...
On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...
On your Libido, Eye Colo(u)r and Broccoli Preferences: Problems with Self-Rep...
 
Power to the Patient
Power to the PatientPower to the Patient
Power to the Patient
 
The Future of Genetics
The Future of GeneticsThe Future of Genetics
The Future of Genetics
 
Was die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutetWas die Post-Genomics-Ära für die Privatssphäre bedeutet
Was die Post-Genomics-Ära für die Privatssphäre bedeutet
 
Crowdsourcing GWAS
Crowdsourcing GWASCrowdsourcing GWAS
Crowdsourcing GWAS
 
Gentechnik
GentechnikGentechnik
Gentechnik
 
Lernen durch Lehren
Lernen durch LehrenLernen durch Lehren
Lernen durch Lehren
 
Haushalt 2011 Münster
Haushalt 2011 MünsterHaushalt 2011 Münster
Haushalt 2011 Münster
 
Defense of my BSc-Thesis
Defense of my BSc-ThesisDefense of my BSc-Thesis
Defense of my BSc-Thesis
 
LiquidFeedback Workshop
LiquidFeedback WorkshopLiquidFeedback Workshop
LiquidFeedback Workshop
 
PiratenMS - Google Street View
PiratenMS - Google Street ViewPiratenMS - Google Street View
PiratenMS - Google Street View
 
SNPDigger
SNPDiggerSNPDigger
SNPDigger
 
Next Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome AnalysisNext Generation Sequencing & Transcriptome Analysis
Next Generation Sequencing & Transcriptome Analysis
 
Medienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen NetzwerkenMedienkompetenz in Sozialen Netzwerken
Medienkompetenz in Sozialen Netzwerken
 
Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?Denkt denn keiner an die Kernthemen?
Denkt denn keiner an die Kernthemen?
 

Recently uploaded

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 

Recently uploaded (20)

Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 

openSNP - Crowdsourcing Genome Wide Association Studies

  • 1. Crowdsourcing Genome 23.01.12, Bastian Greshake Wide Association Studies
  • 2. some words about me • BSc in Life Sciences (2010) • Working at Biodiversity & Climate Research Center (since 2010) • MSc studies at the Goethe University in Frankfurt/Main (since 2011) • Not exactly a biologist with much professional background in human genetics, but...
  • 3. some words about me • some background in data mining (mainly transcriptomics) • some experience with web applications • interest in social media & crowd-sourcing • customer of DTC genetic testing myself
  • 5. mining DTC genetic tests • results are hidden somewhere on the web • often no phenotypic annotation • not easily re-usable
  • 6. let’s code it: • wants to be a central repository for sharing DTC results • enables users to share phenotypes as well • lowers barrier to participate • motivation to share through benefits for users • can we take it a step further and provide data for GWAS?
  • 7. mining DTC genetic tests • lots of potential for open data (100k+ customers) • cheap data source for scientists Would you share DTC test results? (n=226) 6 % 26 % 68 % Yes Only with DTC company No
  • 9.
  • 10.
  • 11.
  • 12. technical implementation • framework: Ruby on Rails • database: PostgreSQL • task management via resque (known of GitHub) • basic API via JSON-queries
  • 13.
  • 14. other resources • Personal Genome Project • data is open • participation not
  • 15. Personal Genome Project
  • 16. other resources • Personal Genome Project • data is open • participation not • no easy way to download data, no API etc. • genomera • participation will be open (currently invited beta) • focus on small scale studies/experiments
  • 18. problems & potential of patient driven/crowd- sourced research • problems • sample sizes • bias in participants • motivation of participants • accuracy of data • potential • possible sample sizes • low costs • "warm fuzzy feeling inside" for patients
  • 19. positive examples: PatientsLikeMe • around since ~2006 • published a dozen studies since then • famous example: ALS research on lithium carbonate intake (149 patients, 447 controls) Paul Wicks et al. (2011) Accelerated clinical discovery using self-reported patient data collected online and a patient-matching algorithm, Nature Biotechnology 29, 411–414
  • 20. positive examples: 23andMe • published some studies in 2010/2011 • done with self-reported data • studies include 10.000+ to 30.000+ participants
  • 21. positive examples: 23andMe – general traits “ Replications of associations [...] for hair color, eye color, and freckling validate the Web-based, self-reporting paradigm. The identification of novel associations for hair morphology [...], freckling [...], the ability to smell the methanethiol produced after eating asparagus [...], and photic sneeze reflex [...] illustrates the power of the approach. Nicolas Eriksson et al. (2010) Web-Based, Participant-Driven Studies Yield Novel Genetic Associations for Common Traits. PLoS Genet 6(6): e1000993. doi:10.1371/ journal.pgen.1000993
  • 22. positive examples: 23andMe – Parkinson’s Disease “ We discovered two novel, genome-wide significant associations with [Parkinson’s Disease]—both replicated in an independent cohort. We also replicated 20 previously discovered genetic associations (including LRRK2, GBA, SNCA, MAPT, GAK, and the HLA region), providing support for our novel study design. Chuong B. Do et al. (2011) Web-Based Genome-Wide Association Study Identifies Two Novel Loci and a Substantial Genetic Component for Parkinson's Disease. PLoS Genet 7(6): e1002141. doi:10.1371/journal.pgen.1002141
  • 25. QS projects • tracking health in response to work-outs (minimizing impacts of disease/genetic predisposition) • track response to different drugs • tracking well-being in response to eating habits (butter vs arithmetics)
  • 26. butter vs arithmetics source: Seth Roberts - quantifiedself.com
  • 27. my conclusions • technology enables new kinds of research • DTC results and patient driven research can lead to new scientific knowledge • can be a valuable addition to traditional research
  • 28. openSNP: now & future • won the Mendeley/PLoS Binary Battle in 2011 • got some funding of the German WikiMedia foundation to get more people genotyped • collaborating with consent to research to get IRB approved consent-process • working on implementing the Distributed Annotation System
  • 29. thanks for your attention source: xkcd.com CC-BY-NC