SlideShare a Scribd company logo
1 of 12
Download to read offline
EPFL/EDCB Ph.D. Candidate
Presentation
Jérémie KALFON,
ECE paris,
University of Kent
jkobject.com, linkedin.com/jkobject, github.com/jkobject, @jkobject
CaImAn: Calcium Imaging Analysis
1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data
Analysis”, eLife
CaImAn: Calcium Imaging Analysis
1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data
Analysis”, eLife
PyCUB: Hidden Patterns of the Codon Usage Bias
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
● GC content
● tRNA pool
● replication speed
● environment temperature
● nitrogen availability
● biased random mutations
PyCUB: Hidden Patterns of the Codon Usage Bias
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
● frequency measures
● deviation-to-reference
measures
● entropy measures
PyCUB: Methods
● 500 species from
ensembl, python
pipeline, scikit learn...
● Vector comparison
● Preprocessing (wide
range of measures ~20)
PyCUB: Methods
● Entropy → Force driving the CUB
● DBscan to cluster with outliers
● t-SNE & PCA to represent the data
● modelisation of the process
Results
❖ Specific distribution by
species groups
❖ Importance sequence’s
age
❖ Correlation to
sequence’s position.
❖ multiplicity of latent
factors
❖ Most Species have
specific CUBs
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
Results
❖ Consistent results
❖ A python package to
analyse the CUB across
species
❖ A new measure of the
CUB with a fast
computation time.
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
Conclusion
❖ Not one determinant of the
CUB
❖ The entropy measure is a
suitable one
❖ There is specific distribution
across genes (SLS).
Future research and ideas:
Using more big data specific
approach to analyze other/richer
kingdoms.
Remarks:
➔ The data was displaying a lot of
improbable sequences,
homologies, etc…
➔ t-SNE allowed to see clearly
driving mechanisms
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
The things I loved
● Machine Learning / Data Science
● genomics / multi-omics & visual
data
● understand and model how cells
work.
● translational applications in
biomedicine
● working with teams, freedom to
explore and create
Computer Science + Biology = <3
1. see: statement of research objectives (jkobject.com)
2. VCF2ancestry, github/jkobject
Thank you!
goals > topics
🎉 reproducible
research

More Related Content

Similar to Epfl edcb ph.d. candidate presentation

JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012
KUPKB_Team
 
Biophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla WashenbergerBiophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla Washenberger
Kayla Washenberger
 
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton SeedHail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Spark Summit
 

Similar to Epfl edcb ph.d. candidate presentation (12)

J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
 
JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012
 
ICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad ReportICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad Report
 
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
The Emerging Global Collaboratory for Microbial Metagenomics ResearchersThe Emerging Global Collaboratory for Microbial Metagenomics Researchers
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
 
Curriculum Vitae
Curriculum VitaeCurriculum Vitae
Curriculum Vitae
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
DPHEP_BLUETWO_001
DPHEP_BLUETWO_001DPHEP_BLUETWO_001
DPHEP_BLUETWO_001
 
Biophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla WashenbergerBiophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla Washenberger
 
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton SeedHail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
 
Statistics for K-mer Based Splicing Analysis
Statistics for K-mer Based Splicing AnalysisStatistics for K-mer Based Splicing Analysis
Statistics for K-mer Based Splicing Analysis
 

Recently uploaded

LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 

Recently uploaded (20)

PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 

Epfl edcb ph.d. candidate presentation

  • 1. EPFL/EDCB Ph.D. Candidate Presentation Jérémie KALFON, ECE paris, University of Kent jkobject.com, linkedin.com/jkobject, github.com/jkobject, @jkobject
  • 2. CaImAn: Calcium Imaging Analysis 1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data Analysis”, eLife
  • 3. CaImAn: Calcium Imaging Analysis 1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data Analysis”, eLife
  • 4. PyCUB: Hidden Patterns of the Codon Usage Bias 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review ● GC content ● tRNA pool ● replication speed ● environment temperature ● nitrogen availability ● biased random mutations
  • 5. PyCUB: Hidden Patterns of the Codon Usage Bias 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review ● frequency measures ● deviation-to-reference measures ● entropy measures
  • 6. PyCUB: Methods ● 500 species from ensembl, python pipeline, scikit learn... ● Vector comparison ● Preprocessing (wide range of measures ~20)
  • 7. PyCUB: Methods ● Entropy → Force driving the CUB ● DBscan to cluster with outliers ● t-SNE & PCA to represent the data ● modelisation of the process
  • 8. Results ❖ Specific distribution by species groups ❖ Importance sequence’s age ❖ Correlation to sequence’s position. ❖ multiplicity of latent factors ❖ Most Species have specific CUBs 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
  • 9. Results ❖ Consistent results ❖ A python package to analyse the CUB across species ❖ A new measure of the CUB with a fast computation time. 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
  • 10. Conclusion ❖ Not one determinant of the CUB ❖ The entropy measure is a suitable one ❖ There is specific distribution across genes (SLS). Future research and ideas: Using more big data specific approach to analyze other/richer kingdoms. Remarks: ➔ The data was displaying a lot of improbable sequences, homologies, etc… ➔ t-SNE allowed to see clearly driving mechanisms 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
  • 11. The things I loved ● Machine Learning / Data Science ● genomics / multi-omics & visual data ● understand and model how cells work. ● translational applications in biomedicine ● working with teams, freedom to explore and create
  • 12. Computer Science + Biology = <3 1. see: statement of research objectives (jkobject.com) 2. VCF2ancestry, github/jkobject Thank you! goals > topics 🎉 reproducible research