SlideShare une entreprise Scribd logo
1  sur  17
Ratio of harmful amino acid
substitutions
Mauno Vihinen
Protein Structure and Bioinformatics Group
Department of Experimental Medical Science
Lund University, Sweden
Focus on variation
Data collection
Method performance
assessment
Systematics
Prediction method
development
Predictors
Mechanisms
How many variations are disease
related?
Current databases don’t provide the answer
Depends on gene/protein/domain/region
Bruton tyrosine kinase (BTK) in X-linked agammaglobulinemia
Amino acid substitutions
Hydrophilic
Hydrophobic Acidic Basic Polar Special
-> A F I L M V W Y D E H K R N Q S T C G P X total
A 0 1 0.6 0.4 0 0 0 0.8 0 2.8
F 0 0 0.4 0.1 0.3 1 0.1 0.1 2.1
I 0.1 0 0 0.1 0 0 0 1 0 0.1 0.6 0 2
L 1.3 0.3 0 0 0.3 0.1 0 0.6 0.1 0.6 3.8 0.6 7.6
M 1.1 0.1 0 0.4 0.4 0.1 2 0 4.2
V 0.4 0.7 0 0 0 0.1 0.4 0.1 0.3 0 0.1 2.2
W 0.1 0.8 0.3 0.3 0 4.1 5.6
Y 0 0 0.7 0.6 0.6 0.8 1.3 4.8 8.7
D 0 0.4 0.1 0 0.1 0.1 0.3 0.3 0 1.4
E 0 0 0.7 0 0.3 0 0.4 1.8 3.2
H 0 0.1 0.1 0 0.4 0 0.1 0.3 0 1.1
K 0.1 0 1.1 0 0.4 0.3 0 0 1.4 3.4
R 0 0.3 0 6.3 4.1 0.6 0.1 4.5 1 0.3 2.5 2 1.1 8.3 31
N 0 0.1 0 0 0.1 0 0 0 0.1 0.4
Q 0 0 0 0.3 0 0.1 0 0.1 6.2 6.7
S 0 0.7 0.1 0.1 0 0.4 0.1 0 0 0 0 0 0.7 1 3.2
T 0.1 0.4 0 0 0 0 0 0 1.1 0 1.7
C 0.7 0.3 1.7 0.4 0.3 0 0.4 0.7 4.5
G 0.1 0.1 0.1 1.1 2.1 1.5 0 0 0.1 0.3 5.6
P 0.3 0.6 0 0 0.3 0 0.7 0.7 0 0 2.5
total 1 3.5 2.1 1.7 0.1 2.5 6.9 2.8 3.6 3.9 5 1.4 5 2.1 4.8 4.8 3.5 4.2 3.5 8 29.5 100
Approach
Predictions
Methods of highest possible performance
Machine learning methods
trained with experimentally verified cases, careful selection, representativeness
Benchmarking
PON-BTK
More than 500 kinases, BTK contains the largest number of disease-
causing variants among them
Dedicated predictor for kinase domain of Bruton
tyrosine kinase
SNAVs, single-nucleotide substitution caused
amino acid variations
Analysis of all amino acid substitutions caused by
single nucleotide change
67% of variations harmful
Väliaho et al. Hum. Mutat. 2015
Experimental studies
PON-MMR
Dedicated predictor for a protein complex, mismatch repair (MMR) system
Lynch syndrome and other gastrointestinal cancers
Collection of data from literature, InSiGHT database
785 amino acid substitutions in InSiGHT database in 5 proteins
Clear functional information for disease relevance only for 168
INSiGHT variation interpretation committee rapport (Thompson et al. Nat. Genet 2014)
Classified 1370 variants out of 2360 investigated
46 of those we had predicted to be either benign or pathogenic
44/46 were correct (96%)
Ali et al. Hum. Mutat. 2012
PON-MMR, structural verification
Structure of MSH2-MSH6 dimer
Structural explanations match for 105 out of 109 variants
PON-MMR2
Novel predictor
for MLH1, MSH2, MSH6, PMS2
Trained with InSiGHT and PON-MMR data
Extensive feature selection and method
training
5 useful features from altogether 624
Accuracy and MCC: 0.84 and 0.70, cross-
validation 0.85 and 0.67 for an independent
test dataset
Niroula and Vihinen, Hum. Mutat. 2015
Ratios of harmful variants
Mitochondrial tRNA
Human cells have two sets of tRNAs - nuclear and mitochondrial
Almost all pathogenic variations in tRNAs are found in mitochondrial
tRNAs
Evidence-based methods have been developed to classify mitochondrial
tRNA variations
Conservation, Biochemical test, histochemical test, heteroplasmy,
segregation, single fibre, trans-mitochondrial cybrid, etc
About 200 variants were classified by Yarham et al.
Niroula and Vihinen, NAR 2016
PON-mt-tRNA
Classification of all possible single nucleotide
substitutions in all 22 human mt-tRNAs
Pathogenic variations are concentrated in the stems
Anticodon loop has the highest frequency of pathogenic
variations among loops
51.0% of all variants predicted to be harmful
42.0% of all variants in yeast arginine tRNACCU (Li et al.
Science 2016)
PON-P2
PON-P2 classifies amino acid substitutions into three classes
benign
pathogenic
unclassified variants (UVs)
Machine learning -based method (random forest)
Features
Physical and biochemical properties of amino acids
Gene Ontology annotations
Evolutionary features
Functional annotation
Niroula et al PLoS ONE 2015
Variants in cancer
Somatic variations in 7,042 cancer genomes/exomes
Alexandrov et al. (Nature 2013)
30 cancer types
~5 million variants
2.63 million in coding region
824,001 amino acid substitutions
Predicted the impact of AASs using PON-P2
14.24% of all variations were predicted harmful
14.71% of COSMIC variation (total 647,872)
39.88% variations in Cancer Gene Census (CGC) genes
Niroula and Vihinen, BMC Med Genet 2015
Summary
Estimates for the ratio of harmful variants
- BTK kinase domain, SNAVs
- MMR system proteins, all variants
- Mitochondrial tRNAs
- Cancer
- 30 cancer types
- COSMIC
- Cancer Gene Census proteins
Highly variable ratios between proteins and domains, and within proteins
Thanks!
Abhishek Gabriel Gerard Jelena Siddhaling Yang
Niroula Teku Schaafsma Calyseva Urolagin Yang
Heidi Preethy Jouni
Ali Nair Väliaho
http://structure.bmc.lu.se
mauno.vihinen@med.lu.se

Contenu connexe

En vedette (7)

Tumor suppressorgenes
Tumor suppressorgenesTumor suppressorgenes
Tumor suppressorgenes
 
Cell signalling
Cell signallingCell signalling
Cell signalling
 
Cell Signalling
Cell SignallingCell Signalling
Cell Signalling
 
Cell signaling
Cell signalingCell signaling
Cell signaling
 
Introduction of cell signaling
Introduction of cell signalingIntroduction of cell signaling
Introduction of cell signaling
 
Second messenger system
Second messenger systemSecond messenger system
Second messenger system
 
Cell signaling
Cell signaling Cell signaling
Cell signaling
 

Plus de Human Variome Project

ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa LandrumClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
Human Variome Project
 
Legal and regulatory challenges to data sharing for clinical genetics and ge...
Legal and regulatory challenges to  data sharing for clinical genetics and ge...Legal and regulatory challenges to  data sharing for clinical genetics and ge...
Legal and regulatory challenges to data sharing for clinical genetics and ge...
Human Variome Project
 
Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...
Human Variome Project
 
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès RötigGENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
Human Variome Project
 
Use of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha KnoppersUse of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Human Variome Project
 

Plus de Human Variome Project (20)

ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa LandrumClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
ClinVar: Aggregating Data to Improve Variant Interpretation - Melissa Landrum
 
The BRCA Share(TM) Consortium - Christophe Beroud
The BRCA Share(TM) Consortium - Christophe BeroudThe BRCA Share(TM) Consortium - Christophe Beroud
The BRCA Share(TM) Consortium - Christophe Beroud
 
Establishing validity, reproducibility, and utility of highly scalable geneti...
Establishing validity, reproducibility, and utility of highly scalable geneti...Establishing validity, reproducibility, and utility of highly scalable geneti...
Establishing validity, reproducibility, and utility of highly scalable geneti...
 
The PhenX Toolkit: Standard Measures for Collaborative Research - Wayne Huggins
The PhenX Toolkit: Standard Measures for  Collaborative Research - Wayne HugginsThe PhenX Toolkit: Standard Measures for  Collaborative Research - Wayne Huggins
The PhenX Toolkit: Standard Measures for Collaborative Research - Wayne Huggins
 
Legal and regulatory challenges to data sharing for clinical genetics and ge...
Legal and regulatory challenges to  data sharing for clinical genetics and ge...Legal and regulatory challenges to  data sharing for clinical genetics and ge...
Legal and regulatory challenges to data sharing for clinical genetics and ge...
 
Report from the International Confederation of Countries Advisory Council - M...
Report from the International Confederation of Countries Advisory Council - M...Report from the International Confederation of Countries Advisory Council - M...
Report from the International Confederation of Countries Advisory Council - M...
 
Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...Human variome project quality assessment criteria for variation databases - M...
Human variome project quality assessment criteria for variation databases - M...
 
HVP Country Node: Venezuela - Aida Falcon de Vargas
HVP Country Node: Venezuela - Aida Falcon de VargasHVP Country Node: Venezuela - Aida Falcon de Vargas
HVP Country Node: Venezuela - Aida Falcon de Vargas
 
Human Genetics of Infectious Diseases - Laurent Abel
Human Genetics of Infectious Diseases - Laurent AbelHuman Genetics of Infectious Diseases - Laurent Abel
Human Genetics of Infectious Diseases - Laurent Abel
 
HVP Country Node: Malaysia - Zilfalil bin Alwi
HVP Country Node: Malaysia - Zilfalil bin AlwiHVP Country Node: Malaysia - Zilfalil bin Alwi
HVP Country Node: Malaysia - Zilfalil bin Alwi
 
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès RötigGENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
GENETIC HETEROGENEITY OF MITOCHONDRIAL DISORDERS - Agnès Rötig
 
The BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
The BRCA Challenge & Exchange: Progress and Plans - Gunnar RätschThe BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
The BRCA Challenge & Exchange: Progress and Plans - Gunnar Rätsch
 
Richard GH Cotton: He may have been a bit before his time - Michael Watson
Richard GH Cotton: He may have been a bit before his time - Michael WatsonRichard GH Cotton: He may have been a bit before his time - Michael Watson
Richard GH Cotton: He may have been a bit before his time - Michael Watson
 
Professor Richard Cotton - Finlay Macrae
Professor Richard Cotton - Finlay MacraeProfessor Richard Cotton - Finlay Macrae
Professor Richard Cotton - Finlay Macrae
 
HVP Country Node: Canada - Matthew Lebo
HVP Country Node: Canada - Matthew LeboHVP Country Node: Canada - Matthew Lebo
HVP Country Node: Canada - Matthew Lebo
 
Use of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha KnoppersUse of open, curated variant databases: ethics? Liability? - Bartha Knoppers
Use of open, curated variant databases: ethics? Liability? - Bartha Knoppers
 
HVP6: Final Thoughts - John Burn & Raj Ramesar
HVP6: Final Thoughts - John Burn & Raj RamesarHVP6: Final Thoughts - John Burn & Raj Ramesar
HVP6: Final Thoughts - John Burn & Raj Ramesar
 
Report from the International Scientific Advisory Committee - John Burn
Report from the International Scientific Advisory Committee - John BurnReport from the International Scientific Advisory Committee - John Burn
Report from the International Scientific Advisory Committee - John Burn
 
HVP Country Node: Italy - Domenico Coviello
HVP Country Node: Italy - Domenico CovielloHVP Country Node: Italy - Domenico Coviello
HVP Country Node: Italy - Domenico Coviello
 
Rare and common variants contribute to the complex inheritance of Hirschsprun...
Rare and common variants contribute to the complex inheritance of Hirschsprun...Rare and common variants contribute to the complex inheritance of Hirschsprun...
Rare and common variants contribute to the complex inheritance of Hirschsprun...
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 

Dernier (20)

Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 

Ratio of harmful amino acid substitutions: Case studies of BTK and MMR system - Mauno Vihinen

  • 1. Ratio of harmful amino acid substitutions Mauno Vihinen Protein Structure and Bioinformatics Group Department of Experimental Medical Science Lund University, Sweden
  • 2. Focus on variation Data collection Method performance assessment Systematics Prediction method development Predictors Mechanisms
  • 3. How many variations are disease related? Current databases don’t provide the answer Depends on gene/protein/domain/region Bruton tyrosine kinase (BTK) in X-linked agammaglobulinemia
  • 4. Amino acid substitutions Hydrophilic Hydrophobic Acidic Basic Polar Special -> A F I L M V W Y D E H K R N Q S T C G P X total A 0 1 0.6 0.4 0 0 0 0.8 0 2.8 F 0 0 0.4 0.1 0.3 1 0.1 0.1 2.1 I 0.1 0 0 0.1 0 0 0 1 0 0.1 0.6 0 2 L 1.3 0.3 0 0 0.3 0.1 0 0.6 0.1 0.6 3.8 0.6 7.6 M 1.1 0.1 0 0.4 0.4 0.1 2 0 4.2 V 0.4 0.7 0 0 0 0.1 0.4 0.1 0.3 0 0.1 2.2 W 0.1 0.8 0.3 0.3 0 4.1 5.6 Y 0 0 0.7 0.6 0.6 0.8 1.3 4.8 8.7 D 0 0.4 0.1 0 0.1 0.1 0.3 0.3 0 1.4 E 0 0 0.7 0 0.3 0 0.4 1.8 3.2 H 0 0.1 0.1 0 0.4 0 0.1 0.3 0 1.1 K 0.1 0 1.1 0 0.4 0.3 0 0 1.4 3.4 R 0 0.3 0 6.3 4.1 0.6 0.1 4.5 1 0.3 2.5 2 1.1 8.3 31 N 0 0.1 0 0 0.1 0 0 0 0.1 0.4 Q 0 0 0 0.3 0 0.1 0 0.1 6.2 6.7 S 0 0.7 0.1 0.1 0 0.4 0.1 0 0 0 0 0 0.7 1 3.2 T 0.1 0.4 0 0 0 0 0 0 1.1 0 1.7 C 0.7 0.3 1.7 0.4 0.3 0 0.4 0.7 4.5 G 0.1 0.1 0.1 1.1 2.1 1.5 0 0 0.1 0.3 5.6 P 0.3 0.6 0 0 0.3 0 0.7 0.7 0 0 2.5 total 1 3.5 2.1 1.7 0.1 2.5 6.9 2.8 3.6 3.9 5 1.4 5 2.1 4.8 4.8 3.5 4.2 3.5 8 29.5 100
  • 5. Approach Predictions Methods of highest possible performance Machine learning methods trained with experimentally verified cases, careful selection, representativeness Benchmarking
  • 6. PON-BTK More than 500 kinases, BTK contains the largest number of disease- causing variants among them Dedicated predictor for kinase domain of Bruton tyrosine kinase SNAVs, single-nucleotide substitution caused amino acid variations Analysis of all amino acid substitutions caused by single nucleotide change 67% of variations harmful Väliaho et al. Hum. Mutat. 2015
  • 8. PON-MMR Dedicated predictor for a protein complex, mismatch repair (MMR) system Lynch syndrome and other gastrointestinal cancers Collection of data from literature, InSiGHT database 785 amino acid substitutions in InSiGHT database in 5 proteins Clear functional information for disease relevance only for 168 INSiGHT variation interpretation committee rapport (Thompson et al. Nat. Genet 2014) Classified 1370 variants out of 2360 investigated 46 of those we had predicted to be either benign or pathogenic 44/46 were correct (96%) Ali et al. Hum. Mutat. 2012
  • 9. PON-MMR, structural verification Structure of MSH2-MSH6 dimer Structural explanations match for 105 out of 109 variants
  • 10. PON-MMR2 Novel predictor for MLH1, MSH2, MSH6, PMS2 Trained with InSiGHT and PON-MMR data Extensive feature selection and method training 5 useful features from altogether 624 Accuracy and MCC: 0.84 and 0.70, cross- validation 0.85 and 0.67 for an independent test dataset Niroula and Vihinen, Hum. Mutat. 2015
  • 11. Ratios of harmful variants
  • 12. Mitochondrial tRNA Human cells have two sets of tRNAs - nuclear and mitochondrial Almost all pathogenic variations in tRNAs are found in mitochondrial tRNAs Evidence-based methods have been developed to classify mitochondrial tRNA variations Conservation, Biochemical test, histochemical test, heteroplasmy, segregation, single fibre, trans-mitochondrial cybrid, etc About 200 variants were classified by Yarham et al. Niroula and Vihinen, NAR 2016
  • 13. PON-mt-tRNA Classification of all possible single nucleotide substitutions in all 22 human mt-tRNAs Pathogenic variations are concentrated in the stems Anticodon loop has the highest frequency of pathogenic variations among loops 51.0% of all variants predicted to be harmful 42.0% of all variants in yeast arginine tRNACCU (Li et al. Science 2016)
  • 14. PON-P2 PON-P2 classifies amino acid substitutions into three classes benign pathogenic unclassified variants (UVs) Machine learning -based method (random forest) Features Physical and biochemical properties of amino acids Gene Ontology annotations Evolutionary features Functional annotation Niroula et al PLoS ONE 2015
  • 15. Variants in cancer Somatic variations in 7,042 cancer genomes/exomes Alexandrov et al. (Nature 2013) 30 cancer types ~5 million variants 2.63 million in coding region 824,001 amino acid substitutions Predicted the impact of AASs using PON-P2 14.24% of all variations were predicted harmful 14.71% of COSMIC variation (total 647,872) 39.88% variations in Cancer Gene Census (CGC) genes Niroula and Vihinen, BMC Med Genet 2015
  • 16. Summary Estimates for the ratio of harmful variants - BTK kinase domain, SNAVs - MMR system proteins, all variants - Mitochondrial tRNAs - Cancer - 30 cancer types - COSMIC - Cancer Gene Census proteins Highly variable ratios between proteins and domains, and within proteins
  • 17. Thanks! Abhishek Gabriel Gerard Jelena Siddhaling Yang Niroula Teku Schaafsma Calyseva Urolagin Yang Heidi Preethy Jouni Ali Nair Väliaho http://structure.bmc.lu.se mauno.vihinen@med.lu.se

Notes de l'éditeur

  1. The figures for Lund University in the presentation are based on the 2011 financial year.
  2. The figures for Lund University in the presentation are based on the 2011 financial year.