SlideShare a Scribd company logo
1 of 18
Introduction to different measures of linkage disequilibrium (LD) and their calculation Awais Khan, University of Illinois, Urbana-Champaign By Dr. M. Awais Khan University of Illinois, Urbana-Champaign
Awais Khan, University of Illinois, Urbana-Champaign To understand the calculation of linkage disequilibrium consider following example Suppose there are two genes on Chromosome 5 of apple, each with two alleles ACT G GTAT………….GATCA A CCAG ACT C GTAT………….GATCA A CCAG ACT C GTAT………….GATCA T CCAG SNP1 SNP2 Showing only alleles for both SNPs  Calculation of linkage disequilibrium Alleles SNP1 SNP2 Allele 1 G A Allele 2 C T
Awais Khan, University of Illinois, Urbana-Champaign For better understanding of LD calculation, it is divided into five steps If p1 and p2 =frequency of the alleles at SNP1 and q1 and q2 =frequency of the alleles at SNP2 then in tabular form it could be written as follows Steps in LD calculation Step 1) Calculate allele frequencies SNP1 SNP2 Allele Frequency Allele Frequency G p1 A q1 C p2 T q2
Awais Khan, University of Illinois, Urbana-Champaign Suppose haplotype frequencies are as follows From our example of two SNPs each with two alleles all possible haplotypes are Step 2) Calculate haplotype frequencies SNP2 Allele A T SNP1 G GA GT C CA CT Haplotype Frequency Haplotype Frequency GA p11 GT p12 CA p21 CT q22
Awais Khan, University of Illinois, Urbana-Champaign When haplotype frequencies are equal to the product of their corresponding allele frequencies, it means the loci are in linkage equilibrium  Step 3) Linkage equilibrium Haplotype frequency Product of allelic frequency p11 = p1q1 p12 = p1q2 p21 = p2q1 p22 = p2q2
Awais Khan, University of Illinois, Urbana-Champaign D11=p11-p1q1 D12=p12-p1q2 D21=p21-p2q1 D22=p22-p2q2 We can deduce linkage disequilibrium for each haplotype as the deviation of observed haplotype frequency from its corresponding allelic frequencies expected under equilibrium Step 4) Linkage disequilibrium After solving above for D, we get as follows: SNP2 1 2 SNP1 1 p1q1+D p1q2-D p1 2 p2q1-D p2q2+D p2 q1 q2 1
Awais Khan, University of Illinois, Urbana-Champaign p11 p22  =(p1 q1 + D )(p2 q2 + D ) =  p1q1p2q2+ p1q1D + p2q2D + D 2   p12 p21  = (p1 q2 − D )(p2 q1 − D )  =  p1q1p2q2− p2q1D − p1q2D + D 2 Subtracting  (p1q1p2q2+ p1q1D + p2q2D + D 2 )  –   (p1q1p2q2− p2q1D − p1q2D + D 2 ) p11 p22 − p12 p21 = D (p1 q1 + p2 q1 + p2 q2 + p1 q2 )  = D × (1) = D Step 5) Calculation of Linkage disequilibrium measure D a) b) c)
Awais Khan, University of Illinois, Urbana-Champaign P11 = p1q1= 0.5  x  0.5 =  0.25  P22 = p2q2= 0.5  x  0.5 =  0.25 P12 = p1q2= 0.5  x  0.5 =  0.25 P21 = p2q1= 0.5  x  0.5 =  0.25  D = (P11)(P22) -(P12)(P21) D  =  (0.25) (0.25)  - (0.25) (0.25)  =  0  Estimate of D in case of Linkage Equilibrium If allele frequencies of p1 and q1 are both 0.5 and equilibrium occurs (only Ab and aB exist in the population)
Awais Khan, University of Illinois, Urbana-Champaign P11  =  p1q1+ D  =  0.25 + D  =  0.5 P22  =  p2q2+ D  =  0.25 + D  =  0.5 P12  =  p1q2-D  =  0.25 -D  =  0 P21  =  p2q1-D  =  0.25 -D  =  0 D = (P11)(P22) -(P12)(P21) D  =  (0.5) (0.5)  - (0) (0)  =  0.25 Estimate of D in case of Linkage Disequilibrium If allele frequency of p1 and  q1 are  both  0.5  and there is complete non-random association (only AB and ab exist in the population) with equal allele frequencies at all loci
Awais Khan, University of Illinois, Urbana-Champaign Sometimes, depending on allele frequency of two loci, the value of D can be negative, but actual gametic frequencies cannot be negative To overcome this issue, standardization methods have been proposed Standardization of D
Awais Khan, University of Illinois, Urbana-Champaign In a common standardization method, a relative measure of  disequilibrium (D) compared to its maximum is used: D’ = D / Dmax When D is positive Dmax=  min  [ (p1q2)  or  (p2q1) ] When  D  is  negative  Dmax=  min  [ (p1q1)  or  (p2q2) ] This standardization makes D-values range between 0 and 1 Standardization of D
Awais Khan, University of Illinois, Urbana-Champaign Another commonly used measure to calculate LD between loci is Pearson coefficient of correlation (r) r = D/ (p1p2q1q2) 1/2 However, squared coefficient of correlation (r 2 ) is often used to remove the arbitrary sign introduced Correlation coefficient as a measure of LD
Awais Khan, University of Illinois, Urbana-Champaign However, r can be conveniently used for chi-test, as χ 2  = r  2  N where N is the number of chromosomes in the sample To test if LD is statistically significant we can do a χ 2  test χ 2 =Σ(obs-exp) 2 / exp  expected is random associations  between alleles Testing significance of LD
Awais Khan, University of Illinois, Urbana-Champaign Genotypic data GA = 474  GT = 611  CA = 142  CT = 773  Total  =  2000 Haplotype Frequencies GA  =  474 / 2000  =  .2370 GT  =  611 / 2000  =  .3055 CA  =  142 / 2000  =  .0710 CT  =  773 / 2000  =  .3865  Example Let’s assume that we have genotypic data for the two SNPs with two alleles each (same example used to deduce the equations for different LD measures) Calculation of haplotype and allele frequencies Allele  frequencies G =  0.542 C =  0.457 A =  0.308 T =  0.692
Awais Khan, University of Illinois, Urbana-Champaign D  =  (P11 P22)  - (P12 P21) D  =  (0.2370  x  0.3865)  - (0.3055 x 0.0710)  =  0.0699 Input values in the equation for D to calculate linkage disequilibrium To estimate Dmax input allelic frequencies and value for D in the following equation Dmax = min [ (p1q2) or  (p2q1)  ] Dmax =  (0.5425 x  0.692)  =  0.375 or  =  (0.4575 x  0.308)  =  0.141
Awais Khan, University of Illinois, Urbana-Champaign Now calculate D’ input value of  D and Dmax calculated in previous step in the following equation D’ = D / Dmax D’ =  0.0699 / 0.141 =  0.496  =  0.5 To calculate coefficient of correlation (r), input value of  D and allele frequencies calculated in previous steps in the following equation r  =  D / (p1p2q1q2)  ½ r  = 0.0699 /(0.5425 x 0.4575 x 0.308 x 0.692) 1/2 r  = 0.0699 / 0.23  =  0.304 r 2  = (0.304) 2   = 0 .092
Awais Khan, University of Illinois, Urbana-Champaign χ 2  = r  2  N χ 2  = 0.092  x  2000  =  184.8  (1 df) At 184.8 and df of 1, P-value is 0.0001 So, we can conclude based on our calculations that there is a significant LD between loci and it is 50% of the theoretical maximum Also note that two SNPs are in complete LD (not separated by recombination) when D ' = 1 or r 2 =1 To check the significance of LD between loci use following equation
Awais Khan, University of Illinois, Urbana-Champaign Reference: Lewontin R.C. 1988. On Measures of Gametic Disequilibrium. Genetics, 120(3):849-852. Devlin B., Risch N. 1995. A Comparison of Linkage Disequilibrium Measures for Fine-Scale Mapping. Genomics 29 (2): 311-322.

More Related Content

What's hot

Quantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingQuantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingPGS
 
Advanced biometrical and quantitative genetics akshay
Advanced biometrical and quantitative genetics akshayAdvanced biometrical and quantitative genetics akshay
Advanced biometrical and quantitative genetics akshayAkshay Deshmukh
 
GBS: Genotyping by sequencing
GBS: Genotyping by sequencingGBS: Genotyping by sequencing
GBS: Genotyping by sequencingsampath perumal
 
mapping population
mapping populationmapping population
mapping populationHarsh Mishra
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminarVarsha Gayatonde
 
Whole Genome Selection
Whole Genome SelectionWhole Genome Selection
Whole Genome SelectionRaghav N.R
 
Epigenetic regulation in higher plants
Epigenetic regulation in higher plantsEpigenetic regulation in higher plants
Epigenetic regulation in higher plantsFOODCROPS
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingMahesh Biradar
 
Association mapping
Association mappingAssociation mapping
Association mappingNivethitha T
 
Association mapping
Association mapping Association mapping
Association mapping Preeti Kapoor
 
heritabilitygeneticadvance.pptx
heritabilitygeneticadvance.pptxheritabilitygeneticadvance.pptx
heritabilitygeneticadvance.pptxPudhuvai Baveesh
 
Molecular markers and Functional molecular markers
Molecular markers and Functional molecular markersMolecular markers and Functional molecular markers
Molecular markers and Functional molecular markersChandana B.R.
 
Candidate Gene Approach in Crop Improvement
Candidate Gene Approach in Crop ImprovementCandidate Gene Approach in Crop Improvement
Candidate Gene Approach in Crop ImprovementBonipasAntony2
 
Genetic diversity analysis
Genetic diversity analysisGenetic diversity analysis
Genetic diversity analysisAKHISHA P. A.
 
Statistical analysis of genetics parameter
Statistical analysis of genetics parameterStatistical analysis of genetics parameter
Statistical analysis of genetics parameterPawan Nagar
 

What's hot (20)

Quantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingQuantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breeding
 
Advanced biometrical and quantitative genetics akshay
Advanced biometrical and quantitative genetics akshayAdvanced biometrical and quantitative genetics akshay
Advanced biometrical and quantitative genetics akshay
 
GBS: Genotyping by sequencing
GBS: Genotyping by sequencingGBS: Genotyping by sequencing
GBS: Genotyping by sequencing
 
mapping population
mapping populationmapping population
mapping population
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminar
 
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
 
Whole Genome Selection
Whole Genome SelectionWhole Genome Selection
Whole Genome Selection
 
Epigenetic regulation in higher plants
Epigenetic regulation in higher plantsEpigenetic regulation in higher plants
Epigenetic regulation in higher plants
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mapping
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Association mapping
Association mapping Association mapping
Association mapping
 
heritabilitygeneticadvance.pptx
heritabilitygeneticadvance.pptxheritabilitygeneticadvance.pptx
heritabilitygeneticadvance.pptx
 
Molecular markers and Functional molecular markers
Molecular markers and Functional molecular markersMolecular markers and Functional molecular markers
Molecular markers and Functional molecular markers
 
GWAS
GWASGWAS
GWAS
 
Candidate Gene Approach in Crop Improvement
Candidate Gene Approach in Crop ImprovementCandidate Gene Approach in Crop Improvement
Candidate Gene Approach in Crop Improvement
 
QTL
QTLQTL
QTL
 
Mapping population ppt
Mapping population pptMapping population ppt
Mapping population ppt
 
Genetic diversity analysis
Genetic diversity analysisGenetic diversity analysis
Genetic diversity analysis
 
Statistical analysis of genetics parameter
Statistical analysis of genetics parameterStatistical analysis of genetics parameter
Statistical analysis of genetics parameter
 
Genomic selection
Genomic  selectionGenomic  selection
Genomic selection
 

Viewers also liked

Estimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 SoftwareEstimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 SoftwareAwais Khan
 
Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Karan Veer Singh
 
Sperm DNA Fragmentation
Sperm DNA FragmentationSperm DNA Fragmentation
Sperm DNA Fragmentationsasicrea
 
Lecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_fullLecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_fullLekki Frazier-Wood
 
Haplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long readsHaplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long readsGenome Reference Consortium
 
Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...Promila Sheoran
 

Viewers also liked (11)

Estimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 SoftwareEstimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 Software
 
Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,
 
Sperm DNA Fragmentation
Sperm DNA FragmentationSperm DNA Fragmentation
Sperm DNA Fragmentation
 
Lecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_fullLecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_full
 
Tetrad analysis
Tetrad analysisTetrad analysis
Tetrad analysis
 
linkage
linkagelinkage
linkage
 
Lytic cycle
Lytic cycleLytic cycle
Lytic cycle
 
Haplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long readsHaplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long reads
 
Recombination
RecombinationRecombination
Recombination
 
Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...
 
Snp
SnpSnp
Snp
 

Similar to Measures of Linkage Disequilibrium

path coeeficient
path coeeficientpath coeeficient
path coeeficientkanaka K K
 
Categorical data analysis full lecture note PPT.pptx
Categorical data analysis full lecture note  PPT.pptxCategorical data analysis full lecture note  PPT.pptx
Categorical data analysis full lecture note PPT.pptxMinilikDerseh1
 
Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01宁 梅
 
Reading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsReading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsKeisuke OTAKI
 
Values and Means - Part 2
Values and Means - Part 2Values and Means - Part 2
Values and Means - Part 2SyedShaanz
 
Chapter-4 combined.pptx
Chapter-4 combined.pptxChapter-4 combined.pptx
Chapter-4 combined.pptxHamzaHaji6
 
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docxChapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docxtiffanyd4
 
Tables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docxTables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docxmattinsonjanel
 
When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?Andrea Dal Pozzolo
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxKIRUTHIKAAR2
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxKIRUTHIKAAR2
 
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...Pioneer Natural Resources
 

Similar to Measures of Linkage Disequilibrium (20)

path coeeficient
path coeeficientpath coeeficient
path coeeficient
 
Categorical data analysis full lecture note PPT.pptx
Categorical data analysis full lecture note  PPT.pptxCategorical data analysis full lecture note  PPT.pptx
Categorical data analysis full lecture note PPT.pptx
 
Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01
 
Reading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsReading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGs
 
Values and Means - Part 2
Values and Means - Part 2Values and Means - Part 2
Values and Means - Part 2
 
Microeconomics-Help-Experts.pptx
Microeconomics-Help-Experts.pptxMicroeconomics-Help-Experts.pptx
Microeconomics-Help-Experts.pptx
 
Sample size estimation
Sample size estimationSample size estimation
Sample size estimation
 
Chapter-4 combined.pptx
Chapter-4 combined.pptxChapter-4 combined.pptx
Chapter-4 combined.pptx
 
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docxChapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
 
Statistics
StatisticsStatistics
Statistics
 
Tables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docxTables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docx
 
Quantitative genetics
Quantitative geneticsQuantitative genetics
Quantitative genetics
 
Chap04
Chap04Chap04
Chap04
 
When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?
 
Datacompression1
Datacompression1Datacompression1
Datacompression1
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas help
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
Chapt 2
Chapt 2Chapt 2
Chapt 2
 
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
 

Recently uploaded

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 

Recently uploaded (20)

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 

Measures of Linkage Disequilibrium

  • 1. Introduction to different measures of linkage disequilibrium (LD) and their calculation Awais Khan, University of Illinois, Urbana-Champaign By Dr. M. Awais Khan University of Illinois, Urbana-Champaign
  • 2. Awais Khan, University of Illinois, Urbana-Champaign To understand the calculation of linkage disequilibrium consider following example Suppose there are two genes on Chromosome 5 of apple, each with two alleles ACT G GTAT………….GATCA A CCAG ACT C GTAT………….GATCA A CCAG ACT C GTAT………….GATCA T CCAG SNP1 SNP2 Showing only alleles for both SNPs Calculation of linkage disequilibrium Alleles SNP1 SNP2 Allele 1 G A Allele 2 C T
  • 3. Awais Khan, University of Illinois, Urbana-Champaign For better understanding of LD calculation, it is divided into five steps If p1 and p2 =frequency of the alleles at SNP1 and q1 and q2 =frequency of the alleles at SNP2 then in tabular form it could be written as follows Steps in LD calculation Step 1) Calculate allele frequencies SNP1 SNP2 Allele Frequency Allele Frequency G p1 A q1 C p2 T q2
  • 4. Awais Khan, University of Illinois, Urbana-Champaign Suppose haplotype frequencies are as follows From our example of two SNPs each with two alleles all possible haplotypes are Step 2) Calculate haplotype frequencies SNP2 Allele A T SNP1 G GA GT C CA CT Haplotype Frequency Haplotype Frequency GA p11 GT p12 CA p21 CT q22
  • 5. Awais Khan, University of Illinois, Urbana-Champaign When haplotype frequencies are equal to the product of their corresponding allele frequencies, it means the loci are in linkage equilibrium Step 3) Linkage equilibrium Haplotype frequency Product of allelic frequency p11 = p1q1 p12 = p1q2 p21 = p2q1 p22 = p2q2
  • 6. Awais Khan, University of Illinois, Urbana-Champaign D11=p11-p1q1 D12=p12-p1q2 D21=p21-p2q1 D22=p22-p2q2 We can deduce linkage disequilibrium for each haplotype as the deviation of observed haplotype frequency from its corresponding allelic frequencies expected under equilibrium Step 4) Linkage disequilibrium After solving above for D, we get as follows: SNP2 1 2 SNP1 1 p1q1+D p1q2-D p1 2 p2q1-D p2q2+D p2 q1 q2 1
  • 7. Awais Khan, University of Illinois, Urbana-Champaign p11 p22 =(p1 q1 + D )(p2 q2 + D ) = p1q1p2q2+ p1q1D + p2q2D + D 2 p12 p21 = (p1 q2 − D )(p2 q1 − D ) = p1q1p2q2− p2q1D − p1q2D + D 2 Subtracting (p1q1p2q2+ p1q1D + p2q2D + D 2 ) – (p1q1p2q2− p2q1D − p1q2D + D 2 ) p11 p22 − p12 p21 = D (p1 q1 + p2 q1 + p2 q2 + p1 q2 ) = D × (1) = D Step 5) Calculation of Linkage disequilibrium measure D a) b) c)
  • 8. Awais Khan, University of Illinois, Urbana-Champaign P11 = p1q1= 0.5 x 0.5 = 0.25 P22 = p2q2= 0.5 x 0.5 = 0.25 P12 = p1q2= 0.5 x 0.5 = 0.25 P21 = p2q1= 0.5 x 0.5 = 0.25 D = (P11)(P22) -(P12)(P21) D = (0.25) (0.25) - (0.25) (0.25) = 0 Estimate of D in case of Linkage Equilibrium If allele frequencies of p1 and q1 are both 0.5 and equilibrium occurs (only Ab and aB exist in the population)
  • 9. Awais Khan, University of Illinois, Urbana-Champaign P11 = p1q1+ D = 0.25 + D = 0.5 P22 = p2q2+ D = 0.25 + D = 0.5 P12 = p1q2-D = 0.25 -D = 0 P21 = p2q1-D = 0.25 -D = 0 D = (P11)(P22) -(P12)(P21) D = (0.5) (0.5) - (0) (0) = 0.25 Estimate of D in case of Linkage Disequilibrium If allele frequency of p1 and q1 are both 0.5 and there is complete non-random association (only AB and ab exist in the population) with equal allele frequencies at all loci
  • 10. Awais Khan, University of Illinois, Urbana-Champaign Sometimes, depending on allele frequency of two loci, the value of D can be negative, but actual gametic frequencies cannot be negative To overcome this issue, standardization methods have been proposed Standardization of D
  • 11. Awais Khan, University of Illinois, Urbana-Champaign In a common standardization method, a relative measure of disequilibrium (D) compared to its maximum is used: D’ = D / Dmax When D is positive Dmax= min [ (p1q2) or (p2q1) ] When D is negative Dmax= min [ (p1q1) or (p2q2) ] This standardization makes D-values range between 0 and 1 Standardization of D
  • 12. Awais Khan, University of Illinois, Urbana-Champaign Another commonly used measure to calculate LD between loci is Pearson coefficient of correlation (r) r = D/ (p1p2q1q2) 1/2 However, squared coefficient of correlation (r 2 ) is often used to remove the arbitrary sign introduced Correlation coefficient as a measure of LD
  • 13. Awais Khan, University of Illinois, Urbana-Champaign However, r can be conveniently used for chi-test, as χ 2 = r 2 N where N is the number of chromosomes in the sample To test if LD is statistically significant we can do a χ 2 test χ 2 =Σ(obs-exp) 2 / exp expected is random associations between alleles Testing significance of LD
  • 14. Awais Khan, University of Illinois, Urbana-Champaign Genotypic data GA = 474 GT = 611 CA = 142 CT = 773 Total = 2000 Haplotype Frequencies GA = 474 / 2000 = .2370 GT = 611 / 2000 = .3055 CA = 142 / 2000 = .0710 CT = 773 / 2000 = .3865 Example Let’s assume that we have genotypic data for the two SNPs with two alleles each (same example used to deduce the equations for different LD measures) Calculation of haplotype and allele frequencies Allele frequencies G = 0.542 C = 0.457 A = 0.308 T = 0.692
  • 15. Awais Khan, University of Illinois, Urbana-Champaign D = (P11 P22) - (P12 P21) D = (0.2370 x 0.3865) - (0.3055 x 0.0710) = 0.0699 Input values in the equation for D to calculate linkage disequilibrium To estimate Dmax input allelic frequencies and value for D in the following equation Dmax = min [ (p1q2) or (p2q1) ] Dmax = (0.5425 x 0.692) = 0.375 or = (0.4575 x 0.308) = 0.141
  • 16. Awais Khan, University of Illinois, Urbana-Champaign Now calculate D’ input value of D and Dmax calculated in previous step in the following equation D’ = D / Dmax D’ = 0.0699 / 0.141 = 0.496 = 0.5 To calculate coefficient of correlation (r), input value of D and allele frequencies calculated in previous steps in the following equation r = D / (p1p2q1q2) ½ r = 0.0699 /(0.5425 x 0.4575 x 0.308 x 0.692) 1/2 r = 0.0699 / 0.23 = 0.304 r 2 = (0.304) 2 = 0 .092
  • 17. Awais Khan, University of Illinois, Urbana-Champaign χ 2 = r 2 N χ 2 = 0.092 x 2000 = 184.8 (1 df) At 184.8 and df of 1, P-value is 0.0001 So, we can conclude based on our calculations that there is a significant LD between loci and it is 50% of the theoretical maximum Also note that two SNPs are in complete LD (not separated by recombination) when D ' = 1 or r 2 =1 To check the significance of LD between loci use following equation
  • 18. Awais Khan, University of Illinois, Urbana-Champaign Reference: Lewontin R.C. 1988. On Measures of Gametic Disequilibrium. Genetics, 120(3):849-852. Devlin B., Risch N. 1995. A Comparison of Linkage Disequilibrium Measures for Fine-Scale Mapping. Genomics 29 (2): 311-322.