SlideShare une entreprise Scribd logo
1  sur  33
Creating a Kinship Matrix using
 Microsatellite Analyzer (MSA)

                        Zhifen Zhang
            The Ohio State University
Why create a kinship matrix?


• Visualize family relatedness
• Use in association mapping


The Objective of this Tutorial:
To introduce one approach to generate a kinship matrix
for a population
Background
    Types of Populations for Association Mapping (Yu, et al. 2006):
Population Type              Pros                               Cons
“Ideal population”           Minimal population                 Difficult to collect,
                             structure and familial             small size
                             relatedness                        Narrow genetic basis
                             Gives the greatest statistical
                             power
Family-based population      Minimal population                 Limited sample size and
                             structure                          allelic diversity
Population with              Larger size and broader            False positives or loss in
population structure         allelic diversity                  power due to familial
                                                                relatedness.
Population with familial     Larger size and broader            Inadequate control for
relationships within         allelic diversity                  false positives due to
structured population                                           population structure
     The highlighted population needs a kinship matrix for analysis.
Scheme of Mapping
                   Germplasm


Phenotyping (Y)                      Genotyping

                  Genome-wide scan           Background
                                              markers

              Genome-wide              Population structure
            polymorphisms (G)            (Q), kinship (K)


Association analysis (General Mixed Model): Y=G+Q+K+e
                                                  Zhu et al. 2006
Coefficient of Kinship

Coefficient of kinship is used to measure relatedness

DEFINITION: Coefficient of kinship is the probability that
  the alleles of a particular locus chosen randomly from
  two individuals are identical by descent (Lange, 2002)


* Identical by descent: the identical alleles from two
individuals arise from the same allele in an earlier
generation
Kinship Matrix Calculation


The kinship coefficient is the probability that an allele (a) taken
randomly from population i (at a given locus) will be identical by
descent to an allele taken randomly from population j at the same
locus

Kf= ∑k∑a (fai * faj)/D

Where ∑k∑a (fai * faj)/D is the sum of all loci and all alleles;
fai is the frequency of allele a in population i, faj is the frequency of allele a in
population j, D is the number of loci.

                                                     Cavalli-Sforza and Bodmer, 1971.
                                                                 MSA manual page 23
An Example
Population           Locus A              Locus B               Locus C
611R2                A                    b                     c
                     A                    b                     c
611R3                A                    B                     c
                     A                    B                     c

Coefficient of kinship for populations 611R2 and 611R3:
Kf= ∑k∑a (fai * faj)/D
  = (1x1+1x0 + 0x1+1x1)/3=2/3
The frequencies of alleles A and c in populations 611R2 and 611R3 are 1; the
frequency of allele b in population 611R2 is 1 and 0 while in population 611R3 is 0
Kinship Matrix


In the unified mixed model (Yu et al. 2006),
marker-based kinship coefficient matrices are included



A marker-based kinship coefficient matrix can be
generated using Microsatellite Analyzer (MSA)
Pipeline for Matrix Generation

   Genotyping with                    Data Entry to Create a
   Markers                            Spreadsheet


                  Arrange Data in a Format
                  Recognized by MSA



Import the .dat                    Run the Kinship
File into MSA                      Coefficient Analysis
A Case Study
Bacterial spot data provided by Dr. David Francis,
            The Ohio State University
Data Preparation
For genotyping, molecular markers can be:
  Microsatellite (Simple Sequence Repeat, SSR)
  SNP (Single Nucleotide Polymorphism)
  RFLP (Restriction Fragment Length Polymorphism)
  AFLP (Amplified Fragment Length Polymorphism)
  Other sequencing markers

MSA was designed for SSR analysis

For other marker types, a value is designated for each
  allele, e.g. For SNPs, A can be 11, C can be 12, G can be 13, and T
   can be 14.
Data Entry
After genotyping, a spreadsheet of genotypic data can
   be generated as shown below.
                                     Marker
Population                                                          Genotype Score




Genotype Score: 0 is homozygous for allele a, 1 is homozygous for allele b, 2 is
heterozygous.
Data Formatting
Population #
                                          Give a new value to each allele
       Mating scheme                      for each marker
                  Group name: default “1”




                             These 3 cells should
                                 be empty.




  Row 1 can be left empty except for columns a, b, and c, which have to be filled. In cell
  A1, enter ‘1’. Row 2 is marker name. Genotype data start in row 3.
Data Formatting
Two-row entry for each individual (we are treating individuals as
populations) to specify heterozygotes and homozygotes
                       Homozygotes have the    Heterozygotes have
                       same value in both      different values in each
                       rows.                   row.
After Formatting
Save the file in a “Tab Delimited” text format, change the extension
name “.txt” into “.dat”.
MSA (Microsatellite Analyzer) is available for free,
provided by Daniel Dieringer.
MSA is compatible with different operation systems




                                    Choose the
                                    operating system
                                    you use and install it
                                    as other software
Install MSA
For Mac, a folder for MSA will be formed after you extract the
downloaded file
Use MSA




                 Manual for MSA is
                 available in the folder

Sample data is
available
Use MSA




                            Menu of
                            Commands




Enter command “i” to import file
Use MSA




Import your data file to MSA by dragging it to the MSA window
Use MSA
After the data file is imported,




                          Enter command “d” to access submenu of
                          “Distance setting”
Use MSA




Enter “4” to turn on kinship coefficient
module.
Use MSA
Each line will be treated as a population in this study




                      Input command “c” to turn on “Pair-wise
                      populations distances calc”
Use MSA




                              NOTE: Dkf in the
                              matrix will be 1-kf in
                              default.




After everything is set, input “!” command to
run the analysis.
Use MSA




After the analysis, a folder of results will be created
automatically in the same folder as your data file
Use MSA




The kinship matrix is in the folder of “Distance _data”
Use MSA
The Kinship matrix is saved as a file named “KSC_Pop.txt”




Open the file with Excel to read the the matrix
Number of
individuals                       Kinship Matrix
     The population (line) code
Number of
individuals                         Kinship Matrix
     The population (line) code   Please remember that the value in the
                                  matrix is “1-kinship coefficient”.
Final Matrix
Use “1-X” (X is the value of each cell in the matrix) to generate a new
matrix that will be the kinship matrix in Excel.
The Mixed Model
Phenotype         Marker                Residual or
score             effect                Error



   Y= αX+ γM+ βQ + υK + e
                           Population
        Fixed Effect       structure
        rather than
        markers

 Yu et al. 2006
References Cited & External Link
References Cited
• Cavalli-Sforza, L. L., and W. F. Bodmer. 1971. The genetics of human
   populations, W.H. Freeman and Company, NY.
• Lange, K. 2002. Mathematical and statistical methods for genetic analysis, 2nd
   edition. Springer-Verlag, NY.
• Yu, J. G. Pressoir, W. H. Briggs, I. V. Bi, M. Yamasaki, J. F. Doebley, M. D.
   McMullen, B. S. Gaut, D. M. Nielsen, J. B. Holland, S. Kresovich, and E. S.
   Buckler. 2006. A unified mixed-model method for association mapping that
   accounts for multiple levels of relatedness. Nature Genetics 38: 203-208.
   (Available online at: http://dx.doi.org/10.1038/ng1702) (verified 8 July 2011).
• Zhu, C. M. Gore, E. S. Buckler, and J. Yu. 2008. Status and prospects of
   association mapping in plants. The Plant Genome 1: 5-20. (Available online at:
   http://dx.doi.org/10.3835/plantgenome2008.02.0089) (verified 8 July 2011).
External Link
• Dieringer, D. Microsatellite analyzer (MSA) [Online]. Available at:
   http://i122server.vu-wien.ac.at/MSA/MSA_download.html (verified 8 July
   2011).

Contenu connexe

Tendances

Molecular mechanism of Ion uptake, Ion transporters for Nitrate
Molecular mechanism of Ion uptake,  Ion transporters for NitrateMolecular mechanism of Ion uptake,  Ion transporters for Nitrate
Molecular mechanism of Ion uptake, Ion transporters for NitrateAgronomist Wasim
 
KISAN CALL CENTRE
KISAN CALL CENTREKISAN CALL CENTRE
KISAN CALL CENTREjivheshsali
 
Apple Flowering and Fruit Quality
Apple Flowering and Fruit QualityApple Flowering and Fruit Quality
Apple Flowering and Fruit QualityMatthew Whiting
 
Extension teaching methods
Extension teaching methodsExtension teaching methods
Extension teaching methodsjivheshsali
 
Nutrient management
Nutrient managementNutrient management
Nutrient managementRomeu Leite
 
National Mission on Agriculture Extension
National Mission on Agriculture ExtensionNational Mission on Agriculture Extension
National Mission on Agriculture ExtensionNishu Kanwar
 
Fundamental of Extension Methods: Tools and Techniques of PRA
Fundamental of Extension Methods:  Tools and Techniques of PRAFundamental of Extension Methods:  Tools and Techniques of PRA
Fundamental of Extension Methods: Tools and Techniques of PRABrajendra Singh Meena
 
Fruit breeding method- introduction(IN INDIA)- Panchaal B.
Fruit breeding method- introduction(IN INDIA)- Panchaal B.Fruit breeding method- introduction(IN INDIA)- Panchaal B.
Fruit breeding method- introduction(IN INDIA)- Panchaal B.Panchaal Bhattacharjee
 
Breeding for resistance to biotic stresses
Breeding for resistance to biotic stressesBreeding for resistance to biotic stresses
Breeding for resistance to biotic stressesMANORANJAN ROUT
 
LINKAGE AND ITS ESTIMATION.pdf
LINKAGE AND ITS ESTIMATION.pdfLINKAGE AND ITS ESTIMATION.pdf
LINKAGE AND ITS ESTIMATION.pdfAthiraRajan46
 
Mechanism of wilt (ralstonia solanacearum) development
Mechanism of wilt (ralstonia solanacearum) developmentMechanism of wilt (ralstonia solanacearum) development
Mechanism of wilt (ralstonia solanacearum) developmentAkankshaShukla85
 
Reasons of less water requirement in pulses crops than cereal crops
Reasons of less water requirement in pulses crops than cereal cropsReasons of less water requirement in pulses crops than cereal crops
Reasons of less water requirement in pulses crops than cereal cropsSuman Dey
 
Water efficient
Water efficientWater efficient
Water efficientsugarmills
 
Water Management in Fruit Crops
Water Management in Fruit CropsWater Management in Fruit Crops
Water Management in Fruit Cropssuraj vishwakarma
 

Tendances (20)

Privatization of extension
Privatization of  extensionPrivatization of  extension
Privatization of extension
 
Molecular mechanism of Ion uptake, Ion transporters for Nitrate
Molecular mechanism of Ion uptake,  Ion transporters for NitrateMolecular mechanism of Ion uptake,  Ion transporters for Nitrate
Molecular mechanism of Ion uptake, Ion transporters for Nitrate
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
KISAN CALL CENTRE
KISAN CALL CENTREKISAN CALL CENTRE
KISAN CALL CENTRE
 
Passive theories of ion absorption
Passive theories of ion absorptionPassive theories of ion absorption
Passive theories of ion absorption
 
Apple Flowering and Fruit Quality
Apple Flowering and Fruit QualityApple Flowering and Fruit Quality
Apple Flowering and Fruit Quality
 
Sukhjinder singh
Sukhjinder singhSukhjinder singh
Sukhjinder singh
 
Extension teaching methods
Extension teaching methodsExtension teaching methods
Extension teaching methods
 
Nutrient management
Nutrient managementNutrient management
Nutrient management
 
National Mission on Agriculture Extension
National Mission on Agriculture ExtensionNational Mission on Agriculture Extension
National Mission on Agriculture Extension
 
Fundamental of Extension Methods: Tools and Techniques of PRA
Fundamental of Extension Methods:  Tools and Techniques of PRAFundamental of Extension Methods:  Tools and Techniques of PRA
Fundamental of Extension Methods: Tools and Techniques of PRA
 
Fruit breeding method- introduction(IN INDIA)- Panchaal B.
Fruit breeding method- introduction(IN INDIA)- Panchaal B.Fruit breeding method- introduction(IN INDIA)- Panchaal B.
Fruit breeding method- introduction(IN INDIA)- Panchaal B.
 
Breeding for resistance to biotic stresses
Breeding for resistance to biotic stressesBreeding for resistance to biotic stresses
Breeding for resistance to biotic stresses
 
LINKAGE AND ITS ESTIMATION.pdf
LINKAGE AND ITS ESTIMATION.pdfLINKAGE AND ITS ESTIMATION.pdf
LINKAGE AND ITS ESTIMATION.pdf
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Layout of Agricultural field experiments
Layout of Agricultural field experimentsLayout of Agricultural field experiments
Layout of Agricultural field experiments
 
Mechanism of wilt (ralstonia solanacearum) development
Mechanism of wilt (ralstonia solanacearum) developmentMechanism of wilt (ralstonia solanacearum) development
Mechanism of wilt (ralstonia solanacearum) development
 
Reasons of less water requirement in pulses crops than cereal crops
Reasons of less water requirement in pulses crops than cereal cropsReasons of less water requirement in pulses crops than cereal crops
Reasons of less water requirement in pulses crops than cereal crops
 
Water efficient
Water efficientWater efficient
Water efficient
 
Water Management in Fruit Crops
Water Management in Fruit CropsWater Management in Fruit Crops
Water Management in Fruit Crops
 

Similaire à Creating a Kinship Matrix Using MSA

Association mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maizeAssociation mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maizeSenthil Natesan
 
Sparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggmSparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggmManjari Narayan
 
Back to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex DiseasesBack to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex DiseasesGolden Helix Inc
 
BES/SfE talk 2014
BES/SfE talk 2014BES/SfE talk 2014
BES/SfE talk 2014Bob O'Hara
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Syed Lokman
 
Integrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataIntegrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataJoão André Carriço
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisJosh Neufeld
 
Session ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrSession ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrUSD Bioinformatics
 
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15Sage Base
 
GWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEAD
GWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEADGWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEAD
GWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEADvv628048
 
Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01Sage Base
 
NetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuNetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuAlexander Pico
 
MCQs on DNA MicroArray.pdf
MCQs on DNA MicroArray.pdfMCQs on DNA MicroArray.pdf
MCQs on DNA MicroArray.pdfRajendraChavhan3
 
Stephen Friend SciLife 2011-09-20
Stephen Friend SciLife 2011-09-20Stephen Friend SciLife 2011-09-20
Stephen Friend SciLife 2011-09-20Sage Base
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationNils Gehlenborg
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)Ariful Islam Sagar
 

Similaire à Creating a Kinship Matrix Using MSA (20)

Gf o2014talk
Gf o2014talkGf o2014talk
Gf o2014talk
 
Association mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maizeAssociation mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maize
 
Sparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggmSparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggm
 
Back to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex DiseasesBack to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex Diseases
 
BES/SfE talk 2014
BES/SfE talk 2014BES/SfE talk 2014
BES/SfE talk 2014
 
Populations
PopulationsPopulations
Populations
 
Basic BLAST (BLASTn)
Basic BLAST (BLASTn)Basic BLAST (BLASTn)
Basic BLAST (BLASTn)
 
BLAST
BLASTBLAST
BLAST
 
Integrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataIntegrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS data
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysis
 
Session ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrSession ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corr
 
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15
 
GWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEAD
GWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEADGWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEAD
GWAS "GENOME WIDE ASSOCIATION STUDIES" A STEP AHEAD
 
Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01Stephen Friend Institute for Cancer Research 2011-11-01
Stephen Friend Institute for Cancer Research 2011-11-01
 
Blast
BlastBlast
Blast
 
NetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang SuNetBioSIG2013-Talk Gang Su
NetBioSIG2013-Talk Gang Su
 
MCQs on DNA MicroArray.pdf
MCQs on DNA MicroArray.pdfMCQs on DNA MicroArray.pdf
MCQs on DNA MicroArray.pdf
 
Stephen Friend SciLife 2011-09-20
Stephen Friend SciLife 2011-09-20Stephen Friend SciLife 2011-09-20
Stephen Friend SciLife 2011-09-20
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient Stratification
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 

Dernier

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 

Dernier (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Creating a Kinship Matrix Using MSA

  • 1. Creating a Kinship Matrix using Microsatellite Analyzer (MSA) Zhifen Zhang The Ohio State University
  • 2. Why create a kinship matrix? • Visualize family relatedness • Use in association mapping The Objective of this Tutorial: To introduce one approach to generate a kinship matrix for a population
  • 3. Background Types of Populations for Association Mapping (Yu, et al. 2006): Population Type Pros Cons “Ideal population” Minimal population Difficult to collect, structure and familial small size relatedness Narrow genetic basis Gives the greatest statistical power Family-based population Minimal population Limited sample size and structure allelic diversity Population with Larger size and broader False positives or loss in population structure allelic diversity power due to familial relatedness. Population with familial Larger size and broader Inadequate control for relationships within allelic diversity false positives due to structured population population structure The highlighted population needs a kinship matrix for analysis.
  • 4. Scheme of Mapping Germplasm Phenotyping (Y) Genotyping Genome-wide scan Background markers Genome-wide Population structure polymorphisms (G) (Q), kinship (K) Association analysis (General Mixed Model): Y=G+Q+K+e Zhu et al. 2006
  • 5. Coefficient of Kinship Coefficient of kinship is used to measure relatedness DEFINITION: Coefficient of kinship is the probability that the alleles of a particular locus chosen randomly from two individuals are identical by descent (Lange, 2002) * Identical by descent: the identical alleles from two individuals arise from the same allele in an earlier generation
  • 6. Kinship Matrix Calculation The kinship coefficient is the probability that an allele (a) taken randomly from population i (at a given locus) will be identical by descent to an allele taken randomly from population j at the same locus Kf= ∑k∑a (fai * faj)/D Where ∑k∑a (fai * faj)/D is the sum of all loci and all alleles; fai is the frequency of allele a in population i, faj is the frequency of allele a in population j, D is the number of loci. Cavalli-Sforza and Bodmer, 1971. MSA manual page 23
  • 7. An Example Population Locus A Locus B Locus C 611R2 A b c A b c 611R3 A B c A B c Coefficient of kinship for populations 611R2 and 611R3: Kf= ∑k∑a (fai * faj)/D = (1x1+1x0 + 0x1+1x1)/3=2/3 The frequencies of alleles A and c in populations 611R2 and 611R3 are 1; the frequency of allele b in population 611R2 is 1 and 0 while in population 611R3 is 0
  • 8. Kinship Matrix In the unified mixed model (Yu et al. 2006), marker-based kinship coefficient matrices are included A marker-based kinship coefficient matrix can be generated using Microsatellite Analyzer (MSA)
  • 9. Pipeline for Matrix Generation Genotyping with Data Entry to Create a Markers Spreadsheet Arrange Data in a Format Recognized by MSA Import the .dat Run the Kinship File into MSA Coefficient Analysis
  • 10. A Case Study Bacterial spot data provided by Dr. David Francis, The Ohio State University
  • 11. Data Preparation For genotyping, molecular markers can be: Microsatellite (Simple Sequence Repeat, SSR) SNP (Single Nucleotide Polymorphism) RFLP (Restriction Fragment Length Polymorphism) AFLP (Amplified Fragment Length Polymorphism) Other sequencing markers MSA was designed for SSR analysis For other marker types, a value is designated for each allele, e.g. For SNPs, A can be 11, C can be 12, G can be 13, and T can be 14.
  • 12. Data Entry After genotyping, a spreadsheet of genotypic data can be generated as shown below. Marker Population Genotype Score Genotype Score: 0 is homozygous for allele a, 1 is homozygous for allele b, 2 is heterozygous.
  • 13. Data Formatting Population # Give a new value to each allele Mating scheme for each marker Group name: default “1” These 3 cells should be empty. Row 1 can be left empty except for columns a, b, and c, which have to be filled. In cell A1, enter ‘1’. Row 2 is marker name. Genotype data start in row 3.
  • 14. Data Formatting Two-row entry for each individual (we are treating individuals as populations) to specify heterozygotes and homozygotes Homozygotes have the Heterozygotes have same value in both different values in each rows. row.
  • 15. After Formatting Save the file in a “Tab Delimited” text format, change the extension name “.txt” into “.dat”.
  • 16. MSA (Microsatellite Analyzer) is available for free, provided by Daniel Dieringer.
  • 17. MSA is compatible with different operation systems Choose the operating system you use and install it as other software
  • 18. Install MSA For Mac, a folder for MSA will be formed after you extract the downloaded file
  • 19. Use MSA Manual for MSA is available in the folder Sample data is available
  • 20. Use MSA Menu of Commands Enter command “i” to import file
  • 21. Use MSA Import your data file to MSA by dragging it to the MSA window
  • 22. Use MSA After the data file is imported, Enter command “d” to access submenu of “Distance setting”
  • 23. Use MSA Enter “4” to turn on kinship coefficient module.
  • 24. Use MSA Each line will be treated as a population in this study Input command “c” to turn on “Pair-wise populations distances calc”
  • 25. Use MSA NOTE: Dkf in the matrix will be 1-kf in default. After everything is set, input “!” command to run the analysis.
  • 26. Use MSA After the analysis, a folder of results will be created automatically in the same folder as your data file
  • 27. Use MSA The kinship matrix is in the folder of “Distance _data”
  • 28. Use MSA The Kinship matrix is saved as a file named “KSC_Pop.txt” Open the file with Excel to read the the matrix
  • 29. Number of individuals Kinship Matrix The population (line) code
  • 30. Number of individuals Kinship Matrix The population (line) code Please remember that the value in the matrix is “1-kinship coefficient”.
  • 31. Final Matrix Use “1-X” (X is the value of each cell in the matrix) to generate a new matrix that will be the kinship matrix in Excel.
  • 32. The Mixed Model Phenotype Marker Residual or score effect Error Y= αX+ γM+ βQ + υK + e Population Fixed Effect structure rather than markers Yu et al. 2006
  • 33. References Cited & External Link References Cited • Cavalli-Sforza, L. L., and W. F. Bodmer. 1971. The genetics of human populations, W.H. Freeman and Company, NY. • Lange, K. 2002. Mathematical and statistical methods for genetic analysis, 2nd edition. Springer-Verlag, NY. • Yu, J. G. Pressoir, W. H. Briggs, I. V. Bi, M. Yamasaki, J. F. Doebley, M. D. McMullen, B. S. Gaut, D. M. Nielsen, J. B. Holland, S. Kresovich, and E. S. Buckler. 2006. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nature Genetics 38: 203-208. (Available online at: http://dx.doi.org/10.1038/ng1702) (verified 8 July 2011). • Zhu, C. M. Gore, E. S. Buckler, and J. Yu. 2008. Status and prospects of association mapping in plants. The Plant Genome 1: 5-20. (Available online at: http://dx.doi.org/10.3835/plantgenome2008.02.0089) (verified 8 July 2011). External Link • Dieringer, D. Microsatellite analyzer (MSA) [Online]. Available at: http://i122server.vu-wien.ac.at/MSA/MSA_download.html (verified 8 July 2011).