SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
Summarize, annotate and visualize genomic intervals 
with R/BioC package genomation
	
Genomic	intervals		
Genomic	intervals	are	the	basis	of	genome	
annota3on.	Intervals	can	contain	simply	
loca3on	informa3on	(TFBS	loca3ons)	or	can	
contain	a	variety	of	scores,	with	different	
scales,	such	as	percent	methyla3on,	ChIP-seq	
enrichment	or	read	coverage.		
Windows/regions	of	interest	
over	the	genome	
Summary	of	genomic	intervals	over	different	
sub-genomic	contexts	or	windows	of	interest	
is	the	necessary	first	step	in	making	inference	
about	the	biological	importance	of	the	data.	
The	can	be	of	equal	length	(Ex:	pre-defined	
regions	around	TSS)	or	not	(Ex:	exons,	
transcript	or	CpG	islands)		
Extract	genomic	intervals	for	
windows	of	interest	
Genomic	intervals	could	be	extracted	for	
windows	of	interest	and	stored	in	matrix	
format.	If	the	windows	are	not	of	equal	
length,	then	binning	strategies	can	be	used	
to	create	equal	number	of	bins	over	those	
windows	and	s3ll	use	a	matrix	to	store	
informa3on.	
Visualize	the	summary	of	
genomic	intervals	
The	matrices	that	described	above	can	be	
visualized	using	heatmaps	or	meta-region	
plots.	Meta-region	plots		show	the	average	
value	of	the	signal	from	genomic	intervals	
over	the	windows	of	interest,	this	could	be	a	
line	plot	or	a	heatmap	where	colors	indicate	
average	values.	Another	type	heatmap		could	
be	used	to	show	the	values	for	matrices	that	
contain	the	signal	value	for	genomic	intervals	
over	the	windows.	In	these	heatmaps,	every	
row	represents	a	window	and	every	column	
is	a	base-posi3on	or	a	bin.	
Read	genomic	intervals	into	R		
Genomic	intervals	of	any	kind	can	be	red	into	
R	using	the	func3ons	below,	all	which	return	
GRanges		or	GRangesList	objects:	
readGeneric(file,…)can	read	generic	
text	files	with	genomic	interval	informa3on.	
readBed(file,...)can	read	BED	files.	
readTranscriptFeatures(file,…) can	
read	BED12	files	with	exon/intron	structure.	
gffToGRanges can	read	a	GFF	file.
readBroadPeak,readNarrowPeak,
readFeatureFlank	are	other	convenience	
func3ons	to	read	BED-like	files.	
Extract	genomic	intervals	for	
windows	of	interest	in	R	
Genomic	intervals	could	be	extracted	for	
windows	of	interest	using	
ScoreMatrix(target,windows) &	
ScoreMatrixBin(target,windows)
func3ons.	These	func3ons	can	handle	BAM,	
BigWig	files	and	GRanges	objects	as	inputs.		
	
patternMatrix() returns	the	rela3ve	
loca3ons	or	scores	of	k-mers	or	mo3fs,	useful	
for	analyzing	ChIP-seq.		
	
All	of	these	return	ScoreMatrix	or	
ScoreMatrixList	objects.	These	objects	can	
be	manipulated	further	using orderBy,
binMatrix, scaleScoreMatrix,
scaleScoreMatrixList,
intersectScoreMatrixList and []
func3ons.	
Visualize	the	summary	of	
genomic	intervals	in	R	
heatMatrix(scoreMatrixobj) and	
multiHeatMatrix(scoreMatrixListObj
) makes	the	heatmaps	for	windows	of	
interest.		
plotMeta()	makes	meta-region	plots	and	
heatMeta() makes	meta-region	heatmaps.	
These	func3ons	return	values	invisibly	see	
respec3ve	help	pages	for	func3ons.	Intergenic
Intron
Exon
Promoter40.9
11.6
21.825.7
0 500 1000
0.00.20.40.60.81.0
base-pairs around anchor
readpermillion
TF4
TF3
TF2
TF1
0
500
1000
0 0.5 1 1.5 2
TF 4
0
500
1000
0 0.5 1 1.5 2 2.5
TF 3
0
500
1000
0 0.5 1 1.5 2 2.5
TF 2
0
500
1000
0 0.5 1 1.5 2 2.5
TF 1
0 500 1000
base-pairs around anchor
TF1
TF2
TF3
TF4
0.0720.340.60.861.1
meta-region plots meta-region heatmaps heatmaps for genomic interval sets
Piecharts for annotation
Annota3on	for	genomic	
intervals	in	R	
Annota3on	summaries	for	target	genomic	
intervals	can	be	obtained	by	
annotateWithFeatures().	The	resul3ng	
object	can	be	visualized	by	
plotTargetAnnotation() for	piechart	
and	heatTargetAnnotation() for	a	
heatmap	of	annota3on	overlapping	
percentages.	
Contributors:		Altuna	Akalin	[aut,	cre],	Vedran	Franke	[aut,	cre],	Katarzyna	Wreczycka	[aut],			
Alexander	Gosdschan	[ctb],	Liz	Ing-Simmons	[ctb]	
CitaCon:	Akalin	A,	Franke	V,	Vlahovicek	K,	Mason	CE,	Schubeler	D.	(2015).	Bioinforma3cs.	Doi:
10.1093/bioinforma3cs/btu775	
Genomic	intervals	with	different	kinds	of	informa3on	
Extract	subset	of		genomic	intervals		for	windows	of	interest	
Meta-region	line	plots	
Meta-region	heatmap	
heatmaps	for	genomic	intervals	
Annota3on	pie	charts	 Annota3on	heatmaps	
Annota3on	for	genomic	
intervals	
Genomic	intervals	such	as	ChIP-seq	peaks	or	
differen3ally	methylated	regions	needs	to	be	
annotated	further	with	known	genomic	
annota3ons	such	as	promoter,	intron,	exon	
structures	of	the	genes.	This	gives	addi3onal	
informa3on	for	func3onal	characteriza3on	of	
the	genomic	intervals.	This	usually	
represented	as	percent	of	genomic	features	
overlapping	with	the	annota3on.	
urls:	hap://bioinforma3cs.mdc-berlin.de/genoma3on			
										hap://www.biconductor.org/release/genoma3on		
Install	genomaCon: OR	
	
devtools::install_github(”BIMSBbioinfo/genomation”, build_vig=F)
source("https://bioconductor.org/biocLite.R") ; biocLite(“genomation”)
Visualize	summaries	and	annota3on	for	genomic	intervals	
Misc.	func3ons	
getRandomEnrichment() calculates	the	significance	of	
associa3on	between	two	sets	of	genomics	intervals.	
Valid	for	package	version	=>	1.5.6

Contenu connexe

Dernier

Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxBhagirath Gogikar
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 

Dernier (20)

Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 

En vedette

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Poster/cheatsheet for R/BioC package genomation [Download for a better resolution]

  • 1. Summarize, annotate and visualize genomic intervals with R/BioC package genomation Genomic intervals Genomic intervals are the basis of genome annota3on. Intervals can contain simply loca3on informa3on (TFBS loca3ons) or can contain a variety of scores, with different scales, such as percent methyla3on, ChIP-seq enrichment or read coverage. Windows/regions of interest over the genome Summary of genomic intervals over different sub-genomic contexts or windows of interest is the necessary first step in making inference about the biological importance of the data. The can be of equal length (Ex: pre-defined regions around TSS) or not (Ex: exons, transcript or CpG islands) Extract genomic intervals for windows of interest Genomic intervals could be extracted for windows of interest and stored in matrix format. If the windows are not of equal length, then binning strategies can be used to create equal number of bins over those windows and s3ll use a matrix to store informa3on. Visualize the summary of genomic intervals The matrices that described above can be visualized using heatmaps or meta-region plots. Meta-region plots show the average value of the signal from genomic intervals over the windows of interest, this could be a line plot or a heatmap where colors indicate average values. Another type heatmap could be used to show the values for matrices that contain the signal value for genomic intervals over the windows. In these heatmaps, every row represents a window and every column is a base-posi3on or a bin. Read genomic intervals into R Genomic intervals of any kind can be red into R using the func3ons below, all which return GRanges or GRangesList objects: readGeneric(file,…)can read generic text files with genomic interval informa3on. readBed(file,...)can read BED files. readTranscriptFeatures(file,…) can read BED12 files with exon/intron structure. gffToGRanges can read a GFF file. readBroadPeak,readNarrowPeak, readFeatureFlank are other convenience func3ons to read BED-like files. Extract genomic intervals for windows of interest in R Genomic intervals could be extracted for windows of interest using ScoreMatrix(target,windows) & ScoreMatrixBin(target,windows) func3ons. These func3ons can handle BAM, BigWig files and GRanges objects as inputs. patternMatrix() returns the rela3ve loca3ons or scores of k-mers or mo3fs, useful for analyzing ChIP-seq. All of these return ScoreMatrix or ScoreMatrixList objects. These objects can be manipulated further using orderBy, binMatrix, scaleScoreMatrix, scaleScoreMatrixList, intersectScoreMatrixList and [] func3ons. Visualize the summary of genomic intervals in R heatMatrix(scoreMatrixobj) and multiHeatMatrix(scoreMatrixListObj ) makes the heatmaps for windows of interest. plotMeta() makes meta-region plots and heatMeta() makes meta-region heatmaps. These func3ons return values invisibly see respec3ve help pages for func3ons. Intergenic Intron Exon Promoter40.9 11.6 21.825.7 0 500 1000 0.00.20.40.60.81.0 base-pairs around anchor readpermillion TF4 TF3 TF2 TF1 0 500 1000 0 0.5 1 1.5 2 TF 4 0 500 1000 0 0.5 1 1.5 2 2.5 TF 3 0 500 1000 0 0.5 1 1.5 2 2.5 TF 2 0 500 1000 0 0.5 1 1.5 2 2.5 TF 1 0 500 1000 base-pairs around anchor TF1 TF2 TF3 TF4 0.0720.340.60.861.1 meta-region plots meta-region heatmaps heatmaps for genomic interval sets Piecharts for annotation Annota3on for genomic intervals in R Annota3on summaries for target genomic intervals can be obtained by annotateWithFeatures(). The resul3ng object can be visualized by plotTargetAnnotation() for piechart and heatTargetAnnotation() for a heatmap of annota3on overlapping percentages. Contributors: Altuna Akalin [aut, cre], Vedran Franke [aut, cre], Katarzyna Wreczycka [aut], Alexander Gosdschan [ctb], Liz Ing-Simmons [ctb] CitaCon: Akalin A, Franke V, Vlahovicek K, Mason CE, Schubeler D. (2015). Bioinforma3cs. Doi: 10.1093/bioinforma3cs/btu775 Genomic intervals with different kinds of informa3on Extract subset of genomic intervals for windows of interest Meta-region line plots Meta-region heatmap heatmaps for genomic intervals Annota3on pie charts Annota3on heatmaps Annota3on for genomic intervals Genomic intervals such as ChIP-seq peaks or differen3ally methylated regions needs to be annotated further with known genomic annota3ons such as promoter, intron, exon structures of the genes. This gives addi3onal informa3on for func3onal characteriza3on of the genomic intervals. This usually represented as percent of genomic features overlapping with the annota3on. urls: hap://bioinforma3cs.mdc-berlin.de/genoma3on hap://www.biconductor.org/release/genoma3on Install genomaCon: OR devtools::install_github(”BIMSBbioinfo/genomation”, build_vig=F) source("https://bioconductor.org/biocLite.R") ; biocLite(“genomation”) Visualize summaries and annota3on for genomic intervals Misc. func3ons getRandomEnrichment() calculates the significance of associa3on between two sets of genomics intervals. Valid for package version => 1.5.6