Difference between revisions of "What does TopHat do?"

From GcatWiki
Jump to: navigation, search
m
 
Line 1: Line 1:
 
'''TopHat''' aligns RNA-seq reads to mammalian-sized genomes (in our case, to the mm10 mouse genome build). Because we are using paired-end datasets, forward and reverse, we need to set a mean inner distance or the average distance in basepairs between reads.
 
'''TopHat''' aligns RNA-seq reads to mammalian-sized genomes (in our case, to the mm10 mouse genome build). Because we are using paired-end datasets, forward and reverse, we need to set a mean inner distance or the average distance in basepairs between reads.
Because we are mapping mRNA with no introns to full genomes, TopHat then uses its parent program '''Bowtie''' to analyze the alignment results to identify splice junctions between exons and correctly aligns the reads.  
+
Because we are mapping mRNA with no introns to full genomes, TopHat then uses its parent program '''Bowtie''' to analyze the alignment results to identify splice junctions between exons and correctly align the reads.  
 
The mapped reads from TopHat saves as a '''BAM/SAM''' dataset.
 
The mapped reads from TopHat saves as a '''BAM/SAM''' dataset.

Latest revision as of 13:30, 6 February 2018

TopHat aligns RNA-seq reads to mammalian-sized genomes (in our case, to the mm10 mouse genome build). Because we are using paired-end datasets, forward and reverse, we need to set a mean inner distance or the average distance in basepairs between reads. Because we are mapping mRNA with no introns to full genomes, TopHat then uses its parent program Bowtie to analyze the alignment results to identify splice junctions between exons and correctly align the reads. The mapped reads from TopHat saves as a BAM/SAM dataset.