Difference between revisions of "What does TopHat do?"

From GcatWiki
Jump to: navigation, search
Line 1: Line 1:
 
'''TopHat''' aligns RNA-seq reads to mammalian-sized genomes (in our case, to the mm10 mouse genome build). Because we are using paired-end datasets, forward and reverse, we need to set a mean inner distance or the average distance in basepairs between reads.
 
'''TopHat''' aligns RNA-seq reads to mammalian-sized genomes (in our case, to the mm10 mouse genome build). Because we are using paired-end datasets, forward and reverse, we need to set a mean inner distance or the average distance in basepairs between reads.
Because we are mapping mRNA with no introns to full genomes, TopHat then uses its parent program '''Bowtie''' to analyze the alignment results to identify splice junctions between exons.
+
Because we are mapping mRNA with no introns to full genomes, TopHat then uses its parent program '''Bowtie''' to analyze the alignment results to identify splice junctions between exons and correctly aligns the reads.
 +
The mapped reads from TopHat saves as a BAM/SAM dataset.

Revision as of 15:22, 6 February 2018

TopHat aligns RNA-seq reads to mammalian-sized genomes (in our case, to the mm10 mouse genome build). Because we are using paired-end datasets, forward and reverse, we need to set a mean inner distance or the average distance in basepairs between reads. Because we are mapping mRNA with no introns to full genomes, TopHat then uses its parent program Bowtie to analyze the alignment results to identify splice junctions between exons and correctly aligns the reads. The mapped reads from TopHat saves as a BAM/SAM dataset.