What does TopHat do?

From GcatWiki
Jump to: navigation, search

TopHat aligns RNA-seq reads to mammalian-sized genomes (in our case, to the mm10 mouse genome build). Because we are using paired-end datasets, forward and reverse, we need to set a mean inner distance or the average distance in basepairs between reads. Because we are mapping mRNA with no introns to full genomes, TopHat then uses its parent program Bowtie to analyze the alignment results to identify splice junctions between exons and correctly align the reads. The mapped reads from TopHat saves as a BAM/SAM dataset.