Potential Gene Across-Species Phylogenetic Analysis with Mr. Bayes
From GcatWiki
Contents
- 1 Potential Gene Across-Species Phylogenetic Analysis with Mr. Bayes
- 1.1 Make databases of species genome FASTA files
- 1.2 Search for desired gene from Genbank
- 1.3 Download FASTA file of nucleotides
- 1.4 BLASTn FASTA file of gene against databases of species sequences
- 1.5 Copy and paste top BLASTn hit for each species with FASTA headings into MUSCLE window
- 1.6 Copy alignments into simple NEXUS text file
- 1.7 Download MrBayes
- 1.8 Open MrBayes executable
Potential Gene Across-Species Phylogenetic Analysis with Mr. Bayes
Make databases of species genome FASTA files
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/makeblastdb -title testdb -dbtype nucl -in bb_latest_assembly.fasta
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/makeblastdb -title testdb -dbtype nucl -in VVgenome
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/makeblastdb -title testdb -dbtype nucl -in rice_communisTIGR_castorWGS_release_0.1.assembly.fsa
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/makeblastdb -title testdb -dbtype nucl -in fragaria vescachr.fna
Search for desired gene from Genbank
From species most closely related to target studied species (Vinis vinifera- Vaccinium corymbosum)
VINST1
Download FASTA file of nucleotides
BLASTn FASTA file of gene against databases of species sequences
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/blastn -query VINST1.fasta -db bb_latest_assembly.fasta -outfmt "7 qacc sacc evalue qstart qend sstart send"
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/blastn -query VINST1.fasta -db VVgenome -outfmt "7 qacc sacc evalue qstart qend sstart send"
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/blastn -query VINST1.fasta -db rice_communisTIGR_castorWGS_release_0.1.assembly.fsa -outfmt "7 qacc sacc evalue qstart qend sstart send"
w10120:Desktop jamimms$ /usr/local/ncbi/blast/bin/blastn -query VINST1.fasta -db fragaria vescachr.fna -outfmt "7 qacc sacc evalue qstart qend sstart send"
Copy and paste top BLASTn hit for each species with FASTA headings into MUSCLE window
Run MUSCLE alignment
Copy alignments into simple NEXUS text file
Formatted below with suffix .nexus
ntax is the number of taxa (sequences aligned)
nchar is the number of characters of each alignment
#NEXUS begin data; dimensions ntax=4 nchar=1200; format datatype=dna interleave=nogap=-; matrix <br> Vaccinium_corybosum AGTCGCGCTAGCGCTG…ATGCTCGGTAGATCG Vinis_vinifera AGTCGCGCTCGCGCTG…ATGCTCGATAGATCG Fragaria_vesca AGTCGCGCTCGCGCTG…ATGCCCGATTGATCG Ricinus_communis AGTCGCGCTCGCGCTG…ATGCTCGATAGATCG ; end; (shortened for demonstration, named VINST1.nexus)
Download MrBayes
Unpack .sit file
Place .nexus file created above in same directory as MrBayes executable
Open MrBayes executable
Execute .nexus file in MrBayes executable window:
execute VINST1.nexus
Set evolutionary model:
lset nst=6 rates=invgamma
Set number of samples:
Mcmc ngen=10000 sample freq=10
Summarize the parameter values (burnin=25% of your samples):
Sump burnin=250
Summarize and visualize the trees:
Sumt burnin=250