Difference between revisions of "Blasting Spacers for Our Genome"
Karicheson (talk | contribs) (→Blasting Spacers) |
Karicheson (talk | contribs) (→Blasting Spacers) |
||
(70 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
== Blasting Spacers == | == Blasting Spacers == | ||
+ | |||
+ | |||
+ | [[EXPLORATION OF THE IMPORTANT FINDINGS]] | ||
+ | |||
CRISPR one results from CRISPR finder are shown below. I blasted all of the spacers using the nr/nt database and the results are shown below. | CRISPR one results from CRISPR finder are shown below. I blasted all of the spacers using the nr/nt database and the results are shown below. | ||
Line 8: | Line 12: | ||
Spacer One: TGCGTCGTCCGGTGGCCGTCAATAAATGTCGCAAGGG | Spacer One: TGCGTCGTCCGGTGGCCGTCAATAAATGTCGCAAGGG | ||
− | No significant hits. . . lowest E-value is | + | No significant hits . . . lowest E-value is 5.9 |
− | + | '''Spacer Two''': TCCTACGACCTCGTCGGCGTCAACGGCTGGCCCGA | |
+ | |||
+ | [[Image:spacer2.jpg]] | ||
+ | |||
+ | Archaeal BJ1 virus complete genome: [http://www.biomedcentral.com/content/pdf/1471-2164-8-410.pdf Good Article] | ||
+ | |||
+ | [[Image:spacer2222.jpg]] | ||
+ | |||
+ | This is the protein sequence location that this match comes from in the virus: | ||
+ | |||
+ | [[Image:archaealvirus.jpg]] | ||
+ | |||
+ | Blasted this little sequence alignment section. . . did not get any significant viral alignments except its own | ||
+ | |||
+ | Did blastp to determine if hypothetical protein in archaeal BJ1 virus had hits in other viruses: | ||
+ | |||
+ | [[Image:blastp4.jpg]] | ||
+ | |||
+ | |||
+ | The rest of the hits are bacteria and are in coding protein sequences such as ABC transporter and ATP-binding protein. | ||
− | |||
− | + | Spacer Three: CACCCTACAACAGGTGAAATCTACCAGACAAAAGA | |
− | + | [[Image:spacer3.jpg]] | |
− | + | All of these hits are bacteria and the bacteria hits are part of coding proteins such as conserved hypothetical protein; putative membrane protein, which I did a blastp for and found no viral matches | |
Spacer Four: TCACCCAAGCGCAAGCAACAGCTGATCGAGGACCTG | Spacer Four: TCACCCAAGCGCAAGCAACAGCTGATCGAGGACCTG | ||
− | + | [[Image:spacer4.jpg]] | |
+ | |||
+ | Hits come from prokaryotes and is from a coding segment | ||
Spacer Five: GCGACGGCGGCCAGTTCCGCGAGGGCGGGAAGGTCC | Spacer Five: GCGACGGCGGCCAGTTCCGCGAGGGCGGGAAGGTCC | ||
− | [[Image: | + | [[Image:spacer5.jpg]] |
− | + | The following hits are the only significant hits that come from a prokaryote but are n a coding segment of DNA | |
+ | |||
+ | '''Spacer Six''': TGCGAGTGTTGCGGGGAACCGACTCGGGTAGGCCAG | ||
+ | |||
+ | [[Image:spacer6.jpg]] | ||
+ | |||
+ | Aligns with something in our genome. . . NOT another spacer but a segment of DNA that flanks two genes and is non-coding. . . | ||
− | + | Spacer Seven: ACGGTTTCGCTACCACCATCGCCACCAGCAACTGCCG | |
− | |||
− | |||
− | |||
− | Spacer | ||
− | |||
− | |||
− | + | [[Image:spacer8.jpg]] | |
− | |||
− | |||
− | + | Hit from a prokaryote that is in its coding region | |
Spacer Eight: GACGAGTATGCCAACCGGCTCGTGAGCGGGCGC | Spacer Eight: GACGAGTATGCCAACCGGCTCGTGAGCGGGCGC | ||
− | No significant hits. . . lowest E-value is | + | No significant hits . . . lowest E-value is 1.2 |
− | + | Spacer Nine: TGGTAGGCGTCGTAGGTGTTCGTGGCGAGCGTGTC | |
− | + | [[Image:spacer9.jpg]] | |
− | + | Hit from prokaryote that is in its coding region | |
Spacer Ten: ACTCACTGGATTATGACCCCTACAACGAGGGCGTCA | Spacer Ten: ACTCACTGGATTATGACCCCTACAACGAGGGCGTCA | ||
− | No significant hits. . . lowest E-value is 1. | + | No significant hits . . . lowest E-value is 1.4 |
− | Spacer Eleven: GGATCTCGATCGTTGTAGTATCCATAGCTGCTATACC | + | '''Spacer Eleven''': GGATCTCGATCGTTGTAGTATCCATAGCTGCTATACC |
− | + | ||
− | [[Image: | + | [[Image:spacer11.jpg]] |
+ | |||
+ | Hits from gene segment in prokaryote or promoter sequence because flanking two genes | ||
Spacer Twelve: GAAGTAACGCAACTCCAGTGAGCGCTACTGAGAGCCC | Spacer Twelve: GAAGTAACGCAACTCCAGTGAGCGCTACTGAGAGCCC | ||
− | No significant hits. . . lowest E-value is 1. | + | No significant hits . . . lowest E-value is 1.5 |
Spacer Thirteen:CCGATCACGCCCTGCCGATACTGGTAGTTCGCGATA | Spacer Thirteen:CCGATCACGCCCTGCCGATACTGGTAGTTCGCGATA | ||
− | [[Image: | + | [[Image:spacer132.jpg]] |
+ | |||
+ | Hit from prokaryote in gene segment | ||
− | Spacer Fourteen: TCGTCGGCCGGCTCGTCGGCCGACGTGGACTTGC | + | '''Spacer Fourteen''': TCGTCGGCCGGCTCGTCGGCCGACGTGGACTTGC |
+ | |||
+ | [[Image:spacer142.jpg]] | ||
+ | |||
+ | first hit from prokaryote gene region | ||
+ | |||
+ | 2nd hit: from [http://www.ncbi.nlm.nih.gov/sites/entrez?cmd=Retrieve&db=nucleotide&dopt=GenBank&RID=FSHC5VSU01S&log%24=nuclalign&blast_rank=3&list_uids=163716581 virus] | ||
+ | |||
+ | [[Image:spacer14more.jpg]] | ||
− | + | this virus when you blast it. . . it infects pathogens, bacteria and plant. . .maybe archaea too???? | |
Spacer Fifteen: AGTAGGTCTAATGTCTCTCTGTCGTCTATCAGCCCCG | Spacer Fifteen: AGTAGGTCTAATGTCTCTCTGTCGTCTATCAGCCCCG | ||
− | No significant hits. . . lowest E-value is | + | No significant hits . . . lowest E-value is 23 |
Spacer Sixteen: GCTCTCCGGTGTCACAGGTCAGGTCACGGTCTCCGC | Spacer Sixteen: GCTCTCCGGTGTCACAGGTCAGGTCACGGTCTCCGC | ||
− | No significant hits . . . lowest E-value is | + | No significant hits . . . lowest E-value is 5.6 |
Spacer Seventeen: ACGGACAAGTCATCCACCCGCCAGTATCTCCCGGT | Spacer Seventeen: ACGGACAAGTCATCCACCCGCCAGTATCTCCCGGT | ||
− | + | [[Image:spacer17.jpg]] | |
+ | |||
+ | Bad hit from prokaryote coding sequence | ||
Spacer Eighteen: AAATACGATCCTGCGGTGACGCTACGTCCGGGGCAGC | Spacer Eighteen: AAATACGATCCTGCGGTGACGCTACGTCCGGGGCAGC | ||
− | No significant hits . . . lowest E-value is | + | No significant hits . . . lowest E-value is 1.5 |
Spacer Nineteen: CAGATGTGGGGTCTGTGGCCACAGTCTAACATCTCT | Spacer Nineteen: CAGATGTGGGGTCTGTGGCCACAGTCTAACATCTCT | ||
− | + | No significant hits . . . lowest E-value is 1.4 | |
Spacer Twenty: CCTGATAACGGACTCTTGTAGGTCCGTTAGGTCGT | Spacer Twenty: CCTGATAACGGACTCTTGTAGGTCCGTTAGGTCGT | ||
− | No significant hits . . . lowest E-value is | + | No significant hits . . . lowest E-value is 5.2 |
Spacer Twenty-One: AAAAATGAGTGACGTAGACATTCGGCAAAATGCCGG | Spacer Twenty-One: AAAAATGAGTGACGTAGACATTCGGCAAAATGCCGG | ||
− | No significant hits . . . lowest E-value is | + | No significant hits . . . lowest E-value is 1.4 |
Spacer Twenty-Two: CAGCAGCGAAACGAGCCGTCCGTCCTTTTGAGACA | Spacer Twenty-Two: CAGCAGCGAAACGAGCCGTCCGTCCTTTTGAGACA | ||
− | + | [[Image:spacer22.jpg]] | |
+ | |||
+ | Hit from prokaryote in gene segment | ||
Spacer Twenty-Three: GTCTAGCCCAGTCTGGTCGGGGTGGTCGGCAGGATCGG | Spacer Twenty-Three: GTCTAGCCCAGTCTGGTCGGGGTGGTCGGCAGGATCGG | ||
− | + | [[Image:spacer23.jpg]] | |
+ | |||
+ | Bad -values= throw out | ||
Spacer Twenty-Four: CACTCCTCATATGTCTGTTCGAGCAGCGGGACGTG | Spacer Twenty-Four: CACTCCTCATATGTCTGTTCGAGCAGCGGGACGTG | ||
− | + | [[Image:spacer24.jpg]] | |
− | + | Hit from prokaryote in gene segment | |
− | [[Image: | + | '''Spacer Twenty-Five''': ACCGTTGCCGCCGATCGGCAGCGAGCCGGTGATGTGT |
+ | |||
+ | [[Image:spacer252.jpg]] | ||
+ | |||
+ | Hits from prokaryotes in flanking sequence or in gene sequence. . . check to see if this is in spacer!! | ||
+ | |||
+ | [[Image:spacer26more.jpg]] | ||
Spacer Twenty-Six: TCAAAGCGAGCCTCGAACGCGACGACGAAGATATG | Spacer Twenty-Six: TCAAAGCGAGCCTCGAACGCGACGACGAAGATATG | ||
− | [[Image: | + | [[Image:spacer262.jpg]] |
+ | |||
+ | Hits from prokaryotes in gene segment | ||
− | Spacer Twenty | + | Spacer Twenty-Seven: TCCTCCTTGTACCCACGGTCTTGCCGATCCATCCCG |
− | No significant hits . . . lowest E-value is | + | No significant hits . . . lowest E-value is 1.4 |
Spacer Twenty-Eight: GACTGGCGTGTTGCCGTTCAGGCCGGCGTTGATCCCG | Spacer Twenty-Eight: GACTGGCGTGTTGCCGTTCAGGCCGGCGTTGATCCCG | ||
− | [[Image: | + | [[Image:spacer28.jpg]] |
+ | |||
+ | Hits from prokaryotes in gene segment | ||
Spacer Twenty-Nine: CTCAGCAGCAGTCAACGGCATTTTATACACCTTGT | Spacer Twenty-Nine: CTCAGCAGCAGTCAACGGCATTTTATACACCTTGT | ||
− | No significant hits . . . lowest E-value is | + | No significant hits . . . lowest E-value is 1.3 |
Spacer Thirty: CACCCCTTCCGGGGAGACGAGGAAACCCCGGACGA | Spacer Thirty: CACCCCTTCCGGGGAGACGAGGAAACCCCGGACGA | ||
− | + | [[Image:spacer30.jpg]] | |
+ | |||
+ | Bad e-value and top hit is a bunch of gene from bacteria | ||
Spacer Thirty-One: GTCACGCTGTCTGACGATATGGCTGACCAGGTGC | Spacer Thirty-One: GTCACGCTGTCTGACGATATGGCTGACCAGGTGC | ||
− | + | [[Image:spacer31.jpg]] | |
+ | |||
+ | Hit from prokaryotes in gene segment | ||
Spacer Thirty-Two: CACTCCTGGGCGGCCTCATCGGCGGCCATCGTC | Spacer Thirty-Two: CACTCCTGGGCGGCCTCATCGGCGGCCATCGTC | ||
− | [[Image: | + | [[Image:spacer322.jpg]] |
− | + | Hit from prokaryotes in gene segment | |
− | No significant hits . . . lowest E-value is | + | Spacer Thirty-Three: GTTGTGTGAGGTATGCGATGGACACCACCGATCACG |
+ | |||
+ | No significant hits . . . lowest E-value is 1.4 | ||
Line 160: | Line 216: | ||
Spacer One: TCCGAGACGTGTTCCCTCTCTAGCTGTGCATCTTCC | Spacer One: TCCGAGACGTGTTCCCTCTCTAGCTGTGCATCTTCC | ||
− | + | Hits from prokaryote in gene segment | |
Spacer Two: CAGATCTAAAACAATGTCATACGGAAAAATCGACATC | Spacer Two: CAGATCTAAAACAATGTCATACGGAAAAATCGACATC | ||
Line 169: | Line 225: | ||
[[Image:crispr2spacer3.jpg]] | [[Image:crispr2spacer3.jpg]] | ||
+ | |||
+ | E-values not good and prokaryotes | ||
Spacer Four: TCGACGAGATCGGCGCGAACTCGTTCGCTGATACT | Spacer Four: TCGACGAGATCGGCGCGAACTCGTTCGCTGATACT | ||
Line 174: | Line 232: | ||
[[Image:crispr2spacer4.jpg]] | [[Image:crispr2spacer4.jpg]] | ||
− | + | Hits from prokaryote in gene segment | |
− | [[Image:crispr2spacer5.jpg]] | + | '''Spacer Five''': TCGGGGACCGAGACGACGGGGCCGGGTGCTGTCT |
+ | |||
+ | [[Image:crispr2spacer5.jpg]] | ||
+ | |||
+ | Hit from a [http://www.ncbi.nlm.nih.gov/sites/entrez?cmd=Retrieve&db=nucleotide&dopt=GenBank&RID=FW14AEYZ01S&log%24=nucltop&blast_rank=3&list_uids=255927524 virus] (same virus even though | ||
+ | called different names): | ||
+ | |||
+ | [[Image:crispr2spacer5more.jpg]] | ||
Spacer Six: CCGGAGGGGCCGCTGCGTGGGTGATCTGGAGAGAAGA | Spacer Six: CCGGAGGGGCCGCTGCGTGGGTGATCTGGAGAGAAGA | ||
[[Image:crispr2spacer6.jpg]] | [[Image:crispr2spacer6.jpg]] | ||
+ | |||
+ | E-values not good and from prokaryotes | ||
Spacer Seven: GTTGCGTGAGCTAGCGAAACACCGAGTCCGTGTGAT | Spacer Seven: GTTGCGTGAGCTAGCGAAACACCGAGTCCGTGTGAT | ||
Line 189: | Line 256: | ||
[[Image:crispr2spacer8.jpg]] | [[Image:crispr2spacer8.jpg]] | ||
+ | |||
+ | Hit from prokaryote in gene segment | ||
Spacer Nine: CGGACATTCAGAAGCGCCTGACTAACCGCATGGCT | Spacer Nine: CGGACATTCAGAAGCGCCTGACTAACCGCATGGCT | ||
− | Spacer Ten: CGGGAAGACCACGACCGCCCGCGCCCTCCAGTTCGA | + | No significant hits . . . lowest E-value is 5.2 |
+ | |||
+ | '''Spacer Ten''': CGGGAAGACCACGACCGCCCGCGCCCTCCAGTTCGA | ||
+ | |||
+ | [[Image:crispr2spacer10.jpg]] | ||
+ | |||
+ | First hit from bacteria in gene segment. . . | ||
+ | |||
+ | Second hit is from another [http://www.ncbi.nlm.nih.gov/sites/entrez?cmd=Retrieve&db=nucleotide&dopt=GenBank&RID=FW1UZWFK01S&log%24=nuclalign&blast_rank=3&list_uids=125860746 archaea] in a conserved hypothetical protein. . . is this a true protein. . .?? | ||
+ | |||
+ | [[Image:crispr2spacer10more.jpg]] | ||
Spacer Eleven: GGCTTCTACGTCGGCAACCGGACCGAGGACGGCGATG | Spacer Eleven: GGCTTCTACGTCGGCAACCGGACCGAGGACGGCGATG | ||
+ | |||
+ | [[Image:crispr2spacer11.jpg]] | ||
+ | |||
+ | Hit with significant e-value is from a bacteria in a gene segment | ||
Spacer Twelve: AAGTACGCCTCGATCATCAACGGCGTCCGGGCTGT | Spacer Twelve: AAGTACGCCTCGATCATCAACGGCGTCCGGGCTGT | ||
+ | |||
+ | [[Image:crispr2spacer12.jpg]] | ||
+ | |||
+ | E-values no good and from a prokaryote in a gene | ||
Spacer Thirteen: GATGCTGTTGAGCTCGTAGCGATCCCACTCGGCGT | Spacer Thirteen: GATGCTGTTGAGCTCGTAGCGATCCCACTCGGCGT | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.3 | ||
Spacer Fourteen: AGCCCTTGTGCAATGATCGGGAGTGCAATCCGACC | Spacer Fourteen: AGCCCTTGTGCAATGATCGGGAGTGCAATCCGACC | ||
+ | |||
+ | No significant hits . . . lowest E-value is 5.2 | ||
Spacer Fifteen: TCAGGCGAGTTGTCGGACGAACAGCTTGAAGCGTGT | Spacer Fifteen: TCAGGCGAGTTGTCGGACGAACAGCTTGAAGCGTGT | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.4 | ||
Spacer Sixteen: TCGTCGAGCGGCAGGCCGCCGACCGCTACGGGCTG | Spacer Sixteen: TCGTCGAGCGGCAGGCCGCCGACCGCTACGGGCTG | ||
− | Spacer Seventeen: | + | [[Image:crispr2spacer16.jpg]] |
+ | |||
+ | Hits from prokaryotes in gene segment | ||
+ | |||
+ | Spacer Seventeen: CGACACGGTCGAAGATGCTGGCAGCTCGCGAATCGC | ||
+ | |||
+ | [[Image:crispr2spacer17.jpg]] | ||
+ | |||
+ | Bad e-value | ||
+ | |||
+ | Spacer Eighteen: GGATCTCGACGGCCGCGTGGCCGTGCATCTCGGGGTC | ||
+ | |||
+ | [[Image:crispr2spacer18.jpg]] | ||
+ | |||
+ | Hit from prokaryote in gene segment | ||
+ | |||
+ | Spacer Nineteen: AACGCTTCAACGCGCTCTATTGACCGAGCGTATCG | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.3 | ||
+ | |||
+ | Spacer Twenty: AGATCTCGCGGATAAGCTGCCCCCGCCCTCCCATGAG | ||
+ | |||
+ | No significant hits . . . lowest E-value is 5.9 | ||
+ | |||
+ | Spacer Twenty-One: CACTTCGAGCGGACTTTGGGCCACCCCGGAAAGTCAG | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.5 | ||
+ | |||
+ | Spacer Twenty-Two: CGATACGTCCGGGACGCCCGTGACGACCACCACTGC | ||
+ | |||
+ | [[Image:crispr2spacer21.jpg]] | ||
− | + | Bad e-value | |
− | Spacer | + | Spacer Twenty-Three: GGAGGAGCGGATGGACATGAGCGACACGACGATCCG |
− | + | No significant hits . . . lowest E-value is 1.4 | |
− | Spacer Twenty- | + | Spacer Twenty-Four: GAGCATCTCTCCAATCAGCGGTCATCAACCGCGA |
− | + | [[Image:crispr2spacer24.jpg]] | |
− | + | Hits from prokaryotes in gene segment | |
− | Spacer Twenty- | + | Spacer Twenty-Five: AGTATCTGTCTACGCGATACCGCACCGTCAGAGTG |
− | + | No significant hits . . . lowest E-value is 1.3 | |
− | Spacer Twenty-Six: | + | Spacer Twenty-Six: TACGACCTTCCCACTGAGGGTCTTGAGCTAACGATT |
− | + | No significant hits . . . lowest E-value is 5.6 | |
− | Spacer Twenty- | + | Spacer Twenty-Seven: GCGACGTGCTTACCTCTGACGCCAATATTGACCTT |
− | + | No significant hits . . . lowest E-value is 5.2 | |
− | Spacer | + | Spacer Twenty-Eight: TTCATCAAGAGGCACACAAGCATGGTGCGTCCAAA |
− | + | No significant hits . . . lowest E-value is 5.2 | |
− | Spacer | + | '''Spacer Twenty-Nine:''' CTGGCCTACCCGAGTCGGTTCCCCGCAGCACTCGCA |
+ | |||
+ | [[Image:crispr2spacer29.jpg]] | ||
− | Spacer Thirty | + | Spacer Thirty: ACGATCGTCACCGACACCCTCGGGGCCGGCGGCGC |
− | + | [[Image:crispr2spacer30.jpg]] | |
− | + | Hits from prokaryotes in gene segment | |
− | Spacer Thirty- | + | Spacer Thirty-One: GAAACGACCGAGACGCAAAGCGAGTTCACACAACTC |
− | + | No significant hits . . . lowest E-value is 5.6 | |
− | Spacer Thirty- | + | Spacer Thirty-Two: TTAGATGATCAGGTAGCCTGCTACCAGTGCAGCTGC |
− | + | [[Image:crispr2spacer32.jpg]] | |
− | + | Poor e-values | |
− | Spacer | + | Spacer Thirty-Three: GAGACCCAGCTTTGCCTTCCAGGTGATCAGCTCGTA |
− | + | [[Image:crispr2spacer33.jpg]] | |
− | + | Hits from prokaryotes in gene segment | |
− | Spacer | + | '''Spacer Thirty-Four:''' TCGAAGCGCTCGGTCGCGACGGAGACCAGCGACCAGCTG |
+ | |||
+ | [[Image:crispr2spacer34.jpg]] | ||
+ | |||
+ | weird because two hits from our species and these 2 hits are in a conserved protein sequence and not the spacer?????????? | ||
+ | |||
+ | [[Image:crispr2spacer34more.jpg]] | ||
+ | Spacer Thirty-Five: TGACGACCACACACACGAGGCCGTGCGTGTGCTTGTA | ||
+ | [[Image:crispr2spacer35.jpg]] | ||
+ | Hits from prokaryotes in gene segment | ||
+ | Spacer Thirty-Six: CCACGTCCCGGTGACACGCAGCTCGGTGAGATCGC | ||
+ | |||
+ | [[Image:crispr2spacer36.jpg]] | ||
+ | |||
+ | Hits from prokaryotes in gene segment | ||
+ | |||
+ | Spacer Thirty-Seven: CATGGAGTCTTCAACATTTCATGGGCTGGGCTTGGCC | ||
+ | |||
+ | No significant hits . . . lowest E-value is 5.9 | ||
+ | |||
+ | Spacer Thirty-Eight: CGCAACCCGACGATCGAGGACGGGCCGTCCCTGGA | ||
+ | |||
+ | [[Image:crispr2spacer38.jpg]] | ||
+ | |||
+ | Bad e-values | ||
+ | |||
+ | Spacer Thirty-Nine: GCGTCGGACTGCGTCGATAGTGTTCGTGCTCATGTT | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.4 | ||
+ | |||
+ | Spacer Forty: CAGACTTCTACTGGAAGGCGAAAACTGAGAAGGCA | ||
+ | |||
+ | No significant hits . . . lowest E-value is 5.2 | ||
+ | |||
+ | Spacer Forty-One: TACGCTCGACGACCTCCGTCGTGCGCTCCAGAAGTCA | ||
+ | |||
+ | [[Image:crispr2spacer41.jpg]] | ||
+ | |||
+ | hit from prokaryote in gene segment | ||
+ | |||
+ | Spacer Forty-Two: GCGCTGCGGACGTGGTGTCAGAGGGGTTACCAGTAACT | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.6 | ||
+ | |||
+ | Spacer Forty-Three: ACTCCGGGTACACTGGTGGCGATGCTCTACTCGCC | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.3 | ||
+ | |||
+ | Spacer Forty-Four: ACACGTTTCTTTTTTTCAGGAGCCATCACTCACTC | ||
+ | |||
+ | No significant hits . . . lowest E-value is 1.3 | ||
Line 282: | Line 457: | ||
look to see if these blast results show hits to any other halophiles | look to see if these blast results show hits to any other halophiles | ||
look at spacer length and number between our species and others?? | look at spacer length and number between our species and others?? | ||
+ | any known viruses that infect halophiles?? | ||
+ | |||
+ | blast the known viral genome against the nr/nt database and it comes up with matches: |
Latest revision as of 01:56, 14 November 2009
Blasting Spacers
EXPLORATION OF THE IMPORTANT FINDINGS
CRISPR one results from CRISPR finder are shown below. I blasted all of the spacers using the nr/nt database and the results are shown below.
Spacer One: TGCGTCGTCCGGTGGCCGTCAATAAATGTCGCAAGGG
No significant hits . . . lowest E-value is 5.9
Spacer Two: TCCTACGACCTCGTCGGCGTCAACGGCTGGCCCGA
Archaeal BJ1 virus complete genome: Good Article This is the protein sequence location that this match comes from in the virus: Blasted this little sequence alignment section. . . did not get any significant viral alignments except its own Did blastp to determine if hypothetical protein in archaeal BJ1 virus had hits in other viruses: The rest of the hits are bacteria and are in coding protein sequences such as ABC transporter and ATP-binding protein.
Spacer Three: CACCCTACAACAGGTGAAATCTACCAGACAAAAGA
All of these hits are bacteria and the bacteria hits are part of coding proteins such as conserved hypothetical protein; putative membrane protein, which I did a blastp for and found no viral matches
Spacer Four: TCACCCAAGCGCAAGCAACAGCTGATCGAGGACCTG
Hits come from prokaryotes and is from a coding segment
Spacer Five: GCGACGGCGGCCAGTTCCGCGAGGGCGGGAAGGTCC
The following hits are the only significant hits that come from a prokaryote but are n a coding segment of DNA
Spacer Six: TGCGAGTGTTGCGGGGAACCGACTCGGGTAGGCCAG
Aligns with something in our genome. . . NOT another spacer but a segment of DNA that flanks two genes and is non-coding. . .
Spacer Seven: ACGGTTTCGCTACCACCATCGCCACCAGCAACTGCCG
Hit from a prokaryote that is in its coding region
Spacer Eight: GACGAGTATGCCAACCGGCTCGTGAGCGGGCGC
No significant hits . . . lowest E-value is 1.2
Spacer Nine: TGGTAGGCGTCGTAGGTGTTCGTGGCGAGCGTGTC
Hit from prokaryote that is in its coding region
Spacer Ten: ACTCACTGGATTATGACCCCTACAACGAGGGCGTCA
No significant hits . . . lowest E-value is 1.4
Spacer Eleven: GGATCTCGATCGTTGTAGTATCCATAGCTGCTATACC
Hits from gene segment in prokaryote or promoter sequence because flanking two genes
Spacer Twelve: GAAGTAACGCAACTCCAGTGAGCGCTACTGAGAGCCC
No significant hits . . . lowest E-value is 1.5
Spacer Thirteen:CCGATCACGCCCTGCCGATACTGGTAGTTCGCGATA
Hit from prokaryote in gene segment
Spacer Fourteen: TCGTCGGCCGGCTCGTCGGCCGACGTGGACTTGC
first hit from prokaryote gene region 2nd hit: from virus
this virus when you blast it. . . it infects pathogens, bacteria and plant. . .maybe archaea too????
Spacer Fifteen: AGTAGGTCTAATGTCTCTCTGTCGTCTATCAGCCCCG
No significant hits . . . lowest E-value is 23
Spacer Sixteen: GCTCTCCGGTGTCACAGGTCAGGTCACGGTCTCCGC
No significant hits . . . lowest E-value is 5.6
Spacer Seventeen: ACGGACAAGTCATCCACCCGCCAGTATCTCCCGGT
Bad hit from prokaryote coding sequence
Spacer Eighteen: AAATACGATCCTGCGGTGACGCTACGTCCGGGGCAGC
No significant hits . . . lowest E-value is 1.5
Spacer Nineteen: CAGATGTGGGGTCTGTGGCCACAGTCTAACATCTCT
No significant hits . . . lowest E-value is 1.4
Spacer Twenty: CCTGATAACGGACTCTTGTAGGTCCGTTAGGTCGT
No significant hits . . . lowest E-value is 5.2
Spacer Twenty-One: AAAAATGAGTGACGTAGACATTCGGCAAAATGCCGG
No significant hits . . . lowest E-value is 1.4
Spacer Twenty-Two: CAGCAGCGAAACGAGCCGTCCGTCCTTTTGAGACA
Hit from prokaryote in gene segment
Spacer Twenty-Three: GTCTAGCCCAGTCTGGTCGGGGTGGTCGGCAGGATCGG
Bad -values= throw out
Spacer Twenty-Four: CACTCCTCATATGTCTGTTCGAGCAGCGGGACGTG
Hit from prokaryote in gene segment
Spacer Twenty-Five: ACCGTTGCCGCCGATCGGCAGCGAGCCGGTGATGTGT
Hits from prokaryotes in flanking sequence or in gene sequence. . . check to see if this is in spacer!!
Spacer Twenty-Six: TCAAAGCGAGCCTCGAACGCGACGACGAAGATATG
Hits from prokaryotes in gene segment
Spacer Twenty-Seven: TCCTCCTTGTACCCACGGTCTTGCCGATCCATCCCG
No significant hits . . . lowest E-value is 1.4
Spacer Twenty-Eight: GACTGGCGTGTTGCCGTTCAGGCCGGCGTTGATCCCG
Hits from prokaryotes in gene segment
Spacer Twenty-Nine: CTCAGCAGCAGTCAACGGCATTTTATACACCTTGT
No significant hits . . . lowest E-value is 1.3
Spacer Thirty: CACCCCTTCCGGGGAGACGAGGAAACCCCGGACGA
Bad e-value and top hit is a bunch of gene from bacteria
Spacer Thirty-One: GTCACGCTGTCTGACGATATGGCTGACCAGGTGC
Hit from prokaryotes in gene segment
Spacer Thirty-Two: CACTCCTGGGCGGCCTCATCGGCGGCCATCGTC
Hit from prokaryotes in gene segment
Spacer Thirty-Three: GTTGTGTGAGGTATGCGATGGACACCACCGATCACG
No significant hits . . . lowest E-value is 1.4
CRISPR Two Spacers: CHANGED parameters to be 1.-1 match/mistatch to make it less intense. . . also excluded eukaryotes NOW
Spacer One: TCCGAGACGTGTTCCCTCTCTAGCTGTGCATCTTCC
Hits from prokaryote in gene segment
Spacer Two: CAGATCTAAAACAATGTCATACGGAAAAATCGACATC
No significant hits . . . lowest E-value is 1.5
Spacer Three: GATCCGGAATATGAAGTGACGAACGATCCGGATACGG
E-values not good and prokaryotes
Spacer Four: TCGACGAGATCGGCGCGAACTCGTTCGCTGATACT
Hits from prokaryote in gene segment
Spacer Five: TCGGGGACCGAGACGACGGGGCCGGGTGCTGTCT
Hit from a virus (same virus even though called different names):
Spacer Six: CCGGAGGGGCCGCTGCGTGGGTGATCTGGAGAGAAGA
E-values not good and from prokaryotes
Spacer Seven: GTTGCGTGAGCTAGCGAAACACCGAGTCCGTGTGAT
No significant hits . . . lowest E-value is 5.5
Spacer Eight: ACGGAAATCCAGCCGATCACCCTCCGAGAGGAGAGG
Hit from prokaryote in gene segment
Spacer Nine: CGGACATTCAGAAGCGCCTGACTAACCGCATGGCT
No significant hits . . . lowest E-value is 5.2
Spacer Ten: CGGGAAGACCACGACCGCCCGCGCCCTCCAGTTCGA
First hit from bacteria in gene segment. . . Second hit is from another archaea in a conserved hypothetical protein. . . is this a true protein. . .??
Spacer Eleven: GGCTTCTACGTCGGCAACCGGACCGAGGACGGCGATG
Hit with significant e-value is from a bacteria in a gene segment
Spacer Twelve: AAGTACGCCTCGATCATCAACGGCGTCCGGGCTGT
E-values no good and from a prokaryote in a gene
Spacer Thirteen: GATGCTGTTGAGCTCGTAGCGATCCCACTCGGCGT
No significant hits . . . lowest E-value is 1.3
Spacer Fourteen: AGCCCTTGTGCAATGATCGGGAGTGCAATCCGACC
No significant hits . . . lowest E-value is 5.2
Spacer Fifteen: TCAGGCGAGTTGTCGGACGAACAGCTTGAAGCGTGT
No significant hits . . . lowest E-value is 1.4
Spacer Sixteen: TCGTCGAGCGGCAGGCCGCCGACCGCTACGGGCTG
Hits from prokaryotes in gene segment
Spacer Seventeen: CGACACGGTCGAAGATGCTGGCAGCTCGCGAATCGC
Bad e-value
Spacer Eighteen: GGATCTCGACGGCCGCGTGGCCGTGCATCTCGGGGTC
Hit from prokaryote in gene segment
Spacer Nineteen: AACGCTTCAACGCGCTCTATTGACCGAGCGTATCG
No significant hits . . . lowest E-value is 1.3
Spacer Twenty: AGATCTCGCGGATAAGCTGCCCCCGCCCTCCCATGAG
No significant hits . . . lowest E-value is 5.9
Spacer Twenty-One: CACTTCGAGCGGACTTTGGGCCACCCCGGAAAGTCAG
No significant hits . . . lowest E-value is 1.5
Spacer Twenty-Two: CGATACGTCCGGGACGCCCGTGACGACCACCACTGC
Bad e-value
Spacer Twenty-Three: GGAGGAGCGGATGGACATGAGCGACACGACGATCCG
No significant hits . . . lowest E-value is 1.4
Spacer Twenty-Four: GAGCATCTCTCCAATCAGCGGTCATCAACCGCGA
Hits from prokaryotes in gene segment
Spacer Twenty-Five: AGTATCTGTCTACGCGATACCGCACCGTCAGAGTG
No significant hits . . . lowest E-value is 1.3
Spacer Twenty-Six: TACGACCTTCCCACTGAGGGTCTTGAGCTAACGATT
No significant hits . . . lowest E-value is 5.6
Spacer Twenty-Seven: GCGACGTGCTTACCTCTGACGCCAATATTGACCTT
No significant hits . . . lowest E-value is 5.2
Spacer Twenty-Eight: TTCATCAAGAGGCACACAAGCATGGTGCGTCCAAA
No significant hits . . . lowest E-value is 5.2
Spacer Twenty-Nine: CTGGCCTACCCGAGTCGGTTCCCCGCAGCACTCGCA
Spacer Thirty: ACGATCGTCACCGACACCCTCGGGGCCGGCGGCGC
Hits from prokaryotes in gene segment
Spacer Thirty-One: GAAACGACCGAGACGCAAAGCGAGTTCACACAACTC
No significant hits . . . lowest E-value is 5.6
Spacer Thirty-Two: TTAGATGATCAGGTAGCCTGCTACCAGTGCAGCTGC
Poor e-values
Spacer Thirty-Three: GAGACCCAGCTTTGCCTTCCAGGTGATCAGCTCGTA
Hits from prokaryotes in gene segment
Spacer Thirty-Four: TCGAAGCGCTCGGTCGCGACGGAGACCAGCGACCAGCTG
weird because two hits from our species and these 2 hits are in a conserved protein sequence and not the spacer??????????
Spacer Thirty-Five: TGACGACCACACACACGAGGCCGTGCGTGTGCTTGTA
Hits from prokaryotes in gene segment
Spacer Thirty-Six: CCACGTCCCGGTGACACGCAGCTCGGTGAGATCGC
Hits from prokaryotes in gene segment
Spacer Thirty-Seven: CATGGAGTCTTCAACATTTCATGGGCTGGGCTTGGCC
No significant hits . . . lowest E-value is 5.9
Spacer Thirty-Eight: CGCAACCCGACGATCGAGGACGGGCCGTCCCTGGA
Bad e-values
Spacer Thirty-Nine: GCGTCGGACTGCGTCGATAGTGTTCGTGCTCATGTT
No significant hits . . . lowest E-value is 1.4
Spacer Forty: CAGACTTCTACTGGAAGGCGAAAACTGAGAAGGCA
No significant hits . . . lowest E-value is 5.2
Spacer Forty-One: TACGCTCGACGACCTCCGTCGTGCGCTCCAGAAGTCA
hit from prokaryote in gene segment
Spacer Forty-Two: GCGCTGCGGACGTGGTGTCAGAGGGGTTACCAGTAACT
No significant hits . . . lowest E-value is 1.6
Spacer Forty-Three: ACTCCGGGTACACTGGTGGCGATGCTCTACTCGCC
No significant hits . . . lowest E-value is 1.3
Spacer Forty-Four: ACACGTTTCTTTTTTTCAGGAGCCATCACTCACTC
No significant hits . . . lowest E-value is 1.3
Other things i want to do:
compare our species spacers with themselves look to see if these blast results show hits to any other halophiles look at spacer length and number between our species and others?? any known viruses that infect halophiles??
blast the known viral genome against the nr/nt database and it comes up with matches: