Difference between revisions of "Cas-Protein aa sequences"
(→Halomicrobium mukohataei:) |
|||
Line 1: | Line 1: | ||
== Halomicrobium mukohataei: == | == Halomicrobium mukohataei: == | ||
− | >CRISPR-associated protein Cas2 at complement(1004454..1004717)<br> | + | >2muk CRISPR-associated protein Cas2 at complement(1004454..1004717)<br> |
MVYVVVVYDMEADRTHKMLKFLRRYLTHVQNSVLEGDVTEGDLEKIRSGVDDLLKPGESTIIYQISSEKLVDRSVFGDDPA<br>ADDQFL | MVYVVVVYDMEADRTHKMLKFLRRYLTHVQNSVLEGDVTEGDLEKIRSGVDDLLKPGESTIIYQISSEKLVDRSVFGDDPA<br>ADDQFL | ||
− | > CRISPR-associated protein Cas1 at complement(1004721..1005719)<br> | + | > 1muk CRISPR-associated protein Cas1 at complement(1004721..1005719)<br> |
MIMNDNYHVFSDGRIERHDDTVRVITDDGEKKYLPVENAEAIFLHGQIEYNTRFVSFLNQEGVAVHVFGWHDHYAGSIMP<br>KRGQTSGQTLVDQVRAYDDPAHRLELAQAFVDGSIHNMRANVTYYDGRGHDFEDVLAELTEARSSLDRMETIDET<br>MGVEARARKAYYSTFDEILPDEFVFGGRQYDPPNNEVNSLISFGNSLVYANVVSAIRATALDPTVSFLHEPGERRYSLA<br>LDIADLFKPLLADRVIFRLVNRGQLTSDDFEAEMNACLLNEHGRKTYSKAYEETLDETIEHPDLGKKVSYQYLLRVEV<br>YKLKKHLLTGEEYVPFQRWW | MIMNDNYHVFSDGRIERHDDTVRVITDDGEKKYLPVENAEAIFLHGQIEYNTRFVSFLNQEGVAVHVFGWHDHYAGSIMP<br>KRGQTSGQTLVDQVRAYDDPAHRLELAQAFVDGSIHNMRANVTYYDGRGHDFEDVLAELTEARSSLDRMETIDET<br>MGVEARARKAYYSTFDEILPDEFVFGGRQYDPPNNEVNSLISFGNSLVYANVVSAIRATALDPTVSFLHEPGERRYSLA<br>LDIADLFKPLLADRVIFRLVNRGQLTSDDFEAEMNACLLNEHGRKTYSKAYEETLDETIEHPDLGKKVSYQYLLRVEV<br>YKLKKHLLTGEEYVPFQRWW | ||
− | > CRISPR-associated RecB family exonuclease at CAYHDFCWSC"<br> | + | > 4muk CRISPR-associated RecB family exonuclease at CAYHDFCWSC"<br> |
MTDSSGDPVDRFLAAARDESAELPFRLTGVMFQYYVVCERELWFLSRDVEIDRDTPAIVRGSDVDDSAYADKRRDVRVDGI<br>IAIDVLDSGEILEVKPSSSMTEPARLQLLFYLWYLDRVTGVEKTGVLAHPAEKRRETVELTPETSAEVESAIEGIRAVV<br> | MTDSSGDPVDRFLAAARDESAELPFRLTGVMFQYYVVCERELWFLSRDVEIDRDTPAIVRGSDVDDSAYADKRRDVRVDGI<br>IAIDVLDSGEILEVKPSSSMTEPARLQLLFYLWYLDRVTGVEKTGVLAHPAEKRRETVELTPETSAEVESAIEGIRAVV<br> | ||
− | > CRISPR-associated helicase Cas3 at complement(1006273..1008843)<br> | + | > 3muk CRISPR-associated helicase Cas3 at complement(1006273..1008843)<br> |
MGTEPISHPATDGDEATRLLDHLDDVAGRAESVVSADATTIGGDPLPEMTAVVARCHDFGKATTWFQAYVVGERDASDR<br>TNHSLLGAYLGYYVLDRLGYDSEDCLAALVALAKHHGQLPDVEEYVDGVSRFENTSEASNERQRILIEQVGNVDDH<br>RRQFAQSFVADATDGNGSWEEFAQSIEDESLFDTVHEHVSLFGFGSDRSAPSEEFYGLLLQLWSGLVFADKTVAAGIE<br>NGDLDGSEPDARLLSEYIADLGGDTDEDGQAAALDTLRGEARDDVLGGVERFRDSEVSIATLTLPTGMGKTLTGLS<br>AALALRDEGERVVYALPFTSVIDQVADELADVFDTDARDDLLTIHHHLAETVTKLGDPDEDPDEYARLAEMVAESW<br>RSGMTLTTFVQLFESLVGPSNTQSMKLPALYDSVIVLDEPQALPMQWWKLVRRIATILTEEYDATIVAMTATQPRAT<br>DEETASQALFDDAFELVDDVDRYFGHFERVEYDVHDSVLAFDDTDAIVDYETAGRTILDETGRSESTLAICNTIDSAT<br>ALTDAIEEREAVVDVGRCLELELDDGADVDALVERVESTLSSNERALVHLSTRLRPRDRLALVEATKRLTERDVPVLA<br>VSTQLIEAGVDISFDRVYRDIAPIDSVVQAAGRCNRSFESDLGTVTVWWLAPPAGTTTTPAQAVYDSEGVSTISLTAR<br>TLDAIGADDGTVAEQTMTRDAVEHYYGLVADRNPGDPEYVKWVDEANADALGGLSLIGQRESVDVVVCRTDGDR<br>ELADAMVAALDEFDYDAFGDYREAAKDITVSLPIYSRDSTEAETVRNLEPLGDADLRVLRRARGTSYFDETKGLAVD<br>EPCVDDRFL | MGTEPISHPATDGDEATRLLDHLDDVAGRAESVVSADATTIGGDPLPEMTAVVARCHDFGKATTWFQAYVVGERDASDR<br>TNHSLLGAYLGYYVLDRLGYDSEDCLAALVALAKHHGQLPDVEEYVDGVSRFENTSEASNERQRILIEQVGNVDDH<br>RRQFAQSFVADATDGNGSWEEFAQSIEDESLFDTVHEHVSLFGFGSDRSAPSEEFYGLLLQLWSGLVFADKTVAAGIE<br>NGDLDGSEPDARLLSEYIADLGGDTDEDGQAAALDTLRGEARDDVLGGVERFRDSEVSIATLTLPTGMGKTLTGLS<br>AALALRDEGERVVYALPFTSVIDQVADELADVFDTDARDDLLTIHHHLAETVTKLGDPDEDPDEYARLAEMVAESW<br>RSGMTLTTFVQLFESLVGPSNTQSMKLPALYDSVIVLDEPQALPMQWWKLVRRIATILTEEYDATIVAMTATQPRAT<br>DEETASQALFDDAFELVDDVDRYFGHFERVEYDVHDSVLAFDDTDAIVDYETAGRTILDETGRSESTLAICNTIDSAT<br>ALTDAIEEREAVVDVGRCLELELDDGADVDALVERVESTLSSNERALVHLSTRLRPRDRLALVEATKRLTERDVPVLA<br>VSTQLIEAGVDISFDRVYRDIAPIDSVVQAAGRCNRSFESDLGTVTVWWLAPPAGTTTTPAQAVYDSEGVSTISLTAR<br>TLDAIGADDGTVAEQTMTRDAVEHYYGLVADRNPGDPEYVKWVDEANADALGGLSLIGQRESVDVVVCRTDGDR<br>ELADAMVAALDEFDYDAFGDYREAAKDITVSLPIYSRDSTEAETVRNLEPLGDADLRVLRRARGTSYFDETKGLAVD<br>EPCVDDRFL | ||
− | > CRISPR-associated protein, TM1800 family at complement(1008843..1009520)<br> | + | > 00muk CRISPR-associated protein, TM1800 family at complement(1008843..1009520)<br> |
MKQTYRVIPRTTVAGLIAAMLGIERDGYYDLFAPGESLVAIEPTSELRTMKLPMNTLSTADEHMASLNPRGKLSIKLPDPSKP<br>RQQHNYEVLVEPAYRIDVWLADDERYDRLRSLLESGESYYVPSLGLSEHLATIDYHGEFPVEHGPDGETVAIDSTVPE<br>AVDSIVPDPETRYQIEQTPAFMERDDGGRTTSAFVSYAYNPDGGSLRVADVSTYSVDDRAVVFT<br> | MKQTYRVIPRTTVAGLIAAMLGIERDGYYDLFAPGESLVAIEPTSELRTMKLPMNTLSTADEHMASLNPRGKLSIKLPDPSKP<br>RQQHNYEVLVEPAYRIDVWLADDERYDRLRSLLESGESYYVPSLGLSEHLATIDYHGEFPVEHGPDGETVAIDSTVPE<br>AVDSIVPDPETRYQIEQTPAFMERDDGGRTTSAFVSYAYNPDGGSLRVADVSTYSVDDRAVVFT<br> | ||
− | > CRISPR-associated protein, TM1801 family at complement(1009635..1010690)<br> | + | > 01muk CRISPR-associated protein, TM1801 family at complement(1009635..1010690)<br> |
MSEHYPTVSNRSEIVFAYDAVDANPNGNPLSGANRPRIDPHTDQAIVTDVRLKRYLRDQLQDDGHGVYIRNVKEDDGDQ<br>ATREDLLEDRLKDIDLDDVDEADIENAVFGQFLENSADVRYFGATMSIDMDDEKVDHLPDHFTGPVQFSPGKSLHR<br>VMENEEYNSLTSVIATGDDKAQGGFDLDDHRIQYAFIGFHGLVDEHGAEGTLLTDGDVRRLDTLCWRALKNQTISR<br>SKVGQEPRLYLRVEYADESFHLGGLDQDIDLDSSESAPVEEIRNVRDICVDVSALLERLDAASDRIDTVHVVASDVLEL<br>SVDGETGGPEFLYDALESRVGSESVREIDVYEDAKATMPEE | MSEHYPTVSNRSEIVFAYDAVDANPNGNPLSGANRPRIDPHTDQAIVTDVRLKRYLRDQLQDDGHGVYIRNVKEDDGDQ<br>ATREDLLEDRLKDIDLDDVDEADIENAVFGQFLENSADVRYFGATMSIDMDDEKVDHLPDHFTGPVQFSPGKSLHR<br>VMENEEYNSLTSVIATGDDKAQGGFDLDDHRIQYAFIGFHGLVDEHGAEGTLLTDGDVRRLDTLCWRALKNQTISR<br>SKVGQEPRLYLRVEYADESFHLGGLDQDIDLDSSESAPVEEIRNVRDICVDVSALLERLDAASDRIDTVHVVASDVLEL<br>SVDGETGGPEFLYDALESRVGSESVREIDVYEDAKATMPEE | ||
Line 23: | Line 23: | ||
'''Need to find Cas1 and Cas2''' | '''Need to find Cas1 and Cas2''' | ||
− | > CRISPR-associated helicase Cas3 at complement(1844..4447)<br> | + | > 3cal CRISPR-associated helicase Cas3 at complement(1844..4447)<br> |
MTFEQYISHPAKTDDGEPTLLIGEGGQFDQDGHLQTVANRMVEACRGQTLADGTPAEPVAEVIGLTHDFAKLTHWAQKH<br>LRCQPFQHSDEYRYHAFPGALVTLYCLLNRRDGTGPLKDDHAAEVATLVVAGHHDIQSPPEPSKLAKNYGRDTLEV<br>QETYKRITEQFEDIGDRVPERADQIIHKATDGEGSWEDFREWHADRTAPIDGPHHHLIYFAQIGDRDTRDGYYSDVV<br>RLWTALKFADQTDASGLKNEDIGGTLLDRSELTRHIEDLDKGENVLAELNDLRNKARQEVTENVETLVKSNDVGLIT<br>LPTGFGKTYAGISAGLRAANINDSRLVYALPYTSILDQTASEIQSVFGVSPYSRAFTLHHHLSNTYTGLGDHYTDADI<br>GRSPGALHAESWLSGVTLTTTVQLFESLAAPTARQATRVPALHEAVVVIDEPQAIPEDWWQIIPELVELLVDSYNATVI<br>LMTATQPGLVKYGSNTLNTRELTDATDKYTDFLADHPRVRYRLHDTVRTDVGDEYATLDYATAGSRVSGAAEGG<br>RDVLAVCNTRASAEELYRSVTATVNTKEKVPVELGHLLHDYVEETGELPSPVELRRFAIDAVAERDVATLYAFLSGDV<br>RPDDRKLIIDTLYDDEIGDEDEPEPLLDSDWSVILVSTSVVEAGVDVSFDTVFRDYAPIPNIVQSGGRCNRSFDGETGD<br>VVVWRLAEPENGSAIPSLVIHGADGGDQLPLLLATGNVLRRHAARDGTIDESTMVSDTVSEFYESLFEGPLNPGNERL<br>ADAISSASMSELEGEHMIDEIEDYEDVVACLTDEERDDLLSSDPEAVSIRGHPGAQVSTDLEAWTKKVTIGNSQYLLV<br>DALSGSYHPVFGVR | MTFEQYISHPAKTDDGEPTLLIGEGGQFDQDGHLQTVANRMVEACRGQTLADGTPAEPVAEVIGLTHDFAKLTHWAQKH<br>LRCQPFQHSDEYRYHAFPGALVTLYCLLNRRDGTGPLKDDHAAEVATLVVAGHHDIQSPPEPSKLAKNYGRDTLEV<br>QETYKRITEQFEDIGDRVPERADQIIHKATDGEGSWEDFREWHADRTAPIDGPHHHLIYFAQIGDRDTRDGYYSDVV<br>RLWTALKFADQTDASGLKNEDIGGTLLDRSELTRHIEDLDKGENVLAELNDLRNKARQEVTENVETLVKSNDVGLIT<br>LPTGFGKTYAGISAGLRAANINDSRLVYALPYTSILDQTASEIQSVFGVSPYSRAFTLHHHLSNTYTGLGDHYTDADI<br>GRSPGALHAESWLSGVTLTTTVQLFESLAAPTARQATRVPALHEAVVVIDEPQAIPEDWWQIIPELVELLVDSYNATVI<br>LMTATQPGLVKYGSNTLNTRELTDATDKYTDFLADHPRVRYRLHDTVRTDVGDEYATLDYATAGSRVSGAAEGG<br>RDVLAVCNTRASAEELYRSVTATVNTKEKVPVELGHLLHDYVEETGELPSPVELRRFAIDAVAERDVATLYAFLSGDV<br>RPDDRKLIIDTLYDDEIGDEDEPEPLLDSDWSVILVSTSVVEAGVDVSFDTVFRDYAPIPNIVQSGGRCNRSFDGETGD<br>VVVWRLAEPENGSAIPSLVIHGADGGDQLPLLLATGNVLRRHAARDGTIDESTMVSDTVSEFYESLFEGPLNPGNERL<br>ADAISSASMSELEGEHMIDEIEDYEDVVACLTDEERDDLLSSDPEAVSIRGHPGAQVSTDLEAWTKKVTIGNSQYLLV<br>DALSGSYHPVFGVR | ||
− | > CRISPR-associated protein, TM1800 family at complement(4458..5279)<br> | + | > 00cal CRISPR-associated protein, TM1800 family at complement(4458..5279)<br> |
MTQQDLTDYASEGGDSSSSRYIPDTCIGFDVTADFAHFRKVGNNSAKPSYRIPPRTTVAGLLAGILGMPRDSYYDLFSPARS<br>AIAIVPKGLPHTYTMGITTVNTKADDAIQYLPQEKHYTKSAEMLTPESYVKYDRQRDTYEMLVDPEYRVYVALSDQN<br>SYNELRERLETSRYHYSPALGLSECIADIRNVEIHTVGPGIEDAVDSAAYDDSEVVPKPGVTIKRERAPLYMESTDGGR<br>RTTEFGNITYAAGDDRLPVDESRTHTVGEHQVAFY | MTQQDLTDYASEGGDSSSSRYIPDTCIGFDVTADFAHFRKVGNNSAKPSYRIPPRTTVAGLLAGILGMPRDSYYDLFSPARS<br>AIAIVPKGLPHTYTMGITTVNTKADDAIQYLPQEKHYTKSAEMLTPESYVKYDRQRDTYEMLVDPEYRVYVALSDQN<br>SYNELRERLETSRYHYSPALGLSECIADIRNVEIHTVGPGIEDAVDSAAYDDSEVVPKPGVTIKRERAPLYMESTDGGR<br>RTTEFGNITYAAGDDRLPVDESRTHTVGEHQVAFY | ||
− | > CRISPR-associated protein, TM1801 family at complement(5304..6332)<br> | + | > 01cal CRISPR-associated protein, TM1801 family at complement(5304..6332)<br> |
MTDTNDATIQNRSEIAFVIDAKDTNPNGDPLTADNEPRIDPVTGQCVVTDVRLKRYLRDQLVEDDHVVLIANPNDEVLTR<br>KEMYDAVESEMGVSTDEAEPEELLEAFVKTAADVRYFGATISLDTDLAEDLPNQFEGPVQFNHGRSYHEVARNTESK<br>QLATVIANEDDDGGKKDQGTFATDNRISYGVIGFGGRINDNAAKDTHLTEDDVERLDTLCWRALKNQTVTRSKAG<br>QQPRLYIRVEYKQDGFEIGRLNDRIGVDSDLPEDEIRGTDDFNLDVSELVTTLADNDARIDTVHITADSAVTFALPDG<br>ETGDREALYTVLNDILGAEAVDAYDVYERYVN | MTDTNDATIQNRSEIAFVIDAKDTNPNGDPLTADNEPRIDPVTGQCVVTDVRLKRYLRDQLVEDDHVVLIANPNDEVLTR<br>KEMYDAVESEMGVSTDEAEPEELLEAFVKTAADVRYFGATISLDTDLAEDLPNQFEGPVQFNHGRSYHEVARNTESK<br>QLATVIANEDDDGGKKDQGTFATDNRISYGVIGFGGRINDNAAKDTHLTEDDVERLDTLCWRALKNQTVTRSKAG<br>QQPRLYIRVEYKQDGFEIGRLNDRIGVDSDLPEDEIRGTDDFNLDVSELVTTLADNDARIDTVHITADSAVTFALPDG<br>ETGDREALYTVLNDILGAEAVDAYDVYERYVN | ||
== Haloarcula sinaiiensis == | == Haloarcula sinaiiensis == | ||
− | > CRISPR-associated protein, TM1801 family at 481071..482093<br> | + | > 01sin CRISPR-associated protein, TM1801 family at 481071..482093<br> |
MTTLNRSELLFVYDAQDCNPNGNPIGDNRPRRDPDTGKGIITDVRLKRYLRDQLQDDGFDIYVKKIAGESRTRTTLIKDVL<br>GGVSDAEDLEDIEDIGESFLEAATDVRYFGATLSFEASDDEEDEAFREALNSAFPNHYQGPVQFLPAKSLNEVEENEEY<br>DSLTSVISTGEGNRQGGFDLDDKRIKYGIFPFYGLVDNHGAETTNLSAADVERLDTLCWRALKNQTTSRSKLGQEPR<br>LYLRVEYAEDDYHIGGLQNLLDLDGGDNLLRSISDVVLDVSDLLSTLDKNRDRIETIHLIADDRMTLDTGDEAISGDQ<br>LATELDSRGLDVHEIDVVDERDLAR | MTTLNRSELLFVYDAQDCNPNGNPIGDNRPRRDPDTGKGIITDVRLKRYLRDQLQDDGFDIYVKKIAGESRTRTTLIKDVL<br>GGVSDAEDLEDIEDIGESFLEAATDVRYFGATLSFEASDDEEDEAFREALNSAFPNHYQGPVQFLPAKSLNEVEENEEY<br>DSLTSVISTGEGNRQGGFDLDDKRIKYGIFPFYGLVDNHGAETTNLSAADVERLDTLCWRALKNQTTSRSKLGQEPR<br>LYLRVEYAEDDYHIGGLQNLLDLDGGDNLLRSISDVVLDVSDLLSTLDKNRDRIETIHLIADDRMTLDTGDEAISGDQ<br>LATELDSRGLDVHEIDVVDERDLAR | ||
− | > CRISPR-associated protein, TM1800 family at 482111..482908<br> | + | > 00sin CRISPR-associated protein, TM1800 family at 482111..482908<br> |
MSPQIDADGIPDRCLSFTVSSTWGHFKRVGRTVTKQTYRIPPRTTVAGMLAAIVGAERNSYYETFGEDNAAIAITPESDLRTI<br>NIPTTGLGTDPDQDVTTTAKKRRNYSLTYQETTGDRQLHAYEVLADPSYRIDVALEDEEFYQKLHDHLEAGTSVYPP<br>SLGKSEYLATIENVQAGQEPEPASSSGPYDIDSIVPIELADAIPQGGVAYESERSPAVMERHQGGRRTTRFDDYVFTRR<br>SDGTVKTDAGTDVKPVSVGNRIVVFR | MSPQIDADGIPDRCLSFTVSSTWGHFKRVGRTVTKQTYRIPPRTTVAGMLAAIVGAERNSYYETFGEDNAAIAITPESDLRTI<br>NIPTTGLGTDPDQDVTTTAKKRRNYSLTYQETTGDRQLHAYEVLADPSYRIDVALEDEEFYQKLHDHLEAGTSVYPP<br>SLGKSEYLATIENVQAGQEPEPASSSGPYDIDSIVPIELADAIPQGGVAYESERSPAVMERHQGGRRTTRFDDYVFTRR<br>SDGTVKTDAGTDVKPVSVGNRIVVFR | ||
− | > CRISPR-associated helicase Cas3 at 482948..485755<br> | + | > 3sin CRISPR-associated helicase Cas3 at 482948..485755<br> |
MDLPLISHPDVDENDAYPSSQLTDDGALRLDAHNRTVGDRAVRLFGPDDDRTQYLRIAASLHDFGKVTPQFQAHVRPTE<br>NYDGPEDEKVHARLGALATWYVLEETDAPPRDQLAATLAVARHHQALPNAAQYTGETLARAVEASADVLQAQINR<br>IDETWPEAADDLFRCTGSDGSSWAEFAEWARSGAATAALQDCSVRETLSGVEPTPSRLPDFLYDRTIHYWAALTLAD<br>KSHAMGLSEERVFDFDTLDLETLERHINTLRQQEASSLHEAQLNDERERARRQALRGTHEWLNQEQTDIATLTLPTG<br>LGKTFTGLSAAFEARDILDETDTEHPDNPRPVIYALPYTSIIEQTRALFEDPELWGADPKKSALTVHHYLSETVVYGDE<br>YDAADVDESDAGEAAQFLGETWRDGTILTTFVQLFESLTGPSNRQGLKLPSLDSALIILDEPQALPKDWWDGIERLLQ<br>LLTGEYGAKVIAMTATQPSLFREMGTSSLLELGAAHAQTDCSHCRRQPAYETELPPISQESYFNEADRVRYTIDESALS<br>HRLETEEEFVEYDSAASRIHETAAQADSVLAICNTIESSRQLTQAVSQHSDAVHLGPVLESILTAPDANVAESEMNPGE<br>IVSEVLETVGIEDQCSDEPTARNQEVPSPQGPFVLTLNSRYRPFDRQVIIQLAEQLSTGPVPFILISTQAIEAGVDISFEM<br>VYRDIAPLDSIVQAAGRCNRAYEWGKNGGQVTIWTLAPTGPDAANPPAYWVYERGSTDAGMPDHLRLISDVLNKV<br>PGQRDIADIHLSKHAVDRYFEQLSRRSLDDGSIRDHIDHAEGRWLSQQSLIGGYETADVLVAVSESESQTLDRITQMF<br>TDGNPRAYDRLDDLSHLRVSLPAKIIDENPKLTRIDGQGRKDDGVNVFRFNGTGGLTYTLEDGGLRATEESIQDRFTI<br> | MDLPLISHPDVDENDAYPSSQLTDDGALRLDAHNRTVGDRAVRLFGPDDDRTQYLRIAASLHDFGKVTPQFQAHVRPTE<br>NYDGPEDEKVHARLGALATWYVLEETDAPPRDQLAATLAVARHHQALPNAAQYTGETLARAVEASADVLQAQINR<br>IDETWPEAADDLFRCTGSDGSSWAEFAEWARSGAATAALQDCSVRETLSGVEPTPSRLPDFLYDRTIHYWAALTLAD<br>KSHAMGLSEERVFDFDTLDLETLERHINTLRQQEASSLHEAQLNDERERARRQALRGTHEWLNQEQTDIATLTLPTG<br>LGKTFTGLSAAFEARDILDETDTEHPDNPRPVIYALPYTSIIEQTRALFEDPELWGADPKKSALTVHHYLSETVVYGDE<br>YDAADVDESDAGEAAQFLGETWRDGTILTTFVQLFESLTGPSNRQGLKLPSLDSALIILDEPQALPKDWWDGIERLLQ<br>LLTGEYGAKVIAMTATQPSLFREMGTSSLLELGAAHAQTDCSHCRRQPAYETELPPISQESYFNEADRVRYTIDESALS<br>HRLETEEEFVEYDSAASRIHETAAQADSVLAICNTIESSRQLTQAVSQHSDAVHLGPVLESILTAPDANVAESEMNPGE<br>IVSEVLETVGIEDQCSDEPTARNQEVPSPQGPFVLTLNSRYRPFDRQVIIQLAEQLSTGPVPFILISTQAIEAGVDISFEM<br>VYRDIAPLDSIVQAAGRCNRAYEWGKNGGQVTIWTLAPTGPDAANPPAYWVYERGSTDAGMPDHLRLISDVLNKV<br>PGQRDIADIHLSKHAVDRYFEQLSRRSLDDGSIRDHIDHAEGRWLSQQSLIGGYETADVLVAVSESESQTLDRITQMF<br>TDGNPRAYDRLDDLSHLRVSLPAKIIDENPKLTRIDGQGRKDDGVNVFRFNGTGGLTYTLEDGGLRATEESIQDRFTI<br> | ||
− | > CRISPR-associated RecB family exonuclease at CLYQDICWM"<br> | + | > 4sin CRISPR-associated RecB family exonuclease at CLYQDICWM"<br> |
MTELSTVDRYIRDEREPGREPETRITGLMIQYYHVCQRELWFMAHGIDIDRETTNIQRGTHVDETSYQDSRQSFMIDNRIQL<br>DVLESGDIMEVKVSSTLEKPARMQLVFYLWFLDNIYDVDKDGVLAYPTERKRETIQLDAANIEAVENTIRGILDVVNR<br> | MTELSTVDRYIRDEREPGREPETRITGLMIQYYHVCQRELWFMAHGIDIDRETTNIQRGTHVDETSYQDSRQSFMIDNRIQL<br>DVLESGDIMEVKVSSTLEKPARMQLVFYLWFLDNIYDVDKDGVLAYPTERKRETIQLDAANIEAVENTIRGILDVVNR<br> | ||
− | > CRISPR-associated protein Cas1 at 486393..487385 | + | > 1sin CRISPR-associated protein Cas1 at 486393..487385 |
MTKPNHHIFADGELSRSESTLRIDTLEGDTEYLPVESVDSLYLHGQIDFNTRTLGLLNEHGVPLHIFGWKDYYKGSYLPKRGQ<br>VSGNTVVEQVRAYDDRRRLNIGQKMIRASIHNMRRNLVYYDGRRGDFSDAIASLDEFKDETADTDDINQLRAVEGN<br>ARSTYYDCFDQILRDPFELSKREYNPPTNEANALVSFLNGMVYTTSVSAIRKTALDPTIGFVHEPGERRFTLSLDIADIF<br>KPILADRLIFRLVNRQQLSLSDFESELEGCLLTESGRMTVLEAFEETLDKTIEHPRLERKVSFKTLVQTDVYSLKKHILTG<br>ESYHPTERWW | MTKPNHHIFADGELSRSESTLRIDTLEGDTEYLPVESVDSLYLHGQIDFNTRTLGLLNEHGVPLHIFGWKDYYKGSYLPKRGQ<br>VSGNTVVEQVRAYDDRRRLNIGQKMIRASIHNMRRNLVYYDGRRGDFSDAIASLDEFKDETADTDDINQLRAVEGN<br>ARSTYYDCFDQILRDPFELSKREYNPPTNEANALVSFLNGMVYTTSVSAIRKTALDPTIGFVHEPGERRFTLSLDIADIF<br>KPILADRLIFRLVNRQQLSLSDFESELEGCLLTESGRMTVLEAFEETLDKTIEHPRLERKVSFKTLVQTDVYSLKKHILTG<br>ESYHPTERWW | ||
Line 56: | Line 56: | ||
== Haloferax denitrificans == | == Haloferax denitrificans == | ||
− | > CRISPR-associated protein, TM1801 family at 3381..4460<br> | + | > 01den CRISPR-associated protein, TM1801 family at 3381..4460<br> |
MSDTNDAVTNRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQREGNQY<br>TRGELLEDRLKDVEPDEYDLDDGEESERFRNDVFGEFLDNSVDVRYFGATMSVDTDDVYAKHLPDHFTGPVQFSPG<br>KSVHAVNENEEYDSLTSVIATQENKQQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTTGDVERLDTLCWRAIKN<br>QTISRSKIGQEPRLYCRVEYADESFHLGGLDRDLVLDDDRSKPDKELRTVRDLTLEIDGFVDRLAAASARINRIRIVAS<br>DVLDISYDGDVGSSELLYGALREAVGDDAVEVVDVYEEHAETLPAGDAA<br> | MSDTNDAVTNRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQREGNQY<br>TRGELLEDRLKDVEPDEYDLDDGEESERFRNDVFGEFLDNSVDVRYFGATMSVDTDDVYAKHLPDHFTGPVQFSPG<br>KSVHAVNENEEYDSLTSVIATQENKQQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTTGDVERLDTLCWRAIKN<br>QTISRSKIGQEPRLYCRVEYADESFHLGGLDRDLVLDDDRSKPDKELRTVRDLTLEIDGFVDRLAAASARINRIRIVAS<br>DVLDISYDGDVGSSELLYGALREAVGDDAVEVVDVYEEHAETLPAGDAA<br> | ||
− | > CRISPR-associated protein, TM1800 family at 4462..5259 | + | > 00den CRISPR-associated protein, TM1800 family at 4462..5259 |
MEQESLDEWVDGGDDGRPERCLSFTVSGPWGHFRRVEGNVVKQTYRIIPRTTVAGLLAAVLGIERDGYYDLFAPGSSGIAIE<br>PVRAVRTLNMPMNTLSTASGNLQSLNGRGKISVKLPNPTALRQQHNYEVLVEPAYRVDVWLAETARYRELREMLEA<br>GKSHYVPSLGLSEHLAEIDYHGEFDVESASGAGRVEVDSAVPNAVDDVVLREGTRCQVEESPAFMRVDAGGRTTTEF<br>TTYAYNPDAEPLVVDGVDSVEVDGRNVVFV<br> | MEQESLDEWVDGGDDGRPERCLSFTVSGPWGHFRRVEGNVVKQTYRIIPRTTVAGLLAAVLGIERDGYYDLFAPGSSGIAIE<br>PVRAVRTLNMPMNTLSTASGNLQSLNGRGKISVKLPNPTALRQQHNYEVLVEPAYRVDVWLAETARYRELREMLEA<br>GKSHYVPSLGLSEHLAEIDYHGEFDVESASGAGRVEVDSAVPNAVDDVVLREGTRCQVEESPAFMRVDAGGRTTTEF<br>TTYAYNPDAEPLVVDGVDSVEVDGRNVVFV<br> | ||
− | > CRISPR-associated helicase Cas3 at 5259..7904<br> | + | > 3den CRISPR-associated helicase Cas3 at 5259..7904<br> |
MTERYSHPPNDVHDGVPLVDHLGDVAERVGYVVPADAKTPAGEPLRAVVETLAYVHDFGKATTYFQDYLLRSVEPRYEQY<br>RYHAPLGSFAAYYALSAQNFDPETCLAGFVAVAKHHGRLPDVAAYVFSRANRRENVSRGNQSTAEMQQVAIAKQL<br>KDIDEHAPELAREVFQSATDGEGSWSSFRGSFRELLTEVKKSTGSSATAITRETLSERCYGLVLECWGSLVLADKTSAA<br>AAASGSNASAGTYDAEKPTFERLEEYIESIERTADADRDGNRSERLNFHRAKARASVLSNVESFADADGGVATVTLPT<br>GMGKTLTGLSAALSVRDQLGGGRVVYALPFTSIIDQVVAEVEEIYQVDTTGRLLTAHHHLSEATIVDESDESADEADA<br>NDDVAGMLAESWRAGMTVTTFVQLFESLAGPANRQSMKLPALRDSIIVLDEPQSLPLDWWKLVPRLVRMLTEQYNA<br>TVIAMTATQPQLFDDATELVSAPETYFEATERVQYELDDSTTRYIDSREEPKSYAEAASAIVDETTDANGTNSDEAGE<br>TESVLAICNTIDSAQALTTHVTETLSDAISVGSVYADVLENADRNSTDIEVATVAKRVADAGGRPVLHLSTRLRPVDR<br>LRLIETAKLLTERGCSLVVVSTQLVEAGVDISFDRVYRDLAPIDSIVQAAGRCNRSFERDRGHVVVWWLEAPDEQTKT<br>PSEAVYNRGTALLPVATETLDDIGGTEGTIPEATVSKTAVEEYYRRLHDEKNIGKDAYVEYVNEARADELGELSLIEQR<br>RSVEVVVCRTTEERERVEAVRAAWQDYEFETVRRLMDSLKEASVSVPIYRGDSKEAKALSGLTRIYEDTETRWIDTRD<br>ARHGSYFDSTTGLAAESSVDNRIL<br> | MTERYSHPPNDVHDGVPLVDHLGDVAERVGYVVPADAKTPAGEPLRAVVETLAYVHDFGKATTYFQDYLLRSVEPRYEQY<br>RYHAPLGSFAAYYALSAQNFDPETCLAGFVAVAKHHGRLPDVAAYVFSRANRRENVSRGNQSTAEMQQVAIAKQL<br>KDIDEHAPELAREVFQSATDGEGSWSSFRGSFRELLTEVKKSTGSSATAITRETLSERCYGLVLECWGSLVLADKTSAA<br>AAASGSNASAGTYDAEKPTFERLEEYIESIERTADADRDGNRSERLNFHRAKARASVLSNVESFADADGGVATVTLPT<br>GMGKTLTGLSAALSVRDQLGGGRVVYALPFTSIIDQVVAEVEEIYQVDTTGRLLTAHHHLSEATIVDESDESADEADA<br>NDDVAGMLAESWRAGMTVTTFVQLFESLAGPANRQSMKLPALRDSIIVLDEPQSLPLDWWKLVPRLVRMLTEQYNA<br>TVIAMTATQPQLFDDATELVSAPETYFEATERVQYELDDSTTRYIDSREEPKSYAEAASAIVDETTDANGTNSDEAGE<br>TESVLAICNTIDSAQALTTHVTETLSDAISVGSVYADVLENADRNSTDIEVATVAKRVADAGGRPVLHLSTRLRPVDR<br>LRLIETAKLLTERGCSLVVVSTQLVEAGVDISFDRVYRDLAPIDSIVQAAGRCNRSFERDRGHVVVWWLEAPDEQTKT<br>PSEAVYNRGTALLPVATETLDDIGGTEGTIPEATVSKTAVEEYYRRLHDEKNIGKDAYVEYVNEARADELGELSLIEQR<br>RSVEVVVCRTTEERERVEAVRAAWQDYEFETVRRLMDSLKEASVSVPIYRGDSKEAKALSGLTRIYEDTETRWIDTRD<br>ARHGSYFDSTTGLAAESSVDNRIL<br> | ||
− | > CRISPR-associated RecB family exonuclease at 7901..8449<br> | + | > 4den CRISPR-associated RecB family exonuclease at 7901..8449<br> |
MTTKDPVDRLLATARGTPVDEPFRVTGVMMQYYYVCERELWFESRSLEIDRENATVVRGMRVDETAYDEKRESLRLGMISL<br>DLLDDGRVVEVKPSSALTEPAEMQLSYYLWYLDRVAGVRRDGVLAHPRERRRESVELTEERAKKVESSIRRIHELVRRS<br>SPPPAERKPFCESCAYHDFCWC<br> | MTTKDPVDRLLATARGTPVDEPFRVTGVMMQYYYVCERELWFESRSLEIDRENATVVRGMRVDETAYDEKRESLRLGMISL<br>DLLDDGRVVEVKPSSALTEPAEMQLSYYLWYLDRVAGVRRDGVLAHPRERRRESVELTEERAKKVESSIRRIHELVRRS<br>SPPPAERKPFCESCAYHDFCWC<br> | ||
− | > CRISPR-associated protein Cas1 at 8440..9444<br> | + | > 1den CRISPR-associated protein Cas1 at 8440..9444<br> |
MVLTMDRNYHVFSDGRLERNDDTLRLVTEAGDKKYVPIENAEAFFLHGQIDFNTRLMSFLNQRTVALHVFGWEDYYAGSV<br>MPKRGQTSGRTVVEQVRAYEDSDHRRRLAAAMVSASIHNMRTNVVYYNNRDRELETEIDDLDAASARVDQTRPID<br>ELMGVEATARKAYYRSFNQILPDEFRLERREYNPPPNEVNSLISFGNALVYANCVSAIRATALDPSISYLHEPGERRYSL<br>SVDLADLFKPVLADRVLFRLINRKQITPSDFETDLGSCLLDEDGRRTYTKAFEETLERTVEHPKLNRKVSYQYLMRLEA<br>YKLKKHLLTGEEYEPFERWW<br> | MVLTMDRNYHVFSDGRLERNDDTLRLVTEAGDKKYVPIENAEAFFLHGQIDFNTRLMSFLNQRTVALHVFGWEDYYAGSV<br>MPKRGQTSGRTVVEQVRAYEDSDHRRRLAAAMVSASIHNMRTNVVYYNNRDRELETEIDDLDAASARVDQTRPID<br>ELMGVEATARKAYYRSFNQILPDEFRLERREYNPPPNEVNSLISFGNALVYANCVSAIRATALDPSISYLHEPGERRYSL<br>SVDLADLFKPVLADRVLFRLINRKQITPSDFETDLGSCLLDEDGRRTYTKAFEETLERTVEHPKLNRKVSYQYLMRLEA<br>YKLKKHLLTGEEYEPFERWW<br> | ||
− | > CRISPR-associated protein Cas2 at 9446..9706<br> | + | > 2den CRISPR-associated protein Cas2 at 9446..9706<br> |
MYVVMVYDLEAERTYKALKLGRRYLTHVQNSVLEGEISEGDLATLRNEVEDLLKSGESVIIYELSSDALLNRSVYGNDPTDEK<br>RFL | MYVVMVYDLEAERTYKALKLGRRYLTHVQNSVLEGEISEGDLATLRNEVEDLLKSGESVIIYELSSDALLNRSVYGNDPTDEK<br>RFL | ||
== Haloferax mediteranei == | == Haloferax mediteranei == | ||
− | > CRISPR-associated protein Cas2 at complement(183157..183420)<br> | + | > 2med CRISPR-associated protein Cas2 at complement(183157..183420)<br> |
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGEVTEGDLETIRNHTQTLLNPDESTIIYRIGSEKYVDRTVIGEDPTDE<br>SRFL | MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGEVTEGDLETIRNHTQTLLNPDESTIIYRIGSEKYVDRTVIGEDPTDE<br>SRFL | ||
− | > CRISPR-associated protein Cas1 at complement(183424..184416)<br> | + | > 1med CRISPR-associated protein Cas1 at complement(183424..184416)<br> |
MDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIYLHGQIDYNTRLISFLNKHGTALHIFGWKDYYAGSVMPKRG<br>QTSGRTLVEQVRAYDSPAQRTDIARKFVDGSIHNMRANVSYYNSRGHDFDSELASLDAAGARLTETTAVEEIMGVE<br>ATARRAYYSTFDSILPDGFVFNGRRYNPPTNEVNSLISFGNSLVYANVVSGIRATALDPAVSFLHEPGERRYSLALDIA<br>DLFKPLLADRVTFRLLNRQQLTPADFETDLNSCLLTEHGRKTFSKAFEETLEQTVEHPRLNRKVSYQYLLRIEAYKLKK<br>HLLTGEEYVPFKRWW | MDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIYLHGQIDYNTRLISFLNKHGTALHIFGWKDYYAGSVMPKRG<br>QTSGRTLVEQVRAYDSPAQRTDIARKFVDGSIHNMRANVSYYNSRGHDFDSELASLDAAGARLTETTAVEEIMGVE<br>ATARRAYYSTFDSILPDGFVFNGRRYNPPTNEVNSLISFGNSLVYANVVSGIRATALDPAVSFLHEPGERRYSLALDIA<br>DLFKPLLADRVTFRLLNRQQLTPADFETDLNSCLLTEHGRKTFSKAFEETLEQTVEHPRLNRKVSYQYLLRIEAYKLKK<br>HLLTGEEYVPFKRWW | ||
− | > CRISPR-associated RecB family exonuclease at CAYHDFCWSC"<br> | + | > 4den CRISPR-associated RecB family exonuclease at CAYHDFCWSC"<br> |
MTDVDPVQRLLRTARDDARDESFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNSAIVRGTHIDETAYSDKRRHVSIDSTIAI<br>DVLDDGRVMEVKPSSALVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTDETEQKVEDAIRGVHEIIAR<br> | MTDVDPVQRLLRTARDDARDESFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNSAIVRGTHIDETAYSDKRRHVSIDSTIAI<br>DVLDDGRVMEVKPSSALVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTDETEQKVEDAIRGVHEIIAR<br> | ||
− | > CRISPR-associated helicase Cas3 at complement(184969..187563)<br> | + | > 3den CRISPR-associated helicase Cas3 at complement(184969..187563)<br> |
MTAEYERRYSHPAEDGRPAVLLFEHLRDVRDRVDMVIPEGATTPEGKPLCGVIRRLALVHDFGKATTFFQQYIGAQLGQPT<br>HDKLRYHAPLGSLAAYYVLRETGHSTATCLAGFVAVAKHHGRLPNVVEYVFKRMASPDPEKWMADKKQVENIHKN<br>APRLATAIFEEATGDDDAWLDFAQSCVNDESLFTEIADHVTRNGERPITEPTFLTDEFYGLMLECWGTLVLADKTSA<br>AGAPQASSVYDATNPRTADLTQYIDNLGDGNTDPDGSRTEQLNYYRSRARQDVLDSVTEFVESESDVATITLPTGM<br>GKTLTGLNAALEIRDQTGGDRIVYALPFTSIIDQVGAEVQDIFDTDGSDGIVALHHHLSDTRFGYSDGDDDASDLND<br>DIAGMLGESWRAGLTVTTFVQLFESLAGPRNTQSMKIPALRGNVIVLDEPQSLPLDWWKLVPRLVDVLTEQYGATVIS<br>MTATQPELFPAPMSLVSDAERYFTVAERVQYHLHDSVERFLRGEEQPLEYNDAANELVEVAQSGDSLLAICNTIDSA<br>RVLANAVTERIQAVNLAEQYFESLRNGSSDPVAETVQLVRQSSKQAFVHLSTRLRPTDRLALIRIIKQLRASGSPVIAVT<br>TQLVEAGVDISFENVYRDLAPVDSIVQAAGRCNRSFERELGAVTVWWLTQPAEQRHTPAVAVYDTQGPSLTPVTAS<br>ALDSVRDGQTKLAGQSVARAAVQEYYGTLHKEKNVGREEYHEYVDDADAESLGRLSLITQTKTVDVLICVTEADQT<br>LVESLEGAYENYDFQEVKRLLNATKPLRVSIPIYRDDSPEANVVTELRPLAGRKDESSIRVLDAGTRDFEKYFDHTTGF<br>VVADSTVEDRFL | MTAEYERRYSHPAEDGRPAVLLFEHLRDVRDRVDMVIPEGATTPEGKPLCGVIRRLALVHDFGKATTFFQQYIGAQLGQPT<br>HDKLRYHAPLGSLAAYYVLRETGHSTATCLAGFVAVAKHHGRLPNVVEYVFKRMASPDPEKWMADKKQVENIHKN<br>APRLATAIFEEATGDDDAWLDFAQSCVNDESLFTEIADHVTRNGERPITEPTFLTDEFYGLMLECWGTLVLADKTSA<br>AGAPQASSVYDATNPRTADLTQYIDNLGDGNTDPDGSRTEQLNYYRSRARQDVLDSVTEFVESESDVATITLPTGM<br>GKTLTGLNAALEIRDQTGGDRIVYALPFTSIIDQVGAEVQDIFDTDGSDGIVALHHHLSDTRFGYSDGDDDASDLND<br>DIAGMLGESWRAGLTVTTFVQLFESLAGPRNTQSMKIPALRGNVIVLDEPQSLPLDWWKLVPRLVDVLTEQYGATVIS<br>MTATQPELFPAPMSLVSDAERYFTVAERVQYHLHDSVERFLRGEEQPLEYNDAANELVEVAQSGDSLLAICNTIDSA<br>RVLANAVTERIQAVNLAEQYFESLRNGSSDPVAETVQLVRQSSKQAFVHLSTRLRPTDRLALIRIIKQLRASGSPVIAVT<br>TQLVEAGVDISFENVYRDLAPVDSIVQAAGRCNRSFERELGAVTVWWLTQPAEQRHTPAVAVYDTQGPSLTPVTAS<br>ALDSVRDGQTKLAGQSVARAAVQEYYGTLHKEKNVGREEYHEYVDDADAESLGRLSLITQTKTVDVLICVTEADQT<br>LVESLEGAYENYDFQEVKRLLNATKPLRVSIPIYRDDSPEANVVTELRPLAGRKDESSIRVLDAGTRDFEKYFDHTTGF<br>VVADSTVEDRFL | ||
− | > CRISPR-associated protein, TM1800 family at complement(187565..188371)<br> | + | > 00den CRISPR-associated protein, TM1800 family at complement(187565..188371)<br> |
MTQQTLTEWNDTQNESGTAGPPRCLSLTVRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFARGRSAI<br>AIEPVAPLRTVNMPVNTLSTADESMKSLNPRGKISIKLPDPTKPRQQHNYEVLVDPAYRLYVWMSDSHWFETLHETL<br>DEGKSHYVPSLGLSEYLAEITYHGRFEVESGPTDTAVAVDSAVPNAVDHVVPDAESRCQIEESPAFMTVDGGGRTTT<br>DFTSYTYNPDAGPVRVRNPDTAIVDGNTVMFV<br> | MTQQTLTEWNDTQNESGTAGPPRCLSLTVRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFARGRSAI<br>AIEPVAPLRTVNMPVNTLSTADESMKSLNPRGKISIKLPDPTKPRQQHNYEVLVDPAYRLYVWMSDSHWFETLHETL<br>DEGKSHYVPSLGLSEYLAEITYHGRFEVESGPTDTAVAVDSAVPNAVDHVVPDAESRCQIEESPAFMTVDGGGRTTT<br>DFTSYTYNPDAGPVRVRNPDTAIVDGNTVMFV<br> | ||
− | > CRISPR-associated protein, TM1801 family at complement(188374..189429)<br> | + | > 01den CRISPR-associated protein, TM1801 family at complement(188374..189429)<br> |
MTVHTDPVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTQQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGEQYT<br>REKLLEDRLKEVDPDEYDDDGELAGVVFQAFLEESTDVRYFGATMSVDLDGKYGSLPDHFTGPVQFSPGKSMHAVN<br>ENEEYDSLTSVIATQTGKEQGGFGLDDHRIQYGLIRFHGLVDEHAAEDTALTAEDVERLDTLCWRAIKNQTISRSKIG<br>QEPRFYLRVEYATESFHLGGLDKDLELDRTDGRTKSDDELRNVRDLTLSVDSLVDRLERSTNRIERVHVTASDVLSVS<br>HGDEVGGPEVLYEALEDRLGTDAVHVIDVYDEHVETLPN | MTVHTDPVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTQQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGEQYT<br>REKLLEDRLKEVDPDEYDDDGELAGVVFQAFLEESTDVRYFGATMSVDLDGKYGSLPDHFTGPVQFSPGKSMHAVN<br>ENEEYDSLTSVIATQTGKEQGGFGLDDHRIQYGLIRFHGLVDEHAAEDTALTAEDVERLDTLCWRAIKNQTISRSKIG<br>QEPRFYLRVEYATESFHLGGLDKDLELDRTDGRTKSDDELRNVRDLTLSVDSLVDRLERSTNRIERVHVTASDVLSVS<br>HGDEVGGPEVLYEALEDRLGTDAVHVIDVYDEHVETLPN | ||
== Haloferax mucosum == | == Haloferax mucosum == | ||
− | > CRISPR-associated protein, TM1801 family at 3755..4810<br> | + | > 01muc CRISPR-associated protein, TM1801 family at 3755..4810<br> |
MSEHSDHVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTNQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGNRYT<br>REKLLENRLKEVDPDEYDDDEELSEAVYRTFLEESVDVRYFGATMSVDLDDRYGSLPDHFTGPVQFSPGKSMHSVNE<br>NEEYDSLTSVIATQDDKQQGGFDLDDHRIQYGLIRFHGLVDEHAAADTNLTTEDVERLDTLCWRAIKNQTISRSKV<br>GQEPRLYLRVEYATDSFHLGGLDKDLDLDRKDGRTKPDDQLRTVRDLTLSVDSLVARLEKSANRIERVHVAASDVLS<br>VSHDDDVGGPEVLYDALADRLGTDAVHTIDVYDEHTTALPN<br> | MSEHSDHVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTNQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGNRYT<br>REKLLENRLKEVDPDEYDDDEELSEAVYRTFLEESVDVRYFGATMSVDLDDRYGSLPDHFTGPVQFSPGKSMHSVNE<br>NEEYDSLTSVIATQDDKQQGGFDLDDHRIQYGLIRFHGLVDEHAAADTNLTTEDVERLDTLCWRAIKNQTISRSKV<br>GQEPRLYLRVEYATDSFHLGGLDKDLDLDRKDGRTKPDDQLRTVRDLTLSVDSLVARLEKSANRIERVHVAASDVLS<br>VSHDDDVGGPEVLYDALADRLGTDAVHTIDVYDEHTTALPN<br> | ||
− | > CRISPR-associated protein, TM1800 family at 4813..5619<br> | + | > 00muc CRISPR-associated protein, TM1800 family at 4813..5619<br> |
MDQQTLSKWDDSQGESETADPPRCLSLTIRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFGPGHSAV<br>AIEPVEPLRTLNLPVNTLSTANESMKSMNAKGKISIKIPDPTKPRQQHNYEVLVDPAYRLYVWLRDSDWFDMLHEML<br>DEGKSHYVPSLGLSEYLAEIEYHGQFTVETGPADSVVAVDSAVPNAIDRIVPDAETRCQIEESPGFMTSDTGGRTTTG<br>FTSYAYNPDAGPVNVRNPDTHIIDGNTVMFV<br> | MDQQTLSKWDDSQGESETADPPRCLSLTIRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFGPGHSAV<br>AIEPVEPLRTLNLPVNTLSTANESMKSMNAKGKISIKIPDPTKPRQQHNYEVLVDPAYRLYVWLRDSDWFDMLHEML<br>DEGKSHYVPSLGLSEYLAEIEYHGQFTVETGPADSVVAVDSAVPNAIDRIVPDAETRCQIEESPGFMTSDTGGRTTTG<br>FTSYAYNPDAGPVNVRNPDTHIIDGNTVMFV<br> | ||
− | > CRISPR-associated helicase Cas3 at 5678..8215<br> | + | > 3muc CRISPR-associated helicase Cas3 at 5678..8215<br> |
MLLFAHLKDVRDRIDMVVPEDTKTPDGMLLSGVIQRLALVHDVGKATTFFQQYIGESPGKPRYEKLRYHAPLGSLAAYYVL<br>DETGHSTATCLAGFVAVAKHHGRLPNVVEYLFNRTARPDPEKWDPVKAQISDIHEHAPKLVTAIFQEATGDADAWQ<br>NFAKACVNDESLFTEIADHITTNGERPITDPSFLTDEFYGLLLECWGTLVLADKTSAAGAPQSSTVYDGTTPKTTTLA<br>EYIDEIEDPNANPDGSQTERLNYFRSRARKDVLDSVSVFIESESNVATITLPTGMGKTLTGLNAALEIRDRTDRDRIVY<br>ALPFTSIIDQVGAEVQDIFDTDGTDGLLALHHHLSDTRFGYRDSDDDVSDLNDDIAGMLGESWRAGIMVTTFVQLF<br>ESLAGPRNTQSMKIPALRESVVVLDEPQSLPLDWWKLVPRLVAVLTEQYDATVISMTATQPELFSDSVSLVSDPEQYFSAAERVRYHLHDSVERFLQGDEQALEYDVAAAEIADVAASGESLLAICNTIDSARKLAEAVTDRIQSVSVAEQYFASLKSGS<br>ADPVEDTVRQIIQSPKQAFVHLSTRLRPTDRLALIRIIKKLRSAGYAVTAVTTQLVEAGVDISFENVYRDLAPVDSIVQA<br>AGRCNRSFEHERGDVTVWWLAQPAAQTHTPAVAVYDLQGPSLTPVTAKALDSVRQGQTTLPGKSVARTAVQNYY<br>DRLHTEKNVGKEAYSAYVDTADAESLGRLSLITQTKTVDVLICVTEADHELVSSLETAYEKYEFEEVKRLLTATKPLRV<br>SIPIYRDDSPEADAVTSLRPLADREEESQIRVLSAETRAFENYFDRSTGFVVTDRTVEDRFL | MLLFAHLKDVRDRIDMVVPEDTKTPDGMLLSGVIQRLALVHDVGKATTFFQQYIGESPGKPRYEKLRYHAPLGSLAAYYVL<br>DETGHSTATCLAGFVAVAKHHGRLPNVVEYLFNRTARPDPEKWDPVKAQISDIHEHAPKLVTAIFQEATGDADAWQ<br>NFAKACVNDESLFTEIADHITTNGERPITDPSFLTDEFYGLLLECWGTLVLADKTSAAGAPQSSTVYDGTTPKTTTLA<br>EYIDEIEDPNANPDGSQTERLNYFRSRARKDVLDSVSVFIESESNVATITLPTGMGKTLTGLNAALEIRDRTDRDRIVY<br>ALPFTSIIDQVGAEVQDIFDTDGTDGLLALHHHLSDTRFGYRDSDDDVSDLNDDIAGMLGESWRAGIMVTTFVQLF<br>ESLAGPRNTQSMKIPALRESVVVLDEPQSLPLDWWKLVPRLVAVLTEQYDATVISMTATQPELFSDSVSLVSDPEQYFSAAERVRYHLHDSVERFLQGDEQALEYDVAAAEIADVAASGESLLAICNTIDSARKLAEAVTDRIQSVSVAEQYFASLKSGS<br>ADPVEDTVRQIIQSPKQAFVHLSTRLRPTDRLALIRIIKKLRSAGYAVTAVTTQLVEAGVDISFENVYRDLAPVDSIVQA<br>AGRCNRSFEHERGDVTVWWLAQPAAQTHTPAVAVYDLQGPSLTPVTAKALDSVRQGQTTLPGKSVARTAVQNYY<br>DRLHTEKNVGKEAYSAYVDTADAESLGRLSLITQTKTVDVLICVTEADHELVSSLETAYEKYEFEEVKRLLTATKPLRV<br>SIPIYRDDSPEADAVTSLRPLADREEESQIRVLSAETRAFENYFDRSTGFVVTDRTVEDRFL | ||
− | > CRISPR-associated RecB family exonuclease at 8212..8766<br> | + | > 4muc CRISPR-associated RecB family exonuclease at 8212..8766<br> |
MTELDPVERLLQTARDETRDDSFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNEAIVRGTHVDETAYSEKRRHVSIDSTIAI<br>DVLDDGRVLEVKPSSSLVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTSAAEQRVEDAIRGVYEIITTE<br>SPPPAVQKPVCGSCAYHDFCWSC<br> | MTELDPVERLLQTARDETRDDSFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNEAIVRGTHVDETAYSEKRRHVSIDSTIAI<br>DVLDDGRVLEVKPSSSLVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTSAAEQRVEDAIRGVYEIITTE<br>SPPPAVQKPVCGSCAYHDFCWSC<br> | ||
− | > CRISPR-associated protein Cas1 at 8726..9760<br> | + | > 1muc CRISPR-associated protein Cas1 at 8726..9760<br> |
MARVRITTSAGVANMDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIFLHGQIDYNTRLVSFLNQHGTAIHVFG<br>WKDYYAGSIMPKRGQTSGRTLVEQVRAYDSTARRTDIARKFVEGSIHNMRANVSYYNSRGHEFDTELASLDAAADR<br>LTEADGVQESMGVEATARRAYYSTFDSILPDGFVFNGRQYNPPTNEVNSLISFGNSLVYANVVSAIRATALDPAVSYL<br>HEPGERRYSLALDIADLFKPLLADRVLFRLLNRKQLTPADFETDLNSCLLTEDGRKTFSKAFEETLEQTIEHPELNRKVS<br>YQYLLRIEAYKLKKHLLTGEEYVPFKRWW<br> | MARVRITTSAGVANMDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIFLHGQIDYNTRLVSFLNQHGTAIHVFG<br>WKDYYAGSIMPKRGQTSGRTLVEQVRAYDSTARRTDIARKFVEGSIHNMRANVSYYNSRGHEFDTELASLDAAADR<br>LTEADGVQESMGVEATARRAYYSTFDSILPDGFVFNGRQYNPPTNEVNSLISFGNSLVYANVVSAIRATALDPAVSYL<br>HEPGERRYSLALDIADLFKPLLADRVLFRLLNRKQLTPADFETDLNSCLLTEDGRKTFSKAFEETLEQTIEHPELNRKVS<br>YQYLLRIEAYKLKKHLLTGEEYVPFKRWW<br> | ||
− | > CRISPR-associated protein Cas2 at 9764..10027<br> | + | > 2muc CRISPR-associated protein Cas2 at 9764..10027<br> |
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGAVTEGDLETIRNHTQSLLKPDESAIIYRIGSDKYVERTVIGDDPTDD<br>SQFL<br> | MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGAVTEGDLETIRNHTQSLLKPDESAIIYRIGSDKYVERTVIGDDPTDD<br>SQFL<br> | ||
== Haloferax sulfurifontis == | == Haloferax sulfurifontis == | ||
− | > CRISPR-associated RecB family exonuclease at 8856..9479<br> | + | > 4sul CRISPR-associated RecB family exonuclease at 8856..9479<br> |
MTGSSFKDELVYVSALNEFVYCPRRYYYQRYYDEIGEPYELVDGRSKHKHASRRGSWVTERYFRSDDLGMHGKIDLVDTET<br>GTPTPVERKRSESGSYYSSDEVQLAGYCMLLEDAIDESVNVGYIYLYSTDTRHSVRITQDHREAVREILRRIRSMSVDSI<br>PPLTQNQSKCEACSARTYCMPGETAVLSPSEAEGTGWEGTTPGDLI<br> | MTGSSFKDELVYVSALNEFVYCPRRYYYQRYYDEIGEPYELVDGRSKHKHASRRGSWVTERYFRSDDLGMHGKIDLVDTET<br>GTPTPVERKRSESGSYYSSDEVQLAGYCMLLEDAIDESVNVGYIYLYSTDTRHSVRITQDHREAVREILRRIRSMSVDSI<br>PPLTQNQSKCEACSARTYCMPGETAVLSPSEAEGTGWEGTTPGDLI<br> | ||
− | > CRISPR-associated protein Cas1 at 9446..10471<br> | + | > 1sul CRISPR-associated protein Cas1 at 9446..10471<br> |
MGGNDTRRFDMKESQGIFEDSVVYVTTQGSQVRIDGGQVVIYDVEGDDGELGAFPVEKLDTINVFGGVNFSTPFVATANE<br>HGIVLNYFTQNGKYRGSFVPERNTIAEVRRAQYALTGRDELAISKAMIAAKIRNARTILSRKGVRGTTVLKDLGERVSS<br>ASNKDDLRGIEGEAAERYFSRLDETLVDNWTFDTRSKRPPEDHINSLLSLTYVMMKNEVLSALRCYNLDPFLGVLHA<br>DRHGRPSLALDLQEEFRPLFCDAFVTRLVNRGTITHDQFGNDNRLTDDGFQAYLDKFDGYMNEELTHPYFEYRVSR<br>RKAIRQQVILLRKAITGELDDYHALEVSR<br> | MGGNDTRRFDMKESQGIFEDSVVYVTTQGSQVRIDGGQVVIYDVEGDDGELGAFPVEKLDTINVFGGVNFSTPFVATANE<br>HGIVLNYFTQNGKYRGSFVPERNTIAEVRRAQYALTGRDELAISKAMIAAKIRNARTILSRKGVRGTTVLKDLGERVSS<br>ASNKDDLRGIEGEAAERYFSRLDETLVDNWTFDTRSKRPPEDHINSLLSLTYVMMKNEVLSALRCYNLDPFLGVLHA<br>DRHGRPSLALDLQEEFRPLFCDAFVTRLVNRGTITHDQFGNDNRLTDDGFQAYLDKFDGYMNEELTHPYFEYRVSR<br>RKAIRQQVILLRKAITGELDDYHALEVSR<br> | ||
− | > CRISPR-associated protein Cas2 at 10480..10740<br> | + | > 2sul CRISPR-associated protein Cas2 at 10480..10740<br> |
MTYDVSDDTNRRRVYRTLERYGAWRQYSVFELDVSKSERVELEDELESHIEPADGDRVRLYRLCEACQEATTDLGNEPPDE<br>QSNVI | MTYDVSDDTNRRRVYRTLERYGAWRQYSVFELDVSKSERVELEDELESHIEPADGDRVRLYRLCEACQEATTDLGNEPPDE<br>QSNVI | ||
Line 130: | Line 130: | ||
== Halorhabdus utahensis == | == Halorhabdus utahensis == | ||
− | > CRISPR-associated protein Cas2 at complement(2149888..2150151)<br> | + | > 2uta CRISPR-associated protein Cas2 at complement(2149888..2150151)<br> |
MVYVVAVYDVEADRTYLFLNFLRRYLTHVQNSVFEGEITEGDLEEVKGKLDSMLEPGESVIVYRMSSEQYVSRTVYGEDPTE<br>DSQFL | MVYVVAVYDVEADRTYLFLNFLRRYLTHVQNSVFEGEITEGDLEEVKGKLDSMLEPGESVIVYRMSSEQYVSRTVYGEDPTE<br>DSQFL | ||
− | > CRISPR-associated protein Cas1 at complement(2150153..2151145)<br> | + | > 1 uta CRISPR-associated protein Cas1 at complement(2150153..2151145)<br> |
MNDNYHIFSDGRVERHNDTVRLVTEDDEKKYLPIENAEALYLHGQIDFNTRVISFLDDHGVAMHVFGWNDYYSGSIMPER<br>GQTSGQTVVEQVRAYDDEAHRGNIAREIVAGSIHNMRANVTYYDNRDYDLSATLESLDRRRDEIKSVASVEEAMGV<br>EASARRAYYAIFDQILPDAFVFGGRKYNPPNNKVNSLISFGNSLVYANIVSAIRATALDPTISYLHEPGERRYSLALDLA<br>DLFKPVLTDRVVFRLVNRGQLSDDDFDSEMNACLLTESGRETFSKEFEQTLDRTIEHPNLNRKVSYQYLLRVEAYKLK<br>KHLLTGESYESFERWW | MNDNYHIFSDGRVERHNDTVRLVTEDDEKKYLPIENAEALYLHGQIDFNTRVISFLDDHGVAMHVFGWNDYYSGSIMPER<br>GQTSGQTVVEQVRAYDDEAHRGNIAREIVAGSIHNMRANVTYYDNRDYDLSATLESLDRRRDEIKSVASVEEAMGV<br>EASARRAYYAIFDQILPDAFVFGGRKYNPPNNKVNSLISFGNSLVYANIVSAIRATALDPTISYLHEPGERRYSLALDLA<br>DLFKPVLTDRVVFRLVNRGQLSDDDFDSEMNACLLTESGRETFSKEFEQTLDRTIEHPNLNRKVSYQYLLRVEAYKLK<br>KHLLTGESYESFERWW | ||
− | > CRISPR-associated RecB family exonuclease at complement(2151149..2151727)<br> | + | > 4uta CRISPR-associated RecB family exonuclease at complement(2151149..2151727)<br> |
MSGHTDERTGETDPVDLLLESARDEAVESSFHVTGVMMQYYEVCERELWFESRNIEIDRENPNVVRGTHVDETAYDEKRRN<br>LSIDGRIAPDLLDDGRVVEVKPSSTLVEPARLQLLYYLWYLDRVVGVEKEGVLAHPTERKRESVELTDETVQQVEDAIR<br>GIYDVVRSETPPPATEKPFCESCAYYDFCWSC | MSGHTDERTGETDPVDLLLESARDEAVESSFHVTGVMMQYYEVCERELWFESRNIEIDRENPNVVRGTHVDETAYDEKRRN<br>LSIDGRIAPDLLDDGRVVEVKPSSTLVEPARLQLLYYLWYLDRVVGVEKEGVLAHPTERKRESVELTDETVQQVEDAIR<br>GIYDVVRSETPPPATEKPFCESCAYYDFCWSC | ||
− | > CRISPR-associated helicase Cas3 at complement(2151724..2154318)<br> | + | > 3uta CRISPR-associated helicase Cas3 at complement(2151724..2154318)<br> |
MAERYSHPPEGGREGVALEVHLADVADRVAHVVPDDATTPTDGSLRSVVETLAWVHDFGKATTYFQEYLLEDSEADPPML<br>RHHAPIGSFAAYHALDTQGFDTETCLAGFVAVAKHHGRLPNVAEYVVDRTHRRDGDSRQNSVEKRQTVVLKQIGD<br>IHDTVPDLAETVFENATGGSGGWESFVRSYQSGSLLSEIEETVGTQTAGRGVDPDALSSSCYGLVLQCWSALVLADK<br>TSAAGAANESETYAPSQPGFETISDYIEDLEAGVDADKAGTKTERLNYHRSDARKNVLDNVASFAESGGGVATLTLP<br>TGMGKTLTGLSAAFALRDALGGNRVVYGLPFTSIIDQVVDEIQEIYETDTAGRLLTAHHHLSETTIRDTDDQSADDA<br>DRNDDVAGMLGESWRAGVTVTTFVQLFESLAGPQNRQSMKLPALRDAVVVLDEPQSLPLDWWKLVPRLVDVLTEQ<br>YNATVIAMTATQPRLFEDEFELVDDPDRYFEVVRRVSYELDDSTERYIESQSEPKSYAAAANELRAAVESGQTTLAVC<br>NTIDSARELTEQVGDGSFVDVGRLYDDELQEAGSADDVDPVELAKRVAATDDNALLHLSTRLRPADRLTLIETAKAL<br>TERGHPTLAISTQLVEAGVDISFDRVFRDLAPIDSIVQAAGRCNRSFEREQGVVTVWWLDVPDEQSKTPAEAVYNRG<br>TTLLPTVADTLRQIRDESGSLSETDVARRGVEWYFERLREDKDVGKQTYADWVDDAKAKELGTISLIDEQLSAEIVVT<br>RTPAERERAEAIRNAQRNFEFETLGQLVDETKPLRISVPYYSEDSETADAITDLPPLVEDEGIYELDVQQNPSHFDRTT<br>GFVVPEASVDHQFL | MAERYSHPPEGGREGVALEVHLADVADRVAHVVPDDATTPTDGSLRSVVETLAWVHDFGKATTYFQEYLLEDSEADPPML<br>RHHAPIGSFAAYHALDTQGFDTETCLAGFVAVAKHHGRLPNVAEYVVDRTHRRDGDSRQNSVEKRQTVVLKQIGD<br>IHDTVPDLAETVFENATGGSGGWESFVRSYQSGSLLSEIEETVGTQTAGRGVDPDALSSSCYGLVLQCWSALVLADK<br>TSAAGAANESETYAPSQPGFETISDYIEDLEAGVDADKAGTKTERLNYHRSDARKNVLDNVASFAESGGGVATLTLP<br>TGMGKTLTGLSAAFALRDALGGNRVVYGLPFTSIIDQVVDEIQEIYETDTAGRLLTAHHHLSETTIRDTDDQSADDA<br>DRNDDVAGMLGESWRAGVTVTTFVQLFESLAGPQNRQSMKLPALRDAVVVLDEPQSLPLDWWKLVPRLVDVLTEQ<br>YNATVIAMTATQPRLFEDEFELVDDPDRYFEVVRRVSYELDDSTERYIESQSEPKSYAAAANELRAAVESGQTTLAVC<br>NTIDSARELTEQVGDGSFVDVGRLYDDELQEAGSADDVDPVELAKRVAATDDNALLHLSTRLRPADRLTLIETAKAL<br>TERGHPTLAISTQLVEAGVDISFDRVFRDLAPIDSIVQAAGRCNRSFEREQGVVTVWWLDVPDEQSKTPAEAVYNRG<br>TTLLPTVADTLRQIRDESGSLSETDVARRGVEWYFERLREDKDVGKQTYADWVDDAKAKELGTISLIDEQLSAEIVVT<br>RTPAERERAEAIRNAQRNFEFETLGQLVDETKPLRISVPYYSEDSETADAITDLPPLVEDEGIYELDVQQNPSHFDRTT<br>GFVVPEASVDHQFL | ||
− | > CRISPR-associated protein, TM1800 family at complement(2154319..2155119)<br> | + | > 00uta CRISPR-associated protein, TM1800 family at complement(2154319..2155119)<br> |
MTMDQESSNGRTGTDGSDLDRCLSFEIRGPWGHFRRVEGNVVKQTYRIVPRTTVAGLIAAVLGIDRDGYYDLFGPEVSAIAI<br>QPVEELRTVNMPMNTLSTAAGDLTSLNPRGKISIKLPNPTKLRQQHNYEVLVDPAYRIDVALADDERYEQLRETLAA<br>GKSHYVPSLGLSEYLAEIDYLGEFDVKPGPASGTIAVDSAVPDAMDDVVLDPETRCQIEQSPAFMASDGSGRTTTEYT<br>TYTYNPDAEPLQVRDVPTSRVDDRTVVFV | MTMDQESSNGRTGTDGSDLDRCLSFEIRGPWGHFRRVEGNVVKQTYRIVPRTTVAGLIAAVLGIDRDGYYDLFGPEVSAIAI<br>QPVEELRTVNMPMNTLSTAAGDLTSLNPRGKISIKLPNPTKLRQQHNYEVLVDPAYRIDVALADDERYEQLRETLAA<br>GKSHYVPSLGLSEYLAEIDYLGEFDVKPGPASGTIAVDSAVPDAMDDVVLDPETRCQIEQSPAFMASDGSGRTTTEYT<br>TYTYNPDAEPLQVRDVPTSRVDDRTVVFV | ||
− | > CRISPR-associated protein, TM1801 family at complement(2155116..2156195)<br> | + | > 01uta CRISPR-associated protein, TM1801 family at complement(2155116..2156195)<br> |
MSEPTQTVENRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQEEGTQYT<br>RAELLEDRLKAVDPDDYDLDDDEAAAQFRDDVFGEYLEESADVRYFGATMSVDTDNAYAKHLPDHFTGPVQFSPG<br>KSIHAVNENEEYDSLTSVIATQEGKEQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTRADVERLDTLCWRAIKNQ<br>TISRSKVGQEPRLYCRVEYGEESYHLGGLDKDLTLDDEASKDHDELRNIRDLTLEIDDFVDRISNASDQIERIRVVASD<br>VLELSHGTDSGGPDLLYDALRTAIGPDRVDVVDVYDEYPETLPQSTGE<br> | MSEPTQTVENRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQEEGTQYT<br>RAELLEDRLKAVDPDDYDLDDDEAAAQFRDDVFGEYLEESADVRYFGATMSVDTDNAYAKHLPDHFTGPVQFSPG<br>KSIHAVNENEEYDSLTSVIATQEGKEQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTRADVERLDTLCWRAIKNQ<br>TISRSKVGQEPRLYCRVEYGEESYHLGGLDKDLTLDDEASKDHDELRNIRDLTLEIDDFVDRISNASDQIERIRVVASD<br>VLELSHGTDSGGPDLLYDALRTAIGPDRVDVVDVYDEYPETLPQSTGE<br> |
Revision as of 16:52, 12 November 2009
Contents
Halomicrobium mukohataei:
>2muk CRISPR-associated protein Cas2 at complement(1004454..1004717)
MVYVVVVYDMEADRTHKMLKFLRRYLTHVQNSVLEGDVTEGDLEKIRSGVDDLLKPGESTIIYQISSEKLVDRSVFGDDPA
ADDQFL
> 1muk CRISPR-associated protein Cas1 at complement(1004721..1005719)
MIMNDNYHVFSDGRIERHDDTVRVITDDGEKKYLPVENAEAIFLHGQIEYNTRFVSFLNQEGVAVHVFGWHDHYAGSIMP
KRGQTSGQTLVDQVRAYDDPAHRLELAQAFVDGSIHNMRANVTYYDGRGHDFEDVLAELTEARSSLDRMETIDET
MGVEARARKAYYSTFDEILPDEFVFGGRQYDPPNNEVNSLISFGNSLVYANVVSAIRATALDPTVSFLHEPGERRYSLA
LDIADLFKPLLADRVIFRLVNRGQLTSDDFEAEMNACLLNEHGRKTYSKAYEETLDETIEHPDLGKKVSYQYLLRVEV
YKLKKHLLTGEEYVPFQRWW
> 4muk CRISPR-associated RecB family exonuclease at CAYHDFCWSC"
MTDSSGDPVDRFLAAARDESAELPFRLTGVMFQYYVVCERELWFLSRDVEIDRDTPAIVRGSDVDDSAYADKRRDVRVDGI
IAIDVLDSGEILEVKPSSSMTEPARLQLLFYLWYLDRVTGVEKTGVLAHPAEKRRETVELTPETSAEVESAIEGIRAVV
> 3muk CRISPR-associated helicase Cas3 at complement(1006273..1008843)
MGTEPISHPATDGDEATRLLDHLDDVAGRAESVVSADATTIGGDPLPEMTAVVARCHDFGKATTWFQAYVVGERDASDR
TNHSLLGAYLGYYVLDRLGYDSEDCLAALVALAKHHGQLPDVEEYVDGVSRFENTSEASNERQRILIEQVGNVDDH
RRQFAQSFVADATDGNGSWEEFAQSIEDESLFDTVHEHVSLFGFGSDRSAPSEEFYGLLLQLWSGLVFADKTVAAGIE
NGDLDGSEPDARLLSEYIADLGGDTDEDGQAAALDTLRGEARDDVLGGVERFRDSEVSIATLTLPTGMGKTLTGLS
AALALRDEGERVVYALPFTSVIDQVADELADVFDTDARDDLLTIHHHLAETVTKLGDPDEDPDEYARLAEMVAESW
RSGMTLTTFVQLFESLVGPSNTQSMKLPALYDSVIVLDEPQALPMQWWKLVRRIATILTEEYDATIVAMTATQPRAT
DEETASQALFDDAFELVDDVDRYFGHFERVEYDVHDSVLAFDDTDAIVDYETAGRTILDETGRSESTLAICNTIDSAT
ALTDAIEEREAVVDVGRCLELELDDGADVDALVERVESTLSSNERALVHLSTRLRPRDRLALVEATKRLTERDVPVLA
VSTQLIEAGVDISFDRVYRDIAPIDSVVQAAGRCNRSFESDLGTVTVWWLAPPAGTTTTPAQAVYDSEGVSTISLTAR
TLDAIGADDGTVAEQTMTRDAVEHYYGLVADRNPGDPEYVKWVDEANADALGGLSLIGQRESVDVVVCRTDGDR
ELADAMVAALDEFDYDAFGDYREAAKDITVSLPIYSRDSTEAETVRNLEPLGDADLRVLRRARGTSYFDETKGLAVD
EPCVDDRFL
> 00muk CRISPR-associated protein, TM1800 family at complement(1008843..1009520)
MKQTYRVIPRTTVAGLIAAMLGIERDGYYDLFAPGESLVAIEPTSELRTMKLPMNTLSTADEHMASLNPRGKLSIKLPDPSKP
RQQHNYEVLVEPAYRIDVWLADDERYDRLRSLLESGESYYVPSLGLSEHLATIDYHGEFPVEHGPDGETVAIDSTVPE
AVDSIVPDPETRYQIEQTPAFMERDDGGRTTSAFVSYAYNPDGGSLRVADVSTYSVDDRAVVFT
> 01muk CRISPR-associated protein, TM1801 family at complement(1009635..1010690)
MSEHYPTVSNRSEIVFAYDAVDANPNGNPLSGANRPRIDPHTDQAIVTDVRLKRYLRDQLQDDGHGVYIRNVKEDDGDQ
ATREDLLEDRLKDIDLDDVDEADIENAVFGQFLENSADVRYFGATMSIDMDDEKVDHLPDHFTGPVQFSPGKSLHR
VMENEEYNSLTSVIATGDDKAQGGFDLDDHRIQYAFIGFHGLVDEHGAEGTLLTDGDVRRLDTLCWRALKNQTISR
SKVGQEPRLYLRVEYADESFHLGGLDQDIDLDSSESAPVEEIRNVRDICVDVSALLERLDAASDRIDTVHVVASDVLEL
SVDGETGGPEFLYDALESRVGSESVREIDVYEDAKATMPEE
Haloarcula californiae:
Need to find Cas1 and Cas2
> 3cal CRISPR-associated helicase Cas3 at complement(1844..4447)
MTFEQYISHPAKTDDGEPTLLIGEGGQFDQDGHLQTVANRMVEACRGQTLADGTPAEPVAEVIGLTHDFAKLTHWAQKH
LRCQPFQHSDEYRYHAFPGALVTLYCLLNRRDGTGPLKDDHAAEVATLVVAGHHDIQSPPEPSKLAKNYGRDTLEV
QETYKRITEQFEDIGDRVPERADQIIHKATDGEGSWEDFREWHADRTAPIDGPHHHLIYFAQIGDRDTRDGYYSDVV
RLWTALKFADQTDASGLKNEDIGGTLLDRSELTRHIEDLDKGENVLAELNDLRNKARQEVTENVETLVKSNDVGLIT
LPTGFGKTYAGISAGLRAANINDSRLVYALPYTSILDQTASEIQSVFGVSPYSRAFTLHHHLSNTYTGLGDHYTDADI
GRSPGALHAESWLSGVTLTTTVQLFESLAAPTARQATRVPALHEAVVVIDEPQAIPEDWWQIIPELVELLVDSYNATVI
LMTATQPGLVKYGSNTLNTRELTDATDKYTDFLADHPRVRYRLHDTVRTDVGDEYATLDYATAGSRVSGAAEGG
RDVLAVCNTRASAEELYRSVTATVNTKEKVPVELGHLLHDYVEETGELPSPVELRRFAIDAVAERDVATLYAFLSGDV
RPDDRKLIIDTLYDDEIGDEDEPEPLLDSDWSVILVSTSVVEAGVDVSFDTVFRDYAPIPNIVQSGGRCNRSFDGETGD
VVVWRLAEPENGSAIPSLVIHGADGGDQLPLLLATGNVLRRHAARDGTIDESTMVSDTVSEFYESLFEGPLNPGNERL
ADAISSASMSELEGEHMIDEIEDYEDVVACLTDEERDDLLSSDPEAVSIRGHPGAQVSTDLEAWTKKVTIGNSQYLLV
DALSGSYHPVFGVR
> 00cal CRISPR-associated protein, TM1800 family at complement(4458..5279)
MTQQDLTDYASEGGDSSSSRYIPDTCIGFDVTADFAHFRKVGNNSAKPSYRIPPRTTVAGLLAGILGMPRDSYYDLFSPARS
AIAIVPKGLPHTYTMGITTVNTKADDAIQYLPQEKHYTKSAEMLTPESYVKYDRQRDTYEMLVDPEYRVYVALSDQN
SYNELRERLETSRYHYSPALGLSECIADIRNVEIHTVGPGIEDAVDSAAYDDSEVVPKPGVTIKRERAPLYMESTDGGR
RTTEFGNITYAAGDDRLPVDESRTHTVGEHQVAFY
> 01cal CRISPR-associated protein, TM1801 family at complement(5304..6332)
MTDTNDATIQNRSEIAFVIDAKDTNPNGDPLTADNEPRIDPVTGQCVVTDVRLKRYLRDQLVEDDHVVLIANPNDEVLTR
KEMYDAVESEMGVSTDEAEPEELLEAFVKTAADVRYFGATISLDTDLAEDLPNQFEGPVQFNHGRSYHEVARNTESK
QLATVIANEDDDGGKKDQGTFATDNRISYGVIGFGGRINDNAAKDTHLTEDDVERLDTLCWRALKNQTVTRSKAG
QQPRLYIRVEYKQDGFEIGRLNDRIGVDSDLPEDEIRGTDDFNLDVSELVTTLADNDARIDTVHITADSAVTFALPDG
ETGDREALYTVLNDILGAEAVDAYDVYERYVN
Haloarcula sinaiiensis
> 01sin CRISPR-associated protein, TM1801 family at 481071..482093
MTTLNRSELLFVYDAQDCNPNGNPIGDNRPRRDPDTGKGIITDVRLKRYLRDQLQDDGFDIYVKKIAGESRTRTTLIKDVL
GGVSDAEDLEDIEDIGESFLEAATDVRYFGATLSFEASDDEEDEAFREALNSAFPNHYQGPVQFLPAKSLNEVEENEEY
DSLTSVISTGEGNRQGGFDLDDKRIKYGIFPFYGLVDNHGAETTNLSAADVERLDTLCWRALKNQTTSRSKLGQEPR
LYLRVEYAEDDYHIGGLQNLLDLDGGDNLLRSISDVVLDVSDLLSTLDKNRDRIETIHLIADDRMTLDTGDEAISGDQ
LATELDSRGLDVHEIDVVDERDLAR
> 00sin CRISPR-associated protein, TM1800 family at 482111..482908
MSPQIDADGIPDRCLSFTVSSTWGHFKRVGRTVTKQTYRIPPRTTVAGMLAAIVGAERNSYYETFGEDNAAIAITPESDLRTI
NIPTTGLGTDPDQDVTTTAKKRRNYSLTYQETTGDRQLHAYEVLADPSYRIDVALEDEEFYQKLHDHLEAGTSVYPP
SLGKSEYLATIENVQAGQEPEPASSSGPYDIDSIVPIELADAIPQGGVAYESERSPAVMERHQGGRRTTRFDDYVFTRR
SDGTVKTDAGTDVKPVSVGNRIVVFR
> 3sin CRISPR-associated helicase Cas3 at 482948..485755
MDLPLISHPDVDENDAYPSSQLTDDGALRLDAHNRTVGDRAVRLFGPDDDRTQYLRIAASLHDFGKVTPQFQAHVRPTE
NYDGPEDEKVHARLGALATWYVLEETDAPPRDQLAATLAVARHHQALPNAAQYTGETLARAVEASADVLQAQINR
IDETWPEAADDLFRCTGSDGSSWAEFAEWARSGAATAALQDCSVRETLSGVEPTPSRLPDFLYDRTIHYWAALTLAD
KSHAMGLSEERVFDFDTLDLETLERHINTLRQQEASSLHEAQLNDERERARRQALRGTHEWLNQEQTDIATLTLPTG
LGKTFTGLSAAFEARDILDETDTEHPDNPRPVIYALPYTSIIEQTRALFEDPELWGADPKKSALTVHHYLSETVVYGDE
YDAADVDESDAGEAAQFLGETWRDGTILTTFVQLFESLTGPSNRQGLKLPSLDSALIILDEPQALPKDWWDGIERLLQ
LLTGEYGAKVIAMTATQPSLFREMGTSSLLELGAAHAQTDCSHCRRQPAYETELPPISQESYFNEADRVRYTIDESALS
HRLETEEEFVEYDSAASRIHETAAQADSVLAICNTIESSRQLTQAVSQHSDAVHLGPVLESILTAPDANVAESEMNPGE
IVSEVLETVGIEDQCSDEPTARNQEVPSPQGPFVLTLNSRYRPFDRQVIIQLAEQLSTGPVPFILISTQAIEAGVDISFEM
VYRDIAPLDSIVQAAGRCNRAYEWGKNGGQVTIWTLAPTGPDAANPPAYWVYERGSTDAGMPDHLRLISDVLNKV
PGQRDIADIHLSKHAVDRYFEQLSRRSLDDGSIRDHIDHAEGRWLSQQSLIGGYETADVLVAVSESESQTLDRITQMF
TDGNPRAYDRLDDLSHLRVSLPAKIIDENPKLTRIDGQGRKDDGVNVFRFNGTGGLTYTLEDGGLRATEESIQDRFTI
> 4sin CRISPR-associated RecB family exonuclease at CLYQDICWM"
MTELSTVDRYIRDEREPGREPETRITGLMIQYYHVCQRELWFMAHGIDIDRETTNIQRGTHVDETSYQDSRQSFMIDNRIQL
DVLESGDIMEVKVSSTLEKPARMQLVFYLWFLDNIYDVDKDGVLAYPTERKRETIQLDAANIEAVENTIRGILDVVNR
> 1sin CRISPR-associated protein Cas1 at 486393..487385
MTKPNHHIFADGELSRSESTLRIDTLEGDTEYLPVESVDSLYLHGQIDFNTRTLGLLNEHGVPLHIFGWKDYYKGSYLPKRGQ
VSGNTVVEQVRAYDDRRRLNIGQKMIRASIHNMRRNLVYYDGRRGDFSDAIASLDEFKDETADTDDINQLRAVEGN
ARSTYYDCFDQILRDPFELSKREYNPPTNEANALVSFLNGMVYTTSVSAIRKTALDPTIGFVHEPGERRFTLSLDIADIF
KPILADRLIFRLVNRQQLSLSDFESELEGCLLTESGRMTVLEAFEETLDKTIEHPRLERKVSFKTLVQTDVYSLKKHILTG
ESYHPTERWW
Haloarcula vallismortis
Nothin
Haloferax denitrificans
> 01den CRISPR-associated protein, TM1801 family at 3381..4460
MSDTNDAVTNRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQREGNQY
TRGELLEDRLKDVEPDEYDLDDGEESERFRNDVFGEFLDNSVDVRYFGATMSVDTDDVYAKHLPDHFTGPVQFSPG
KSVHAVNENEEYDSLTSVIATQENKQQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTTGDVERLDTLCWRAIKN
QTISRSKIGQEPRLYCRVEYADESFHLGGLDRDLVLDDDRSKPDKELRTVRDLTLEIDGFVDRLAAASARINRIRIVAS
DVLDISYDGDVGSSELLYGALREAVGDDAVEVVDVYEEHAETLPAGDAA
> 00den CRISPR-associated protein, TM1800 family at 4462..5259
MEQESLDEWVDGGDDGRPERCLSFTVSGPWGHFRRVEGNVVKQTYRIIPRTTVAGLLAAVLGIERDGYYDLFAPGSSGIAIE
PVRAVRTLNMPMNTLSTASGNLQSLNGRGKISVKLPNPTALRQQHNYEVLVEPAYRVDVWLAETARYRELREMLEA
GKSHYVPSLGLSEHLAEIDYHGEFDVESASGAGRVEVDSAVPNAVDDVVLREGTRCQVEESPAFMRVDAGGRTTTEF
TTYAYNPDAEPLVVDGVDSVEVDGRNVVFV
> 3den CRISPR-associated helicase Cas3 at 5259..7904
MTERYSHPPNDVHDGVPLVDHLGDVAERVGYVVPADAKTPAGEPLRAVVETLAYVHDFGKATTYFQDYLLRSVEPRYEQY
RYHAPLGSFAAYYALSAQNFDPETCLAGFVAVAKHHGRLPDVAAYVFSRANRRENVSRGNQSTAEMQQVAIAKQL
KDIDEHAPELAREVFQSATDGEGSWSSFRGSFRELLTEVKKSTGSSATAITRETLSERCYGLVLECWGSLVLADKTSAA
AAASGSNASAGTYDAEKPTFERLEEYIESIERTADADRDGNRSERLNFHRAKARASVLSNVESFADADGGVATVTLPT
GMGKTLTGLSAALSVRDQLGGGRVVYALPFTSIIDQVVAEVEEIYQVDTTGRLLTAHHHLSEATIVDESDESADEADA
NDDVAGMLAESWRAGMTVTTFVQLFESLAGPANRQSMKLPALRDSIIVLDEPQSLPLDWWKLVPRLVRMLTEQYNA
TVIAMTATQPQLFDDATELVSAPETYFEATERVQYELDDSTTRYIDSREEPKSYAEAASAIVDETTDANGTNSDEAGE
TESVLAICNTIDSAQALTTHVTETLSDAISVGSVYADVLENADRNSTDIEVATVAKRVADAGGRPVLHLSTRLRPVDR
LRLIETAKLLTERGCSLVVVSTQLVEAGVDISFDRVYRDLAPIDSIVQAAGRCNRSFERDRGHVVVWWLEAPDEQTKT
PSEAVYNRGTALLPVATETLDDIGGTEGTIPEATVSKTAVEEYYRRLHDEKNIGKDAYVEYVNEARADELGELSLIEQR
RSVEVVVCRTTEERERVEAVRAAWQDYEFETVRRLMDSLKEASVSVPIYRGDSKEAKALSGLTRIYEDTETRWIDTRD
ARHGSYFDSTTGLAAESSVDNRIL
> 4den CRISPR-associated RecB family exonuclease at 7901..8449
MTTKDPVDRLLATARGTPVDEPFRVTGVMMQYYYVCERELWFESRSLEIDRENATVVRGMRVDETAYDEKRESLRLGMISL
DLLDDGRVVEVKPSSALTEPAEMQLSYYLWYLDRVAGVRRDGVLAHPRERRRESVELTEERAKKVESSIRRIHELVRRS
SPPPAERKPFCESCAYHDFCWC
> 1den CRISPR-associated protein Cas1 at 8440..9444
MVLTMDRNYHVFSDGRLERNDDTLRLVTEAGDKKYVPIENAEAFFLHGQIDFNTRLMSFLNQRTVALHVFGWEDYYAGSV
MPKRGQTSGRTVVEQVRAYEDSDHRRRLAAAMVSASIHNMRTNVVYYNNRDRELETEIDDLDAASARVDQTRPID
ELMGVEATARKAYYRSFNQILPDEFRLERREYNPPPNEVNSLISFGNALVYANCVSAIRATALDPSISYLHEPGERRYSL
SVDLADLFKPVLADRVLFRLINRKQITPSDFETDLGSCLLDEDGRRTYTKAFEETLERTVEHPKLNRKVSYQYLMRLEA
YKLKKHLLTGEEYEPFERWW
> 2den CRISPR-associated protein Cas2 at 9446..9706
MYVVMVYDLEAERTYKALKLGRRYLTHVQNSVLEGEISEGDLATLRNEVEDLLKSGESVIIYELSSDALLNRSVYGNDPTDEK
RFL
Haloferax mediteranei
> 2med CRISPR-associated protein Cas2 at complement(183157..183420)
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGEVTEGDLETIRNHTQTLLNPDESTIIYRIGSEKYVDRTVIGEDPTDE
SRFL
> 1med CRISPR-associated protein Cas1 at complement(183424..184416)
MDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIYLHGQIDYNTRLISFLNKHGTALHIFGWKDYYAGSVMPKRG
QTSGRTLVEQVRAYDSPAQRTDIARKFVDGSIHNMRANVSYYNSRGHDFDSELASLDAAGARLTETTAVEEIMGVE
ATARRAYYSTFDSILPDGFVFNGRRYNPPTNEVNSLISFGNSLVYANVVSGIRATALDPAVSFLHEPGERRYSLALDIA
DLFKPLLADRVTFRLLNRQQLTPADFETDLNSCLLTEHGRKTFSKAFEETLEQTVEHPRLNRKVSYQYLLRIEAYKLKK
HLLTGEEYVPFKRWW
> 4den CRISPR-associated RecB family exonuclease at CAYHDFCWSC"
MTDVDPVQRLLRTARDDARDESFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNSAIVRGTHIDETAYSDKRRHVSIDSTIAI
DVLDDGRVMEVKPSSALVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTDETEQKVEDAIRGVHEIIAR
> 3den CRISPR-associated helicase Cas3 at complement(184969..187563)
MTAEYERRYSHPAEDGRPAVLLFEHLRDVRDRVDMVIPEGATTPEGKPLCGVIRRLALVHDFGKATTFFQQYIGAQLGQPT
HDKLRYHAPLGSLAAYYVLRETGHSTATCLAGFVAVAKHHGRLPNVVEYVFKRMASPDPEKWMADKKQVENIHKN
APRLATAIFEEATGDDDAWLDFAQSCVNDESLFTEIADHVTRNGERPITEPTFLTDEFYGLMLECWGTLVLADKTSA
AGAPQASSVYDATNPRTADLTQYIDNLGDGNTDPDGSRTEQLNYYRSRARQDVLDSVTEFVESESDVATITLPTGM
GKTLTGLNAALEIRDQTGGDRIVYALPFTSIIDQVGAEVQDIFDTDGSDGIVALHHHLSDTRFGYSDGDDDASDLND
DIAGMLGESWRAGLTVTTFVQLFESLAGPRNTQSMKIPALRGNVIVLDEPQSLPLDWWKLVPRLVDVLTEQYGATVIS
MTATQPELFPAPMSLVSDAERYFTVAERVQYHLHDSVERFLRGEEQPLEYNDAANELVEVAQSGDSLLAICNTIDSA
RVLANAVTERIQAVNLAEQYFESLRNGSSDPVAETVQLVRQSSKQAFVHLSTRLRPTDRLALIRIIKQLRASGSPVIAVT
TQLVEAGVDISFENVYRDLAPVDSIVQAAGRCNRSFERELGAVTVWWLTQPAEQRHTPAVAVYDTQGPSLTPVTAS
ALDSVRDGQTKLAGQSVARAAVQEYYGTLHKEKNVGREEYHEYVDDADAESLGRLSLITQTKTVDVLICVTEADQT
LVESLEGAYENYDFQEVKRLLNATKPLRVSIPIYRDDSPEANVVTELRPLAGRKDESSIRVLDAGTRDFEKYFDHTTGF
VVADSTVEDRFL
> 00den CRISPR-associated protein, TM1800 family at complement(187565..188371)
MTQQTLTEWNDTQNESGTAGPPRCLSLTVRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFARGRSAI
AIEPVAPLRTVNMPVNTLSTADESMKSLNPRGKISIKLPDPTKPRQQHNYEVLVDPAYRLYVWMSDSHWFETLHETL
DEGKSHYVPSLGLSEYLAEITYHGRFEVESGPTDTAVAVDSAVPNAVDHVVPDAESRCQIEESPAFMTVDGGGRTTT
DFTSYTYNPDAGPVRVRNPDTAIVDGNTVMFV
> 01den CRISPR-associated protein, TM1801 family at complement(188374..189429)
MTVHTDPVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTQQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGEQYT
REKLLEDRLKEVDPDEYDDDGELAGVVFQAFLEESTDVRYFGATMSVDLDGKYGSLPDHFTGPVQFSPGKSMHAVN
ENEEYDSLTSVIATQTGKEQGGFGLDDHRIQYGLIRFHGLVDEHAAEDTALTAEDVERLDTLCWRAIKNQTISRSKIG
QEPRFYLRVEYATESFHLGGLDKDLELDRTDGRTKSDDELRNVRDLTLSVDSLVDRLERSTNRIERVHVTASDVLSVS
HGDEVGGPEVLYEALEDRLGTDAVHVIDVYDEHVETLPN
Haloferax mucosum
> 01muc CRISPR-associated protein, TM1801 family at 3755..4810
MSEHSDHVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTNQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGNRYT
REKLLENRLKEVDPDEYDDDEELSEAVYRTFLEESVDVRYFGATMSVDLDDRYGSLPDHFTGPVQFSPGKSMHSVNE
NEEYDSLTSVIATQDDKQQGGFDLDDHRIQYGLIRFHGLVDEHAAADTNLTTEDVERLDTLCWRAIKNQTISRSKV
GQEPRLYLRVEYATDSFHLGGLDKDLDLDRKDGRTKPDDQLRTVRDLTLSVDSLVARLEKSANRIERVHVAASDVLS
VSHDDDVGGPEVLYDALADRLGTDAVHTIDVYDEHTTALPN
> 00muc CRISPR-associated protein, TM1800 family at 4813..5619
MDQQTLSKWDDSQGESETADPPRCLSLTIRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFGPGHSAV
AIEPVEPLRTLNLPVNTLSTANESMKSMNAKGKISIKIPDPTKPRQQHNYEVLVDPAYRLYVWLRDSDWFDMLHEML
DEGKSHYVPSLGLSEYLAEIEYHGQFTVETGPADSVVAVDSAVPNAIDRIVPDAETRCQIEESPGFMTSDTGGRTTTG
FTSYAYNPDAGPVNVRNPDTHIIDGNTVMFV
> 3muc CRISPR-associated helicase Cas3 at 5678..8215
MLLFAHLKDVRDRIDMVVPEDTKTPDGMLLSGVIQRLALVHDVGKATTFFQQYIGESPGKPRYEKLRYHAPLGSLAAYYVL
DETGHSTATCLAGFVAVAKHHGRLPNVVEYLFNRTARPDPEKWDPVKAQISDIHEHAPKLVTAIFQEATGDADAWQ
NFAKACVNDESLFTEIADHITTNGERPITDPSFLTDEFYGLLLECWGTLVLADKTSAAGAPQSSTVYDGTTPKTTTLA
EYIDEIEDPNANPDGSQTERLNYFRSRARKDVLDSVSVFIESESNVATITLPTGMGKTLTGLNAALEIRDRTDRDRIVY
ALPFTSIIDQVGAEVQDIFDTDGTDGLLALHHHLSDTRFGYRDSDDDVSDLNDDIAGMLGESWRAGIMVTTFVQLF
ESLAGPRNTQSMKIPALRESVVVLDEPQSLPLDWWKLVPRLVAVLTEQYDATVISMTATQPELFSDSVSLVSDPEQYFSAAERVRYHLHDSVERFLQGDEQALEYDVAAAEIADVAASGESLLAICNTIDSARKLAEAVTDRIQSVSVAEQYFASLKSGS
ADPVEDTVRQIIQSPKQAFVHLSTRLRPTDRLALIRIIKKLRSAGYAVTAVTTQLVEAGVDISFENVYRDLAPVDSIVQA
AGRCNRSFEHERGDVTVWWLAQPAAQTHTPAVAVYDLQGPSLTPVTAKALDSVRQGQTTLPGKSVARTAVQNYY
DRLHTEKNVGKEAYSAYVDTADAESLGRLSLITQTKTVDVLICVTEADHELVSSLETAYEKYEFEEVKRLLTATKPLRV
SIPIYRDDSPEADAVTSLRPLADREEESQIRVLSAETRAFENYFDRSTGFVVTDRTVEDRFL
> 4muc CRISPR-associated RecB family exonuclease at 8212..8766
MTELDPVERLLQTARDETRDDSFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNEAIVRGTHVDETAYSEKRRHVSIDSTIAI
DVLDDGRVLEVKPSSSLVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTSAAEQRVEDAIRGVYEIITTE
SPPPAVQKPVCGSCAYHDFCWSC
> 1muc CRISPR-associated protein Cas1 at 8726..9760
MARVRITTSAGVANMDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIFLHGQIDYNTRLVSFLNQHGTAIHVFG
WKDYYAGSIMPKRGQTSGRTLVEQVRAYDSTARRTDIARKFVEGSIHNMRANVSYYNSRGHEFDTELASLDAAADR
LTEADGVQESMGVEATARRAYYSTFDSILPDGFVFNGRQYNPPTNEVNSLISFGNSLVYANVVSAIRATALDPAVSYL
HEPGERRYSLALDIADLFKPLLADRVLFRLLNRKQLTPADFETDLNSCLLTEDGRKTFSKAFEETLEQTIEHPELNRKVS
YQYLLRIEAYKLKKHLLTGEEYVPFKRWW
> 2muc CRISPR-associated protein Cas2 at 9764..10027
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGAVTEGDLETIRNHTQSLLKPDESAIIYRIGSDKYVERTVIGDDPTDD
SQFL
Haloferax sulfurifontis
> 4sul CRISPR-associated RecB family exonuclease at 8856..9479
MTGSSFKDELVYVSALNEFVYCPRRYYYQRYYDEIGEPYELVDGRSKHKHASRRGSWVTERYFRSDDLGMHGKIDLVDTET
GTPTPVERKRSESGSYYSSDEVQLAGYCMLLEDAIDESVNVGYIYLYSTDTRHSVRITQDHREAVREILRRIRSMSVDSI
PPLTQNQSKCEACSARTYCMPGETAVLSPSEAEGTGWEGTTPGDLI
> 1sul CRISPR-associated protein Cas1 at 9446..10471
MGGNDTRRFDMKESQGIFEDSVVYVTTQGSQVRIDGGQVVIYDVEGDDGELGAFPVEKLDTINVFGGVNFSTPFVATANE
HGIVLNYFTQNGKYRGSFVPERNTIAEVRRAQYALTGRDELAISKAMIAAKIRNARTILSRKGVRGTTVLKDLGERVSS
ASNKDDLRGIEGEAAERYFSRLDETLVDNWTFDTRSKRPPEDHINSLLSLTYVMMKNEVLSALRCYNLDPFLGVLHA
DRHGRPSLALDLQEEFRPLFCDAFVTRLVNRGTITHDQFGNDNRLTDDGFQAYLDKFDGYMNEELTHPYFEYRVSR
RKAIRQQVILLRKAITGELDDYHALEVSR
> 2sul CRISPR-associated protein Cas2 at 10480..10740
MTYDVSDDTNRRRVYRTLERYGAWRQYSVFELDVSKSERVELEDELESHIEPADGDRVRLYRLCEACQEATTDLGNEPPDE
QSNVI
Haloferax volcanii
Nothin'
Halorhabdus utahensis
> 2uta CRISPR-associated protein Cas2 at complement(2149888..2150151)
MVYVVAVYDVEADRTYLFLNFLRRYLTHVQNSVFEGEITEGDLEEVKGKLDSMLEPGESVIVYRMSSEQYVSRTVYGEDPTE
DSQFL
> 1 uta CRISPR-associated protein Cas1 at complement(2150153..2151145)
MNDNYHIFSDGRVERHNDTVRLVTEDDEKKYLPIENAEALYLHGQIDFNTRVISFLDDHGVAMHVFGWNDYYSGSIMPER
GQTSGQTVVEQVRAYDDEAHRGNIAREIVAGSIHNMRANVTYYDNRDYDLSATLESLDRRRDEIKSVASVEEAMGV
EASARRAYYAIFDQILPDAFVFGGRKYNPPNNKVNSLISFGNSLVYANIVSAIRATALDPTISYLHEPGERRYSLALDLA
DLFKPVLTDRVVFRLVNRGQLSDDDFDSEMNACLLTESGRETFSKEFEQTLDRTIEHPNLNRKVSYQYLLRVEAYKLK
KHLLTGESYESFERWW
> 4uta CRISPR-associated RecB family exonuclease at complement(2151149..2151727)
MSGHTDERTGETDPVDLLLESARDEAVESSFHVTGVMMQYYEVCERELWFESRNIEIDRENPNVVRGTHVDETAYDEKRRN
LSIDGRIAPDLLDDGRVVEVKPSSTLVEPARLQLLYYLWYLDRVVGVEKEGVLAHPTERKRESVELTDETVQQVEDAIR
GIYDVVRSETPPPATEKPFCESCAYYDFCWSC
> 3uta CRISPR-associated helicase Cas3 at complement(2151724..2154318)
MAERYSHPPEGGREGVALEVHLADVADRVAHVVPDDATTPTDGSLRSVVETLAWVHDFGKATTYFQEYLLEDSEADPPML
RHHAPIGSFAAYHALDTQGFDTETCLAGFVAVAKHHGRLPNVAEYVVDRTHRRDGDSRQNSVEKRQTVVLKQIGD
IHDTVPDLAETVFENATGGSGGWESFVRSYQSGSLLSEIEETVGTQTAGRGVDPDALSSSCYGLVLQCWSALVLADK
TSAAGAANESETYAPSQPGFETISDYIEDLEAGVDADKAGTKTERLNYHRSDARKNVLDNVASFAESGGGVATLTLP
TGMGKTLTGLSAAFALRDALGGNRVVYGLPFTSIIDQVVDEIQEIYETDTAGRLLTAHHHLSETTIRDTDDQSADDA
DRNDDVAGMLGESWRAGVTVTTFVQLFESLAGPQNRQSMKLPALRDAVVVLDEPQSLPLDWWKLVPRLVDVLTEQ
YNATVIAMTATQPRLFEDEFELVDDPDRYFEVVRRVSYELDDSTERYIESQSEPKSYAAAANELRAAVESGQTTLAVC
NTIDSARELTEQVGDGSFVDVGRLYDDELQEAGSADDVDPVELAKRVAATDDNALLHLSTRLRPADRLTLIETAKAL
TERGHPTLAISTQLVEAGVDISFDRVFRDLAPIDSIVQAAGRCNRSFEREQGVVTVWWLDVPDEQSKTPAEAVYNRG
TTLLPTVADTLRQIRDESGSLSETDVARRGVEWYFERLREDKDVGKQTYADWVDDAKAKELGTISLIDEQLSAEIVVT
RTPAERERAEAIRNAQRNFEFETLGQLVDETKPLRISVPYYSEDSETADAITDLPPLVEDEGIYELDVQQNPSHFDRTT
GFVVPEASVDHQFL
> 00uta CRISPR-associated protein, TM1800 family at complement(2154319..2155119)
MTMDQESSNGRTGTDGSDLDRCLSFEIRGPWGHFRRVEGNVVKQTYRIVPRTTVAGLIAAVLGIDRDGYYDLFGPEVSAIAI
QPVEELRTVNMPMNTLSTAAGDLTSLNPRGKISIKLPNPTKLRQQHNYEVLVDPAYRIDVALADDERYEQLRETLAA
GKSHYVPSLGLSEYLAEIDYLGEFDVKPGPASGTIAVDSAVPDAMDDVVLDPETRCQIEQSPAFMASDGSGRTTTEYT
TYTYNPDAEPLQVRDVPTSRVDDRTVVFV
> 01uta CRISPR-associated protein, TM1801 family at complement(2155116..2156195)
MSEPTQTVENRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQEEGTQYT
RAELLEDRLKAVDPDDYDLDDDEAAAQFRDDVFGEYLEESADVRYFGATMSVDTDNAYAKHLPDHFTGPVQFSPG
KSIHAVNENEEYDSLTSVIATQEGKEQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTRADVERLDTLCWRAIKNQ
TISRSKVGQEPRLYCRVEYGEESYHLGGLDKDLTLDDEASKDHDELRNIRDLTLEIDDFVDRISNASDQIERIRVVASD
VLELSHGTDSGGPDLLYDALRTAIGPDRVDVVDVYDEYPETLPQSTGE