Difference between revisions of "Cas-Protein aa sequences"
Line 13: | Line 13: | ||
[[Image:Cas33.png]] | [[Image:Cas33.png]] | ||
+ | |||
+ | [[Protein Sequences for Cas-proteins]] | ||
== Halomicrobium mukohataei: == | == Halomicrobium mukohataei: == |
Revision as of 17:43, 12 November 2009
Contents
Phylogenetic Tree
Cas3 protein alignment
Protein Sequences for Cas-proteins
Halomicrobium mukohataei:
>2muk CRISPR-associated protein Cas2 at complement(1004454..1004717)
MVYVVVVYDMEADRTHKMLKFLRRYLTHVQNSVLEGDVTEGDLEKIRSGVDDLLKPGESTIIYQISSEKLVDRSVFGDDPA
ADDQFL
> 1muk CRISPR-associated protein Cas1 at complement(1004721..1005719)
MIMNDNYHVFSDGRIERHDDTVRVITDDGEKKYLPVENAEAIFLHGQIEYNTRFVSFLNQEGVAVHVFGWHDHYAGSIMP
KRGQTSGQTLVDQVRAYDDPAHRLELAQAFVDGSIHNMRANVTYYDGRGHDFEDVLAELTEARSSLDRMETIDET
MGVEARARKAYYSTFDEILPDEFVFGGRQYDPPNNEVNSLISFGNSLVYANVVSAIRATALDPTVSFLHEPGERRYSLA
LDIADLFKPLLADRVIFRLVNRGQLTSDDFEAEMNACLLNEHGRKTYSKAYEETLDETIEHPDLGKKVSYQYLLRVEV
YKLKKHLLTGEEYVPFQRWW
> 4muk CRISPR-associated RecB family exonuclease at CAYHDFCWSC"
MTDSSGDPVDRFLAAARDESAELPFRLTGVMFQYYVVCERELWFLSRDVEIDRDTPAIVRGSDVDDSAYADKRRDVRVDGI
IAIDVLDSGEILEVKPSSSMTEPARLQLLFYLWYLDRVTGVEKTGVLAHPAEKRRETVELTPETSAEVESAIEGIRAVV
> 3muk CRISPR-associated helicase Cas3 at complement(1006273..1008843)
MGTEPISHPATDGDEATRLLDHLDDVAGRAESVVSADATTIGGDPLPEMTAVVARCHDFGKATTWFQAYVVGERDASDR
TNHSLLGAYLGYYVLDRLGYDSEDCLAALVALAKHHGQLPDVEEYVDGVSRFENTSEASNERQRILIEQVGNVDDH
RRQFAQSFVADATDGNGSWEEFAQSIEDESLFDTVHEHVSLFGFGSDRSAPSEEFYGLLLQLWSGLVFADKTVAAGIE
NGDLDGSEPDARLLSEYIADLGGDTDEDGQAAALDTLRGEARDDVLGGVERFRDSEVSIATLTLPTGMGKTLTGLS
AALALRDEGERVVYALPFTSVIDQVADELADVFDTDARDDLLTIHHHLAETVTKLGDPDEDPDEYARLAEMVAESW
RSGMTLTTFVQLFESLVGPSNTQSMKLPALYDSVIVLDEPQALPMQWWKLVRRIATILTEEYDATIVAMTATQPRAT
DEETASQALFDDAFELVDDVDRYFGHFERVEYDVHDSVLAFDDTDAIVDYETAGRTILDETGRSESTLAICNTIDSAT
ALTDAIEEREAVVDVGRCLELELDDGADVDALVERVESTLSSNERALVHLSTRLRPRDRLALVEATKRLTERDVPVLA
VSTQLIEAGVDISFDRVYRDIAPIDSVVQAAGRCNRSFESDLGTVTVWWLAPPAGTTTTPAQAVYDSEGVSTISLTAR
TLDAIGADDGTVAEQTMTRDAVEHYYGLVADRNPGDPEYVKWVDEANADALGGLSLIGQRESVDVVVCRTDGDR
ELADAMVAALDEFDYDAFGDYREAAKDITVSLPIYSRDSTEAETVRNLEPLGDADLRVLRRARGTSYFDETKGLAVD
EPCVDDRFL
> 00muk CRISPR-associated protein, TM1800 family at complement(1008843..1009520)
MKQTYRVIPRTTVAGLIAAMLGIERDGYYDLFAPGESLVAIEPTSELRTMKLPMNTLSTADEHMASLNPRGKLSIKLPDPSKP
RQQHNYEVLVEPAYRIDVWLADDERYDRLRSLLESGESYYVPSLGLSEHLATIDYHGEFPVEHGPDGETVAIDSTVPE
AVDSIVPDPETRYQIEQTPAFMERDDGGRTTSAFVSYAYNPDGGSLRVADVSTYSVDDRAVVFT
> 01muk CRISPR-associated protein, TM1801 family at complement(1009635..1010690)
MSEHYPTVSNRSEIVFAYDAVDANPNGNPLSGANRPRIDPHTDQAIVTDVRLKRYLRDQLQDDGHGVYIRNVKEDDGDQ
ATREDLLEDRLKDIDLDDVDEADIENAVFGQFLENSADVRYFGATMSIDMDDEKVDHLPDHFTGPVQFSPGKSLHR
VMENEEYNSLTSVIATGDDKAQGGFDLDDHRIQYAFIGFHGLVDEHGAEGTLLTDGDVRRLDTLCWRALKNQTISR
SKVGQEPRLYLRVEYADESFHLGGLDQDIDLDSSESAPVEEIRNVRDICVDVSALLERLDAASDRIDTVHVVASDVLEL
SVDGETGGPEFLYDALESRVGSESVREIDVYEDAKATMPEE
Haloarcula californiae:
Need to find Cas1 and Cas2
> 3cal CRISPR-associated helicase Cas3 at complement(1844..4447)
MTFEQYISHPAKTDDGEPTLLIGEGGQFDQDGHLQTVANRMVEACRGQTLADGTPAEPVAEVIGLTHDFAKLTHWAQKH
LRCQPFQHSDEYRYHAFPGALVTLYCLLNRRDGTGPLKDDHAAEVATLVVAGHHDIQSPPEPSKLAKNYGRDTLEV
QETYKRITEQFEDIGDRVPERADQIIHKATDGEGSWEDFREWHADRTAPIDGPHHHLIYFAQIGDRDTRDGYYSDVV
RLWTALKFADQTDASGLKNEDIGGTLLDRSELTRHIEDLDKGENVLAELNDLRNKARQEVTENVETLVKSNDVGLIT
LPTGFGKTYAGISAGLRAANINDSRLVYALPYTSILDQTASEIQSVFGVSPYSRAFTLHHHLSNTYTGLGDHYTDADI
GRSPGALHAESWLSGVTLTTTVQLFESLAAPTARQATRVPALHEAVVVIDEPQAIPEDWWQIIPELVELLVDSYNATVI
LMTATQPGLVKYGSNTLNTRELTDATDKYTDFLADHPRVRYRLHDTVRTDVGDEYATLDYATAGSRVSGAAEGG
RDVLAVCNTRASAEELYRSVTATVNTKEKVPVELGHLLHDYVEETGELPSPVELRRFAIDAVAERDVATLYAFLSGDV
RPDDRKLIIDTLYDDEIGDEDEPEPLLDSDWSVILVSTSVVEAGVDVSFDTVFRDYAPIPNIVQSGGRCNRSFDGETGD
VVVWRLAEPENGSAIPSLVIHGADGGDQLPLLLATGNVLRRHAARDGTIDESTMVSDTVSEFYESLFEGPLNPGNERL
ADAISSASMSELEGEHMIDEIEDYEDVVACLTDEERDDLLSSDPEAVSIRGHPGAQVSTDLEAWTKKVTIGNSQYLLV
DALSGSYHPVFGVR
> 00cal CRISPR-associated protein, TM1800 family at complement(4458..5279)
MTQQDLTDYASEGGDSSSSRYIPDTCIGFDVTADFAHFRKVGNNSAKPSYRIPPRTTVAGLLAGILGMPRDSYYDLFSPARS
AIAIVPKGLPHTYTMGITTVNTKADDAIQYLPQEKHYTKSAEMLTPESYVKYDRQRDTYEMLVDPEYRVYVALSDQN
SYNELRERLETSRYHYSPALGLSECIADIRNVEIHTVGPGIEDAVDSAAYDDSEVVPKPGVTIKRERAPLYMESTDGGR
RTTEFGNITYAAGDDRLPVDESRTHTVGEHQVAFY
> 01cal CRISPR-associated protein, TM1801 family at complement(5304..6332)
MTDTNDATIQNRSEIAFVIDAKDTNPNGDPLTADNEPRIDPVTGQCVVTDVRLKRYLRDQLVEDDHVVLIANPNDEVLTR
KEMYDAVESEMGVSTDEAEPEELLEAFVKTAADVRYFGATISLDTDLAEDLPNQFEGPVQFNHGRSYHEVARNTESK
QLATVIANEDDDGGKKDQGTFATDNRISYGVIGFGGRINDNAAKDTHLTEDDVERLDTLCWRALKNQTVTRSKAG
QQPRLYIRVEYKQDGFEIGRLNDRIGVDSDLPEDEIRGTDDFNLDVSELVTTLADNDARIDTVHITADSAVTFALPDG
ETGDREALYTVLNDILGAEAVDAYDVYERYVN
Haloarcula sinaiiensis
> 01sin CRISPR-associated protein, TM1801 family at 481071..482093
MTTLNRSELLFVYDAQDCNPNGNPIGDNRPRRDPDTGKGIITDVRLKRYLRDQLQDDGFDIYVKKIAGESRTRTTLIKDVL
GGVSDAEDLEDIEDIGESFLEAATDVRYFGATLSFEASDDEEDEAFREALNSAFPNHYQGPVQFLPAKSLNEVEENEEY
DSLTSVISTGEGNRQGGFDLDDKRIKYGIFPFYGLVDNHGAETTNLSAADVERLDTLCWRALKNQTTSRSKLGQEPR
LYLRVEYAEDDYHIGGLQNLLDLDGGDNLLRSISDVVLDVSDLLSTLDKNRDRIETIHLIADDRMTLDTGDEAISGDQ
LATELDSRGLDVHEIDVVDERDLAR
> 00sin CRISPR-associated protein, TM1800 family at 482111..482908
MSPQIDADGIPDRCLSFTVSSTWGHFKRVGRTVTKQTYRIPPRTTVAGMLAAIVGAERNSYYETFGEDNAAIAITPESDLRTI
NIPTTGLGTDPDQDVTTTAKKRRNYSLTYQETTGDRQLHAYEVLADPSYRIDVALEDEEFYQKLHDHLEAGTSVYPP
SLGKSEYLATIENVQAGQEPEPASSSGPYDIDSIVPIELADAIPQGGVAYESERSPAVMERHQGGRRTTRFDDYVFTRR
SDGTVKTDAGTDVKPVSVGNRIVVFR
> 3sin CRISPR-associated helicase Cas3 at 482948..485755
MDLPLISHPDVDENDAYPSSQLTDDGALRLDAHNRTVGDRAVRLFGPDDDRTQYLRIAASLHDFGKVTPQFQAHVRPTE
NYDGPEDEKVHARLGALATWYVLEETDAPPRDQLAATLAVARHHQALPNAAQYTGETLARAVEASADVLQAQINR
IDETWPEAADDLFRCTGSDGSSWAEFAEWARSGAATAALQDCSVRETLSGVEPTPSRLPDFLYDRTIHYWAALTLAD
KSHAMGLSEERVFDFDTLDLETLERHINTLRQQEASSLHEAQLNDERERARRQALRGTHEWLNQEQTDIATLTLPTG
LGKTFTGLSAAFEARDILDETDTEHPDNPRPVIYALPYTSIIEQTRALFEDPELWGADPKKSALTVHHYLSETVVYGDE
YDAADVDESDAGEAAQFLGETWRDGTILTTFVQLFESLTGPSNRQGLKLPSLDSALIILDEPQALPKDWWDGIERLLQ
LLTGEYGAKVIAMTATQPSLFREMGTSSLLELGAAHAQTDCSHCRRQPAYETELPPISQESYFNEADRVRYTIDESALS
HRLETEEEFVEYDSAASRIHETAAQADSVLAICNTIESSRQLTQAVSQHSDAVHLGPVLESILTAPDANVAESEMNPGE
IVSEVLETVGIEDQCSDEPTARNQEVPSPQGPFVLTLNSRYRPFDRQVIIQLAEQLSTGPVPFILISTQAIEAGVDISFEM
VYRDIAPLDSIVQAAGRCNRAYEWGKNGGQVTIWTLAPTGPDAANPPAYWVYERGSTDAGMPDHLRLISDVLNKV
PGQRDIADIHLSKHAVDRYFEQLSRRSLDDGSIRDHIDHAEGRWLSQQSLIGGYETADVLVAVSESESQTLDRITQMF
TDGNPRAYDRLDDLSHLRVSLPAKIIDENPKLTRIDGQGRKDDGVNVFRFNGTGGLTYTLEDGGLRATEESIQDRFTI
> 4sin CRISPR-associated RecB family exonuclease at CLYQDICWM"
MTELSTVDRYIRDEREPGREPETRITGLMIQYYHVCQRELWFMAHGIDIDRETTNIQRGTHVDETSYQDSRQSFMIDNRIQL
DVLESGDIMEVKVSSTLEKPARMQLVFYLWFLDNIYDVDKDGVLAYPTERKRETIQLDAANIEAVENTIRGILDVVNR
> 1sin CRISPR-associated protein Cas1 at 486393..487385
MTKPNHHIFADGELSRSESTLRIDTLEGDTEYLPVESVDSLYLHGQIDFNTRTLGLLNEHGVPLHIFGWKDYYKGSYLPKRGQ
VSGNTVVEQVRAYDDRRRLNIGQKMIRASIHNMRRNLVYYDGRRGDFSDAIASLDEFKDETADTDDINQLRAVEGN
ARSTYYDCFDQILRDPFELSKREYNPPTNEANALVSFLNGMVYTTSVSAIRKTALDPTIGFVHEPGERRFTLSLDIADIF
KPILADRLIFRLVNRQQLSLSDFESELEGCLLTESGRMTVLEAFEETLDKTIEHPRLERKVSFKTLVQTDVYSLKKHILTG
ESYHPTERWW
Haloarcula vallismortis
Nothin
Haloferax denitrificans
> 01den CRISPR-associated protein, TM1801 family at 3381..4460
MSDTNDAVTNRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQREGNQY
TRGELLEDRLKDVEPDEYDLDDGEESERFRNDVFGEFLDNSVDVRYFGATMSVDTDDVYAKHLPDHFTGPVQFSPG
KSVHAVNENEEYDSLTSVIATQENKQQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTTGDVERLDTLCWRAIKN
QTISRSKIGQEPRLYCRVEYADESFHLGGLDRDLVLDDDRSKPDKELRTVRDLTLEIDGFVDRLAAASARINRIRIVAS
DVLDISYDGDVGSSELLYGALREAVGDDAVEVVDVYEEHAETLPAGDAA
> 00den CRISPR-associated protein, TM1800 family at 4462..5259
MEQESLDEWVDGGDDGRPERCLSFTVSGPWGHFRRVEGNVVKQTYRIIPRTTVAGLLAAVLGIERDGYYDLFAPGSSGIAIE
PVRAVRTLNMPMNTLSTASGNLQSLNGRGKISVKLPNPTALRQQHNYEVLVEPAYRVDVWLAETARYRELREMLEA
GKSHYVPSLGLSEHLAEIDYHGEFDVESASGAGRVEVDSAVPNAVDDVVLREGTRCQVEESPAFMRVDAGGRTTTEF
TTYAYNPDAEPLVVDGVDSVEVDGRNVVFV
> 3den CRISPR-associated helicase Cas3 at 5259..7904
MTERYSHPPNDVHDGVPLVDHLGDVAERVGYVVPADAKTPAGEPLRAVVETLAYVHDFGKATTYFQDYLLRSVEPRYEQY
RYHAPLGSFAAYYALSAQNFDPETCLAGFVAVAKHHGRLPDVAAYVFSRANRRENVSRGNQSTAEMQQVAIAKQL
KDIDEHAPELAREVFQSATDGEGSWSSFRGSFRELLTEVKKSTGSSATAITRETLSERCYGLVLECWGSLVLADKTSAA
AAASGSNASAGTYDAEKPTFERLEEYIESIERTADADRDGNRSERLNFHRAKARASVLSNVESFADADGGVATVTLPT
GMGKTLTGLSAALSVRDQLGGGRVVYALPFTSIIDQVVAEVEEIYQVDTTGRLLTAHHHLSEATIVDESDESADEADA
NDDVAGMLAESWRAGMTVTTFVQLFESLAGPANRQSMKLPALRDSIIVLDEPQSLPLDWWKLVPRLVRMLTEQYNA
TVIAMTATQPQLFDDATELVSAPETYFEATERVQYELDDSTTRYIDSREEPKSYAEAASAIVDETTDANGTNSDEAGE
TESVLAICNTIDSAQALTTHVTETLSDAISVGSVYADVLENADRNSTDIEVATVAKRVADAGGRPVLHLSTRLRPVDR
LRLIETAKLLTERGCSLVVVSTQLVEAGVDISFDRVYRDLAPIDSIVQAAGRCNRSFERDRGHVVVWWLEAPDEQTKT
PSEAVYNRGTALLPVATETLDDIGGTEGTIPEATVSKTAVEEYYRRLHDEKNIGKDAYVEYVNEARADELGELSLIEQR
RSVEVVVCRTTEERERVEAVRAAWQDYEFETVRRLMDSLKEASVSVPIYRGDSKEAKALSGLTRIYEDTETRWIDTRD
ARHGSYFDSTTGLAAESSVDNRIL
> 4den CRISPR-associated RecB family exonuclease at 7901..8449
MTTKDPVDRLLATARGTPVDEPFRVTGVMMQYYYVCERELWFESRSLEIDRENATVVRGMRVDETAYDEKRESLRLGMISL
DLLDDGRVVEVKPSSALTEPAEMQLSYYLWYLDRVAGVRRDGVLAHPRERRRESVELTEERAKKVESSIRRIHELVRRS
SPPPAERKPFCESCAYHDFCWC
> 1den CRISPR-associated protein Cas1 at 8440..9444
MVLTMDRNYHVFSDGRLERNDDTLRLVTEAGDKKYVPIENAEAFFLHGQIDFNTRLMSFLNQRTVALHVFGWEDYYAGSV
MPKRGQTSGRTVVEQVRAYEDSDHRRRLAAAMVSASIHNMRTNVVYYNNRDRELETEIDDLDAASARVDQTRPID
ELMGVEATARKAYYRSFNQILPDEFRLERREYNPPPNEVNSLISFGNALVYANCVSAIRATALDPSISYLHEPGERRYSL
SVDLADLFKPVLADRVLFRLINRKQITPSDFETDLGSCLLDEDGRRTYTKAFEETLERTVEHPKLNRKVSYQYLMRLEA
YKLKKHLLTGEEYEPFERWW
> 2den CRISPR-associated protein Cas2 at 9446..9706
MYVVMVYDLEAERTYKALKLGRRYLTHVQNSVLEGEISEGDLATLRNEVEDLLKSGESVIIYELSSDALLNRSVYGNDPTDEK
RFL
Haloferax mediteranei
> 2med CRISPR-associated protein Cas2 at complement(183157..183420)
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGEVTEGDLETIRNHTQTLLNPDESTIIYRIGSEKYVDRTVIGEDPTDE
SRFL
> 1med CRISPR-associated protein Cas1 at complement(183424..184416)
MDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIYLHGQIDYNTRLISFLNKHGTALHIFGWKDYYAGSVMPKRG
QTSGRTLVEQVRAYDSPAQRTDIARKFVDGSIHNMRANVSYYNSRGHDFDSELASLDAAGARLTETTAVEEIMGVE
ATARRAYYSTFDSILPDGFVFNGRRYNPPTNEVNSLISFGNSLVYANVVSGIRATALDPAVSFLHEPGERRYSLALDIA
DLFKPLLADRVTFRLLNRQQLTPADFETDLNSCLLTEHGRKTFSKAFEETLEQTVEHPRLNRKVSYQYLLRIEAYKLKK
HLLTGEEYVPFKRWW
> 4med CRISPR-associated RecB family exonuclease at CAYHDFCWSC"
MTDVDPVQRLLRTARDDARDESFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNSAIVRGTHIDETAYSDKRRHVSIDSTIAI
DVLDDGRVMEVKPSSALVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTDETEQKVEDAIRGVHEIIAR
> 3med CRISPR-associated helicase Cas3 at complement(184969..187563)
MTAEYERRYSHPAEDGRPAVLLFEHLRDVRDRVDMVIPEGATTPEGKPLCGVIRRLALVHDFGKATTFFQQYIGAQLGQPT
HDKLRYHAPLGSLAAYYVLRETGHSTATCLAGFVAVAKHHGRLPNVVEYVFKRMASPDPEKWMADKKQVENIHKN
APRLATAIFEEATGDDDAWLDFAQSCVNDESLFTEIADHVTRNGERPITEPTFLTDEFYGLMLECWGTLVLADKTSA
AGAPQASSVYDATNPRTADLTQYIDNLGDGNTDPDGSRTEQLNYYRSRARQDVLDSVTEFVESESDVATITLPTGM
GKTLTGLNAALEIRDQTGGDRIVYALPFTSIIDQVGAEVQDIFDTDGSDGIVALHHHLSDTRFGYSDGDDDASDLND
DIAGMLGESWRAGLTVTTFVQLFESLAGPRNTQSMKIPALRGNVIVLDEPQSLPLDWWKLVPRLVDVLTEQYGATVIS
MTATQPELFPAPMSLVSDAERYFTVAERVQYHLHDSVERFLRGEEQPLEYNDAANELVEVAQSGDSLLAICNTIDSA
RVLANAVTERIQAVNLAEQYFESLRNGSSDPVAETVQLVRQSSKQAFVHLSTRLRPTDRLALIRIIKQLRASGSPVIAVT
TQLVEAGVDISFENVYRDLAPVDSIVQAAGRCNRSFERELGAVTVWWLTQPAEQRHTPAVAVYDTQGPSLTPVTAS
ALDSVRDGQTKLAGQSVARAAVQEYYGTLHKEKNVGREEYHEYVDDADAESLGRLSLITQTKTVDVLICVTEADQT
LVESLEGAYENYDFQEVKRLLNATKPLRVSIPIYRDDSPEANVVTELRPLAGRKDESSIRVLDAGTRDFEKYFDHTTGF
VVADSTVEDRFL
> 00med CRISPR-associated protein, TM1800 family at complement(187565..188371)
MTQQTLTEWNDTQNESGTAGPPRCLSLTVRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFARGRSAI
AIEPVAPLRTVNMPVNTLSTADESMKSLNPRGKISIKLPDPTKPRQQHNYEVLVDPAYRLYVWMSDSHWFETLHETL
DEGKSHYVPSLGLSEYLAEITYHGRFEVESGPTDTAVAVDSAVPNAVDHVVPDAESRCQIEESPAFMTVDGGGRTTT
DFTSYTYNPDAGPVRVRNPDTAIVDGNTVMFV
> 01med CRISPR-associated protein, TM1801 family at complement(188374..189429)
MTVHTDPVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTQQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGEQYT
REKLLEDRLKEVDPDEYDDDGELAGVVFQAFLEESTDVRYFGATMSVDLDGKYGSLPDHFTGPVQFSPGKSMHAVN
ENEEYDSLTSVIATQTGKEQGGFGLDDHRIQYGLIRFHGLVDEHAAEDTALTAEDVERLDTLCWRAIKNQTISRSKIG
QEPRFYLRVEYATESFHLGGLDKDLELDRTDGRTKSDDELRNVRDLTLSVDSLVDRLERSTNRIERVHVTASDVLSVS
HGDEVGGPEVLYEALEDRLGTDAVHVIDVYDEHVETLPN
Haloferax mucosum
> 01muc CRISPR-associated protein, TM1801 family at 3755..4810
MSEHSDHVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTNQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGNRYT
REKLLENRLKEVDPDEYDDDEELSEAVYRTFLEESVDVRYFGATMSVDLDDRYGSLPDHFTGPVQFSPGKSMHSVNE
NEEYDSLTSVIATQDDKQQGGFDLDDHRIQYGLIRFHGLVDEHAAADTNLTTEDVERLDTLCWRAIKNQTISRSKV
GQEPRLYLRVEYATDSFHLGGLDKDLDLDRKDGRTKPDDQLRTVRDLTLSVDSLVARLEKSANRIERVHVAASDVLS
VSHDDDVGGPEVLYDALADRLGTDAVHTIDVYDEHTTALPN
> 00muc CRISPR-associated protein, TM1800 family at 4813..5619
MDQQTLSKWDDSQGESETADPPRCLSLTIRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFGPGHSAV
AIEPVEPLRTLNLPVNTLSTANESMKSMNAKGKISIKIPDPTKPRQQHNYEVLVDPAYRLYVWLRDSDWFDMLHEML
DEGKSHYVPSLGLSEYLAEIEYHGQFTVETGPADSVVAVDSAVPNAIDRIVPDAETRCQIEESPGFMTSDTGGRTTTG
FTSYAYNPDAGPVNVRNPDTHIIDGNTVMFV
> 3muc CRISPR-associated helicase Cas3 at 5678..8215
MLLFAHLKDVRDRIDMVVPEDTKTPDGMLLSGVIQRLALVHDVGKATTFFQQYIGESPGKPRYEKLRYHAPLGSLAAYYVL
DETGHSTATCLAGFVAVAKHHGRLPNVVEYLFNRTARPDPEKWDPVKAQISDIHEHAPKLVTAIFQEATGDADAWQ
NFAKACVNDESLFTEIADHITTNGERPITDPSFLTDEFYGLLLECWGTLVLADKTSAAGAPQSSTVYDGTTPKTTTLA
EYIDEIEDPNANPDGSQTERLNYFRSRARKDVLDSVSVFIESESNVATITLPTGMGKTLTGLNAALEIRDRTDRDRIVY
ALPFTSIIDQVGAEVQDIFDTDGTDGLLALHHHLSDTRFGYRDSDDDVSDLNDDIAGMLGESWRAGIMVTTFVQLF
ESLAGPRNTQSMKIPALRESVVVLDEPQSLPLDWWKLVPRLVAVLTEQYDATVISMTATQPELFSDSVSLVSDPEQYFSAAERVRYHLHDSVERFLQGDEQALEYDVAAAEIADVAASGESLLAICNTIDSARKLAEAVTDRIQSVSVAEQYFASLKSGS
ADPVEDTVRQIIQSPKQAFVHLSTRLRPTDRLALIRIIKKLRSAGYAVTAVTTQLVEAGVDISFENVYRDLAPVDSIVQA
AGRCNRSFEHERGDVTVWWLAQPAAQTHTPAVAVYDLQGPSLTPVTAKALDSVRQGQTTLPGKSVARTAVQNYY
DRLHTEKNVGKEAYSAYVDTADAESLGRLSLITQTKTVDVLICVTEADHELVSSLETAYEKYEFEEVKRLLTATKPLRV
SIPIYRDDSPEADAVTSLRPLADREEESQIRVLSAETRAFENYFDRSTGFVVTDRTVEDRFL
> 4muc CRISPR-associated RecB family exonuclease at 8212..8766
MTELDPVERLLQTARDETRDDSFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNEAIVRGTHVDETAYSEKRRHVSIDSTIAI
DVLDDGRVLEVKPSSSLVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTSAAEQRVEDAIRGVYEIITTE
SPPPAVQKPVCGSCAYHDFCWSC
> 1muc CRISPR-associated protein Cas1 at 8726..9760
MARVRITTSAGVANMDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIFLHGQIDYNTRLVSFLNQHGTAIHVFG
WKDYYAGSIMPKRGQTSGRTLVEQVRAYDSTARRTDIARKFVEGSIHNMRANVSYYNSRGHEFDTELASLDAAADR
LTEADGVQESMGVEATARRAYYSTFDSILPDGFVFNGRQYNPPTNEVNSLISFGNSLVYANVVSAIRATALDPAVSYL
HEPGERRYSLALDIADLFKPLLADRVLFRLLNRKQLTPADFETDLNSCLLTEDGRKTFSKAFEETLEQTIEHPELNRKVS
YQYLLRIEAYKLKKHLLTGEEYVPFKRWW
> 2muc CRISPR-associated protein Cas2 at 9764..10027
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGAVTEGDLETIRNHTQSLLKPDESAIIYRIGSDKYVERTVIGDDPTDD
SQFL
Haloferax sulfurifontis
> 4sul CRISPR-associated RecB family exonuclease at 8856..9479
MTGSSFKDELVYVSALNEFVYCPRRYYYQRYYDEIGEPYELVDGRSKHKHASRRGSWVTERYFRSDDLGMHGKIDLVDTET
GTPTPVERKRSESGSYYSSDEVQLAGYCMLLEDAIDESVNVGYIYLYSTDTRHSVRITQDHREAVREILRRIRSMSVDSI
PPLTQNQSKCEACSARTYCMPGETAVLSPSEAEGTGWEGTTPGDLI
> 1sul CRISPR-associated protein Cas1 at 9446..10471
MGGNDTRRFDMKESQGIFEDSVVYVTTQGSQVRIDGGQVVIYDVEGDDGELGAFPVEKLDTINVFGGVNFSTPFVATANE
HGIVLNYFTQNGKYRGSFVPERNTIAEVRRAQYALTGRDELAISKAMIAAKIRNARTILSRKGVRGTTVLKDLGERVSS
ASNKDDLRGIEGEAAERYFSRLDETLVDNWTFDTRSKRPPEDHINSLLSLTYVMMKNEVLSALRCYNLDPFLGVLHA
DRHGRPSLALDLQEEFRPLFCDAFVTRLVNRGTITHDQFGNDNRLTDDGFQAYLDKFDGYMNEELTHPYFEYRVSR
RKAIRQQVILLRKAITGELDDYHALEVSR
> 2sul CRISPR-associated protein Cas2 at 10480..10740
MTYDVSDDTNRRRVYRTLERYGAWRQYSVFELDVSKSERVELEDELESHIEPADGDRVRLYRLCEACQEATTDLGNEPPDE
QSNVI
Haloferax volcanii
Nothin'
Halorhabdus utahensis
> 2uta CRISPR-associated protein Cas2 at complement(2149888..2150151)
MVYVVAVYDVEADRTYLFLNFLRRYLTHVQNSVFEGEITEGDLEEVKGKLDSMLEPGESVIVYRMSSEQYVSRTVYGEDPTE
DSQFL
> 1uta CRISPR-associated protein Cas1 at complement(2150153..2151145)
MNDNYHIFSDGRVERHNDTVRLVTEDDEKKYLPIENAEALYLHGQIDFNTRVISFLDDHGVAMHVFGWNDYYSGSIMPER
GQTSGQTVVEQVRAYDDEAHRGNIAREIVAGSIHNMRANVTYYDNRDYDLSATLESLDRRRDEIKSVASVEEAMGV
EASARRAYYAIFDQILPDAFVFGGRKYNPPNNKVNSLISFGNSLVYANIVSAIRATALDPTISYLHEPGERRYSLALDLA
DLFKPVLTDRVVFRLVNRGQLSDDDFDSEMNACLLTESGRETFSKEFEQTLDRTIEHPNLNRKVSYQYLLRVEAYKLK
KHLLTGESYESFERWW
> 4uta CRISPR-associated RecB family exonuclease at complement(2151149..2151727)
MSGHTDERTGETDPVDLLLESARDEAVESSFHVTGVMMQYYEVCERELWFESRNIEIDRENPNVVRGTHVDETAYDEKRRN
LSIDGRIAPDLLDDGRVVEVKPSSTLVEPARLQLLYYLWYLDRVVGVEKEGVLAHPTERKRESVELTDETVQQVEDAIR
GIYDVVRSETPPPATEKPFCESCAYYDFCWSC
> 3uta CRISPR-associated helicase Cas3 at complement(2151724..2154318)
MAERYSHPPEGGREGVALEVHLADVADRVAHVVPDDATTPTDGSLRSVVETLAWVHDFGKATTYFQEYLLEDSEADPPML
RHHAPIGSFAAYHALDTQGFDTETCLAGFVAVAKHHGRLPNVAEYVVDRTHRRDGDSRQNSVEKRQTVVLKQIGD
IHDTVPDLAETVFENATGGSGGWESFVRSYQSGSLLSEIEETVGTQTAGRGVDPDALSSSCYGLVLQCWSALVLADK
TSAAGAANESETYAPSQPGFETISDYIEDLEAGVDADKAGTKTERLNYHRSDARKNVLDNVASFAESGGGVATLTLP
TGMGKTLTGLSAAFALRDALGGNRVVYGLPFTSIIDQVVDEIQEIYETDTAGRLLTAHHHLSETTIRDTDDQSADDA
DRNDDVAGMLGESWRAGVTVTTFVQLFESLAGPQNRQSMKLPALRDAVVVLDEPQSLPLDWWKLVPRLVDVLTEQ
YNATVIAMTATQPRLFEDEFELVDDPDRYFEVVRRVSYELDDSTERYIESQSEPKSYAAAANELRAAVESGQTTLAVC
NTIDSARELTEQVGDGSFVDVGRLYDDELQEAGSADDVDPVELAKRVAATDDNALLHLSTRLRPADRLTLIETAKAL
TERGHPTLAISTQLVEAGVDISFDRVFRDLAPIDSIVQAAGRCNRSFEREQGVVTVWWLDVPDEQSKTPAEAVYNRG
TTLLPTVADTLRQIRDESGSLSETDVARRGVEWYFERLREDKDVGKQTYADWVDDAKAKELGTISLIDEQLSAEIVVT
RTPAERERAEAIRNAQRNFEFETLGQLVDETKPLRISVPYYSEDSETADAITDLPPLVEDEGIYELDVQQNPSHFDRTT
GFVVPEASVDHQFL
> 00uta CRISPR-associated protein, TM1800 family at complement(2154319..2155119)
MTMDQESSNGRTGTDGSDLDRCLSFEIRGPWGHFRRVEGNVVKQTYRIVPRTTVAGLIAAVLGIDRDGYYDLFGPEVSAIAI
QPVEELRTVNMPMNTLSTAAGDLTSLNPRGKISIKLPNPTKLRQQHNYEVLVDPAYRIDVALADDERYEQLRETLAA
GKSHYVPSLGLSEYLAEIDYLGEFDVKPGPASGTIAVDSAVPDAMDDVVLDPETRCQIEQSPAFMASDGSGRTTTEYT
TYTYNPDAEPLQVRDVPTSRVDDRTVVFV
> 01uta CRISPR-associated protein, TM1801 family at complement(2155116..2156195)
MSEPTQTVENRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQEEGTQYT
RAELLEDRLKAVDPDDYDLDDDEAAAQFRDDVFGEYLEESADVRYFGATMSVDTDNAYAKHLPDHFTGPVQFSPG
KSIHAVNENEEYDSLTSVIATQEGKEQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTRADVERLDTLCWRAIKNQ
TISRSKVGQEPRLYCRVEYGEESYHLGGLDKDLTLDDEASKDHDELRNIRDLTLEIDDFVDRISNASDQIERIRVVASD
VLELSHGTDSGGPDLLYDALRTAIGPDRVDVVDVYDEYPETLPQSTGE