Protein Sequences for Cas-proteins

From GcatWiki
Jump to: navigation, search

Halomicrobium mukohataei:

>2muk CRISPR-associated protein Cas2 at complement(1004454..1004717)
MVYVVVVYDMEADRTHKMLKFLRRYLTHVQNSVLEGDVTEGDLEKIRSGVDDLLKPGESTIIYQISSEKLVDRSVFGDDPA
ADDQFL

> 1muk CRISPR-associated protein Cas1 at complement(1004721..1005719)
MIMNDNYHVFSDGRIERHDDTVRVITDDGEKKYLPVENAEAIFLHGQIEYNTRFVSFLNQEGVAVHVFGWHDHYAGSIMP
KRGQTSGQTLVDQVRAYDDPAHRLELAQAFVDGSIHNMRANVTYYDGRGHDFEDVLAELTEARSSLDRMETIDET
MGVEARARKAYYSTFDEILPDEFVFGGRQYDPPNNEVNSLISFGNSLVYANVVSAIRATALDPTVSFLHEPGERRYSLA
LDIADLFKPLLADRVIFRLVNRGQLTSDDFEAEMNACLLNEHGRKTYSKAYEETLDETIEHPDLGKKVSYQYLLRVEV
YKLKKHLLTGEEYVPFQRWW

> 4muk CRISPR-associated RecB family exonuclease at CAYHDFCWSC"
MTDSSGDPVDRFLAAARDESAELPFRLTGVMFQYYVVCERELWFLSRDVEIDRDTPAIVRGSDVDDSAYADKRRDVRVDGI
IAIDVLDSGEILEVKPSSSMTEPARLQLLFYLWYLDRVTGVEKTGVLAHPAEKRRETVELTPETSAEVESAIEGIRAVV

> 3muk CRISPR-associated helicase Cas3 at complement(1006273..1008843)
MGTEPISHPATDGDEATRLLDHLDDVAGRAESVVSADATTIGGDPLPEMTAVVARCHDFGKATTWFQAYVVGERDASDR
TNHSLLGAYLGYYVLDRLGYDSEDCLAALVALAKHHGQLPDVEEYVDGVSRFENTSEASNERQRILIEQVGNVDDH
RRQFAQSFVADATDGNGSWEEFAQSIEDESLFDTVHEHVSLFGFGSDRSAPSEEFYGLLLQLWSGLVFADKTVAAGIE
NGDLDGSEPDARLLSEYIADLGGDTDEDGQAAALDTLRGEARDDVLGGVERFRDSEVSIATLTLPTGMGKTLTGLS
AALALRDEGERVVYALPFTSVIDQVADELADVFDTDARDDLLTIHHHLAETVTKLGDPDEDPDEYARLAEMVAESW
RSGMTLTTFVQLFESLVGPSNTQSMKLPALYDSVIVLDEPQALPMQWWKLVRRIATILTEEYDATIVAMTATQPRAT
DEETASQALFDDAFELVDDVDRYFGHFERVEYDVHDSVLAFDDTDAIVDYETAGRTILDETGRSESTLAICNTIDSAT
ALTDAIEEREAVVDVGRCLELELDDGADVDALVERVESTLSSNERALVHLSTRLRPRDRLALVEATKRLTERDVPVLA
VSTQLIEAGVDISFDRVYRDIAPIDSVVQAAGRCNRSFESDLGTVTVWWLAPPAGTTTTPAQAVYDSEGVSTISLTAR
TLDAIGADDGTVAEQTMTRDAVEHYYGLVADRNPGDPEYVKWVDEANADALGGLSLIGQRESVDVVVCRTDGDR
ELADAMVAALDEFDYDAFGDYREAAKDITVSLPIYSRDSTEAETVRNLEPLGDADLRVLRRARGTSYFDETKGLAVD
EPCVDDRFL

> 00muk CRISPR-associated protein, TM1800 family at complement(1008843..1009520)
MKQTYRVIPRTTVAGLIAAMLGIERDGYYDLFAPGESLVAIEPTSELRTMKLPMNTLSTADEHMASLNPRGKLSIKLPDPSKP
RQQHNYEVLVEPAYRIDVWLADDERYDRLRSLLESGESYYVPSLGLSEHLATIDYHGEFPVEHGPDGETVAIDSTVPE
AVDSIVPDPETRYQIEQTPAFMERDDGGRTTSAFVSYAYNPDGGSLRVADVSTYSVDDRAVVFT

> 01muk CRISPR-associated protein, TM1801 family at complement(1009635..1010690)
MSEHYPTVSNRSEIVFAYDAVDANPNGNPLSGANRPRIDPHTDQAIVTDVRLKRYLRDQLQDDGHGVYIRNVKEDDGDQ
ATREDLLEDRLKDIDLDDVDEADIENAVFGQFLENSADVRYFGATMSIDMDDEKVDHLPDHFTGPVQFSPGKSLHR
VMENEEYNSLTSVIATGDDKAQGGFDLDDHRIQYAFIGFHGLVDEHGAEGTLLTDGDVRRLDTLCWRALKNQTISR
SKVGQEPRLYLRVEYADESFHLGGLDQDIDLDSSESAPVEEIRNVRDICVDVSALLERLDAASDRIDTVHVVASDVLEL
SVDGETGGPEFLYDALESRVGSESVREIDVYEDAKATMPEE

Haloarcula californiae:

Need to find Cas1 and Cas2

> 3cal CRISPR-associated helicase Cas3 at complement(1844..4447)
MTFEQYISHPAKTDDGEPTLLIGEGGQFDQDGHLQTVANRMVEACRGQTLADGTPAEPVAEVIGLTHDFAKLTHWAQKH
LRCQPFQHSDEYRYHAFPGALVTLYCLLNRRDGTGPLKDDHAAEVATLVVAGHHDIQSPPEPSKLAKNYGRDTLEV
QETYKRITEQFEDIGDRVPERADQIIHKATDGEGSWEDFREWHADRTAPIDGPHHHLIYFAQIGDRDTRDGYYSDVV
RLWTALKFADQTDASGLKNEDIGGTLLDRSELTRHIEDLDKGENVLAELNDLRNKARQEVTENVETLVKSNDVGLIT
LPTGFGKTYAGISAGLRAANINDSRLVYALPYTSILDQTASEIQSVFGVSPYSRAFTLHHHLSNTYTGLGDHYTDADI
GRSPGALHAESWLSGVTLTTTVQLFESLAAPTARQATRVPALHEAVVVIDEPQAIPEDWWQIIPELVELLVDSYNATVI
LMTATQPGLVKYGSNTLNTRELTDATDKYTDFLADHPRVRYRLHDTVRTDVGDEYATLDYATAGSRVSGAAEGG
RDVLAVCNTRASAEELYRSVTATVNTKEKVPVELGHLLHDYVEETGELPSPVELRRFAIDAVAERDVATLYAFLSGDV
RPDDRKLIIDTLYDDEIGDEDEPEPLLDSDWSVILVSTSVVEAGVDVSFDTVFRDYAPIPNIVQSGGRCNRSFDGETGD
VVVWRLAEPENGSAIPSLVIHGADGGDQLPLLLATGNVLRRHAARDGTIDESTMVSDTVSEFYESLFEGPLNPGNERL
ADAISSASMSELEGEHMIDEIEDYEDVVACLTDEERDDLLSSDPEAVSIRGHPGAQVSTDLEAWTKKVTIGNSQYLLV
DALSGSYHPVFGVR

> 00cal CRISPR-associated protein, TM1800 family at complement(4458..5279)
MTQQDLTDYASEGGDSSSSRYIPDTCIGFDVTADFAHFRKVGNNSAKPSYRIPPRTTVAGLLAGILGMPRDSYYDLFSPARS
AIAIVPKGLPHTYTMGITTVNTKADDAIQYLPQEKHYTKSAEMLTPESYVKYDRQRDTYEMLVDPEYRVYVALSDQN
SYNELRERLETSRYHYSPALGLSECIADIRNVEIHTVGPGIEDAVDSAAYDDSEVVPKPGVTIKRERAPLYMESTDGGR
RTTEFGNITYAAGDDRLPVDESRTHTVGEHQVAFY

> 01cal CRISPR-associated protein, TM1801 family at complement(5304..6332)
MTDTNDATIQNRSEIAFVIDAKDTNPNGDPLTADNEPRIDPVTGQCVVTDVRLKRYLRDQLVEDDHVVLIANPNDEVLTR
KEMYDAVESEMGVSTDEAEPEELLEAFVKTAADVRYFGATISLDTDLAEDLPNQFEGPVQFNHGRSYHEVARNTESK
QLATVIANEDDDGGKKDQGTFATDNRISYGVIGFGGRINDNAAKDTHLTEDDVERLDTLCWRALKNQTVTRSKAG
QQPRLYIRVEYKQDGFEIGRLNDRIGVDSDLPEDEIRGTDDFNLDVSELVTTLADNDARIDTVHITADSAVTFALPDG
ETGDREALYTVLNDILGAEAVDAYDVYERYVN

Haloarcula sinaiiensis

> 01sin CRISPR-associated protein, TM1801 family at 481071..482093
MTTLNRSELLFVYDAQDCNPNGNPIGDNRPRRDPDTGKGIITDVRLKRYLRDQLQDDGFDIYVKKIAGESRTRTTLIKDVL
GGVSDAEDLEDIEDIGESFLEAATDVRYFGATLSFEASDDEEDEAFREALNSAFPNHYQGPVQFLPAKSLNEVEENEEY
DSLTSVISTGEGNRQGGFDLDDKRIKYGIFPFYGLVDNHGAETTNLSAADVERLDTLCWRALKNQTTSRSKLGQEPR
LYLRVEYAEDDYHIGGLQNLLDLDGGDNLLRSISDVVLDVSDLLSTLDKNRDRIETIHLIADDRMTLDTGDEAISGDQ
LATELDSRGLDVHEIDVVDERDLAR

> 00sin CRISPR-associated protein, TM1800 family at 482111..482908
MSPQIDADGIPDRCLSFTVSSTWGHFKRVGRTVTKQTYRIPPRTTVAGMLAAIVGAERNSYYETFGEDNAAIAITPESDLRTI
NIPTTGLGTDPDQDVTTTAKKRRNYSLTYQETTGDRQLHAYEVLADPSYRIDVALEDEEFYQKLHDHLEAGTSVYPP
SLGKSEYLATIENVQAGQEPEPASSSGPYDIDSIVPIELADAIPQGGVAYESERSPAVMERHQGGRRTTRFDDYVFTRR
SDGTVKTDAGTDVKPVSVGNRIVVFR

> 3sin CRISPR-associated helicase Cas3 at 482948..485755
MDLPLISHPDVDENDAYPSSQLTDDGALRLDAHNRTVGDRAVRLFGPDDDRTQYLRIAASLHDFGKVTPQFQAHVRPTE
NYDGPEDEKVHARLGALATWYVLEETDAPPRDQLAATLAVARHHQALPNAAQYTGETLARAVEASADVLQAQINR
IDETWPEAADDLFRCTGSDGSSWAEFAEWARSGAATAALQDCSVRETLSGVEPTPSRLPDFLYDRTIHYWAALTLAD
KSHAMGLSEERVFDFDTLDLETLERHINTLRQQEASSLHEAQLNDERERARRQALRGTHEWLNQEQTDIATLTLPTG
LGKTFTGLSAAFEARDILDETDTEHPDNPRPVIYALPYTSIIEQTRALFEDPELWGADPKKSALTVHHYLSETVVYGDE
YDAADVDESDAGEAAQFLGETWRDGTILTTFVQLFESLTGPSNRQGLKLPSLDSALIILDEPQALPKDWWDGIERLLQ
LLTGEYGAKVIAMTATQPSLFREMGTSSLLELGAAHAQTDCSHCRRQPAYETELPPISQESYFNEADRVRYTIDESALS
HRLETEEEFVEYDSAASRIHETAAQADSVLAICNTIESSRQLTQAVSQHSDAVHLGPVLESILTAPDANVAESEMNPGE
IVSEVLETVGIEDQCSDEPTARNQEVPSPQGPFVLTLNSRYRPFDRQVIIQLAEQLSTGPVPFILISTQAIEAGVDISFEM
VYRDIAPLDSIVQAAGRCNRAYEWGKNGGQVTIWTLAPTGPDAANPPAYWVYERGSTDAGMPDHLRLISDVLNKV
PGQRDIADIHLSKHAVDRYFEQLSRRSLDDGSIRDHIDHAEGRWLSQQSLIGGYETADVLVAVSESESQTLDRITQMF
TDGNPRAYDRLDDLSHLRVSLPAKIIDENPKLTRIDGQGRKDDGVNVFRFNGTGGLTYTLEDGGLRATEESIQDRFTI

> 4sin CRISPR-associated RecB family exonuclease at CLYQDICWM"
MTELSTVDRYIRDEREPGREPETRITGLMIQYYHVCQRELWFMAHGIDIDRETTNIQRGTHVDETSYQDSRQSFMIDNRIQL
DVLESGDIMEVKVSSTLEKPARMQLVFYLWFLDNIYDVDKDGVLAYPTERKRETIQLDAANIEAVENTIRGILDVVNR

> 1sin CRISPR-associated protein Cas1 at 486393..487385 MTKPNHHIFADGELSRSESTLRIDTLEGDTEYLPVESVDSLYLHGQIDFNTRTLGLLNEHGVPLHIFGWKDYYKGSYLPKRGQ
VSGNTVVEQVRAYDDRRRLNIGQKMIRASIHNMRRNLVYYDGRRGDFSDAIASLDEFKDETADTDDINQLRAVEGN
ARSTYYDCFDQILRDPFELSKREYNPPTNEANALVSFLNGMVYTTSVSAIRKTALDPTIGFVHEPGERRFTLSLDIADIF
KPILADRLIFRLVNRQQLSLSDFESELEGCLLTESGRMTVLEAFEETLDKTIEHPRLERKVSFKTLVQTDVYSLKKHILTG
ESYHPTERWW


Haloarcula vallismortis

Nothin

Haloferax denitrificans

> 01den CRISPR-associated protein, TM1801 family at 3381..4460
MSDTNDAVTNRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQREGNQY
TRGELLEDRLKDVEPDEYDLDDGEESERFRNDVFGEFLDNSVDVRYFGATMSVDTDDVYAKHLPDHFTGPVQFSPG
KSVHAVNENEEYDSLTSVIATQENKQQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTTGDVERLDTLCWRAIKN
QTISRSKIGQEPRLYCRVEYADESFHLGGLDRDLVLDDDRSKPDKELRTVRDLTLEIDGFVDRLAAASARINRIRIVAS
DVLDISYDGDVGSSELLYGALREAVGDDAVEVVDVYEEHAETLPAGDAA

> 00den CRISPR-associated protein, TM1800 family at 4462..5259 MEQESLDEWVDGGDDGRPERCLSFTVSGPWGHFRRVEGNVVKQTYRIIPRTTVAGLLAAVLGIERDGYYDLFAPGSSGIAIE
PVRAVRTLNMPMNTLSTASGNLQSLNGRGKISVKLPNPTALRQQHNYEVLVEPAYRVDVWLAETARYRELREMLEA
GKSHYVPSLGLSEHLAEIDYHGEFDVESASGAGRVEVDSAVPNAVDDVVLREGTRCQVEESPAFMRVDAGGRTTTEF
TTYAYNPDAEPLVVDGVDSVEVDGRNVVFV

> 3den CRISPR-associated helicase Cas3 at 5259..7904
MTERYSHPPNDVHDGVPLVDHLGDVAERVGYVVPADAKTPAGEPLRAVVETLAYVHDFGKATTYFQDYLLRSVEPRYEQY
RYHAPLGSFAAYYALSAQNFDPETCLAGFVAVAKHHGRLPDVAAYVFSRANRRENVSRGNQSTAEMQQVAIAKQL
KDIDEHAPELAREVFQSATDGEGSWSSFRGSFRELLTEVKKSTGSSATAITRETLSERCYGLVLECWGSLVLADKTSAA
AAASGSNASAGTYDAEKPTFERLEEYIESIERTADADRDGNRSERLNFHRAKARASVLSNVESFADADGGVATVTLPT
GMGKTLTGLSAALSVRDQLGGGRVVYALPFTSIIDQVVAEVEEIYQVDTTGRLLTAHHHLSEATIVDESDESADEADA
NDDVAGMLAESWRAGMTVTTFVQLFESLAGPANRQSMKLPALRDSIIVLDEPQSLPLDWWKLVPRLVRMLTEQYNA
TVIAMTATQPQLFDDATELVSAPETYFEATERVQYELDDSTTRYIDSREEPKSYAEAASAIVDETTDANGTNSDEAGE
TESVLAICNTIDSAQALTTHVTETLSDAISVGSVYADVLENADRNSTDIEVATVAKRVADAGGRPVLHLSTRLRPVDR
LRLIETAKLLTERGCSLVVVSTQLVEAGVDISFDRVYRDLAPIDSIVQAAGRCNRSFERDRGHVVVWWLEAPDEQTKT
PSEAVYNRGTALLPVATETLDDIGGTEGTIPEATVSKTAVEEYYRRLHDEKNIGKDAYVEYVNEARADELGELSLIEQR
RSVEVVVCRTTEERERVEAVRAAWQDYEFETVRRLMDSLKEASVSVPIYRGDSKEAKALSGLTRIYEDTETRWIDTRD
ARHGSYFDSTTGLAAESSVDNRIL

> 4den CRISPR-associated RecB family exonuclease at 7901..8449
MTTKDPVDRLLATARGTPVDEPFRVTGVMMQYYYVCERELWFESRSLEIDRENATVVRGMRVDETAYDEKRESLRLGMISL
DLLDDGRVVEVKPSSALTEPAEMQLSYYLWYLDRVAGVRRDGVLAHPRERRRESVELTEERAKKVESSIRRIHELVRRS
SPPPAERKPFCESCAYHDFCWC

> 1den CRISPR-associated protein Cas1 at 8440..9444
MVLTMDRNYHVFSDGRLERNDDTLRLVTEAGDKKYVPIENAEAFFLHGQIDFNTRLMSFLNQRTVALHVFGWEDYYAGSV
MPKRGQTSGRTVVEQVRAYEDSDHRRRLAAAMVSASIHNMRTNVVYYNNRDRELETEIDDLDAASARVDQTRPID
ELMGVEATARKAYYRSFNQILPDEFRLERREYNPPPNEVNSLISFGNALVYANCVSAIRATALDPSISYLHEPGERRYSL
SVDLADLFKPVLADRVLFRLINRKQITPSDFETDLGSCLLDEDGRRTYTKAFEETLERTVEHPKLNRKVSYQYLMRLEA
YKLKKHLLTGEEYEPFERWW

> 2den CRISPR-associated protein Cas2 at 9446..9706
MYVVMVYDLEAERTYKALKLGRRYLTHVQNSVLEGEISEGDLATLRNEVEDLLKSGESVIIYELSSDALLNRSVYGNDPTDEK
RFL

Haloferax mediteranei

> 2med CRISPR-associated protein Cas2 at complement(183157..183420)
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGEVTEGDLETIRNHTQTLLNPDESTIIYRIGSEKYVDRTVIGEDPTDE
SRFL

> 1med CRISPR-associated protein Cas1 at complement(183424..184416)
MDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIYLHGQIDYNTRLISFLNKHGTALHIFGWKDYYAGSVMPKRG
QTSGRTLVEQVRAYDSPAQRTDIARKFVDGSIHNMRANVSYYNSRGHDFDSELASLDAAGARLTETTAVEEIMGVE
ATARRAYYSTFDSILPDGFVFNGRRYNPPTNEVNSLISFGNSLVYANVVSGIRATALDPAVSFLHEPGERRYSLALDIA
DLFKPLLADRVTFRLLNRQQLTPADFETDLNSCLLTEHGRKTFSKAFEETLEQTVEHPRLNRKVSYQYLLRIEAYKLKK
HLLTGEEYVPFKRWW

> 4med CRISPR-associated RecB family exonuclease at CAYHDFCWSC"
MTDVDPVQRLLRTARDDARDESFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNSAIVRGTHIDETAYSDKRRHVSIDSTIAI
DVLDDGRVMEVKPSSALVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTDETEQKVEDAIRGVHEIIAR

> 3med CRISPR-associated helicase Cas3 at complement(184969..187563)
MTAEYERRYSHPAEDGRPAVLLFEHLRDVRDRVDMVIPEGATTPEGKPLCGVIRRLALVHDFGKATTFFQQYIGAQLGQPT
HDKLRYHAPLGSLAAYYVLRETGHSTATCLAGFVAVAKHHGRLPNVVEYVFKRMASPDPEKWMADKKQVENIHKN
APRLATAIFEEATGDDDAWLDFAQSCVNDESLFTEIADHVTRNGERPITEPTFLTDEFYGLMLECWGTLVLADKTSA
AGAPQASSVYDATNPRTADLTQYIDNLGDGNTDPDGSRTEQLNYYRSRARQDVLDSVTEFVESESDVATITLPTGM
GKTLTGLNAALEIRDQTGGDRIVYALPFTSIIDQVGAEVQDIFDTDGSDGIVALHHHLSDTRFGYSDGDDDASDLND
DIAGMLGESWRAGLTVTTFVQLFESLAGPRNTQSMKIPALRGNVIVLDEPQSLPLDWWKLVPRLVDVLTEQYGATVIS
MTATQPELFPAPMSLVSDAERYFTVAERVQYHLHDSVERFLRGEEQPLEYNDAANELVEVAQSGDSLLAICNTIDSA
RVLANAVTERIQAVNLAEQYFESLRNGSSDPVAETVQLVRQSSKQAFVHLSTRLRPTDRLALIRIIKQLRASGSPVIAVT
TQLVEAGVDISFENVYRDLAPVDSIVQAAGRCNRSFERELGAVTVWWLTQPAEQRHTPAVAVYDTQGPSLTPVTAS
ALDSVRDGQTKLAGQSVARAAVQEYYGTLHKEKNVGREEYHEYVDDADAESLGRLSLITQTKTVDVLICVTEADQT
LVESLEGAYENYDFQEVKRLLNATKPLRVSIPIYRDDSPEANVVTELRPLAGRKDESSIRVLDAGTRDFEKYFDHTTGF
VVADSTVEDRFL

> 00med CRISPR-associated protein, TM1800 family at complement(187565..188371)
MTQQTLTEWNDTQNESGTAGPPRCLSLTVRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFARGRSAI
AIEPVAPLRTVNMPVNTLSTADESMKSLNPRGKISIKLPDPTKPRQQHNYEVLVDPAYRLYVWMSDSHWFETLHETL
DEGKSHYVPSLGLSEYLAEITYHGRFEVESGPTDTAVAVDSAVPNAVDHVVPDAESRCQIEESPAFMTVDGGGRTTT
DFTSYTYNPDAGPVRVRNPDTAIVDGNTVMFV

> 01med CRISPR-associated protein, TM1801 family at complement(188374..189429)
MTVHTDPVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTQQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGEQYT
REKLLEDRLKEVDPDEYDDDGELAGVVFQAFLEESTDVRYFGATMSVDLDGKYGSLPDHFTGPVQFSPGKSMHAVN
ENEEYDSLTSVIATQTGKEQGGFGLDDHRIQYGLIRFHGLVDEHAAEDTALTAEDVERLDTLCWRAIKNQTISRSKIG
QEPRFYLRVEYATESFHLGGLDKDLELDRTDGRTKSDDELRNVRDLTLSVDSLVDRLERSTNRIERVHVTASDVLSVS
HGDEVGGPEVLYEALEDRLGTDAVHVIDVYDEHVETLPN

Haloferax mucosum

> 01muc CRISPR-associated protein, TM1801 family at 3755..4810
MSEHSDHVENRSEIVFLYDAVDANPNGNPLSGANRPRIDPQTNQAIVTDVRLKRYLRDQLDADGHGVYIRNVQEEGNRYT
REKLLENRLKEVDPDEYDDDEELSEAVYRTFLEESVDVRYFGATMSVDLDDRYGSLPDHFTGPVQFSPGKSMHSVNE
NEEYDSLTSVIATQDDKQQGGFDLDDHRIQYGLIRFHGLVDEHAAADTNLTTEDVERLDTLCWRAIKNQTISRSKV
GQEPRLYLRVEYATDSFHLGGLDKDLDLDRKDGRTKPDDQLRTVRDLTLSVDSLVARLEKSANRIERVHVAASDVLS
VSHDDDVGGPEVLYDALADRLGTDAVHTIDVYDEHTTALPN

> 00muc CRISPR-associated protein, TM1800 family at 4813..5619
MDQQTLSKWDDSQGESETADPPRCLSLTIRGPWGHFRRVEGNIVKQTYRIIPRTTVAGLLAAVLGIERDGYYELFGPGHSAV
AIEPVEPLRTLNLPVNTLSTANESMKSMNAKGKISIKIPDPTKPRQQHNYEVLVDPAYRLYVWLRDSDWFDMLHEML
DEGKSHYVPSLGLSEYLAEIEYHGQFTVETGPADSVVAVDSAVPNAIDRIVPDAETRCQIEESPGFMTSDTGGRTTTG
FTSYAYNPDAGPVNVRNPDTHIIDGNTVMFV

> 3muc CRISPR-associated helicase Cas3 at 5678..8215
MLLFAHLKDVRDRIDMVVPEDTKTPDGMLLSGVIQRLALVHDVGKATTFFQQYIGESPGKPRYEKLRYHAPLGSLAAYYVL
DETGHSTATCLAGFVAVAKHHGRLPNVVEYLFNRTARPDPEKWDPVKAQISDIHEHAPKLVTAIFQEATGDADAWQ
NFAKACVNDESLFTEIADHITTNGERPITDPSFLTDEFYGLLLECWGTLVLADKTSAAGAPQSSTVYDGTTPKTTTLA
EYIDEIEDPNANPDGSQTERLNYFRSRARKDVLDSVSVFIESESNVATITLPTGMGKTLTGLNAALEIRDRTDRDRIVY
ALPFTSIIDQVGAEVQDIFDTDGTDGLLALHHHLSDTRFGYRDSDDDVSDLNDDIAGMLGESWRAGIMVTTFVQLF
ESLAGPRNTQSMKIPALRESVVVLDEPQSLPLDWWKLVPRLVAVLTEQYDATVISMTATQPELFSDSVSLVSDPEQYFSAAERVRYHLHDSVERFLQGDEQALEYDVAAAEIADVAASGESLLAICNTIDSARKLAEAVTDRIQSVSVAEQYFASLKSGS
ADPVEDTVRQIIQSPKQAFVHLSTRLRPTDRLALIRIIKKLRSAGYAVTAVTTQLVEAGVDISFENVYRDLAPVDSIVQA
AGRCNRSFEHERGDVTVWWLAQPAAQTHTPAVAVYDLQGPSLTPVTAKALDSVRQGQTTLPGKSVARTAVQNYY
DRLHTEKNVGKEAYSAYVDTADAESLGRLSLITQTKTVDVLICVTEADHELVSSLETAYEKYEFEEVKRLLTATKPLRV
SIPIYRDDSPEADAVTSLRPLADREEESQIRVLSAETRAFENYFDRSTGFVVTDRTVEDRFL

> 4muc CRISPR-associated RecB family exonuclease at 8212..8766
MTELDPVERLLQTARDETRDDSFRVTGVMMQYYVVCKRELWFHSRHIEIDRGNEAIVRGTHVDETAYSEKRRHVSIDSTIAI
DVLDDGRVLEVKPSSSLVEPAKLQLLYYLWYLKHVVGVEKSGVLAHPTERKREDVELTSAAEQRVEDAIRGVYEIITTE
SPPPAVQKPVCGSCAYHDFCWSC

> 1muc CRISPR-associated protein Cas1 at 8726..9760
MARVRITTSAGVANMDRNYHIFSDGCLERHNDTVRLVTLDDEKKYLPIEKAEAIFLHGQIDYNTRLVSFLNQHGTAIHVFG
WKDYYAGSIMPKRGQTSGRTLVEQVRAYDSTARRTDIARKFVEGSIHNMRANVSYYNSRGHEFDTELASLDAAADR
LTEADGVQESMGVEATARRAYYSTFDSILPDGFVFNGRQYNPPTNEVNSLISFGNSLVYANVVSAIRATALDPAVSYL
HEPGERRYSLALDIADLFKPLLADRVLFRLLNRKQLTPADFETDLNSCLLTEDGRKTFSKAFEETLEQTIEHPELNRKVS
YQYLLRIEAYKLKKHLLTGEEYVPFKRWW

> 2muc CRISPR-associated protein Cas2 at 9764..10027
MVYIIVVYDMRADRTRLMLNFLRKYLTHVQNSVFEGAVTEGDLETIRNHTQSLLKPDESAIIYRIGSDKYVERTVIGDDPTDD
SQFL

Haloferax sulfurifontis

> 4sul CRISPR-associated RecB family exonuclease at 8856..9479
MTGSSFKDELVYVSALNEFVYCPRRYYYQRYYDEIGEPYELVDGRSKHKHASRRGSWVTERYFRSDDLGMHGKIDLVDTET
GTPTPVERKRSESGSYYSSDEVQLAGYCMLLEDAIDESVNVGYIYLYSTDTRHSVRITQDHREAVREILRRIRSMSVDSI
PPLTQNQSKCEACSARTYCMPGETAVLSPSEAEGTGWEGTTPGDLI

> 1sul CRISPR-associated protein Cas1 at 9446..10471
MGGNDTRRFDMKESQGIFEDSVVYVTTQGSQVRIDGGQVVIYDVEGDDGELGAFPVEKLDTINVFGGVNFSTPFVATANE
HGIVLNYFTQNGKYRGSFVPERNTIAEVRRAQYALTGRDELAISKAMIAAKIRNARTILSRKGVRGTTVLKDLGERVSS
ASNKDDLRGIEGEAAERYFSRLDETLVDNWTFDTRSKRPPEDHINSLLSLTYVMMKNEVLSALRCYNLDPFLGVLHA
DRHGRPSLALDLQEEFRPLFCDAFVTRLVNRGTITHDQFGNDNRLTDDGFQAYLDKFDGYMNEELTHPYFEYRVSR
RKAIRQQVILLRKAITGELDDYHALEVSR

> 2sul CRISPR-associated protein Cas2 at 10480..10740
MTYDVSDDTNRRRVYRTLERYGAWRQYSVFELDVSKSERVELEDELESHIEPADGDRVRLYRLCEACQEATTDLGNEPPDE
QSNVI

Haloferax volcanii

Nothin'

Halorhabdus utahensis

> 2uta CRISPR-associated protein Cas2 at complement(2149888..2150151)
MVYVVAVYDVEADRTYLFLNFLRRYLTHVQNSVFEGEITEGDLEEVKGKLDSMLEPGESVIVYRMSSEQYVSRTVYGEDPTE
DSQFL

> 1uta CRISPR-associated protein Cas1 at complement(2150153..2151145)
MNDNYHIFSDGRVERHNDTVRLVTEDDEKKYLPIENAEALYLHGQIDFNTRVISFLDDHGVAMHVFGWNDYYSGSIMPER
GQTSGQTVVEQVRAYDDEAHRGNIAREIVAGSIHNMRANVTYYDNRDYDLSATLESLDRRRDEIKSVASVEEAMGV
EASARRAYYAIFDQILPDAFVFGGRKYNPPNNKVNSLISFGNSLVYANIVSAIRATALDPTISYLHEPGERRYSLALDLA
DLFKPVLTDRVVFRLVNRGQLSDDDFDSEMNACLLTESGRETFSKEFEQTLDRTIEHPNLNRKVSYQYLLRVEAYKLK
KHLLTGESYESFERWW

> 4uta CRISPR-associated RecB family exonuclease at complement(2151149..2151727)
MSGHTDERTGETDPVDLLLESARDEAVESSFHVTGVMMQYYEVCERELWFESRNIEIDRENPNVVRGTHVDETAYDEKRRN
LSIDGRIAPDLLDDGRVVEVKPSSTLVEPARLQLLYYLWYLDRVVGVEKEGVLAHPTERKRESVELTDETVQQVEDAIR
GIYDVVRSETPPPATEKPFCESCAYYDFCWSC

> 3uta CRISPR-associated helicase Cas3 at complement(2151724..2154318)
MAERYSHPPEGGREGVALEVHLADVADRVAHVVPDDATTPTDGSLRSVVETLAWVHDFGKATTYFQEYLLEDSEADPPML
RHHAPIGSFAAYHALDTQGFDTETCLAGFVAVAKHHGRLPNVAEYVVDRTHRRDGDSRQNSVEKRQTVVLKQIGD
IHDTVPDLAETVFENATGGSGGWESFVRSYQSGSLLSEIEETVGTQTAGRGVDPDALSSSCYGLVLQCWSALVLADK
TSAAGAANESETYAPSQPGFETISDYIEDLEAGVDADKAGTKTERLNYHRSDARKNVLDNVASFAESGGGVATLTLP
TGMGKTLTGLSAAFALRDALGGNRVVYGLPFTSIIDQVVDEIQEIYETDTAGRLLTAHHHLSETTIRDTDDQSADDA
DRNDDVAGMLGESWRAGVTVTTFVQLFESLAGPQNRQSMKLPALRDAVVVLDEPQSLPLDWWKLVPRLVDVLTEQ
YNATVIAMTATQPRLFEDEFELVDDPDRYFEVVRRVSYELDDSTERYIESQSEPKSYAAAANELRAAVESGQTTLAVC
NTIDSARELTEQVGDGSFVDVGRLYDDELQEAGSADDVDPVELAKRVAATDDNALLHLSTRLRPADRLTLIETAKAL
TERGHPTLAISTQLVEAGVDISFDRVFRDLAPIDSIVQAAGRCNRSFEREQGVVTVWWLDVPDEQSKTPAEAVYNRG
TTLLPTVADTLRQIRDESGSLSETDVARRGVEWYFERLREDKDVGKQTYADWVDDAKAKELGTISLIDEQLSAEIVVT
RTPAERERAEAIRNAQRNFEFETLGQLVDETKPLRISVPYYSEDSETADAITDLPPLVEDEGIYELDVQQNPSHFDRTT
GFVVPEASVDHQFL

> 00uta CRISPR-associated protein, TM1800 family at complement(2154319..2155119)
MTMDQESSNGRTGTDGSDLDRCLSFEIRGPWGHFRRVEGNVVKQTYRIVPRTTVAGLIAAVLGIDRDGYYDLFGPEVSAIAI
QPVEELRTVNMPMNTLSTAAGDLTSLNPRGKISIKLPNPTKLRQQHNYEVLVDPAYRIDVALADDERYEQLRETLAA
GKSHYVPSLGLSEYLAEIDYLGEFDVKPGPASGTIAVDSAVPDAMDDVVLDPETRCQIEQSPAFMASDGSGRTTTEYT
TYTYNPDAEPLQVRDVPTSRVDDRTVVFV

> 01uta CRISPR-associated protein, TM1801 family at complement(2155116..2156195)
MSEPTQTVENRSEIVFLYDAVDANPNGNPLSGSNRPRIDPQTQQAIVTDVRLKRYLRDQLDDDGHGVYIRNVQEEGTQYT
RAELLEDRLKAVDPDDYDLDDDEAAAQFRDDVFGEYLEESADVRYFGATMSVDTDNAYAKHLPDHFTGPVQFSPG
KSIHAVNENEEYDSLTSVIATQEGKEQGGFDLDDHRIQYGLIRFHGLVDEHGAADTNLTRADVERLDTLCWRAIKNQ
TISRSKVGQEPRLYCRVEYGEESYHLGGLDKDLTLDDEASKDHDELRNIRDLTLEIDDFVDRISNASDQIERIRVVASD
VLELSHGTDSGGPDLLYDALRTAIGPDRVDVVDVYDEYPETLPQSTGE