Forum "Analyse phylogénétique"

Thread subject: Problème groupe extérieur

[ Return to forums ]
Problème groupe extérieur
dama
17 Dec 2008 17:23
Contribution: Pertinent
Lors de ma correction, on m'a conseillé de prendre les archeo comme groupe extérieur. Or mon alignement est mauvais en présence du groupe extérieur, et ma séquence est située a l'extérieur si j'élimine le groupe extérieur.
Dois je garder le groupe extérieur des archeo ou les éliminer ?
Hingamp_BC08
17 Dec 2008 19:59
Game master
On suit très bien votre raisonnement dans votre fiche d'annotations. C'est vrai qu'avec les archae, l'alignement multiple devient assez limite (l'aln est déjà pas évident sans groupe ext). Cependant, le petit domaine fortement conservé "PSK" en c-term restera parfaitement conservé en retirant seulement une archae de votre groupe extérieur: "Methanocorpusculum"? Si c'est bien le cas, j'opterai bien pour cette aln pour inférer les arbres. Si l'ORF reste marginale au groupe d'étude, ainsi soit-il, vous aurez en tout cas tout tenté!

Attention! Votre arbre avec groupe ext n'est pas raciné sur ce dernier!!! Ceci a pû vous induire en erreur lors de son interprétation... (Voyez mon autre message "Explorer des arbres très rapidement" pour raciner un arbre)
dama
18 Dec 2008 11:33
Non evaluated contribution
Le domaine "PSK" à 3 identités quand j'enlève toutes les archeo.
Dès que l'on essaye d'introduire une archeo dans l'aln les identités du domaine "PSK" ne sont plus que de 2 ou 1.

Ayant tout essayé, je pense qu'il est préférable de ne prendre aucun groupe extérieur.
Qu'en pensez vous ?
Hingamp_BC08
18 Dec 2008 13:16
Game master
C'est en effet un cas vraiment pas évident du tout! J'ai repris toutes vos séquences au format FASTA (dans votre bloc note) et j'ai soumis à Phylogeny.fr avec aln multiple par T-Coffee (dans les cas de séquences très divergentes cette méthode est quelques fois plus performante - un peu plus lente aussi). J'ai ensuite enlevé progressivement les séquences qui semblaient trop éloignées (le "heat map" de T-COFFEE est vraiment très utile pour étudier de près l'alignement dans ce genre de cas difficile). Il me reste un alignement acceptable avec deux archae pour enraciner (arbre par PhyML). Votre ORF se niche dans les bactéries, reste à voir si elle se rapproche plus d'une branche ou d'une autre. Je vous joins l'URL de l'analyse que j'ai faite sur Phylogeny.fr:
http://www.phylogeny.fr/version2_cgi/alacarte.cgi?workflow_id=a9dc68dbbf5c4ac5f8ba367cd32d6805&tab_index=3

L'arbre:
                                                                                                    -------0.5-----

                      +---------------------Clostridium_spiroforme_DSM_1552_gi_169350044
                   +--+
                   |  +--------------Syntrophus_aciditrophicus_SB_gi_85858228
                  ++
                  |+---------------Clostridium_sporogenes_ATCC_15579_gi_187778256
                  |
                  |            +-----Frankia_sp._EAN1pec_gi_158313994
                  |            |
                  |        +---+
                  |        |   +----------------------Micrococcus_luteus_NCTC_2665_gi_177670690
                  |        |
                  |        |-------------------Acidobacteria_bacterium_Ellin345_gi_94970332
                  |      +-+
                  |      | |      +---Trichodesmium_erythraeum_IMS101_gi_113475218
                  |      | |      |
   +--------------+      | +------+ +---------Nostoc_punctiforme_PCC_73102_gi_186684139
   |              |      |        +-+
   |              |      |          +----Microcystis_aeruginosa_NIES-843_gi_166365055
   |              |      |
   |              |  +---+                             +Mycobacterium_sp._MCS_gi_108797917
   |              |  |   |                +------------+
   |              |  |   |                |            +Mycobacterium_sp._KMS_gi_119867013
   |              |  |   |     +----------+
   |              |  |   |     |          |
   |              |  |   |+----+          +--------Streptomyces_sp._SPB74_gi_197339188
   |              |  |   ||    |
   |              |  |   ++    +----------Methylokorus_infernorum_V4_gi_189220239
   |              +--+    |
+-+                 |    |  +-----Acidothermus_cellulolyticus_11B_gi_117927655
| |                 |    +--+
| |                 |       +--------------GOS_678020_Traduction_3-392_sens_direct
| |                 |
| |                 |      +--------------Geobacillus_sp._Y412MC10_gi_192811390
| |                 |      |
| |                 |      |                +-------Victivallis_vadensis_ATCC_BAA-548_gi_150383573
| |                 +------+        +-------+
| |                        |        |       |
| |                        +--------+       +---------------------------Agrobacterium_tumefaciens_str._C58_gi_159186334
| |                                 |
| |                                 +-------------Acidobacteria_bacterium_Ellin345_gi_94970849
| |
| +--Methanosarcina_barkeri_str._Fusaro_gi_73668661
|
+--Methanosarcina_mazei_Go1_gi_21227245


L'aln multiple (on voit mieux les régions conservées en couleur sur le site):
CLUSTAL FORMAT for T-COFFEE Version_6.85 [http://www.tcoffee.org] [MODE:  ], CPU=16.55 sec, SCORE=64, Nseq=21, Len=578

gi|1179276      --------------------------------------MRIL---AVVPHFVPDV-APTG
gi|1134752      --------------------------------------MRIL---IYSYNYNPEL-IGIA
gi|1866841      --------------------------------------MHIL---IYSYNYHPEP-IGIA
gi|1663650      --------------------------------------MRIL---LYSYNYYPEP-IGIA
gi|1503835      -------------------------------------MRTLF---FFTEYYEPAP-NSTA
gi|1583139      MPDDSRPSPVAPARPWPSTTRSAPTPSRQVSSGPPSGWPRIL---LVTHYFPPEV-GAPQ
gi|1973391      --------------------------------------------------MP--------
gi|9497084      ----------------------------------------------MFSDIEHEP-IG--
gi|1776706      -------------------------------MTTEPRRPRLL---VVSHSYRPER-TPPA
gi|1693500      ------------------------------MTTKRIEKKNII---RLYAYYTPEI-TAST
gi|8585822      --------------------------------------MRIL---EIGMFYEPDL-GPGA
gi|1087979      -----------------------------------------M---ILGLNYPPEP-TGIA
gi|1198670      ----------------------------------MRKRPDVL---ILGLNYPPEP-TGIA
gi|1892202      -----------------------------------MAAYRIL---FLGINYWPEK-TGIG
gi|1928113      ------------------------------------MKKEIV---FVANYFHPDY-ASSG
gi|9497033      --------------------------------------MKFL---ILSQYFYPEV-GATQ
gi|1591863      ----------------------------------------MI---FLNRYGFPDQ-SATS
gi|7366866      ----------------------------------MV--KKVNNIFLVYYG-SFNAKSGSN
gi|2122724      ------------------------------------------MKLFISYGMQLDLFNGSN
gi|1877782      --------------------------------------MKIL---FLTQYCPPEV-GAPQ
GOS_678020      ------------------------------------------------------------
                                                                            

gi|1179276      IIAARLIEEIGGLGHDIHVVTSLP--WYE-RHAVEPEWRGRVVHRQWRP-WGSVTRVYPF
gi|1134752      PLMTELAEGFAKRGHQVRVVTGMP--NYP-ERKVYDGYKGKFFLTEYKN-GVTVQRSYIH
gi|1866841      PLMTELAEGLVNRGHEVRVITGMP--NYP-EREIYDGYRGQWYVTEQKN-GVTIQRSYIR
gi|1663650      PLMTELAEGLVKRGHQVRVLTAFP--WYP-NSEIDPEYRGKIYLEEERN-GVKIQRSYVW
gi|1503835      YFLTRIIGVARRFHGKIQVICATP---------------GPQTPSEDTE-NFSVFRVAVG
gi|1583139      ARLSETARAWAQAGADVTVLTGMP--NHP-TGIVPPSYRGAARRVEHSD-GYRIVRTWLY
gi|1973391      ------------------------------------------------------------
gi|9497084      ---------------NVRFVTG-P------------------------------------
gi|1776706      RRWGAVVAALRSAGWDVDVVAPGG--VRP-----F----------AGPD-GERVLPTPRL
gi|1693500      HLVNDLEKTLVDNDFQIDCVTPTP--SRGLEQDIVDNYKDIRY-EEKYDGKIRVHRFK-L
gi|8585822      PLCTMLSRELASLGHQVTALAAVP--HYP-TGQVQRAFRGRWLRTTVED-GVEVVRVGLP
gi|1087979      PYTGALASGLNSLGRHVTAQVAHP--HYP-EWVVREGY-GQWSRVEQLD-GVEVRRLLHY
gi|1198670      PYTGALASGLNSLGRHVTAQVAHP--HYP-EWVVREGY-GQWSRVEQLD-GVEVRRLLHY
gi|1892202      IFNTGRCEFLSSKGYEVSMITAFP--YYP-FWEIPAEYRGKWFKDESRN-GVKILRSFIY
gi|1928113      QLLTELCLEL-QHDFDVTVIAVQP--ENA-NHV----QKKRMFEYDQLE-RIRIIRLRTP
gi|9497033      TRLAGTAAEIVLAGHEVEVLTGLP--NAP-AGKIFPDYRGRFYMRDEWN-GCPVHRTWLY
gi|1591863      RMISSLAMSMAQRGMNVTVVTSREIHNHP---------GSVLPATETVR-GIHIQRLSSG
gi|7366866      IHILELLRNLKKYTD-IVL--FVP---------------GQKSVDRTLP-GIKCVPVID-
gi|2122724      VHTIELLNNLKKLGVDVLL--FSR---------------SSKNRSYKNP-NIIEVPSTHF
gi|1877782      NRIFEFAKQLKKFDHEVTILTAMP--NYP-KGEVFDGYKGKKIVKEELE-GIKIVRTSIY
GOS_678020      ------------------------------------------------------------
                                                                            

gi|1179276      PT-ADKRNIPKRALGFAAFCGL----SAVAA-S-IGGRL-DVAFA-MTPPLPMAATGWLA
gi|1134752      -VRGSKPGVLARLLLDGSFIVS----SLWQA-F-NGWKP-EIIFA-TTPPILISLPVSFY
gi|1866841      -I-KSKPNLLDRLLLELSFIFT----SLPQA-F-RGERP-DVMIL-TVPPLLGILPATIF
gi|1663650      -A-RPQRSLKNRILFELSFVFL----SFFQA-L-KGEKP-DLIFL-TVPGLPVCVPAALL
gi|1503835      -H-GDKNNLKQRIQKFLVISWH----FFRLAFL-HVRRN-DVVFA-VTNPAFIIFILAVL
gi|1583139      -A-TPNEGVLRKTIGHISFTLS----SVLLG-G-RLAGPADVVVV-SSPTFFPLGSAWWL
gi|1973391      -------------------------------------PS-DAVHA-QMPSLAGGVLAARL
gi|9497084      --KYERKTAARRFKTWFKYCWQ----ATRLA-F-RTKGDPKLFIV-AQPPF-LSLLGYLQ
gi|1776706      -P-LGEAGRNGRFAEAVVHALL----AIPRG-M-ATRRP-DVVVA-TVPALPVVVPGVVL
gi|1693500      LP-E-NSVVIKRIIRYLLLNIK----QYRTA-R-NLSNC-DVILA-GSTPPTQGIVAALL
gi|8585822      -S-VNRADLVQRFTQFFCYQVG----AALSG-L--TKRY-DVVLA-ANPFLMVWLPFALL
gi|1087979      -V-PKSPRGLRRLLSETSFGLR----LLF---A-RWGRP-RIVIT-VSPSLFSAALAALR
gi|1198670      -V-PKSPRGLRRLLSETSFGLR----LLF---A-RWGRP-RIVIT-VSPSLFSAALAALR
gi|1892202      -V-PKKVSTLSRIVHEASFLFS----SLLRA-L-LVKKP-DLLFI-VSPPLGLGLSAFFL
gi|1928113      -Q-VNKRSKFSRILFILSYFLL----AVIAL-L-RIKKV-DVIYTISSPPIIGGLIGAIG
gi|9497033      -A-ATGKS-LARVMNYGSFALA----SWWEA-R-RVKKP-DYVFV-ESPPLTTAWPGLRI
gi|1591863      -R-FGRHNLMRRSIDYLLFQVL----AFAWL-LRNVRAT-DTVIVCTDPPLL-SVMSSIA
gi|7366866      ----------NKYLVQPSYEFMLSFYLLY--SC-IRNRP-DVLYL-RQNSFP--FFPIFL
gi|2122724      Q--FLFSNYLNIFTYQLSLFLY----LIY--YT-IKLKP-DLFYA-RLSGSG--ASSTIV
gi|1877782      -A-TKDKSFVKRLRNYLSFTFS----SVFTG-SKYIDNQ-DVIIT-ESPPLFLGWSGYVL
GOS_678020      ------------------------------------------------------------
                                                                            

gi|1179276      SLA--RRCPLVLNVQDVFPDVAV-ELGLLR-N--P--AL-IRAAAALERWSYARCAAVTV
gi|1134752      GLF--SKSSVVLNIQDIVSEAAV-RVGLVNKN--S--WI-VSLAQAVEKLAYFKADKISV
gi|1866841      GWL--YNCPIVLNVQDILPEAAV-RIGLLK-N--K--WM-IRTLAALEKFAYRTAHTISV
gi|1663650      SKL--YGVPIILNLQDILPDAAV-HVGLLT-N--E--KM-IKVFSRLEKFAYQTASKISV
gi|1503835      RSF--RRFEYILLAYDIFPENLV-AAGLAR-Q--KS-FH-YRVVKKIFDWSYSRADRVIV
gi|1583139      ARR--WRARLVVEVRDLWPAIFT-QLGVIK-N--R--RV-IAALERLELAAYRAADAVVT
gi|1973391      ARR--WKVPYVPVVQDLMGAAAA-QSGISG-G--D--RA-AKVAARAESFALRRATLVGI
gi|9497084      KKL--MGRRYFLWIDDVWPDIIV-GQKMREGS--S--WG-IRLWAGFNRVTFRHAEHVFT
gi|1776706      SRL--WRRPLVLEMRDAWPDLAH-EAGVHQ----G--LL-GAAMERVVTGGQRAARLVVT
gi|1693500      GKK--LCLPVVYIVQDIFPDSLV-STGISS-E--K--SLFFKIGKLIEKYTYKHADKIIV
gi|8585822      GAL--KHKPIVYIVQDLYPDVGI-KLGVFK-N--R--FV-ISAATALERYCLVNSDVVHI
gi|1087979      IRLTPRRPPFIVWIQDIYTLGLA-ETG-EG-G--G--FA-ATLTRWVESRTLRSADRVVV
gi|1198670      IRLTPRRPPFIVWIQDIYTLGLA-ETG-EG-G--G--FA-ATLTRWVESRTLRSADRVVV
gi|1892202      SKL--WNIPYIFHVPDLQPDAAR-DLGMIN-S--N--LF-FDILYKIEKFAYDNAAKVST
gi|1928113      KVL--KRGKLVYNIMDFNPEQAE-AISYTN-R--K--WL-FRLAKRMDNLSCRIADHIIT
gi|9497033      AKK--LGARLIFNVSDLWPDSVR-DLGVMS-D--G--RA-FRTLEKMEIGIYRKSFAVTA
gi|1591863      IRL--KGALMVNWIMDLFPETAI-ELGFFR-K--RARWL-APWLTRARNWSLRSPGMVVC
gi|7366866      CKI--LKIPSIVEVNGIVLDELKVDPNSQS-FAYR--VF-SHLALRSENFNYKHCDRIVS
gi|2122724      SSI--LGIPQVGEVNGITIDEMI-IQGSSK-S--K-----IKIAQLIESINLKGCSKLIA
gi|1877782      SKR--KKAKFIFNVSDLWPESAV-KLDVLH-N--K--FL-IKASTWLEEFCYKKAAAVTG
GOS_678020      ------------------------------------------------------------
                                                                            

gi|1179276      LSEDMRANIAA--KVRDPRRVHVIPNFVDVAGITPG-ERE----NSYRREYGL-AGKTVV
gi|1134752      ITEDFVTKLVE--QGVSKDRIVCISNWVDINFIRPL-NK-N---NYFRAEHNL-QDKFVV
gi|1866841      IADGFRENLVN--KGVPVNKIVCIPNWVNVNFIHPL-PKQN---NSWISSHQL-DGKFIV
gi|1663650      IADGFTKNLLT--KNVPSQKIIEIPNWVDVSFIKPL-PKNN---NYFRQENHL-EGKFVV
gi|1503835      IGRDMANILKN--KGVENRRLLLIPNWSDCSRIQPTSPRNN----PYLCQLGI-QNKKVF
gi|1583139      VTDGFRDDIVR--RGIPAEKVHVIPNGVDLDRFQPG-EPAS---AEVRARLGAGPDDILV
gi|1973391      IHETFRAKVE-A-LGVDPDRIRLVPNWSHVTS--PAGDRAA---TRA--RLGWAPDETVV
gi|9497084      LGPYMRDKVRQ--YVPENIPITIIPTWVDIDSIRPI-PKEQ---NPFAAEHGL-GDKLTV
gi|1776706      VTDGFAETLR-G-RGVR--VVRTLGNGVDLARLAVA-PRRE---R--------AAGELHV
gi|1693500      ICDEFKHNLVD--KGVLAEKIKVIYNWINADEVIPI-SRNS---NKLFEEYNLDKNNFFV
gi|8585822      ISDSFRPSLRA--LGVSDDKMALVYNWVDTDLVRPL-PRN----NAFAQEHDL-GGRFVI
gi|1087979      IHHRFADYVARE-LGVKASDVVVVRNWAHLAMEMPV-SSAS---AKL--ALGWPSGVILA
gi|1198670      IHHRFADYVARE-LGVKASDVVVVRNWAHLAMEMPV-SSAS---AKL--ALGWPSGVILA
gi|1892202      LTPTMRQKIIS--KGIAAEKVLLLPDWADPLLFSLP-ERNGA--LKFREKYGL-EERIIV
gi|1928113      VGQDMQETLNSRFRGRRVPSNSVINNWTDEQDIKPL-PRSDANVSKFLKDHDL-QGKFIV
gi|9497033      VTEGIRDRIINV-KGIPAEKVLFLPNGADTDTYRPL-PPD----TELAAKLGL-TGKKVV
gi|1591863      PTEKMAEFLFT--QGIAKDRVSVLHHWSDGEEIYPV-LPEG---NSLRKAWGL-QDVFVV
gi|7366866      VTDKLRDELVRL-YSVPESKIYVINNGANTDVFKPL-GLEQ-----TREKLQLENSKKYV
gi|2122724      VTDGVKKGLMEI-YFIPESKIVVINNGANTELFIPM-DKNK-----VKKELNLDSTLHYI
gi|1877782      QTKGIVDNIVN--RGFDKNKVHLITNGVDTEFFKKE-NR-D---EKLREEWGL-KDKFAV
GOS_678020      ------------------------------------------------------------
                                                                            

gi|1179276      MYAGNVGMSQSLDLLIAAARDFR-DRDDVVFVVNGTGSTLPDWQRLAD--G-LPNIRFVP
gi|1134752      IYSGNIALTQGLETVVKAAASLKE-KSEISFVILGEETARQQLQECCNNYQ-ADNILLLP
gi|1866841      LYSGNIALTQGLETVIEAAVCLRH-IKEIVFVIVGESRALQRLQEYCLLHG-ADNVLLLP
gi|1663650      LYSGNIALTQPLETLIDAAVYLVD-IPEIKVVIVGKKEALDRLEKYRQQQG-ASNVLLLP
gi|1503835      LFAGNIGRVQGINNLLTAITLVKSK-QAV-FLFIGSGAMADTVRMQQAESR-YHNIYYLE
gi|1583139      LYVGAHGISQGLTSIADAAARLAEKAPAIRFAFVGEGADKQRLTDHVGQLG-LTNTTLAP
gi|1973391      VHSGNMGLKQGLEVLVEAAR----RDPAVRFVLMGDGNQRAHLEEL--GKG-VPNLDFPP
gi|9497084      LYSGNLGLTHDIQSILEAARILRN-EVSLHFMIIGAGPQWDSIERSIKEHQ-DANVTLLP
gi|1776706      LYLGNMGESQGLERLIDAAATLRRTRPGVRVRLVGEGTRRPALEARNAELG-SPAEILGP
gi|1693500      TYAGNMGKAQDIDTIINVAKIMQ-EYKDIKFILFGSGDGKKYYENLINSEK-INNITILP
gi|8585822      LYAGNIGFSQDLDKVLDAAQALA-DQDEILFLFIGDGVGREPLIADAKKRR-LTNVKFLH
gi|1087979      VHTGNMGLKQGLENIVDAAREADERSAPVHFLLVGDGGERRKLMER--AHG-ISRITFVG
gi|1198670      VHTGNMGLKQGLENIVDAAREADERSAPVHFLLVGDGGERRKLMER--AHG-ISRITFVG
gi|1892202      AHTGNMGVKQGLDIVIEAAERLKEK-KGIVFLLVGDGADRRRLEEKAQSLN-LPQLKFIP
gi|1928113      MYSGNLGLYYDLENIIRVASDFKD-HPDILFVFIGDGAMKPEMQRYVEEKG-LRNVRFLP
gi|9497033      YFAGTLGYAQGLHSVITAAETLQRNQPLVHFLFIGEGPEKPRLKEAVATKG-LRNVSFVD
gi|1591863      GYSGNFGRAHDFGTMLAAAKRLE-HRPDIRFLLIGGGHQHAAVKTVVQDLG-LQNVIFKP
gi|7366866      CFVGNLAAWQGVEFLIHASPLILEKCPDTHFLIVGDGVMKDKLMETASKLELSDKFTFTG
gi|2122724      CFVGNLIPWQGVEYLIRAAPLILKEFADARFLIVGDGIMKKEWMKLADDLGLLDNFIFTG
gi|1877782      CYAGIHGLAQGLEVIINAAELLK-EERDIQFVFIGDGPEKSKLMTMVKEKK-LTNISFQP
GOS_678020      -----------------------------------------------------PNLRFGD
                                                                            

gi|1179276      LQPVERLPEVLAAADIHVVP-LKK-GLARS--SV-PSKTYSILAAARPVVASVDEGSEVA
gi|1134752      LVPREKLPEMLSAADVGLVI-QKK-TVTAF--NL-PSKIPVILASGRPIIASVPDTGTAM
gi|1866841      LQPREKLPEMLAAADVGLIV-QKR-NIISF--NM-PSKIPLLLASGRPIVGSVPATGTAA
gi|1663650      FQPREKLPEMLAAADVGMVM-QKH-NVISF--NM-PSKIQVLLASGRAIIASVAADGTAA
gi|1503835      PLPLEKQPEFLNACDVAIVT-LGT-NMLGL--GV-PSKSYFSMAAGKPLLYIGEHDSEIA
gi|1583139      AVPRADMATLLASADICLVP-LRDVPLFDT--FI-PSKMFELLAAGRPVIGSVR--GEAA
gi|1973391      PADDADFMDVLAAADVLAVT-QRA-SVLDM--SV-PSKLTSYFAAARPVLASVAAEGGTA
gi|9497084      LQPIDVLPFSLATADIAIAS-LEQ-GIEGV--SM-PSKTYYSMAAGSAIVGICETNSDLA
gi|1776706      -VHGAAVAQQYAWADTLVVA-LRP-DWPSFRHTV-PSKTYEVLAVGRHVTGLVT--GEAA
gi|1693500      IQPQNRVSEVYSLGNVSIVS-CKK-GAGKT--AL-PSKTWSIMATATAVITNFDKDSELN
gi|8585822      YQPRERLPEVLACADVSLVI-LRE-GIGTA--SV-PSKALSILASGRPMVASVDEGSDLW
gi|1087979      PLSDADYRLALSAADVLLVN-EKP-GISSM--AV-PSKLTSYFHAGRPVIAATDADGITA
gi|1198670      PLSDADYRLALSAADVLLVN-EKP-GISSM--AV-PSKLTSYFHAGRPVIAATDADGITA
gi|1892202      LLPKEEYLALLSAADIALVT-QKK-EVGDI--VF-PSKVMTYLSAAKPVIGAVNKNSEVA
gi|1928113      FQPKENIKYSLCAADVHLVV-NQK-GIKGV--SV-PSKIYGVMAAGKPILGVLEQGSEAA
gi|9497033      AVPAKEISRYASIAMCGLVQ-LLDIPLFEG--AR-PGKTTAIMSCGRPVIYAV--RGEGV
gi|1591863      LQPVENLAESLSVADVHLVS-LLP-ELEHC--II-PSKFYGIMAAGRPTIFIGDPDGEVP
gi|7366866      RIPYEQVPLYINAADVCVAPFIKE-RNSKI--GLSALKTYEYLACGKPIVASG--ISGVK
gi|2122724      RIPYEKVPVYINASDICVAPFIKE-RNSKI--GLSALKTYEYLACGKPLVASA--IPGVK
gi|1877782      VQLKPNMPRIIASMDATVVP-LKKLDLFKG--AL-PSKMFEALASELPIVLAV--EGEAE
GOS_678020      YQPAERLAEVLASADIHVVT-LRR-GLGHV--SV-PSKTYAVLAAGRPVLAAIDADTEVP
                                                   . *    :                

gi|1179276      RIVHDSGAGIAVPPDDEKAFVAAIRALVE-DP-QRAAVMGAAGRTFIEKAATPRQVAEAY
gi|1134752      RVVKESGGGIVVTPEDFSALAQAILELYE-NP-KKLEELGQQGRKYAEENFGSKNALNSY
gi|1866841      RAIKLSGGGIIVEPESPDAMAAAVHDLYA-NP-TFCTRLGNAGRQFAEENYSLEQALDRY
gi|1663650      RAIERSGGGLVVTPEDPEALATAILKLYK-NP-DLATILGEKGRQYAEENYAFEKTLDQY
gi|1503835      QVIAEEQCGWQVEPHEPARLAELIDLICA-LPEEKLTAAGQAARRTAEKRFSETVIQNRY
gi|1583139      RILAEAG-AVVVPPEDPDALAEAVLDAAT-DP-GRDVDMGRTARQYVAQHFDRSMLAQRY
gi|1973391      HEVLRSGAGVLVAPEDPGALLKEVRRLAD-DP-AQARALGDAGPRHVAAHLTRAAGLARV
gi|9497084      HVVLSNQCGGVVRPKSPEALAELILRMAT-DR-EQLGRLRENARHAAVNCYSRSANTPKL
gi|1776706      RTLEAAGG-ADVIGPDVDDLVRHLTALAD-DP-HR-TDVGAGGRDWVRRHADLPAVARRY
gi|1693500      NIINDSKSGIACESGNVMEIKHAILKLYD-DR-ALCSKMGNNGREYIKRNLDSNMCTKKY
gi|8585822      NLVKEAEAGLCIPPGSSNDLVQAILTLKQ-DK-DLRERLGNNGRTWAEKHHSPRVAARRF
gi|1087979      SEVLAAGAGIVVPAGEHSALLDAVLDLGD-DP-AAAARFGRNGRRYRESVLDEQVAIAQW
gi|1198670      SEVLAAGAGIVVPAGEHSALLDAVLDLGD-DP-AAAARFGRNGRRYRESVLDEQVAIAQW
gi|1892202      KVILEAGAGRVVPAEDSKAMAEAVGELAA-DA-QLRQRFGEEGRAYALRTWQKKKVLEEM
gi|1928113      VLIKESGCGVVVEPQQYREISQHIANMYACGT-ETLELTGRGGRLYLERYLAKAQSIDKY
gi|9497033      RLMERSNAGWVIPPMDPDALVGAILEMVA-NP-DEVQRRGENGRRYIEQHMTWEILVRDW
gi|1591863      RILRAKRCGSNVEIGETDKLTGIIEELCD-DP-DTTKAMGDAARRLLCTDYSREKAADAW
gi|7366866      DLIEASGGGISVTPENPKQLATAVIRLLL-DE-NTRVLMGEKGRRYVVENHSWDGVARKI
gi|2122724      DLIELSGGGIAVTPENSEELAAAVIKLLR-DE-NSRKLMGEKGRKYIVKNHSWNSVAKKV
gi|1877782      KLINEANAGITVEPENAKEIAQAVLKLYK-DK-ELKQKLGENGRRYVIEHYSRESITKKL
GOS_678020      RILAASGAGRRVEPDDAAAFGSALREMLA-DP-AALARMGHDARRWVESHASPASVAARY
                  :            .   :   :                  .                

gi|1179276      VRLFSEVRRE-------------------------------------------------G
gi|1134752      EALFAEILSCQ---------------------------------------------E---
gi|1866841      EWLFYHILANR---------------------------------------------K---
gi|1663650      ENLFSQVVS---------------------------------------------------
gi|1503835      GELFTSLNHQE---------------------------------------------K---
gi|1583139      HDLLLGLLTGRPGDAVSRDEAALGPGSPRDQGARNEPGQERVVAPMPPLEEQSPVPS---
gi|1973391      DALIEEALGVW---------------------------------------------R---
gi|9497084      RAILEGKVEPVA------------------QG------------------------QS--
gi|1776706      ERLLRRIAGR--------------------------------------------------
gi|1693500      IQVLNEAISIK---------------------------------------------K---
gi|8585822      ESLLRQAIACK---------------------------------------------TRDG
gi|1087979      ESLVETTIAGA---------------------------------------------R---
gi|1198670      ESLVETTIAGA---------------------------------------------R---
gi|1892202      EKAVEDLLK---------------------------------------------------
gi|1928113      RTLLQSI-----------------------------------------------------
gi|9497033      LSQLERLSAAT---------------------------------------------R---
gi|1591863      AALIAGLQTVE---------------------------------------------P---
gi|7366866      LDICKDII----------------------------------------------------
gi|2122724      LSVCNEASKSH-------------------------------------------------
gi|1877782      ETILLNLL----------------------------------------------------
GOS_678020      EALIASLAS---------------------------------------------------
                                                                            

gi|1179276      -------------------------------------R
gi|1134752      --R--------G-----EKK---KGKG-----------
gi|1866841      --S--------NVGILPKLDSKESVVD----------A
gi|1663650      --------------------------------------
gi|1503835      --K-----------------------------------
gi|1583139      --P--------R-----PARPGPTDPHLAIDTRGR-SA
gi|1973391      ---------------------------T----------
gi|9497084      --------------------------------------
gi|1776706      --------------------------------------
gi|1693500      --DR----------------------------------
gi|8585822      ICPAKASTERSS-----LISGSQNEIDGLNRGG-KDGT
gi|1087979      --P--------E-----QAEGGQSEAEV----------
gi|1198670      --P--------E-----QAEGGQSEAEV----------
gi|1892202      --K-----------------------------------
gi|1928113      --------------------------------------
gi|9497033      -----------------------A--------------
gi|1591863      --------SRPH-----LAQGISP--------------
gi|7366866      --------------------------------------
gi|2122724      --------------------------------------
gi|1877782      -----------------------K--------------
GOS_678020      -------------------------------------R